Skip to content

DAOS-19212 object: client retry modification if TX_RESTART repeatedly - b28#18601

Draft
Nasf-Fan wants to merge 1 commit into
release/2.8from
Nasf-Fan/DAOS-19212_1_b28
Draft

DAOS-19212 object: client retry modification if TX_RESTART repeatedly - b28#18601
Nasf-Fan wants to merge 1 commit into
release/2.8from
Nasf-Fan/DAOS-19212_1_b28

Conversation

@Nasf-Fan

@Nasf-Fan Nasf-Fan commented Jul 2, 2026

Copy link
Copy Markdown
Contributor

On server side, when IO handler repeatedly hit -DER_TX_RESTART, then it is quite possible that the -DER_TX_RESTART failure is related with server overload or some congestion caused RPC delay. Under such case, server retry with newer epoch may increase server workload/congestion. Then let's ask client to retry with some backoff delay.

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

… - b28

On server side, when IO handler repeatedly hit -DER_TX_RESTART, then
it is quite possible that the -DER_TX_RESTART failure is related with
server overload or some congestion caused RPC delay. Under such case,
server retry with newer epoch may increase server workload/congestion.
Then let's ask client to retry with some backoff delay.

Signed-off-by: Fan Yong <fan.yong@hpe.com>
@github-actions

github-actions Bot commented Jul 2, 2026

Copy link
Copy Markdown

Ticket title is 'Aurora: performance jobs keep timing out with v2.8.0-rc1 and MDonSSD'
Status is 'In Progress'
Labels: '2.8.0rc1,md_on_ssd,scrubbed_2.8,test_2.8.0rc'
https://daosio.atlassian.net/browse/DAOS-19212

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

1 participant