fail_hard is not working as described #2847

ximinez · 2019-02-05T22:27:13Z

The fail_hard parameter to submit (https://developers.ripple.com/submit.html) is described as

If true, and the transaction fails locally, do not retry or relay the transaction to other servers

This is not the current rippled behavior. Instead, only ter result codes (except for terQUEUED) are not retried or relayed. All other failure codes do retry. This is unlikely to be a problem because those failure codes will not succeed on most retry events, BUT some of them could potentially succeed if the protocol rules change while it's retrying, eg. an amendment goes live.

The text was updated successfully, but these errors were encountered:

ximinez · 2019-02-05T22:33:31Z

To keep implementation simple for now, continue to treat terQUEUED specially. ie. those transactions will still be relayed, and retried locally if necessary.

intelliot · 2019-02-06T01:04:51Z

As a client of rippled, I care about which category the result code falls under:

The transaction has been forwarded (or will be forwarded) to other server(s).
The transaction was NOT forwarded to other servers, and never will be.

In case of (1), I'll watch fully validated ledgers to see if they included my transaction, and if one does, I'll look to see what effect my transaction had on the ledger.

In case of (2), I'll know that my transaction "failed" with finality, so I can feel free to create/submit a new transaction re-using that tx's sequence number.

Currently, I have to assume everything is (1) because even tem txs get held for a few tries. With fail_hard: true, I would like to have an additional field in the submit response that indicates which of the above two categories applies to the transaction. It could be a boolean called hard_failed.

mDuo13 · 2019-02-06T22:45:04Z

My 2¢:

Yeah, terQUEUED should be treated as not a hard failure. All other ter, tem, tef, and tel codes should be hard failures.
Maybe another "stronger" option could instruct the server to skip the queue, so that terQUEUED would also be a hard failure.
Possibly even tec codes could be treated as hard failures and not relayed (but I guess removing them from the open ledger could be messy?). That way, if you're using your own server, you don't have to destroy XRP for every time you screw up and got tecPATH_DRY because you forgot to include proper Paths or something. You might want to limit this to admins if it's significantly more work than rejecting tel / tef / etc. Or just assume that tec transactions are forwarded and will probably make it into a ledger.
temDISABLED should really be more like a terDISABLED because the transaction is (at least somewhat) likely to become valid in the foreseeable future. That said, it doesn't really make sense to keep and retry those unless the amendment in question is nearing the point of becoming enabled. This whole point is probably outside the scope of this issue, but it's related. Obviously we can't guarantee that a tem code is truly final without knowing every future amendment that could ever be invented and enabled, but I'd like to move as close to "tem codes are final" as we can.
- Crazy-talk: we could totally have a tefDISABLED which means, "The amendment you need for this transaction can't possibly become enabled until after this transaction's LastLedgerSequence has passed." Of course, without knowing how much realtime a flag ledger takes, you would only know this if the LLS is less than 1 flag ledger away. (I'm assuming that the minimum number of ledger versions for an amendment to become enabled is two flag ledgers—one for it to gain a majority, one for it to still have that majority, with two weeks' time in between.)

ximinez · 2019-02-07T17:17:03Z

In case of (2), I'll know that my transaction "failed" with finality, so I can feel free to create/submit a new transaction re-using that tx's sequence number.

Just adding a friendly reminder that you can't know 100% that your transaction is in (2), if you don't trust / control the server you're submitting to, or the wire you're submitting over. That aside, you can always attempt to submit a new transaction with the same sequence number. Even if the old transaction is in the local list, the new one may go into the queue or open ledger, which would cause later attempts on the old one to fail with tefPAST_SEQ. As long as you do it before something happens to let that old, "bad" transaction succeed, you're good to go.

Currently, I have to assume everything is (1) because even tem txs get held for a few tries. With fail_hard: true, I would like to have an additional field in the submit response that indicates which of the above two categories applies to the transaction. It could be a boolean called hard_failed.

I would not object to adding such a field, but I'm not sure if it's necessary if we implement things correctly.

Yeah, terQUEUED should be treated as not a hard failure. All other ter, tem, tef, and tel codes should be hard failures.

You know... While we're in there making changes, it would be "easy" enough to force those types of failures to never be retried, regardless of the value of fail_hard.

Possibly even tec codes could be treated as hard failures and not relayed (but I guess removing them from the open ledger could be messy?). That way, if you're using your own server, you don't have to destroy XRP for every time you screw up and got tecPATH_DRY because you forgot to include proper Paths or something. You might want to limit this to admins if it's significantly more work than rejecting tel / tef / etc. Or just assume that tec transactions are forwarded and will probably make it into a ledger.

I think that's technically possible with flags, so it wouldn't be super difficult to implement. However, it would not be a good idea unless it was limited to admins. On a more public node, we want anybody who can pay a fee to pay the fee to discourage them from experimenting or abusing that node. This merits further discussion. I think this would be a breaking change requiring an amendment, because it changes whether a transaction gets put into the open ledger at all, even if that is only in a local context.

temDISABLED should really be more like a terDISABLED because the transaction is (at least somewhat) likely to become valid in the foreseeable future. That said, it doesn't really make sense to keep and retry those unless the amendment in question is nearing the point of becoming enabled. This whole point is probably outside the scope of this issue, but it's related. Obviously we can't guarantee that a tem code is truly final without knowing every future amendment that could ever be invented and enabled, but I'd like to move as close to "tem codes are final" as we can.

Crazy-talk: we could totally have a tefDISABLED which means, "The amendment you need for this transaction can't possibly become enabled until after this transaction's LastLedgerSequence has passed." Of course, without knowing how much realtime a flag ledger takes, you would only know this if the LLS is less than 1 flag ledger away. (I'm assuming that the minimum number of ledger versions for an amendment to become enabled is two flag ledgers—one for it to gain a majority, one for it to still have that majority, with two weeks' time in between.)

I agree about temDISABLED being the wrong code, but I disagree that should make any decision based on whether the amendment is about to go active. I think it's fair to have a ruling that if you jump the gun on an amendment, then it's on you to hold on to that transaction and resubmit it yourself when the amendment goes live. Also, on a technical level, making that decision is probably unnecessarily difficult.

intelliot · 2019-02-08T05:38:55Z

Just adding a friendly reminder that you can't know 100% that your transaction is in (2), if you don't trust / control the server you're submitting to, or the wire you're submitting over.

That's not a big deal because you always have to use https/wss and trust the server you're using. fail_hard is irrelevant here; if the server is not trusted, then it could lie to you about anything, including transaction results. For example, the server could claim your transaction was finalized as a tec when it was actually tes, tricking you into re-sending money.

You know... While we're in there making changes, it would be "easy" enough to force those types of failures to never be retried, regardless of the value of fail_hard.

That sounds fine.

Possibly even tec codes could be treated as hard failures and not relayed (but I guess removing them from the open ledger could be messy?). That way, if you're using your own server, you don't have to destroy XRP for every time you screw up and got tecPATH_DRY because you forgot to include proper Paths or something. You might want to limit this to admins if it's significantly more work than rejecting tel / tef / etc. Or just assume that tec transactions are forwarded and will probably make it into a ledger.

I like this idea. For admins, with fail_hard: true, I would like tec to be a hard failure and not relayed. Basically treating/converting them to a tem (or maybe tef).

I think this would be a breaking change requiring an amendment, because it changes whether a transaction gets put into the open ledger at all, even if that is only in a local context.

No, I don't think this should require an amendment because it should only affect the local context. We can avoid a breaking change by using an API version parameter, called e.g. api_version and using the new behavior for api_version: 2.

I think it's fair to have a ruling that if you jump the gun on an amendment, then it's on you to hold on to that transaction and resubmit it yourself when the amendment goes live.

Agreed.

JoelKatz · 2019-02-08T16:04:55Z

The 'fail_hard' feature was, unfortunately, poorly thought out and not very well conceived. For transactions that are perfect, it doesn't help you, they'll succeed no matter what. For transactions that are totally busted, it doesn't help you, they'll fail no matter what. It was supposed to help you in the squishy middle, but unfortunately the middle is quite squishy. It predates the transaction fee queue and so doesn't really handle that right at all. There's no clear right way to handle that. Broadly, the problem is that the context in which the server executes the transaction when it is submitted and the context in which the transaction might run if it reaches a network consensus and is sequenced can differ. A 'tec' at submission time does not mean a 'tec' at later execution. A 'tes' at submission does not mean a 'tes' at later execution. The original thinking was that it would be nice if you could treat a 'tef' as final. The transaction is very unlikely to succeed if retried anyway. So it would be nice to be able to treat that as a "can't ever happen" condition and have one less edge case to worry about. It doesn't make much sense to treat a 'tec' as final because there's definitely no guarantee the transaction will get a 'tec' later anyway (it's not necessarily even likely). It is very important for everyone to keep in mind that whatever result you get as a result of submitting a transaction, it is the result of that submission ONLY. It in no way tells you much about the final disposition of that transaction. And, of course, the fee queue makes this even more difficult. My advice has always been that if you sign a transaction, you should never assume that it can't wind up in a ledger later until you see a fully-validated ledger that either has a transaction from the same account with the same sequence number or you see a fully-validated ledger with a ledger sequence number equal to or grater than the last valid ledger for the transaction (if it has one). Nothing else should really be treated as final because it's not hard to create edge cases where results change. It would definitely be nice to have a flag in the response that told you whether or not the server promises not to continue trying to process the transaction.

intelliot · 2019-08-22T18:38:17Z

I would like to emphasize:

It would definitely be nice to have a flag in the response that told you whether or not the server promises not to continue trying to process the transaction.

That would be very helpful.

sublimator · 2019-08-22T18:40:23Z

FirstLedgerSequence ^^^^^

intelliot · 2019-08-22T18:41:25Z

I do not think FirstLedgerSequence is relevant here

sublimator · 2019-08-22T18:46:36Z

with the same sequence number or you see a fully-validated ledger with a

ledger sequence number equal to or grater than the last valid ledger for the transaction (if it has one).

intelliot · 2019-09-13T19:15:31Z

When we fix fail_hard, I think we should make it so that tec codes are treated as hard failures and not relayed.

However, it would not be a good idea unless it was limited to admins. On a more public node, we want anybody who can pay a fee to pay the fee to discourage them from experimenting or abusing that node. This merits further discussion.

I understand the reasoning here, as it's an anti-abuse mechanism and theoretically cuts down on spam by claiming transaction fees and destroying XRP. But in practice, I don't think this is is a valuable defense. If a node can be abused by sending invalid transactions to it, then we have bigger problems. This should be solved with rate limiting instead.

After all, there is already processing involved for transactions that end up with tem or tef, and users aren't charged XRP for those. I'm not sure what the existing rate limits are, but we should make sure they are restrictive enough that it's fine for users to submit transactions that ultimately tec without harming the node.

I think this would be a breaking change requiring an amendment, because it changes whether a transaction gets put into the open ledger at all, even if that is only in a local context.

I do not think it should require an amendment since it only affects the local context, and only when fail_hard is set. In general, there will still be tec transactions in the open ledger and in validated ledgers, since not every transaction will be submitted with the fail_hard option.

JoelKatz · 2019-09-16T22:53:42Z

That's not a big deal because you *always* have to use https/wss and trust the server you're using. fail_hard is irrelevant here; if the server is not trusted, then it could lie to you about anything, including transaction results. For example, the server could claim your transaction was finalized as a tec when it was actually tes, tricking you into re-sending money.

True, but only because clients are not very smart. I don't want to design for dumb clients. The original plan was that human readable APIs are for humans and that clients should be able to minimize server load by parsing binary data. A server can provide validations and proof trees to a smart client. Maybe that ship has sailed. DS

…

MarkusTeufelberger · 2019-09-17T06:13:22Z

Well, servers don't provide proof trees so far, so nobody wrote clients that would understand them.

FIXES: XRPLF#2847 * Transactions that are submitted with the fail_hard flag and that result in any TER code besides tesSUCCESS shall not be queued or held.

FIXES: XRPLF#2847 * Transactions that are submitted with the fail_hard flag and that result in any TER code besides tesSUCCESS shall be neither queued nor held.

FIXES: XRPLF#2847 * Transactions that are submitted with the fail_hard flag and that result in any TER code besides tesSUCCESS shall be neither queued nor held. [FOLD] Keep tec results out of the open ledger when fail_hard: * Improve TransactionStatus const correctness, and remove redundant `local` check * Check open ledger tx count in fail_hard tests * Fix some wrapping * Remove duplicate test

ximinez self-assigned this Feb 5, 2019

ximinez added API Change Good First Issue Great issue for a new contributor Bug labels Feb 5, 2019

gituser mentioned this issue May 4, 2019

rippled report me telINSUF_FEE_P even the transaction validate in ledger. #2427

Closed

intelliot mentioned this issue Jul 11, 2019

Clarify that terQUEUED can change ledger state XRPLF/xrpl-dev-portal#623

Merged

mDuo13 mentioned this issue Jul 30, 2019

Charge fees for use of memo #3007

Closed

intelliot mentioned this issue Aug 22, 2019

Adding Fail_hard option in submit Command XRPLF/xrpl.js#1029

Merged

intelliot mentioned this issue Oct 11, 2019

Multiple Payments tefMAX_LEDGER Tentative Message: Ledger sequence too high. XRPLF/xrpl.js#1047

Closed

intelliot added this to the 2019-10 milestone Oct 15, 2019

undertome mentioned this issue Nov 18, 2019

Change how fail_hard transactions are handled. #3160

Closed

mellery451 closed this as completed in cd9732b Jan 14, 2020

ximinez removed their assignment Jan 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fail_hard is not working as described #2847

fail_hard is not working as described #2847

ximinez commented Feb 5, 2019 •

edited

Loading

ximinez commented Feb 5, 2019

intelliot commented Feb 6, 2019

mDuo13 commented Feb 6, 2019

ximinez commented Feb 7, 2019

intelliot commented Feb 8, 2019

JoelKatz commented Feb 8, 2019 via email •

edited

Loading

intelliot commented Aug 22, 2019

sublimator commented Aug 22, 2019 via email

intelliot commented Aug 22, 2019

sublimator commented Aug 22, 2019 via email

intelliot commented Sep 13, 2019

JoelKatz commented Sep 16, 2019 via email

MarkusTeufelberger commented Sep 17, 2019

fail_hard is not working as described #2847

fail_hard is not working as described #2847

Comments

ximinez commented Feb 5, 2019 • edited Loading

ximinez commented Feb 5, 2019

intelliot commented Feb 6, 2019

mDuo13 commented Feb 6, 2019

ximinez commented Feb 7, 2019

intelliot commented Feb 8, 2019

JoelKatz commented Feb 8, 2019 via email • edited Loading

intelliot commented Aug 22, 2019

sublimator commented Aug 22, 2019 via email

intelliot commented Aug 22, 2019

sublimator commented Aug 22, 2019 via email

intelliot commented Sep 13, 2019

JoelKatz commented Sep 16, 2019 via email

MarkusTeufelberger commented Sep 17, 2019

ximinez commented Feb 5, 2019 •

edited

Loading

JoelKatz commented Feb 8, 2019 via email •

edited

Loading