From subhra.mazumdar1993 at gmail.com  Mon Apr 13 03:39:44 2020
From: subhra.mazumdar1993 at gmail.com (Subhra Mazumdar)
Date: Mon, 13 Apr 2020 09:09:44 +0530
Subject: [Lightning-dev] Proof-of-closure as griefing attack mitigation
In-Reply-To: <CEHIx_VK7oXu1vmI3SDTO5RW0DfqNX89WcCJ5O6ZIyxAjLL5XI8juk1-XVUgbMWmmqPlMEDtJONU0xcBSL70tii4YaPVNnJ8XeD8gLOVwr4=@protonmail.com>
References: <3N2W7o-6QbX0lG38elnlTiG5bcgI5yRhSORS7ibxgZEtLe2mGyI4gW6jVK1IP6cXF-F97uSDWTlKIygsCcx_yUkIiYt2COYBTRLTLH3ofV0=@protonmail.com>
 <CAPvY=RiTeBqbXT5gyeeP1WRWN5b8=3_eR7qJW4Y2QSYM0KwSKg@mail.gmail.com>
 <CEHIx_VK7oXu1vmI3SDTO5RW0DfqNX89WcCJ5O6ZIyxAjLL5XI8juk1-XVUgbMWmmqPlMEDtJONU0xcBSL70tii4YaPVNnJ8XeD8gLOVwr4=@protonmail.com>
Message-ID: <CAPvY=RjOn=PJHAs_8+D5VO5_wBdiWpBJwPx=qwpsrhn5WJtoWQ@mail.gmail.com>

Ok. But this is a worse situation where C pays money to D but bound to keep
its resource locked for a longer duration, unlike D not responding and C
being able to unlock after the elapse of lock time.

On Mon, Apr 13, 2020, 08:21 ZmnSCPxj <ZmnSCPxj at protonmail.com> wrote:

> Good morning Subhra,
>
> > Hello,
> >       So based on what you have stated as possible scenario of griefing
> attack, does delay in providing the preimage also counted as a form of
> griefing in htlc? Like given the path A->B->C->D, what if C and D has a
> lock time of 144 blocks and D responds after 142 block time elapses,
> claiming the money locked with D?
>
> That ***is*** the griefing attack.
>
> Regards,
> ZmnSCPxj
>
> >
> > On Wed, Apr 1, 2020, 11:49 ZmnSCPxj via Lightning-dev <
> lightning-dev at lists.linuxfoundation.org> wrote:
> >
> > > Introduction
> > > ============
> > >
> > > Given the fact that contracts on offchain protocols need to be
> enforceable onchain as well, timelocks involved in multi-hop payments are
> measured in blocks.
> > > This is because the blockchain can only (third-party-verifiably)
> enforce timeouts in units of entire blocks.
> > > This leads to very long timeouts for payment delivery, thus multi-hop
> offchain payment attempts can be, deliberately or accidentally, be in a
> "pending" state up to the very large timeouts involved.
> > >
> > > Deliberately setting up a multi-hop payment such that it will be in a
> "pending" state for long periods of time is colloquially known as a
> "griefing attack".
> > > In this article, we assess various proposed solutions to mitigate the
> effects of griefing attacks, and propose a particular solution,
> proof-of-closure, as well, that requires significant changes to the update
> state machine.
> > >
> > > Digression: Why Grief?
> > > ======================
> > >
> > > Before embarking on our investigation for solutions to the griefing
> problem, we should first wonder if griefing is, in fact, a problem.
> > >
> > > This brings up the question of: why would anybody grief at all?
> > >
> > > Humans, like cats and other less-sapient pieces of walking meat, often
> find enjoyment in causing the suffering of others for no immediate direct
> gain to themselves, as a public demonstration of dominance over those they
> make suffer (aka "shits and giggles", which, if executed correctly, can
> lead to eventual direct gains to themselves or their progeny or relatives
> or allies, but such details are often outside the ken of the very beings
> who execute such survival strategies: brains are pieces of meat that have
> been hacked to act as action-reaction engines, but are not sophisticated
> enough to execute as pure rationality engines at all times).
> > > Fortunately, in the Bitcoin world, only purely rational beings of pure
> selfishness can exist in the long run, thus we can neglect such motivations
> as mere noise.
> > >
> > > First, let us investigate *how* griefing attacks can be performed.
> > >
> > > * An intermediate node in a multi-hop attempt can delay forwarding or
> failing an incoming HTLC.
> > > * A final node in a payment attempt can delay claiming an incoming
> HTLC.
> > >
> > > Let us consider a purely rational intermediate node of pure
> selfishness:
> > >
> > > * If it forwards as soon as possible, it can earn fees, and also speed
> up the release of the HTLC-locked funds so that they can reuse those funds
> as liquidity for further payment attempts.
> > > * Thus, delaying an HTLC is not selfishly-rational for an intermediate
> node.
> > >
> > > Thus, for an intermediate node, it seems there is no
> selfishly-rational motivation to execute a griefing attack on an arbitrary
> payment attempt.
> > > We can then conclude that an intermediate that delays a payment would
> do so, not of its own rational self-interest, but as an accident, such as
> an unforeseen connectivity or power failure.
> > >
> > > However, things are different when we consider a non-arbitrary payment.
> > > Suppose a node were to make a payment attempt to itself, and
> deliberately delay claiming this self-payment.
> > > This lets any single node, *who happens to own large liquidity*, to
> lock up the liquidity of other nodes.
> > >
> > > The motivation to lock up the liquidity of other nodes is to
> *eliminate competition*.
> > > Suppose we have a network as below:
> > >
> > >     A -- B -- C
> > >       \     /
> > >        \   /
> > >         \ /
> > >          E
> > >
> > > When A and C want to transact with one another, they may choose to
> route via either B or E.
> > > B and E are therefore competitors in the business of forwarding
> payments.
> > >
> > > But suppose E has much larger channels AE and CE than the channels of
> AB and CB.
> > > For example, suppose E has 100mBTC perfectly-balanced channels while B
> has only 10mBTC perfectly-balanced channels, as all things should be in
> simplified models of reality.
> > > E can then "take out the competition" by making a 5mBTC self-payment
> along E->A->B->C->E and a 5mBTC self-payment along E->C->B->A->E, then
> refusing to claim the payment, tying up all the liquidity of the channels
> of B.
> > > By doing so, it can ensure that A and C will always fail to pay via B,
> even if they wish to transact in amounts less than 5mBTC.
> > > E thereby eliminates B as a competitor.
> > >
> > > This demonstrates that griefing attacks will be motivated, such that
> such attacks will be performed by payers and payees *against intermediate
> nodes*.
> > > Intermediate nodes have no motivation to attack payers and payees
> (those are their potential customers in the business of forwarding
> payments, and attacking potential customers is bad business: such attacking
> intermediate nodes will be removed economically in the long run).
> > > However, payers and payees can become motivated to attack intermediate
> nodes, if the "payer" and "payee" are actually competitor intermediate
> nodes.
> > >
> > > (We can observe that this is always a possibility even outside of
> Lightning: a service or product provider has no incentive to attack its
> customers ("the customer is always right"), but have an incentive to
> *pretend* to be a customer of a competitor and attack them.)
> > >
> > > We will keep this fact in mind: active griefing attacks are attacks
> *on* intermediate nodes, not *by* intermediate nodes, because there is no
> economic incentive for intermediate nodes to attack their customers.
> > >
> > > Previous Proposed Solutions
> > > ===========================
> > >
> > > Time-Spent Reporting
> > > --------------------
> > >
> > > At each channel along the route, the time spent by a node to handle
> its forwarding is recorded, and reported upstream in the route.
> > >
> > > Unfortunately, this solution protects payers from intermediate nodes
> and payees: it does not protect intermediate nodes from colluding payers
> and payees.
> > > Even if an intermediate node knows that a particular node is
> consistently slow via a previous time-spent report, it will not be able,
> with our current onion routing, determine if an onion packet it just
> received will or will not go through the known-slow node.
> > > Thus, an intermediate node would not be able to defend against distant
> payees that, with a colluding payer, will not claim a particular payment.
> > >
> > > As we have established, an active griefing atttack will never be
> deliberately performed by a selfishly-rational intermediate node.
> > > Thus, this solution protects against the wrong thing: it protects
> payers against slow/unreliable intermediate nodes, it does not protect
> intermediate nodes against malicious payer/payee collusions.
> > > It protects only against intermediate nodes that inadvertently go
> offline during forwarding, but such nodes will inevitably lose out on the
> forwarding market anyway, and will disappear in the long run.
> > >
> > > Up-Front Payment
> > > ----------------
> > >
> > > Payers pay for an attempt, not just the successful completion of an
> attempt.
> > >
> > > A variation on this is that the payer (or payee) continuously pays as
> long as the payment is pending.
> > > Further variations include paying by other means, such as just locking
> funds or paying with proof-of-work.
> > >
> > > While it certainly erects economic barriers against payer/payee
> collusions attacking intermediate nodes, it *also* erects economic barriers
> against normal, non-malicious payments.
> > >
> > > We can consider that economic barriers against non-malicious,
> low-value, high-frequency payments ("micropayments") may be enough that
> such payments become infeasible if we impose up-front payment for mere
> attempts.
> > > Thus, while this solution is certainly something we can consider, we
> must be reluctant to use it due to its up-front, strict-evaluation behavior.
> > >
> > > Proof-Of-Closure
> > > ================
> > >
> > > Observing the above, we want the properties for a "good" solution to
> griefing attacks to be:
> > >
> > > * It should protect intermediate nodes against payer/payee collusions.
> > > * It should only come into play upon detection of an attack.
> > >
> > > We now present proof-of-closure, which (we hope) has the above
> properties.
> > >
> > > We can consider instead a softer timeout, distinct from the HTLC
> block-based timeout.
> > > This softer timeout is measurable in fractions of a second, e.g. units
> of 0.1 seconds.
> > >
> > > Each node on the network advertises, in addition to a block-based
> `cltv_delta`, a `timeout_delta` in units of 0.1 seconds.
> > > Further, each invoice contains, in addition to a block-based
> `final_cltv`, a `final_timeout` in units of 0.1 seconds.
> > >
> > > Thus, there are two timeouts:
> > >
> > > * The current "hard" block-based timeout that is enforceable onchain.
> > > * A new "soft" sidereal-time-based timeout that is not onchain
> enforceable.
> > >
> > > The soft timeout, as mentioned, is not enforceable onchain.
> > > Instead, enforcement of the soft timeout *is* the act of putting the
> channel state onchain.
> > >
> > > Now, for the current "hard" block-based timeout, we already have a
> reaction.
> > > If the HTLC "hard" timeout is approaching:
> > >
> > > * Drop the channel onchain and enforce the hard timeout onchain to
> reclaim the funds in the HTLCs.
> > > * Wait for the onchain action to be deeply resolved (either timelock
> or hashlock branch is confirmed deeply) and report the result (success or
> fail) upstream.
> > >
> > > What happens if the "soft" timeout is violated?
> > >
> > > * Drop the channel onchain.
> > > * Report the channel closure upstream.
> > >
> > > The "hard" timeout is cancelled in any of these two conditions:
> > >
> > > * A success is reported via `update_fulfill_htlc`, OR,
> > > * A failure is reported via `update_fail_htlc` AND the HTLC is
> irrevocably removed from the latest commitments/state(s) of the channel.
> > >
> > > The "soft" timeout is cancelled in any of these three conditions, the
> first two of which are the same as above:
> > >
> > > * A success is reported via `update_fulfill_htlc`, OR,
> > > * A failure is reported via `update_fail_htlc` AND the HTLC is
> irrevocably removed from the latest commitments/state(s) of the channel, OR
> > > * A channel closure is reported.
> > >
> > > Let us fill this in more detail.
> > >
> > > Suppose we have a payment route A->B->C->E.
> > >
> > > Both the "hard" block timeouts and the "soft" second timeouts
> decrement monotonically at each hop.
> > > Thus, the payee E has the shortest "hard" and "soft" timeouts (as
> normal).
> > >
> > > * Suppose E then delays claiming the payment and violates the "soft"
> timeout.
> > > * C then drops the CE channel onchain.
> > > * C reports, before its own timeout (slightly larger than the timeout
> imposed on E), the closing of the channel CE, to B.
> > > * B validates this report, and if valid, propagates the report to A.
> > > * A validates this report, and if valid, accepts that the payment will
> be "stuck" for up to the hard timeout it imposed on B.
> > >
> > > C has to report back to B in order to prevent B from closing the BC
> channel, and B has to report back to A in order to prevent A from closing
> the AB channel.
> > > The decrementing seconds-unit timeouts are needed for each hop, for
> the same reason that decrementing block-unit timeouts are needed.
> > >
> > > Since E is motivated to attack intermediate nodes because it wants to
> redirect payment forwards through itself rather than its competitotrs,
> having one of its channels closed (which prevents it from being used for
> forwarding) is directly opposed to its end goal of getting more money,
> thus, we can believe the action of closing a channel involved in a griefing
> attack is sufficient disincentive.
> > >
> > > The major drawback is that enforcement of the soft timeout *is* a
> channel closure, which is generally a negative for the network.
> > > This is not a remote attack vector, since a node can only trigger this
> closure if it is able to stall the fulfillment or failure of an HTLC on a
> channel, which generally means the node triggering this closure can only do
> so for its own channels (or it is able to, via a separate mechanism,
> remotely crash a different node).
> > >
> > > Proving Channel Closes
> > > ----------------------
> > >
> > > What C *really* needs to prove is that:
> > >
> > > * It is *willing* to close a channel due to a violation of the soft
> timeout.
> > > * The channel it is willing to close was, in fact, involved in the
> same payment attempt.
> > >
> > > With the above, B can believe that C was innocent of wrongdoing,
> because:
> > >
> > > * C would only be wiling to close a channel in case of a protocol
> violation, in this case, a violation of the soft timeout.
> > > * The channel it closed was closed because of this payment attempt,
> and not because of another payment attempt, or some other unrelated channel
> being unilaterally closed.
> > >
> > > First, what C needs to prove is *NOT*, in fact, actual channel
> closure: it needs to prove a *willingness* to close a channel.
> > > Thus, it does not require the channel to actually be *closed* yet,
> i.e. it does not have to wait for onchain activity that the channel closure
> is in a mempool and is confirmed deeply onchain etc etc.
> > >
> > > Thus, to prove a *willingness to close* rather than an actual close, C
> can provide the unilateral close of the channel CE.
> > > The act of unilaterally closing a channel is the publication of the
> transaction(s) making up the unilateral close.
> > > Thus, if C is *willing* to close the channel, it is willing to publish
> the transaction(s) involved, and thus, providing the unilateral close to B
> and further upstream, shows a willingness to close the channel.
> > >
> > > B then validates the provided proof-of-closure by checking that the
> unilateral close transaction is either onchain, in the mempool, or that it
> spends a TXO that is not currently spent by another transaction.
> > > In the case the unilateral close transaction is not confirmed and in
> the mempool, B can speed up its propagation on the Bitcoin layer by putting
> it in its own mempool as well --- after all, C is willing to close the
> channel to exonerate itself and punish the actual culprit, and B putting
> the unilateral close in its own mempool can only help C in what it is
> willing to do.
> > >
> > > Secondly, C needs to prove that the channel it is willing to close
> involves the payment attempt, and is not some other channel closure that it
> is attempting to use to fulfill its own soft timeout.
> > > Since the unilateral close transaction *is* the proof-of-closure, B
> (and A) can inspect the transaction outputs and see (with some additional
> data from C) that one of the outputs is to an HTLC that matches the payment
> hash.
> > >
> > > Thus, B (and A) can believe that the proof-of-closure proves that
> whoever is presenting it is free of wrongdoing, as whoever is actually
> causing the delay has been punished (by someone being willing to close a
> channel with the culprit), and that the proof-of-closure commits to this
> particular payment attempt and no other (because it commits to a particular
> payment hash).
> > >
> > > Further, if CE is closed by E dropping it onchain rather than C, C
> will still be able to fulfill its own soft timeout by taking the closing
> transaction from E, which should still contain the HTLC.
> > > Indeed, neither A nor B will particularly care (nor need to know) who
> dropped the channel onchain, or (for A) that the channel participants are C
> and E.
> > >
> > > Update State Shenanigans
> > > ------------------------
> > >
> > > Bitcoin update mechanisms are complicated things, and it may be
> possible for an attacking payee E to fool around with the update state
> machine to make it difficult for C to report a willingness to close CE.
> > >
> > > In particular, I quote here the relevant passages from
> `lightning-rfc`, `02-peer-protocol.md`, which is an implementation of the
> Poon-Dryja update mechanism:
> > >
> > > > Thus each update traverses through the following states:
> > > >
> > > > 1. pending on the receiver
> > > > 2. in the receiver's latest commitment transaction
> > > > 3. ... and the receiver's previous commitment transaction has been
> revoked,
> > > >    and the update is pending on the sender
> > > > 4. ... and in the sender's latest commitment transaction
> > > > 5. ... and the sender's previous commitment transaction has been
> revoked
> > >
> > > The payee E is the "receiver" in this context.
> > >
> > > In this case, once the update has reached step 2, then E has a
> commitment transaction that it can put onchain, that contains an HTLC it
> can claim.
> > > From this step onward, C cannot send a failure (i.e. it cannot send
> back an `update_fail_htlc`) back to B, because E could drop its latest
> commitment onchain and claim the HTLC onchain.
> > >
> > > However, until step 4, C does not have a unilateral close containing
> the HTLC, and thus cannot provide a proof-of-closure that contains an HTLC
> that refers to the payment.
> > >
> > > Thus, between steps 2 to 4, C cannot safely respond to its own soft
> timeout.
> > > C cannot respond with a failure, as E could then drop its latest
> commitment transaction onchain and claim the payment from C, and extract
> money from C that way.
> > > C also cannot respond with a proof-of-closure, as it does not have a
> transaction that it can use to provide this proof.
> > >
> > > The best that C can do would be to impose an even shorter timeout
> between steps 2 and 4 above, and to drop its current commitment transaction
> (which does not contain the HTLC yet and thus does not constitute a valid
> proof-of-closure) onchain.
> > > In between the time it drops the commitment transaction and its own
> incoming soft timeout, there is a chance, however small, that this
> transaction will be confirmed, and the channel will (with high probability)
> settle in a state where the HTLC is not instantiated, thus C can safely
> fail its incoming HTLC (not show a proof-of-closure, since that is not
> possible for C to do) without risk of loss, just prior to its own soft
> timeout.
> > >
> > > Of course, C is still at risk here: E could collude with miners via a
> side-channel fee offer to confirm its commitment transaction with the HTLC
> present, and ensure that C is liable for the HTLC value.
> > >
> > > With Decker-Russell-Osuntokun, we can remove this risk by requiring a
> ritual as follows:
> > >
> > > 1.  C requests exclusive access to update their single shared state.
> > >   * This can be done via a variety of sub-protocols, including a fair
> coin toss in case of near-simultaneous requests for exclusive locks on both
> sides.
> > > 2.  C provides the details of the new HTLC to E.
> > > 3.  C and E generate the new state transaction and exchange signatures
> for it.
> > > 4.  C and E generate (without signing) the new update transaction.
> > > 5.  E provides the signature (or share of signature, if MuSig) for the
> new update transaction to C.
> > > 6.  C provides the signature for the new update transaction to E,
> which releases the exclusive lock on the shared state atomically with the
> finalization of the new update transaction.
> > >
> > > Prior to step 5, C can simply fail the incoming HTLC from B in case
> its own soft timeout is near.
> > > Even if E performs step 5 after C has already failed the incoming HTLC
> from B, C can simply not perform step 6 and drop the channel onchain with
> the previous update and state transactions.
> > >
> > > With Poon-Dryja, we will have to rearrange the order in which we
> perform things, effectively adding an extra communications turnaround
> between the participants.
> > > Specifically, the order would have to be revised to:
> > >
> > > > 1. pending on the sender
> > > > 2. in the sender's latest commitment transaction
> > > > 3. ... and the sender's previous commitment transaction has been
> revoked,
> > > >    and the update is pending on the receiver
> > > > 4. ... and in the receiver's latest commitment transaction
> > > > 5. ... and the receiver's previous commitment transaction has been
> revoked
> > >
> > > This allows the sender (C in our context) to provide a
> proof-of-closure after step 2, and before step 2, C can safely return a
> failure with `update_fail_htlc` (and refuse to proceed beyond step 2, thus
> ensuring it can still use the previous commitment that still has no HTLC).
> > >
> > > Of course, this change will require redesigning the update state
> machine, increasing the number of communication turnarounds, and creating a
> subtle incompatbility when transitioning a payment from a hop that knows
> only the old update state machine to a hop that knows the new update state
> machine.
> > >
> > > Purely Falsified Proof-Of-Closure
> > > ---------------------------------
> > >
> > > Of course, the attacking node E might want to create a false
> proof-of-closure.
> > > E can do this by simulating a Lightning channel: lock an amount of
> funds in a 2-of-2 (where E controls both keys), then spend it in a set of
> transactions mimicking the unilateral close.
> > >
> > > We observe, however, that the overhead of simulating a Lightning
> channel is the same as the overhead of actually creating and closing a
> Lightning channel.
> > > Since the punishment of proof-of-closure is to force attackers to have
> their channels closed, we can consider that this simulation of a channel
> open and close is sufficient as well.
> > >
> > > Future-Proofing
> > > ---------------
> > >
> > > This sketch of proof-of-closure can be used for any update mechanism:
> > >
> > > * With Poon-Dryja, C can use its own commitment transaction as the
> proof-of-closure.
> > > * With Decker-Wattenhofer, C can give all the offchain transactions up
> to the last stage in the multi-stage decrementing-`nSequence` mechanism.
> > > * With Deckker-Russell-Osuntokun, C can give the latest update and
> state trnsaction.
> > >
> > > Basically, we expect that for now, and in the future, any update
> mechanism worth consideration will have a concept of "unilateral close"
> where a channel can be dropped onchain, using data that only one of the
> channel participants holds.
> > >
> > > Such a unilateral close will be a sequence of one or more valid
> transactions, terminating in a transaction containing an HTLC-like contract
> in one of its outputs.
> > >
> > > Thus, to validate the unilateral close, it is only required to
> validate all the transactions contained in the proof-of-closure, and see
> that the last transaction has an HTLC output.
> > >
> > > The limitations are thus:
> > >
> > > * The acceptable forms of HTLC would need to be agreed-upon by the
> entire network.
> > > * Implementations would need to be able to assess, in a
> Bitcoin-consensus-compatible way, whether a transaction is valid or not.
> > >
> > > Payment Decorrelation and Payment Points
> > > ----------------------------------------
> > >
> > > Of course, having a single payment hash for the entire payment attempt
> is a privacy loss, which we intend to fix in the near future by using
> payment points, and adding a blinding scalar at each hop, aka. payment
> decorrelation.
> > >
> > > Thus, in the future, there will not be any HTLC, but instead a PTLC.
> > > Further, the payment point at each hop will be changed at each hop, in
> order to prevent decorrelation.
> > >
> > > Thus, C needs to provide proofs:
> > >
> > > * That an apparent singlesig on the unilateral close output is in fact
> a PTLC.
> > >   C needs to provide:
> > >   * A target point P.
> > >   * A partial signature that would spend that singlesig for a
> particular sighash.
> > >   * An adaptor signature which, with knowledge of the completed
> signature, adaptor signature, and sighash message, would have revealed the
> scalar behind P.
> > > * That the PTLC belongs to the same payment attempt as what B offered
> to C.
> > >   C needs to provide:
> > >   * The C-only blinding factor that is the difference between the
> payment point of the B-to-C PTLC and the C-to-E PTLC on the unilateral
> close.
> > >
> > > Then, when B needs to propagate the proof-of-closure back to A, B
> simply adds its own blinding factor to the reported blinding factor, in
> order to convince A that this is the same payment attempt.
> > >
> > > As we have brought up privacy, we observe that, when this mechanism
> triggers, there is a mild privacy loss, in that intermediate nodes now know
> some channel closure that is related to this payment, and can thus
> determine the exact path that the payment attempt went through, at least
> until the channel being closed.
> > > However, proof-of-closure is only propagated in case of violation of
> the soft timeout, so for normal non-malicious payments, proof-of-closure
> does not cause any privacy loss.
> > > _______________________________________________
> > > Lightning-dev mailing list
> > > Lightning-dev at lists.linuxfoundation.org
> > > https://lists.linuxfoundation.org/mailman/listinfo/lightning-dev
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.linuxfoundation.org/pipermail/lightning-dev/attachments/20200413/ae2be073/attachment-0001.html>