Return-Path: Received: from smtp1.linuxfoundation.org (smtp1.linux-foundation.org [172.17.192.35]) by mail.linuxfoundation.org (Postfix) with ESMTPS id D0D33F35 for ; Fri, 8 Jan 2016 01:26:25 +0000 (UTC) X-Greylist: from auto-whitelisted by SQLgrey-1.7.6 Received: from mail.bluematt.me (mail.bluematt.me [192.241.179.72]) by smtp1.linuxfoundation.org (Postfix) with ESMTPS id D0F36165 for ; Fri, 8 Jan 2016 01:26:24 +0000 (UTC) Received: from [172.17.0.2] (gw.vpn.bluematt.me [162.243.132.6]) by mail.bluematt.me (Postfix) with ESMTPSA id 6A3505BC69; Fri, 8 Jan 2016 01:26:23 +0000 (UTC) To: Gavin Andresen , Ethan Heilman References: From: Matt Corallo Message-ID: <568F103E.1050909@mattcorallo.com> Date: Fri, 8 Jan 2016 01:26:22 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.3.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=ham version=3.3.1 X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on smtp1.linux-foundation.org Cc: Bitcoin Dev Subject: Re: [bitcoin-dev] Time to worry about 80-bit collision attacks or not? X-BeenThere: bitcoin-dev@lists.linuxfoundation.org X-Mailman-Version: 2.1.12 Precedence: list List-Id: Bitcoin Development Discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 08 Jan 2016 01:26:25 -0000 So just because other attacks are possible we should weaken the crypto we use? You may feel comfortable weakening crypto used to protect a few billion dollars of other peoples' money, but I dont. On 01/07/16 23:39, Gavin Andresen via bitcoin-dev wrote: > Thanks, Ethan, that's helpful and I'll stop thinking that collision > attacks require 2^(n/2) memory... > > So can we quantify the incremental increase in security of > SHA256(SHA256) over RIPEMD160(SHA256) versus the incremental increase in > security of having a simpler implementation of segwitness? > > I'm going to claim that the difference in the first case is very, very, > very small-- the risk of an implementation error caused by having > multiple ways of interpreting the segwitness hash in the scriptPubKey is > much, much greater. > > And even if there IS some risk of collision attack now or at some point > in the future, I claim that it is easy for wallets to mitigate that > risk. In fact, the principle of security in depth means wallets that > don't completely control the scriptPubKeys they're creating on behalf of > users SHOULD be coded to mitigate that risk (e.g. not allowing arbitrary > data around a user's public key in a Script so targeted substring > attacks are eliminated entirely). > > Purely from a security point of view, I think a single 20-byte > segwitness in the scriptPubKey is the best design. > "Keep the design as simple and small as possible" > https://www.securecoding.cert.org/confluence/plugins/servlet/mobile#content/view/2426 > > Add in the implied capacity increase of smaller scriptPubKeys and I > still think it is a no-brainer. > > > On Thu, Jan 7, 2016 at 5:56 PM, Ethan Heilman > wrote: > > >Ethan: your algorithm will find two arbitrary values that collide. That isn't useful as an attack in the context we're talking about here (both of those values will be useless as coin destinations with overwhelming probability). > > I'm not sure exactly the properties you want here and determining > these properties is not an easy task, but the case is far worse than > just two random values. For instance: (a). with a small modification > my algorithm can also find collisions containing targeted substrings, > (b). length extension attacks are possible with RIPEMD160. > > (a). targeted cycles: > > target1 = "str to prepend" > target2 = "str to end with" > > seed = {0,1}^160 > x = hash(seed) > > for i in 2^80: > ....x = hash(target1||x||target2) > x_final = x > > y = hash(tartget1||x_final||target2) > > for j in 2^80: > ....if y == x_final: > ........print "cycle len: "+j > ........break > ....y = hash(target1||y||target2) > > If a collision is found, the two colliding inputs must both start with > "str to prepend" and end with the phrase "str to end with". As before > this only requires 2^81.5 computations and no real memory. For an > additional 2**80 an adversary has an good change of finding two > different targeted substrings which collide. Consider the case where > the attacker mixes the targeted strings with the hash output: > > hash("my name is=0x329482039483204324423"+x[1]+", my favorite number > is="+x) where x[1] is the first bit of x. > > (b). length extension attacks > > Even if all the adversary can do is create two random values that > collide, you can append substrings to the input and get collisions. > Once you find two random values hash(x) = hash(y), you could use a > length extension attack on RIPEMD-160 to find hash(x||z) = hash(y||z). > > Now the bitcoin wiki says: > "The padding scheme is identical to MD4 using Merkle–Damgård > strengthening to prevent length extension attacks."[1] > > Which is confusing to me because: > > 1. MD4 is vulnerable to length extension attacks > 2. Merkle–Damgård strengthening does not protect against length > extension: "Indeed, we already pointed out that none of the 64 > variants above can withstand the 'extension' attack on the MAC > application, even with the Merkle-Damgard strengthening" [2] > 3. RIPEMD-160 is vulnerable to length extension attacks, is Bitcoin > using a non-standard version of RIPEMD-160. > > RIPEMD160(SHA256()) does not protect against length extension attacks > on SHA256, but should protect RIPEMD-160 against length extension > attacks as RIPEMD-160 uses 512-bit message blocks. That being said we > should be very careful here. Research has been done that shows that > cascading the same hash function twice is weaker than using HMAC[3]. I > can't find results on cascading RIPEMD160(SHA256()). > > RIPEMD160(SHA256()) seems better than RIPEMD160() though, but security > should not rest on the notion that an attacker requires 2**80 memory, > many targeted collision attacks can work without much memory. > > [1]: https://en.bitcoin.it/wiki/RIPEMD-160 > [2]: "Merkle-Damgard Revisited: How to Construct a Hash Function" > https://www.cs.nyu.edu/~puniya/papers/merkle.pdf > [3]: https://www.cs.nyu.edu/~dodis/ps/h-of-h.pdf > > On Thu, Jan 7, 2016 at 4:06 PM, Gavin Andresen via bitcoin-dev > > wrote: > > Maybe I'm asking this question on the wrong mailing list: > > > > Matt/Adam: do you have some reason to think that RIPEMD160 will be > broken > > before SHA256? > > And do you have some reason to think that they will be so broken > that the > > nested hash construction RIPEMD160(SHA256()) will be vulnerable? > > > > Adam: re: "where to stop" : I'm suggesting we stop exactly at > the current > > status quo, where we use RIPEMD160 for P2SH and P2PKH. > > > > Ethan: your algorithm will find two arbitrary values that > collide. That > > isn't useful as an attack in the context we're talking about here > (both of > > those values will be useless as coin destinations with overwhelming > > probability). > > > > Dave: you described a first preimage attack, which is 2**160 cpu > time and no > > storage. > > > > > > -- > > -- > > Gavin Andresen > > > > _______________________________________________ > > bitcoin-dev mailing list > > bitcoin-dev@lists.linuxfoundation.org > > > https://lists.linuxfoundation.org/mailman/listinfo/bitcoin-dev > > > > > > > -- > -- > Gavin Andresen > > > _______________________________________________ > bitcoin-dev mailing list > bitcoin-dev@lists.linuxfoundation.org > https://lists.linuxfoundation.org/mailman/listinfo/bitcoin-dev >