MIME-Version: 1.0
References: <CAApLimjfPKDxmiy_SHjuOKbfm6HumFPjc9EFKvw=3NwZO8JcmQ@mail.gmail.com>
	<CAAS2fgTHTK8Dve9xHW9yULa1yObWtmwmeKKcD_BMjON=RAw8Sg@mail.gmail.com>
	<E5BD6DC6-281B-46E5-ABD3-71B2D5549902@sprovoost.nl>
In-Reply-To: <E5BD6DC6-281B-46E5-ABD3-71B2D5549902@sprovoost.nl>
From: Jim Posen <jim.posen@gmail.com>
Date: Sun, 10 Jun 2018 16:07:07 -0700
Message-ID: <CADZtCSiPkah3sX5yw_29mcnFxFm_SrgzFxOuaQwffMHhRDaobg@mail.gmail.com>
To: Bitcoin Protocol Discussion <bitcoin-dev@lists.linuxfoundation.org>
Content-Type: multipart/alternative; boundary="000000000000f00327056e51b1ee"
Subject: Re: [bitcoin-dev] UHS: Full-node security without maintaining a
 full UTXO set
Precedence: list

--000000000000f00327056e51b1ee
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

I generally like the direction of this proposal in terms of allowing full
nodes to run with a different storage/bandwidth tradeoff. Cory, were this
implemented, would you expect Core to support both operating modes (full
UTXO set and UHS) depending on user configuration, or would UHS be
mandatory?

Also, given that Bram Cohen's TXO bitfield proposal was an inspiration for
this, could you comment on why the UHS is preferable to that approach? An
alternative that goes even further in the direction of more bandwidth, less
storage, would be for nodes to simply maintain a Merkle Mountain Range over
all TXOs in order of creation and a spentness bitfield. Blocks could be
requested with the prev outputs and a Merkle proof linking them into the
MMR root. Since the Merkle proof is deterministic, it could be computed by
archive nodes and miners and saved alongside the block data for relay.
Another benefit of this is the TXO MMR root may be independently useful if
committed into the coinbase transaction.

On Thu, Jun 7, 2018 at 7:02 AM Sjors Provoost via bitcoin-dev <
bitcoin-dev@lists.linuxfoundation.org> wrote:

> eMMC storage, which low end devices often use, come in 2x increments.
> Running a pruned full node on 8 GB is difficult if not impossible (the UT=
XO
> set peaked at 3.5 GB in January, but a full node stores additional stuff)=
.
>
> However, 16 GB is only =E2=82=AC10 more expensive and presumably standard=
 by the
> time this would be rolled out.
>
> On AWS every GB of SSD storage avoided saves $1 per year, not end of the
> world stuff, but not negligible either. Outbound traffic costs $0.10 / GB
> (ignoring free allowance), so when uploading 200 GB per year, the 5% woul=
d
> offset $1 of storage cost savings.
>
> The above seems marginal, probably not worth it unless there=E2=80=99s re=
ally no
> downside.
>
> What I find attractive about this proposal is the ability to squeeze more
> out of limited RAM (typically only 1 or 2 GB on these low end devices). I=
=E2=80=99d
> have to test Cory=E2=80=99s branch to see if that actually matters in pra=
ctice.
>
> It=E2=80=99s also useful to distinguish benefits during initial sync from=
 ongoing
> operation. The former I=E2=80=99ve almost given up on for  low end device=
s (can
> take weeks), in favor of doing it on a faster computer and copying the
> result. The latter needs far less RAM, so perhaps this proposal doesn=E2=
=80=99t
> help much there, but that would be useful to measure.
>
> Did you try the recent SHA256 optimizations on your branch?
>
> Sjors
>
> > Op 17 mei 2018, om 18:56 heeft Gregory Maxwell via bitcoin-dev <
> bitcoin-dev@lists.linuxfoundation.org> het volgende geschreven:
> >
> > On Wed, May 16, 2018 at 4:36 PM, Cory Fields via bitcoin-dev
> > <bitcoin-dev@lists.linuxfoundation.org> wrote:
> >> Tl;dr: Rather than storing all unspent outputs, store their hashes.
> >
> > My initial thoughts are it's not _completely_ obvious to me that a 5%
> > ongoing bandwidth increase is actually a win to get something like a
> > 40% reduction in the size of a pruned node (and less than a 1%
> > reduction in an archive node) primarily because I've not seen size of
> > a pruned node cited as a usage limiting factor basically anywhere. I
> > would assume it is a win but wouldn't be shocked to see a careful
> > analysis that concluded it wasn't.
> >
> > But perhaps more interestingly, I think the overhead is not really 5%,
> > but it's 5% measured in the context of the phenomenally inefficient tx
> > mechanisms ( https://bitcointalk.org/index.php?topic=3D1377345.0 ).
> > Napkin math on the size of a txn alone tells me it's more like a 25%
> > increase if you just consider size of tx vs size of
> > tx+scriptpubkeys,amounts.  If I'm not missing something there, I think
> > that would get in into a very clear not-win range.
> >
> > On the positive side is that it doesn't change the blockchain
> > datastructure, so it's something implementations could do without
> > marrying the network to it forever.
> >
>
> _______________________________________________
> bitcoin-dev mailing list
> bitcoin-dev@lists.linuxfoundation.org
> https://lists.linuxfoundation.org/mailman/listinfo/bitcoin-dev
>

--000000000000f00327056e51b1ee
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr">I generally like the direction of this proposal in terms o=
f allowing full nodes to run with a different storage/bandwidth tradeoff. C=
ory, were this implemented, would you expect Core to support both operating=
 modes (full UTXO set and UHS) depending on user configuration, or would UH=
S be mandatory?<div><br></div><div>Also, given that Bram Cohen&#39;s TXO bi=
tfield proposal was an inspiration for this, could you comment on why the U=
HS is preferable to that approach? An alternative that goes even further in=
 the direction of more bandwidth, less storage, would be for nodes to simpl=
y maintain a Merkle Mountain Range over all TXOs in order of creation and a=
 spentness bitfield. Blocks could be requested with the prev outputs and a =
Merkle proof linking them into the MMR root. Since the Merkle proof is dete=
rministic, it could be computed by archive nodes and miners and saved along=
side the block data for relay. Another benefit of this is the TXO MMR root =
may be independently useful if committed into the coinbase transaction.</di=
v></div><br><div class=3D"gmail_quote"><div dir=3D"ltr">On Thu, Jun 7, 2018=
 at 7:02 AM Sjors Provoost via bitcoin-dev &lt;<a href=3D"mailto:bitcoin-de=
v@lists.linuxfoundation.org">bitcoin-dev@lists.linuxfoundation.org</a>&gt; =
wrote:<br></div><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8e=
x;border-left:1px #ccc solid;padding-left:1ex">eMMC storage, which low end =
devices often use, come in 2x increments. Running a pruned full node on 8 G=
B is difficult if not impossible (the UTXO set peaked at 3.5 GB in January,=
 but a full node stores additional stuff).<br>
<br>
However, 16 GB is only =E2=82=AC10 more expensive and presumably standard b=
y the time this would be rolled out.<br>
<br>
On AWS every GB of SSD storage avoided saves $1 per year, not end of the wo=
rld stuff, but not negligible either. Outbound traffic costs $0.10 / GB (ig=
noring free allowance), so when uploading 200 GB per year, the 5% would off=
set $1 of storage cost savings.<br>
<br>
The above seems marginal, probably not worth it unless there=E2=80=99s real=
ly no downside.<br>
<br>
What I find attractive about this proposal is the ability to squeeze more o=
ut of limited RAM (typically only 1 or 2 GB on these low end devices). I=E2=
=80=99d have to test Cory=E2=80=99s branch to see if that actually matters =
in practice.<br>
<br>
It=E2=80=99s also useful to distinguish benefits during initial sync from o=
ngoing operation. The former I=E2=80=99ve almost given up on for=C2=A0 low =
end devices (can take weeks), in favor of doing it on a faster computer and=
 copying the result. The latter needs far less RAM, so perhaps this proposa=
l doesn=E2=80=99t help much there, but that would be useful to measure.<br>
<br>
Did you try the recent SHA256 optimizations on your branch?<br>
<br>
Sjors<br>
<br>
&gt; Op 17 mei 2018, om 18:56 heeft Gregory Maxwell via bitcoin-dev &lt;<a =
href=3D"mailto:bitcoin-dev@lists.linuxfoundation.org" target=3D"_blank">bit=
coin-dev@lists.linuxfoundation.org</a>&gt; het volgende geschreven:<br>
&gt; <br>
&gt; On Wed, May 16, 2018 at 4:36 PM, Cory Fields via bitcoin-dev<br>
&gt; &lt;<a href=3D"mailto:bitcoin-dev@lists.linuxfoundation.org" target=3D=
"_blank">bitcoin-dev@lists.linuxfoundation.org</a>&gt; wrote:<br>
&gt;&gt; Tl;dr: Rather than storing all unspent outputs, store their hashes=
.<br>
&gt; <br>
&gt; My initial thoughts are it&#39;s not _completely_ obvious to me that a=
 5%<br>
&gt; ongoing bandwidth increase is actually a win to get something like a<b=
r>
&gt; 40% reduction in the size of a pruned node (and less than a 1%<br>
&gt; reduction in an archive node) primarily because I&#39;ve not seen size=
 of<br>
&gt; a pruned node cited as a usage limiting factor basically anywhere. I<b=
r>
&gt; would assume it is a win but wouldn&#39;t be shocked to see a careful<=
br>
&gt; analysis that concluded it wasn&#39;t.<br>
&gt; <br>
&gt; But perhaps more interestingly, I think the overhead is not really 5%,=
<br>
&gt; but it&#39;s 5% measured in the context of the phenomenally inefficien=
t tx<br>
&gt; mechanisms ( <a href=3D"https://bitcointalk.org/index.php?topic=3D1377=
345.0" rel=3D"noreferrer" target=3D"_blank">https://bitcointalk.org/index.p=
hp?topic=3D1377345.0</a> ).<br>
&gt; Napkin math on the size of a txn alone tells me it&#39;s more like a 2=
5%<br>
&gt; increase if you just consider size of tx vs size of<br>
&gt; tx+scriptpubkeys,amounts.=C2=A0 If I&#39;m not missing something there=
, I think<br>
&gt; that would get in into a very clear not-win range.<br>
&gt; <br>
&gt; On the positive side is that it doesn&#39;t change the blockchain<br>
&gt; datastructure, so it&#39;s something implementations could do without<=
br>
&gt; marrying the network to it forever.<br>
&gt; <br>
<br>
_______________________________________________<br>
bitcoin-dev mailing list<br>
<a href=3D"mailto:bitcoin-dev@lists.linuxfoundation.org" target=3D"_blank">=
bitcoin-dev@lists.linuxfoundation.org</a><br>
<a href=3D"https://lists.linuxfoundation.org/mailman/listinfo/bitcoin-dev" =
rel=3D"noreferrer" target=3D"_blank">https://lists.linuxfoundation.org/mail=
man/listinfo/bitcoin-dev</a><br>
</blockquote></div>

--000000000000f00327056e51b1ee--