summaryrefslogtreecommitdiff
path: root/de/c74d7cdcedbbf77aead280a4ff57e741291f3c
blob: 2ae84c3884a49e0ec2f4fd6eca215e08ccc49a2b (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
Return-Path: <elombrozo@gmail.com>
Received: from smtp1.linuxfoundation.org (smtp1.linux-foundation.org
	[172.17.192.35])
	by mail.linuxfoundation.org (Postfix) with ESMTPS id 6122C959
	for <bitcoin-dev@lists.linuxfoundation.org>;
	Thu,  1 Jun 2017 21:00:05 +0000 (UTC)
X-Greylist: whitelisted by SQLgrey-1.7.6
Received: from mail-yb0-f179.google.com (mail-yb0-f179.google.com
	[209.85.213.179])
	by smtp1.linuxfoundation.org (Postfix) with ESMTPS id 41735163
	for <bitcoin-dev@lists.linuxfoundation.org>;
	Thu,  1 Jun 2017 21:00:04 +0000 (UTC)
Received: by mail-yb0-f179.google.com with SMTP id 202so13743948ybd.0
	for <bitcoin-dev@lists.linuxfoundation.org>;
	Thu, 01 Jun 2017 14:00:04 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025;
	h=mime-version:in-reply-to:references:from:date:message-id:subject:to
	:cc; bh=kkl865D/0dJcs258H3MYGNIgVARzc3lff/6/tTXSqyc=;
	b=AD9981OD73aa54R4zs1lBJtuaUeHvbedMYHD8abZjRgmoYzYHwC64ZG3MKBFdibxMI
	boITOEUzUzac6Ski/6skC1gbRFTpQOTYMKuE9HfpvaA6JsJ85y4vghRx3I/AaNTxUUxA
	hyYJSzCgbPFtfydhj5vkWYN1LsloGmvJzxsATP5Q3T3LDL9VPVPRmSsUxtCtNFN8+yl4
	46bZOYVinNm0JK+sdnUM8tOzSUvQqGRWHNA77IEAtP+nIy7x9DPYsUfwDkZFcesYrPdL
	X/qpmfzeI7e1zHBYz7JiMmF//Pi5MA/UDf7OwZVShaVp/jRFP6fZ55Qohr746qiOkiZh
	JmzQ==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
	d=1e100.net; s=20161025;
	h=x-gm-message-state:mime-version:in-reply-to:references:from:date
	:message-id:subject:to:cc;
	bh=kkl865D/0dJcs258H3MYGNIgVARzc3lff/6/tTXSqyc=;
	b=juvnEeaAxUYNmto6VDCyj8BkatYW8RHy4OFm79PTNb+CXmW/OcLp/epUzX4XtGntk7
	c4ZVE2PxOxN1YiiveGwIysmWsl8pERO9/Qw8AxwtvIt9lXEmJtEKFFr2PP9/vh2YZFZh
	M+P2Sx8snS/wchzrajIcsPYoJCK9HP6k+CeM32kipfbL+TNw3HaFawMfuzonVQ6Anxae
	m0C+QUNGWiGDJ2QnG66xuK9XPLygN5IRguQo/uRFzXa5R9m74zdkpkREMKVEFRuyWw8e
	kGAJ5DtlM6AqZhpt9jURNeZydjZnEN+G7gHmwmENKlG8eoobT0/lDnmDAzBuwSQzc1ux
	LyaQ==
X-Gm-Message-State: AODbwcBJNLdYCJoEWaOENZhlEV9m0sLfmGKXemJwuhD0eW24molBwlx9
	NFOPYWwIrA9Y217zGcrzYFIC+S9hIw==
X-Received: by 10.37.181.201 with SMTP id d9mr14032719ybg.87.1496350803317;
	Thu, 01 Jun 2017 14:00:03 -0700 (PDT)
MIME-Version: 1.0
Received: by 10.37.210.206 with HTTP; Thu, 1 Jun 2017 14:00:02 -0700 (PDT)
In-Reply-To: <CAO3Pvs8ccTkgrecJG6KFbBW+9moHF-FTU+4qNfayeE3hM9uRrg@mail.gmail.com>
References: <CAO3Pvs8ccTkgrecJG6KFbBW+9moHF-FTU+4qNfayeE3hM9uRrg@mail.gmail.com>
From: Eric Lombrozo <elombrozo@gmail.com>
Date: Thu, 1 Jun 2017 14:00:02 -0700
Message-ID: <CABr1YTdQy9=XiCQOuSnLq_y1PDWg7mC=Wquv9YEFDW832J2NhQ@mail.gmail.com>
To: Olaoluwa Osuntokun <laolu32@gmail.com>
Content-Type: multipart/alternative; boundary="089e0821e844175c3f0550ec5228"
X-Spam-Status: No, score=-1.5 required=5.0 tests=BAYES_00,DKIM_SIGNED,
	DKIM_VALID, DKIM_VALID_AU, FREEMAIL_FROM, HTML_MESSAGE,
	RCVD_IN_DNSWL_NONE, 
	RCVD_IN_SORBS_SPAM autolearn=no version=3.3.1
X-Spam-Checker-Version: SpamAssassin 3.3.1 (2010-03-16) on
	smtp1.linux-foundation.org
Cc: Arnoud Kouwenhoven - Pukaki Corp via bitcoin-dev
	<bitcoin-dev@lists.linuxfoundation.org>
Subject: Re: [bitcoin-dev] BIP Proposal: Compact Client Side Filtering for
 Light Clients
X-BeenThere: bitcoin-dev@lists.linuxfoundation.org
X-Mailman-Version: 2.1.12
Precedence: list
List-Id: Bitcoin Protocol Discussion <bitcoin-dev.lists.linuxfoundation.org>
List-Unsubscribe: <https://lists.linuxfoundation.org/mailman/options/bitcoin-dev>,
	<mailto:bitcoin-dev-request@lists.linuxfoundation.org?subject=unsubscribe>
List-Archive: <http://lists.linuxfoundation.org/pipermail/bitcoin-dev/>
List-Post: <mailto:bitcoin-dev@lists.linuxfoundation.org>
List-Help: <mailto:bitcoin-dev-request@lists.linuxfoundation.org?subject=help>
List-Subscribe: <https://lists.linuxfoundation.org/mailman/listinfo/bitcoin-dev>,
	<mailto:bitcoin-dev-request@lists.linuxfoundation.org?subject=subscribe>
X-List-Received-Date: Thu, 01 Jun 2017 21:00:05 -0000

--089e0821e844175c3f0550ec5228
Content-Type: text/plain; charset="UTF-8"

Thanks for sending this proposal! I look forward to having a great
discussion around this.

- Eric

On Thursday, June 1, 2017, Olaoluwa Osuntokun via bitcoin-dev <
bitcoin-dev@lists.linuxfoundation.org> wrote:

> Hi y'all,
>
> Alex Akselrod and I would like to propose a new light client BIP for
> consideration:
>    * https://github.com/Roasbeef/bips/blob/master/gcs_light_
> client.mediawiki
>
> This BIP proposal describes a concrete specification (along with a
> reference implementations[1][2][3]) for the much discussed client-side
> filtering reversal of BIP-37. The precise details are described in the
> BIP, but as a summary: we've implemented a new light-client mode that uses
> client-side filtering based off of Golomb-Rice coded sets. Full-nodes
> maintain an additional index of the chain, and serve this compact filter
> (the index) to light clients which request them. Light clients then fetch
> these filters, query the locally and _maybe_ fetch the block if a relevant
> item matches. The cool part is that blocks can be fetched from _any_
> source, once the light client deems it necessary. Our primary motivation
> for this work was enabling a light client mode for lnd[4] in order to
> support a more light-weight back end paving the way for the usage of
> Lightning on mobile phones and other devices. We've integrated neutrino
> as a back end for lnd, and will be making the updated code public very
> soon.
>
> One specific area we'd like feedback on is the parameter selection. Unlike
> BIP-37 which allows clients to dynamically tune their false positive rate,
> our proposal uses a _fixed_ false-positive. Within the document, it's
> currently specified as P = 1/2^20. We've done a bit of analysis and
> optimization attempting to optimize the following sum:
> filter_download_bandwidth + expected_block_false_positive_bandwidth. Alex
> has made a JS calculator that allows y'all to explore the affect of
> tweaking the false positive rate in addition to the following variables:
> the number of items the wallet is scanning for, the size of the blocks,
> number of blocks fetched, and the size of the filters themselves. The
> calculator calculates the expected bandwidth utilization using the CDF of
> the Geometric Distribution. The calculator can be found here:
> https://aakselrod.github.io/gcs_calc.html. Alex also has an empirical
> script he's been running on actual data, and the results seem to match up
> rather nicely.
>
> We we're excited to see that Karl Johan Alm (kallewoof) has done some
> (rather extensive!) analysis of his own, focusing on a distinct encoding
> type [5]. I haven't had the time yet to dig into his report yet, but I
> think I've read enough to extract the key difference in our encodings: his
> filters use a binomial encoding _directly_ on the filter contents, will we
> instead create a Golomb-Coded set with the contents being _hashes_ (we use
> siphash) of the filter items.
>
> Using a fixed fp=20, I have some stats detailing the total index size, as
> well as averages for both mainnet and testnet. For mainnet, using the
> filter contents as currently described in the BIP (basic + extended), the
> total size of the index comes out to 6.9GB. The break down is as follows:
>
>     * total size:  6976047156
>     * total avg:  14997.220622758816
>     * total median:  3801
>     * total max:  79155
>     * regular size:  3117183743
>     * regular avg:  6701.372750217131
>     * regular median:  1734
>     * regular max:  67533
>     * extended size:  3858863413
>     * extended avg:  8295.847872541684
>     * extended median:  2041
>     * extended max:  52508
>
> In order to consider the average+median filter sizes in a world worth
> larger blocks, I also ran the index for testnet:
>
>     * total size:  2753238530
>     * total avg:  5918.95736054141
>     * total median:  60202
>     * total max:  74983
>     * regular size:  1165148878
>     * regular avg:  2504.856172982827
>     * regular median:  24812
>     * regular max:  64554
>     * extended size:  1588089652
>     * extended avg:  3414.1011875585823
>     * extended median:  35260
>     * extended max:  41731
>
> Finally, here are the testnet stats which take into account the increase
> in the maximum filter size due to segwit's block-size increase. The max
> filter sizes are a bit larger due to some of the habitual blocks I
> created last year when testing segwit (transactions with 30k inputs, 30k
> outputs, etc).
>
>      * total size:  585087597
>      * total avg:  520.8839608674402
>      * total median:  20
>      * total max:  164598
>      * regular size:  299325029
>      * regular avg:  266.4790836307566
>      * regular median:  13
>      * regular max:  164583
>      * extended size:  285762568
>      * extended avg:  254.4048772366836
>      * extended median:  7
>      * extended max:  127631
>
> For those that are interested in the raw data, I've uploaded a CSV file
> of raw data for each block (mainnet + testnet), which can be found here:
>      * mainnet: (14MB): https://www.dropbox.com/s/
> 4yk2u8dj06njbuv/mainnet-gcs-stats.csv?dl=0
>      * testnet: (25MB): https://www.dropbox.com/s/
> w7dmmcbocnmjfbo/gcs-stats-testnet.csv?dl=0
>
>
> We look forward to getting feedback from all of y'all!
>
> -- Laolu
>
>
> [1]: https://github.com/lightninglabs/neutrino
> [2]: https://github.com/Roasbeef/btcd/tree/segwit-cbf
> [3]: https://github.com/Roasbeef/btcutil/tree/gcs/gcs
> [4]: https://github.com/lightningnetwork/lnd/
>
> -- Laolu
>
>

--089e0821e844175c3f0550ec5228
Content-Type: text/html; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Thanks for sending this proposal! I look forward to having a great discussi=
on around this.<div><br></div><div>- Eric</div><div><br>On Thursday, June 1=
, 2017, Olaoluwa Osuntokun via bitcoin-dev &lt;<a href=3D"mailto:bitcoin-de=
v@lists.linuxfoundation.org">bitcoin-dev@lists.linuxfoundation.org</a>&gt; =
wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;bord=
er-left:1px #ccc solid;padding-left:1ex"><div dir=3D"ltr"><div>Hi y&#39;all=
,=C2=A0</div><div><br></div><div>Alex Akselrod and I would like to propose =
a new light client BIP for</div><div>consideration:=C2=A0</div><div>=C2=A0 =
=C2=A0* <a href=3D"https://github.com/Roasbeef/bips/blob/master/gcs_light_c=
lient.mediawiki" target=3D"_blank">https://github.com/Roasbeef/<wbr>bips/bl=
ob/master/gcs_light_<wbr>client.mediawiki</a></div><div><br></div><div>This=
 BIP proposal describes a concrete specification (along with a</div><div>re=
ference implementations[1][2][3]) for the much discussed client-side</div><=
div>filtering reversal of BIP-37. The precise details are described in the<=
/div><div>BIP, but as a summary: we&#39;ve implemented a new light-client m=
ode that uses</div><div>client-side filtering based off of Golomb-Rice code=
d sets. Full-nodes</div><div>maintain an additional index of the chain, and=
 serve this compact filter</div><div>(the index) to light clients which req=
uest them. Light clients then fetch</div><div>these filters, query the loca=
lly and _maybe_ fetch the block if a relevant</div><div>item matches. The c=
ool part is that blocks can be fetched from _any_</div><div>source, once th=
e light client deems it necessary. Our primary motivation</div><div>for thi=
s work was enabling a light client mode for lnd[4] in order to</div><div>su=
pport a more light-weight back end paving the way for the usage of</div><di=
v>Lightning on mobile phones and other devices. We&#39;ve integrated neutri=
no</div><div>as a back end for lnd, and will be making the updated code pub=
lic very</div><div>soon.</div><div><br></div><div>One specific area we&#39;=
d like feedback on is the parameter selection. Unlike</div><div>BIP-37 whic=
h allows clients to dynamically tune their false positive rate,</div><div>o=
ur proposal uses a _fixed_ false-positive. Within the document, it&#39;s</d=
iv><div>currently specified as P =3D 1/2^20. We&#39;ve done a bit of analys=
is and</div><div>optimization attempting to optimize the following sum:</di=
v><div>filter_download_bandwidth + expected_block_false_positive_<wbr>bandw=
idth. Alex</div><div>has made a JS calculator that allows y&#39;all to expl=
ore the affect of</div><div>tweaking the false positive rate in addition to=
 the following variables:</div><div>the number of items the wallet is scann=
ing for, the size of the blocks,</div><div>number of blocks fetched, and th=
e size of the filters themselves. The</div><div>calculator calculates the e=
xpected bandwidth utilization using the CDF of</div><div>the Geometric Dist=
ribution. The calculator can be found here:</div><div><a href=3D"https://aa=
kselrod.github.io/gcs_calc.html" target=3D"_blank">https://aakselrod.github=
.io/<wbr>gcs_calc.html</a>. Alex also has an empirical</div><div>script he&=
#39;s been running on actual data, and the results seem to match up</div><d=
iv>rather nicely.</div><div><br></div><div>We we&#39;re excited to see that=
 Karl Johan Alm (kallewoof) has done some</div><div>(rather extensive!) ana=
lysis of his own, focusing on a distinct encoding</div><div>type [5]. I hav=
en&#39;t had the time yet to dig into his report yet, but I</div><div>think=
 I&#39;ve read enough to extract the key difference in our encodings: his</=
div><div>filters use a binomial encoding _directly_ on the filter contents,=
 will we</div><div>instead create a Golomb-Coded set with the contents bein=
g _hashes_ (we use</div><div>siphash) of the filter items.</div><div><br></=
div><div>Using a fixed fp=3D20, I have some stats detailing the total index=
 size, as</div><div>well as averages for both mainnet and testnet. For main=
net, using the</div><div>filter contents as currently described in the BIP =
(basic + extended), the</div><div>total size of the index comes out to 6.9G=
B. The break down is as follows:</div><div><br></div><div>=C2=A0 =C2=A0 * t=
otal size: =C2=A06976047156</div><div>=C2=A0 =C2=A0 * total avg: =C2=A01499=
7.220622758816</div><div>=C2=A0 =C2=A0 * total median: =C2=A03801</div><div=
>=C2=A0 =C2=A0 * total max: =C2=A079155</div><div>=C2=A0 =C2=A0 * regular s=
ize: =C2=A03117183743</div><div>=C2=A0 =C2=A0 * regular avg: =C2=A06701.372=
750217131</div><div>=C2=A0 =C2=A0 * regular median: =C2=A01734</div><div>=
=C2=A0 =C2=A0 * regular max: =C2=A067533</div><div>=C2=A0 =C2=A0 * extended=
 size: =C2=A03858863413</div><div>=C2=A0 =C2=A0 * extended avg: =C2=A08295.=
847872541684</div><div>=C2=A0 =C2=A0 * extended median: =C2=A02041</div><di=
v>=C2=A0 =C2=A0 * extended max: =C2=A052508</div><div><br></div><div>In ord=
er to consider the average+median filter sizes in a world worth</div><div>l=
arger blocks, I also ran the index for testnet:=C2=A0</div><div><br></div><=
div>=C2=A0 =C2=A0 * total size: =C2=A02753238530</div><div>=C2=A0 =C2=A0 * =
total avg: =C2=A05918.95736054141</div><div>=C2=A0 =C2=A0 * total median: =
=C2=A060202</div><div>=C2=A0 =C2=A0 * total max: =C2=A074983</div><div>=C2=
=A0 =C2=A0 * regular size: =C2=A01165148878</div><div>=C2=A0 =C2=A0 * regul=
ar avg: =C2=A02504.856172982827</div><div>=C2=A0 =C2=A0 * regular median: =
=C2=A024812</div><div>=C2=A0 =C2=A0 * regular max: =C2=A064554</div><div>=
=C2=A0 =C2=A0 * extended size: =C2=A01588089652</div><div>=C2=A0 =C2=A0 * e=
xtended avg: =C2=A03414.1011875585823</div><div>=C2=A0 =C2=A0 * extended me=
dian: =C2=A035260</div><div>=C2=A0 =C2=A0 * extended max: =C2=A041731</div>=
<div><br></div><div>Finally, here are the testnet stats which take into acc=
ount the increase</div><div>in the maximum filter size due to segwit&#39;s =
block-size increase. The max</div><div>filter sizes are a bit larger due to=
 some of the habitual blocks I</div><div>created last year when testing seg=
wit (transactions with 30k inputs, 30k</div><div>outputs, etc).</div><div><=
br></div><div>=C2=A0 =C2=A0 =C2=A0* total size: =C2=A0585087597</div><div>=
=C2=A0 =C2=A0 =C2=A0* total avg: =C2=A0520.8839608674402</div><div>=C2=A0 =
=C2=A0 =C2=A0* total median: =C2=A020</div><div>=C2=A0 =C2=A0 =C2=A0* total=
 max: =C2=A0164598</div><div>=C2=A0 =C2=A0 =C2=A0* regular size: =C2=A02993=
25029</div><div>=C2=A0 =C2=A0 =C2=A0* regular avg: =C2=A0266.4790836307566<=
/div><div>=C2=A0 =C2=A0 =C2=A0* regular median: =C2=A013</div><div>=C2=A0 =
=C2=A0 =C2=A0* regular max: =C2=A0164583</div><div>=C2=A0 =C2=A0 =C2=A0* ex=
tended size: =C2=A0285762568</div><div>=C2=A0 =C2=A0 =C2=A0* extended avg: =
=C2=A0254.4048772366836</div><div>=C2=A0 =C2=A0 =C2=A0* extended median: =
=C2=A07</div><div>=C2=A0 =C2=A0 =C2=A0* extended max: =C2=A0127631</div><di=
v><br></div><div>For those that are interested in the raw data, I&#39;ve up=
loaded a CSV file</div><div>of raw data for each block (mainnet + testnet),=
 which can be found here:</div><div>=C2=A0 =C2=A0 =C2=A0* mainnet: (14MB): =
<a href=3D"https://www.dropbox.com/s/4yk2u8dj06njbuv/mainnet-gcs-stats.csv?=
dl=3D0" target=3D"_blank">https://www.dropbox.com/s/<wbr>4yk2u8dj06njbuv/ma=
innet-gcs-<wbr>stats.csv?dl=3D0</a></div><div>=C2=A0 =C2=A0 =C2=A0* testnet=
: (25MB): <a href=3D"https://www.dropbox.com/s/w7dmmcbocnmjfbo/gcs-stats-te=
stnet.csv?dl=3D0" target=3D"_blank">https://www.dropbox.com/s/<wbr>w7dmmcbo=
cnmjfbo/gcs-stats-<wbr>testnet.csv?dl=3D0</a></div><div><br></div><div><br>=
</div><div>We look forward to getting feedback from all of y&#39;all!</div>=
<div><br></div><div>-- Laolu</div><div><br></div><div><br></div><div>[1]: <=
a href=3D"https://github.com/lightninglabs/neutrino" target=3D"_blank">http=
s://github.com/<wbr>lightninglabs/neutrino</a></div><div>[2]: <a href=3D"ht=
tps://github.com/Roasbeef/btcd/tree/segwit-cbf" target=3D"_blank">https://g=
ithub.com/Roasbeef/<wbr>btcd/tree/segwit-cbf</a></div><div>[3]: <a href=3D"=
https://github.com/Roasbeef/btcutil/tree/gcs/gcs" target=3D"_blank">https:/=
/github.com/Roasbeef/<wbr>btcutil/tree/gcs/gcs</a></div><div>[4]: <a href=
=3D"https://github.com/lightningnetwork/lnd/" target=3D"_blank">https://git=
hub.com/<wbr>lightningnetwork/lnd/</a></div><div><br></div><div>-- Laolu</d=
iv><div><br></div></div>
</blockquote></div>

--089e0821e844175c3f0550ec5228--