Received: from sog-mx-3.v43.ch3.sourceforge.com ([172.29.43.193] helo=mx.sourceforge.net) by sfs-ml-4.v29.ch3.sourceforge.com with esmtp (Exim 4.76) (envelope-from ) id 1Xj8B3-00022V-BA for bitcoin-development@lists.sourceforge.net; Tue, 28 Oct 2014 14:55:17 +0000 Received-SPF: pass (sog-mx-3.v43.ch3.sourceforge.com: domain of gmail.com designates 209.85.213.182 as permitted sender) client-ip=209.85.213.182; envelope-from=morcos@gmail.com; helo=mail-ig0-f182.google.com; Received: from mail-ig0-f182.google.com ([209.85.213.182]) by sog-mx-3.v43.ch3.sourceforge.com with esmtps (TLSv1:RC4-SHA:128) (Exim 4.76) id 1Xj8B2-0008VR-7U for bitcoin-development@lists.sourceforge.net; Tue, 28 Oct 2014 14:55:17 +0000 Received: by mail-ig0-f182.google.com with SMTP id hn18so1168234igb.15 for ; Tue, 28 Oct 2014 07:55:11 -0700 (PDT) MIME-Version: 1.0 X-Received: by 10.42.190.196 with SMTP id dj4mr4286412icb.35.1414508110899; Tue, 28 Oct 2014 07:55:10 -0700 (PDT) Received: by 10.50.223.146 with HTTP; Tue, 28 Oct 2014 07:55:10 -0700 (PDT) In-Reply-To: References: Date: Tue, 28 Oct 2014 10:55:10 -0400 Message-ID: From: Alex Morcos To: Gavin Andresen Content-Type: multipart/alternative; boundary=20cf303f6e0c7b9b3005067cd516 X-Spam-Score: -0.6 (/) X-Spam-Report: Spam Filtering performed by mx.sourceforge.net. See http://spamassassin.org/tag/ for more details. -1.5 SPF_CHECK_PASS SPF reports sender host as permitted sender for sender-domain 0.0 FREEMAIL_FROM Sender email is commonly abused enduser mail provider (morcos[at]gmail.com) -0.0 SPF_PASS SPF: sender matches SPF record 1.0 HTML_MESSAGE BODY: HTML included in message -0.1 DKIM_VALID_AU Message has a valid DKIM or DK signature from author's domain 0.1 DKIM_SIGNED Message has a DKIM or DK signature, not necessarily valid -0.1 DKIM_VALID Message has at least one valid DKIM or DK signature X-Headers-End: 1Xj8B2-0008VR-7U Cc: Bitcoin Dev Subject: Re: [Bitcoin-development] Reworking the policy estimation code (fee estimates) X-BeenThere: bitcoin-development@lists.sourceforge.net X-Mailman-Version: 2.1.9 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Oct 2014 14:55:17 -0000 --20cf303f6e0c7b9b3005067cd516 Content-Type: text/plain; charset=UTF-8 Sorry, perhaps I misinterpreted that question. The estimates will be dominated by the prevailing transaction rates initially, so the estimates you get for something like "what is the least I can pay and still be 90% sure I get confirmed in 20 blocks" won't be insane, but they will still be way too conservative. I'm not sure what you meant by reasonable. You won't get the "correct" answer of something significantly less than 40k sat/kB for quite some time. Given that the half-life of the decay is 2.5 days, then within a couple of days. And in fact even in the steady state, the new code will still return a much higher rate than the existing code, say 10k sat/kB instead of 1k sat/kB, but that's just a result of the sorting the existing code does and the fact that no one places transactions with that small fee. To correctly give such low answers, the new code will require that those super low feerate transactions are occurring frequently enough, but the bar for enough datapoints in a feerate bucket is pretty low, an average of 1 tx per block. The bar can be made lower at the expense of a bit of noisiness in the answers, for instance for priorities I had to make the bar significantly lower because there are so many fewer transactions confirmed because of priorities. I'm certainly open to tuning some of these variables. On Tue, Oct 28, 2014 at 10:30 AM, Alex Morcos wrote: > Oh in just a couple of blocks, it'll give you a somewhat reasonable > estimate for asking about every confirmation count other than 1, but it > could take several hours for it to have enough data points to give you a > good estimate for getting confirmed in one block (because the prevalent > feerate is not always confirmed in 1 block >80% of the time) Essentially > what it does is just combine buckets until it has enough data points, so > after the first block it might be treating all of the txs as belonging to > the same feerate bucket, but since the answer it returns is the "median"* > fee rate for that bucket, its a reasonable answer right off the get go. > > Do you think it would make sense to make that 90% number an argument to > rpc call? For instance there could be a default (I would use 80%) but then > you could specify if you required a different certainty. It wouldn't > require any code changes and might make it easier for people to build more > complicated logic on top of it. > > *It can't actually track the median, but it identifies which of the > smaller actual buckets the median would have fallen into and returns the > average feerate for that median bucket. > > > > > > On Tue, Oct 28, 2014 at 9:59 AM, Gavin Andresen > wrote: > >> I think Alex's approach is better; I don't think we can know how much >> better until we have a functioning fee market. >> >> We don't have a functioning fee market now, because fees are hard-coded. >> So we get "pay the hard-coded fee and you'll get confirmed in one or two or >> three blocks, depending on which miners mine the next three blocks and what >> time of day it is." >> >> git HEAD code says you need a fee of 10,0000 satoshis/kb to be pretty >> sure you'll get confirmed in the next block. That looks about right with >> Alex's real-world data (if we take "90% chance" as 'pretty sure you'll get >> confirmed'): >> >> Fee rate 100000 Avg blocks to confirm 1.09 NumBlocks:% confirmed 1: 0.901 >> 2: 1.0 3: 1.0 >> >> My only concern with Alex's code is that it takes much longer to get >> 'primed' -- Alex, if I started with no data about fees, how long would it >> take to be able to get enough data for a reasonable estimate of "what is >> the least I can pay and still be 90% sure I get confirmed in 20 blocks" ? >> Hours? Days? Weeks? >> >> -- >> -- >> Gavin Andresen >> > > --20cf303f6e0c7b9b3005067cd516 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
Sorry, perhaps I misinterpreted that question.=C2=A0 The e= stimates will be dominated by the prevailing transaction rates initially, s= o the estimates you get for something like=C2=A0"what is the least I can pay and still be 90% sure I get confirm= ed in 20 blocks"=C2=A0 won't be insane, but they will still be way= too conservative.=C2=A0 I'm not sure what you meant by reasonable.=C2= =A0 You won't get the "correct" answer of something significa= ntly less than 40k sat/kB for quite some time.=C2=A0 Given that the half-li= fe of the decay is 2.5 days, then within a couple of days.=C2=A0 And in fac= t even in the steady state, the new code will still return a much higher ra= te than the existing code, say 10k sat/kB instead of 1k sat/kB, but that= 9;s just a result of the sorting the existing code does and the fact that n= o one places transactions with that small fee. =C2=A0 To correctly give suc= h low answers, the new code will require that those super low feerate trans= actions are occurring frequently enough, but the bar for enough datapoints = in a feerate bucket is pretty low, an average of 1 tx per block.=C2=A0 The = bar can be made lower at the expense of a bit of noisiness in the answers, = for instance for priorities I had to make the bar=C2=A0significantly=C2=A0l= ower because there are so many fewer transactions confirmed=C2=A0because of= priorities.=C2=A0 I'm certainly open to tuning some of these variables= .





On Tue, Oct 28, 2014 at 10:= 30 AM, Alex Morcos <morcos@gmail.com> wrote:
Oh in just a couple of blocks, it'll= give you a somewhat reasonable estimate for asking about every confirmatio= n count other than 1, but it could take several hours for it to have enough= data points to give you a good estimate for getting confirmed in one block= (because the prevalent feerate is not always confirmed in 1 block >80% = of the time) =C2=A0 Essentially what it does is just combine buckets until = it has enough data points, so after the first block it might be treating al= l of the txs as belonging to the same feerate bucket, but since the answer = it returns is the "median"* fee rate for that bucket, its a reaso= nable answer right off the get go.=C2=A0

Do you think it= would make sense to make that 90% number an argument to rpc call?=C2=A0 Fo= r instance there could be a default (I would use 80%) but then you could sp= ecify if you required a different certainty.=C2=A0 It wouldn't require = any code changes and might make it easier for people to build more complica= ted logic on top of it.

*It can't actually track= the median, but it identifies which of the smaller actual buckets the medi= an would have fallen into and returns the average feerate for that median b= ucket.





On Tue, Oct 28, 2014 at 9:59 AM, Gavin Andr= esen <gavinandresen@gmail.com> wrote:
I think Alex&= #39;s approach is better; I don't think we can know how much better unt= il we have a functioning fee market.

We don't have a functioning fee market n= ow, because fees are hard-coded. So we get "pay the hard-coded fee and= you'll get confirmed in one or two or three blocks, depending on which= miners mine the next three blocks and what time of day it is."
<= div class=3D"gmail_extra">
git HEAD cod= e says you need a fee of 10,0000 satoshis/kb to be pretty sure you'll g= et confirmed in the next block. That looks about right with Alex's real= -world data (if we take "90% chance" as 'pretty sure you'= ll get confirmed'):

Fee rate 100000 Avg blocks to= confirm 1.09 NumBlocks:% confirmed 1: 0.901 2: 1.0 =C2=A0 3: 1.0

My only concern with Alex's code is that it= takes much longer to get 'primed' -- Alex, if I started with no da= ta about fees, how long would it take to be able to get enough data for a r= easonable estimate of "what is the least I can pay and still be 90% su= re I get confirmed in 20 blocks" ? Hours? Days? Weeks?

--
--
Gavin Andresen<= br>


--20cf303f6e0c7b9b3005067cd516--