They have an example that calculates the expected information gained by truth bo...

tobyjsullivan · 2025-12-16T00:16:07 1765844167

The author defines one “bit” as ruling out half the remaining options.

So a yes might rule out 75% of remaining options (for example) which provides 2 bits of information.

hatthew · 2025-12-16T00:24:28 1765844668

We have to make a distinction between "expected information gain" vs "maximum information gain". An answer of "yes" generally gains >1 bit, but an answer of "no" generally gains <1 bit, and the average outcome ends up <1. It is impossible for a single yes/no question to have an expected gain of >1; the maximum possible is precisely 1.

tobyjsullivan · 2025-12-16T00:37:28 1765845448

The total probabilities add up to 1. But I’m not following how that relates to the average bits.

Despite summing to 1, the exact values of P(true) and P(false) are dependent on the options which have previously been discounted. Then those variables get multiplied by the amount of information gained by either answer.

adastra22 · 2025-12-16T01:17:17 1765847837

It is definitional, which I mean in the strictest mathematical sense: the information content of a result is directly derived from how “unexpected” it is.

A result which conveys 2 bits of information should occur with 25% expected probability. Because that’s what we mean by “two bits of information.”

thaumasiotes · 2025-12-16T03:56:38 1765857398

So, you have n options, you ask a question, and now you're down to m options.

The number of bits of information you gained is -log₂ (m/n).

If you ask a question which always eliminates half of the options, you will always gain -log₂ (1/2) = 1 bit of information.

If you go with the dumber approach of taking a moonshot, you can potentially gain more than that, but in expectation you'll gain less.

If your question is a 25-75 split, you have a 25% chance of gaining -log₂ (1/4) = 2 bits, and a 75% chance of gaining -log₂ (3/4) = 0.415 bits. On average, this strategy will gain you (0.25)(2) + (0.75)(0.415) = 0.8113 bits, which is less than 1 bit.

The farther away you get from 50-50, the more bits you can potentially gain in a single question, but - also - the lower the number of bits you expect to gain becomes. You can never do better than an expectation of 1 bit for a trial with 2 outcomes.

(All of this is in the article; see footnote 3 and its associated paragraph.)

The article explicitly calls out the expectational maximum of one bit:

>> You'll also notice that you're never presented with a question that gives you more than 1 expected information, which is backed up by the above graph never going higher than 1.

So it's strange that it then goes on to list an example of a hypothetical (undescribed, since the scenario is impossible) yes/no question with an expected gain of 1.6 bits.

hatthew · 2025-12-16T01:12:47 1765847567

The article states "Suppose we have calculated the expected information gained by potential truth booths like below: Expected information: 1.60 bits ..." This is impossible because of the general fact in information theory that (p(true) * bits_if_true) + (p(false) * bits_if_false) <= 1. If they had said "Suppose we have calculated the maximum information gained...", then 1.6 bits would be valid. They said "expected information" though, so 1.6 bits is invalid.

owenlacey · 2025-12-16T10:29:39 1765880979

Great spot! The max expected information is 1. I've updated this part of the post to only show examples that are < 1, thank you for raising!

akoboldfrying · 2025-12-16T10:35:10 1765881310

Do you mean the diagram following the sentence "Suppose we have calculated the expected information gained by potential truth booths like below:"?

Yes, that looks like a mistake -- a truth booth only has 2 outcomes, so it can produce at most 1 bit of information.

Regarding the other mentions on the page of information levels exceeding 1 bit: Those are OK, since they allow match-ups, which for 6 people have 7 possible outcomes, thus can yield up to log2(7) ≈ 2.81 bits.

latortuga · 2025-12-15T22:56:30 1765839390

Because when it's true, you also learn about any prior match ups involving those two people.

MarkusQ · 2025-12-15T23:20:39 1765840839

That's not how information works. Learning more from one outcome than the other decreases the probability of that outcome occurring, so the expected information (which is the sum of the outcome probability times the outcome information for each of the two possible outcomes) is always less than or equal to one.

If all you can get is a "true" or "false" you expect, at most, one bit of information.

sebastos · 2025-12-16T00:15:58 1765844158

Right - but coming back to the original question, if I'm not mistaken, the explanation is that the blogpost is measuring information gained from an actual outcome, as opposed to _expected_ information gain. An example will help:

Say you're trying to guess the number on a 6-sided die that I've rolled. If I wanted to outright tell you the answer, that would be 2.58 bits of information I need to convey. But you're trying to guess it without me telling, so suppose you can ask a yes or no question about the outcome. The maximum of the _expected_ information add is 1 bit. If you ask "was it 4 or greater?", then that is an optimal question, because the expected information gain is min-maxed. That is, the minimum information you can gain is also the maximum: 1 bit. However, suppose you ask "was it a 5?". This is a bad question, because if the answer is no, there are still 5 numbers it could be. Plus, the likelihood of it being 'no' is high: 5/6. However, despite these downsides, it is true that 1/6 times, the answer WILL be yes, and you will gain all 2.58 bits of information in one go. The downside case more than counteracts this and preserves the rules of information theory: the _expected_ information gain is still < 1 bit.

EDIT: D'oh, nevermind. Re-reading the post, it's definitely talking about >1 bit expectations of potential matchings. So I don't know!

kevindamm · 2025-12-15T23:54:51 1765842891

It's not a yes/no per contestent, it's per edge between contestants. There are n(n-1)/2 of these.

A true answer for a potential match is actually a state update for all of the (n-1) edges connecting either contestant, that's 2(n-2) edges that can be updated to be false. Some of these may already be known from previous rounds' matchups but that's still more than a single binary.

hatthew · 2025-12-16T00:29:00 1765844940

An answer of "yes" will generally eliminate many edges, with potential for >1 bit. However, an answer of "no" will generally eliminate just that one edge, which is generally <1 bit.

MarkusQ · 2025-12-16T01:21:55 1765848115

But you don't receive more than a single binary value; you get a yes or no.

If both of these are equally likely, you gain one bit of information, the maximum possible amount. If you already have other information about the situation, you might gain _less_ than one bit on average (because it confirms something you already knew, but doesn't provide any new information), but you can't gain more.

twoodfin · 2025-12-16T01:28:43 1765848523

If I’m trying to guess a 9-letter English word, and test whether the first letter is “x”, there are only the same two answers: Yes/No.

But “Yes” obviously gives me much more than one bit of the information I need to know the answer.

Dylan16807 · 2025-12-16T02:00:51 1765850451

But that "yes" is so unlikely that your expected/average information is still 1 bit or less.

twoodfin · 2025-12-16T02:42:23 1765852943

The claim was that one bit was the maximum amount of information you could gain, which is clearly false.

Just to make this unambiguous: If you ask me to guess a number between one and one billion, and by fantastic luck I guess right, your “yes/no” answer obviously gives me more than one bit of information as to the right answer.

Dylan16807 · 2025-12-16T02:53:32 1765853612

> The claim was that one bit was the maximum amount of information you could gain, which is clearly false.

That's not what I see.

https://news.ycombinator.com/item?id=46282007 They have an example that calculates the expected information gained by truth booths and all of the top ones are giving more than one bit. How can this be? It is a yes/no question a max of 1 bit should be possible

https://news.ycombinator.com/item?id=46282343 the expected information (which is the sum of the outcome probability times the outcome information for each of the two possible outcomes) is always less than or equal to one.

The specific comment you replied to had one sentence that didn't say "expected" or "average", but the surrounding sentences and comments give context. The part you objected to was also trying to talk about averages, which makes it not false.

twoodfin · 2025-12-16T03:03:50 1765854230

If both of these are equally likely, you gain one bit of information, the maximum possible amount. If you already have other information about the situation, you might gain _less_ than one bit on average (because it confirms something you already knew, but doesn't provide any new information), but you can't gain more.

Can’t gain more!

The core confusion is this idea that the answer to a yes/no question can’t provide more than one bit of information, no matter what the question or answer. This is false. The question itself can encode multiple bits of potential information and the answer simply verifies them.

Dylan16807 · 2025-12-16T03:15:55 1765854955

> Can’t gain more!

"you might gain _less_ than one bit on average [...], but you can't gain more."

On. Average.

That's a true statement. Can't gain more than one bit on average.

twoodfin · 2025-12-16T03:46:20 1765856780

I’m not arguing with that, it’s basic information theory.

One bit, however, is not “the maximum possible amount” you can gain from an oracular answer to a yes/no question. The OP covers exactly this point re: the “Guess Who?” game.

Dylan16807 · 2025-12-16T03:55:13 1765857313

The start of this comment thread was a complaint that OP is showing more than one bit expected for certain yes/no answers. Not best case, expected.

That's why people are talking about the maximum expected value.

jncfhnb · 2025-12-15T23:49:59 1765842599

I’m not really following. But if you’re told that one of A, B, or C is true; you learn more by being told A is True than if you learn D is True, no?

hatthew · 2025-12-16T00:17:50 1765844270

Yes, you learn more than 1 bit in that case. However, if you are told A is false, you still don't know whether B or C is true, so you gain less than 1 bit. Assuming A, B and C all have equal probability, your average/expected information gain is <1 bit.

If you ask the question "which of A, B, or C is true?" then you're not asking a yes/no question, and it's not surprising that you expect to gain more than 1 bit of information.

jncfhnb · 2025-12-16T13:43:29 1765892609

but that’s all consistent. “Expected” gain is less than 1 for the truth booths and sometimes > 1 for actuals; and is > 1 on expected value of the match ups, which aren’t binary questions.

hatthew · 2025-12-17T05:28:25 1765949305

Sure, and the issue is that the article says "Suppose we have calculated the expected information gained by potential truth booths like below:" and then lists some values >1

edit: just saw that the article fixed this recently, and the values are now <1

stevage · 2025-12-15T23:56:17 1765842977

You also learn about other pairings now being impossible.

mnw21cam · 2025-12-16T09:22:23 1765876943

No, that doesn't make sense either. For a truth booth, you're taking all the possible pairing arrangements, and dividing them into two sets. After the answer, one of those two sets is false. There is no way that this can provide more than 1 bit of information.

The match-ups can however give more information, as it isn't giving a yes/no answer.