Probability is in the Mind
Followup to: The Mind Projection Fallacy
Yesterday I spoke of the Mind Projection Fallacy, giving the example of the alien monster who carries off a girl in a torn dress for intended ravishing - a mistake which I imputed to the artist's tendency to think that a woman's sexiness is a property of the woman herself, woman.sexiness, rather than something that exists in the mind of an observer, and probably wouldn't exist in an alien mind.
The term "Mind Projection Fallacy" was coined by the late great Bayesian Master, E. T. Jaynes, as part of his long and hard-fought battle against the accursèd frequentists. Jaynes was of the opinion that probabilities were in the mind, not in the environment - that probabilities express ignorance, states of partial information; and if I am ignorant of a phenomenon, that is a fact about my state of mind, not a fact about the phenomenon.
I cannot do justice to this ancient war in a few words - but the classic example of the argument runs thus:
You have a coin.
The coin is biased.
You don't know which way it's biased or how much it's biased. Someone just told you, "The coin is biased" and that's all they said.
This is all the information you have, and the only information you have.
You draw the coin forth, flip it, and slap it down.
Now - before you remove your hand and look at the result - are you willing to say that you assign a 0.5 probability to the coin having come up heads?
The frequentist says, "No. Saying 'probability 0.5' means that the coin has an inherent propensity to come up heads as often as tails, so that if we flipped the coin infinitely many times, the ratio of heads to tails would approach 1:1. But we know that the coin is biased, so it can have any probability of coming up heads except 0.5."
The Bayesian says, "Uncertainty exists in the map, not in the territory. In the real world, the coin has either come up heads, or come up tails. Any talk of 'probability' must refer to the information that I have about the coin - my state of partial ignorance and partial knowledge - not just the coin itself. Furthermore, I have all sorts of theorems showing that if I don't treat my partial knowledge a certain way, I'll make stupid bets. If I've got to plan, I'll plan for a 50/50 state of uncertainty, where I don't weigh outcomes conditional on heads any more heavily in my mind than outcomes conditional on tails. You can call that number whatever you like, but it has to obey the probability laws on pain of stupidity. So I don't have the slightest hesitation about calling my outcome-weighting a probability."
I side with the Bayesians. You may have noticed that about me.
Even before a fair coin is tossed, the notion that it has an inherent 50% probability of coming up heads may be just plain wrong. Maybe you're holding the coin in such a way that it's just about guaranteed to come up heads, or tails, given the force at which you flip it, and the air currents around you. But, if you don't know which way the coin is biased on this one occasion, so what?
I believe there was a lawsuit where someone alleged that the draft lottery was unfair, because the slips with names on them were not being mixed thoroughly enough; and the judge replied, "To whom is it unfair?"
To make the coinflip experiment repeatable, as frequentists are wont to demand, we could build an automated coinflipper, and verify that the results were 50% heads and 50% tails. But maybe a robot with extra-sensitive eyes and a good grasp of physics, watching the autoflipper prepare to flip, could predict the coin's fall in advance - not with certainty, but with 90% accuracy. Then what would the real probability be?
There is no "real probability". The robot has one state of partial information. You have a different state of partial information. The coin itself has no mind, and doesn't assign a probability to anything; it just flips into the air, rotates a few times, bounces off some air molecules, and lands either heads or tails.
So that is the Bayesian view of things, and I would now like to point out a couple of classic brainteasers that derive their brain-teasing ability from the tendency to think of probabilities as inherent properties of objects.
Let's take the old classic: You meet a mathematician on the street, and she happens to mention that she has given birth to two children on two separate occasions. You ask: "Is at least one of your children a boy?" The mathematician says, "Yes, he is."
What is the probability that she has two boys? If you assume that the prior probability of a child being a boy is 1/2, then the probability that she has two boys, on the information given, is 1/3. The prior probabilities were: 1/4 two boys, 1/2 one boy one girl, 1/4 two girls. The mathematician's "Yes" response has probability ~1 in the first two cases, and probability ~0 in the third. Renormalizing leaves us with a 1/3 probability of two boys, and a 2/3 probability of one boy one girl.
But suppose that instead you had asked, "Is your eldest child a boy?" and the mathematician had answered "Yes." Then the probability of the mathematician having two boys would be 1/2. Since the eldest child is a boy, and the younger child can be anything it pleases.
Likewise if you'd asked "Is your youngest child a boy?" The probability of their being both boys would, again, be 1/2.
Now, if at least one child is a boy, it must be either the oldest child who is a boy, or the youngest child who is a boy. So how can the answer in the first case be different from the answer in the latter two?
Or here's a very similar problem: Let's say I have four cards, the ace of hearts, the ace of spades, the two of hearts, and the two of spades. I draw two cards at random. You ask me, "Are you holding at least one ace?" and I reply "Yes." What is the probability that I am holding a pair of aces? It is 1/5. There are six possible combinations of two cards, with equal prior probability, and you have just eliminated the possibility that I am holding a pair of twos. Of the five remaining combinations, only one combination is a pair of aces. So 1/5.
Now suppose that instead you asked me, "Are you holding the ace of spades?" If I reply "Yes", the probability that the other card is the ace of hearts is 1/3. (You know I'm holding the ace of spades, and there are three possibilities for the other card, only one of which is the ace of hearts.) Likewise, if you ask me "Are you holding the ace of hearts?" and I reply "Yes", the probability I'm holding a pair of aces is 1/3.
But then how can it be that if you ask me, "Are you holding at least one ace?" and I say "Yes", the probability I have a pair is 1/5? Either I must be holding the ace of spades or the ace of hearts, as you know; and either way, the probability that I'm holding a pair of aces is 1/3.
How can this be? Have I miscalculated one or more of these probabilities?
If you want to figure it out for yourself, do so now, because I'm about to reveal...
That all stated calculations are correct.
As for the paradox, there isn't one. The appearance of paradox comes from thinking that the probabilities must be properties of the cards themselves. The ace I'm holding has to be either hearts or spades; but that doesn't mean that your knowledge about my cards must be the same as if you knew I was holding hearts, or knew I was holding spades.
It may help to think of Bayes's Theorem:
P(H|E) = P(E|H)P(H) / P(E)
That last term, where you divide by P(E), is the part where you throw out all the possibilities that have been eliminated, and renormalize your probabilities over what remains.
Now let's say that you ask me, "Are you holding at least one ace?" Before I answer, your probability that I say "Yes" should be 5/6.
But if you ask me "Are you holding the ace of spades?", your prior probability that I say "Yes" is just 1/2.
So right away you can see that you're learning something very different in the two cases. You're going to be eliminating some different possibilities, and renormalizing using a different P(E). If you learn two different items of evidence, you shouldn't be surprised at ending up in two different states of partial information.
Similarly, if I ask the mathematician, "Is at least one of your two children a boy?" I expect to hear "Yes" with probability 3/4, but if I ask "Is your eldest child a boy?" I expect to hear "Yes" with probability 1/2. So it shouldn't be surprising that I end up in a different state of partial knowledge, depending on which of the two questions I ask.
The only reason for seeing a "paradox" is thinking as though the probability of holding a pair of aces is a property of cards that have at least one ace, or a property of cards that happen to contain the ace of spades. In which case, it would be paradoxical for card-sets containing at least one ace to have an inherent pair-probability of 1/5, while card-sets containing the ace of spades had an inherent pair-probability of 1/3, and card-sets containing the ace of hearts had an inherent pair-probability of 1/3.
Similarly, if you think a 1/3 probability of being both boys is an inherent property of child-sets that include at least one boy, then that is not consistent with child-sets of which the eldest is male having an inherent probability of 1/2 of being both boys, and child-sets of which the youngest is male having an inherent 1/2 probability of being both boys. It would be like saying, "All green apples weigh a pound, and all red apples weigh a pound, and all apples that are green or red weigh half a pound."
That's what happens when you start thinking as if probabilities are in things, rather than probabilities being states of partial information about things.
Probabilities express uncertainty, and it is only agents who can be uncertain. A blank map does not correspond to a blank territory. Ignorance is in the mind.

It seems to me you're using "perceived probability" and "probability" interchangeably. That is, you're "defining" probability as the probability that an observer assigns based on certain pieces of information. Is it not true that when one rolls a fair 1d6, there is an actual 1/6 probability of getting any one specific value? Or using your biased coin example: our information may tell us to assume a 50/50 chance, but the man may be correct in saying that the coin has a bias--that is, the coin may really come up heads 80% of the time, but we must assume a 50% chance to make the decision, until we can be certain of the 80% chance ourselves. What am I missing?
I would say that the Gomboc (http://tinyurl.com/2rffxs) has a 100% chance of righting itself, inherently. I do not understand how this is incorrect.
Posted by: GBM | March 12, 2008 at 02:19 AM
GBM:
Q: What is the probability for a pseudo-random number generator to generate a specific number as his next output?
A: 1 or 0 because you can actually calculate the next number if you have the available information.
Q: What probability do you assign to a specific number as being it's next output if you don't have the information to calculate it?
Replace pseudo-random number generator with dice and repeat.
Posted by: Roland | March 12, 2008 at 02:28 AM
So therefore a person with perfect knowledge would not need probability. Is this another interpretation of "God does not play dice?" :-)
Posted by: Ian C. | March 12, 2008 at 02:33 AM
Alas, the coin was part of an erroneous stamping, and is blank on both sides.
Posted by: Caledonian | March 12, 2008 at 09:10 AM
In other words, probability is not likelihood.
Posted by: Jef Allbright | March 12, 2008 at 10:10 AM
Here is another example me, my dad and my brother came up with when we were discussing probability.
Suppose there are 4 card, an ace and 3 kings. They are shuffled and placed face side down. I didn't look at the cards, my dad looked at the first card, my brother looked at the first and second cards. What is the probability of the ace being one of the last 2 cards.
For me: 1/2
For my dad: If he saw the ace it is 0, otherwise 2/3.
For my brother: If he saw the ace it is 0, otherwise 1.
How can there be different probabilities of the same event? It is because probability is something in the mind calculated because of imperfect knowledge. It is not a property of reality. Reality will take only a single path. We just don't know what that path is. It is pointless to ask for "the real likelihood" of an event. The likelihood depends on how much information you have. If you had all the information, the likelihood of the event would be 100% or 0%.
Posted by: PK | March 12, 2008 at 12:32 PM
The competent frequentist would presumably not be befuddled by these supposed paradoxes. Since he would not be befuddled (or so I am fairly certain), the "paradoxes" fail to prove the superiority of the Bayesian approach. Frankly, the treatment of these "paradoxes" in terms of repeated experiments seems to straightforward that I don't know how you can possibly think there's a problem.
Posted by: Constant | March 12, 2008 at 12:41 PM
"Probabilities express uncertainty, and it is only agents who can be uncertain. A blank map does not correspond to a blank territory. Ignorance is in the mind."
Eliezer, in quantum mechanics, one does not say that one does not have knowledge of both position and momentum of a particle simultaneously. Rather, one says that one CANNOT have such knowledge. This contradicts your statement that ignorance is in the mind. If quantum mechanics is true, then ignorance/uncertainty is a part of nature and not just something that agents have.
Posted by: Sudeep | March 12, 2008 at 12:42 PM
Constant: The competent frequentist would presumably not be befuddled by these supposed paradoxes.
Not the last two paradoxes, no. But the first case given, the biased coin whose bias is not known, is indeed a classic example of the difference between Bayesians and frequentists. The frequentist says:
According to the frequentist, apparently there is no rational way to manage your uncertainty about a single flip of a coin of unknown bias, since whatever you do, someone else will be able to criticize your belief as "subjective" - such a devastating criticism that you may as well, um, flip a coin. Or consult a magic 8-ball.
Sudeep: If quantum mechanics is true, then ignorance/uncertainty is a part of nature and not just something that agents have.
A common misconception - Jaynes railed against that idea too, and he wasn't even equipped with the modern understanding of decoherence. In quantum mechanics, it's an objective fact that the blobs of amplitude making up reality sometimes split in two, and you can't predict what "you" will see, when that happens, because it is an objective fact that different versions of you will see different things. But all this is completely mechanical, causal, and deterministic - the splitting of observers just introduces an element of anthropic pseudo-uncertainty, if you happen to be one of those observers. The splitting is not inherently related to the act of measurement by a conscious agent, or any kind of agent; it happens just as much when a system is "measured" by a photon bouncing off and interacting with a rock.
There are other interpretations of quantum mechanics, but they don't make any sense. Making this fully clear will require more prerequisite posts first, though.
Posted by: Eliezer Yudkowsky | March 12, 2008 at 12:57 PM
Maybe I'm stupid here... what difference does it make?
Sure, if we had a coin-flip-predicting robot with quick eyes it might be able to guess right/predict the outcome 90% of the time. And if we were precognitive we could clean up at Vegas.
In terms of non-hypothetical real decisions that confront people, what is the outcome of this line of reasoning? What do you suggest people do differently and in what context? Mark cards?
B/c currently, as far as I can see, you're saying, "The coin won't end up 'heads or tails' -- it'll end up heads, or it'll end up tails." True but uninformative.
Conrad.
ps - The thought experiment with the trick coin is ungrounded. If I'm being asked to lay even odds on a dollar bet that the coin is heads, then that's rational -- since the coin could be biased for heads, or tails (and the guy proposing the bet doesn't know the bias). If I'm being asked to accept or reject a number meant to correspond to the calculated or measured likelihood of heads coming up, and I trust the information about it being biased, then the only correct move is to reject the 0.5 probability. It has nothing to do with frequentist, Bayesian, or any other suchlike.
C.
Posted by: Conrad | March 12, 2008 at 01:00 PM
Sudeep: the inverse certainy of the position and momentum is a mathematical artifact and does not depend upon the validity of quantum mechanics. (Er, at least to the extent that math is independent of the external world!)
PK: I like your posts, and don't take this the wrong way, but, to me, your example doesn't have as much shocking unintuitiveness as the ones Eliezer Yudkowsky (no underscore) listed.
Posted by: Silas | March 12, 2008 at 01:03 PM
I'd like to understand: Are frequentist "probability" and subjective "probability" simply two different concepts, to be distinguished carefully? Or is there some true debate here?
I think that Jaynes shows a derivation follownig Bayesian principles of the frequentist probability from the subjective probability. I'd love to see one of Eliezer's lucid explanations on that.
Posted by: Joshua Fox | March 12, 2008 at 01:27 PM
You can derive frequentist probabilities from subjective probabilities but not the other way around.
Posted by: Eliezer Yudkowsky | March 12, 2008 at 01:30 PM
Silas: My post wasn't meant to be "shockingly unintuitive", it was meant to illustrate Eliezer's point that probability is in the mind and not out there in reality in a ridiculously obvious way.
Am I somehow talking about something entirely different than what Eliezer was talking about? Or should I complexificationafize my vocabulary to seem more academic? English isn't my first language after all.
Posted by: PK | March 12, 2008 at 01:42 PM
Alas, no. Here's the deal: implicit in all the coin toss toy problems is the idea that the observations may be modeled as exchangeable. It really really helps to have a grasp on what the math looks like when we assume exchangeability.
In models where (infinite) exchangeability is assumed, the concept of long-run frequency can be sensibly defined. (Long-run frequency may or may not be a cogent concept in models without exchangeability.) The probability of heads in any one toss is the expectation of a probability density function (pdf) which encodes our knowledge about the long run frequency. (Roughly. There are some technical conditions for the existence of a pdf that I'm ignoring.)
Conrad, your idea that 0.5 is not an allowable probability is almost correct. In fact, the correct expression of this idea is that the pdf of the long-run frequency must be equal to zero at 0.5. But! -- its values in the neighborhood of 0.5 are not constrained, so the pdf may have a removable singularity.
Suppose our information about bias in favour of heads is equivalent to our information about bias in favour of tail. Our pdf for the long-run frequency will be symmetrical about 0.5 and its expectation (which is the probability in any single toss) must also be 0.5. It is quite possible for an expectation to take a value which has zero probability density. We can refuse to believe that the long-run frequency will converge to exactly 0.5 while simultaneously holding a probability of 0.5 for any specific single toss in isolation.
Posted by: Cyan | March 12, 2008 at 01:46 PM
Eliezer, I have no argument with the Bayesian use of the probability calculus and so I do not side with those who say "there is no rational way to manage your uncertainty", but I think I probably do have an argument with the insistence that it is the one true way. None of the problems you have so far outlined, including the coin one, really seem to doom either frequentism specifically, or more generally, an objective account of probability. I agree with this:
but I question whether it really captures the frequentist position. To address the specifics, you seem to be talking about how the coin is held in a specific concrete toss. But frequentists emphatically are not talking about individual tosses. They are talking about infinitely repeated tosses. Alternatively, you might be talking about an infinitely repeated experiment in which the coin is tossed "in such a way", but here too I see no problem for the frequentists. Since the way of holding the coin is part of the experiment, then in this case they will predict a long term frequency of mostly heads. So they won't get this one wrong.
Posted by: Constant | March 12, 2008 at 02:11 PM
(Replace the link to "removable singularity" with one to removable discontinuity.)
Posted by: Cyan | March 12, 2008 at 02:16 PM
No way to do it other way around? Nothing along the lines of, say, considering a set of various "things to be explained" and for each a hypothesis explaining it, and then talk about subsets of those? ie, a subset in which 1/10 of the hypothesies in that subset are objectively true would be a set of hypothesies assigned .1 probability, or something?
Yeah, the notion of how to do this exactly is, admittedly, fuzzy in my head, but I have to say that it sure does seem like there ought to be some way to use the notion of frequentist probability to construct subjective probability along these lines.
I may be completely wrong though.
Posted by: Psy-Kosh | March 12, 2008 at 02:17 PM
"Suppose our information about bias in favour of heads is equivalent to our information about bias in favour of tail. Our pdf for the long-run frequency will be symmetrical about 0.5 and its expectation (which is the probability in any single toss) must also be 0.5. It is quite possible for an expectation to take a value which has zero probability density."
What I said: if all you know is that it's a trick coin, you can lay even odds on heads.
"We can refuse to believe that the long-run frequency will converge to exactly 0.5 while simultaneously holding a probability of 0.5 for any specific single toss in isolation."
Again what I said: if the question is, "This is a trick coin: I've rigged it. I have written down here the probability that it'll come up heads. Do you accept that the number I've written down is .5?" -- You've got to say no. Since they've just told you it was rigged.
And if what they've written down is .50000000000001 and come back at you for it, then they stretched a point to say it was rigged.
So your problem is you haven't grounded the example in terms of what we're being asked to do.
Again, what difference does it make?
Conrad.
ps - Ofc, knowing, or even just suspecting, the coin is rigged, on the *second* throw you'd best bet on a repeat of the outcome of the *first*.
C.
Posted by: Conrad | March 12, 2008 at 02:31 PM
In other words, they are talking about tail events. That a frequentist probability (i.e., a long-run frequency) even exists can be a zero-probability event -- but you have to give axioms for probability before you can even make this claim. (Furthermore, I'm never going to observe a tail event, so I don't much care about them.)
Posted by: Cyan | March 12, 2008 at 02:33 PM
Conrad,
Okay, so unpack "ungrounded" for me. You've used the phrases "probability" and "calculated or measured likelihood of heads coming up", but I'm not sure how you're defining them.
I'm going to do two things. First, I'm going to Taboo "probability" and "likelihood" (for myself -- you too, if you want). Second, I'm going to ask you exactly which specific observable event it is we're talking about. (First toss? Twenty-third toss? Infinite collection of tosses?) I have a definite feeling that our disagreement is about word usage.
Posted by: Cyan | March 12, 2008 at 02:48 PM
If you honestly subscribe to this view of probability, please never give the odds for winning the lottery again. Or any odds for anything else.
What does telling me your probability that you assign something actually tell me about the world? If I don't know the information you are basing it on, very little.
I'm also curious about a formulation of probability theory that completely ignores random numbers and other theories that are based upon them (e.g. The law of large numbers, Central limit theorem).
Heck a re-write of http://en.wikipedia.org/wiki/Probability_theory with all mention of probabilities in the external world removed might be useful.
Posted by: Will Pearson | March 12, 2008 at 02:55 PM
I'm not sure the many-worlds interpretation fully eliminates the issue of quantum probability as part of objective reality. You can call it "anthropic pseudo-uncertainty" when you get split and find that your instances face different outcomes. But what determines the *probability* you will see those various outcomes? Just your state of knowledge? No, theory says it is an objective element of reality, the amplitude of the various elements of the quantum wave function. This means that probability, or at least its close cousin amplitude, is indeed an element of reality and is more than just a representation of your state of knowledge.
For aficionados of interpretations of QM, this relates to an old debate, whether the so-called "Born rule" can be derived from the MWI. Various arguments have been offered for this, including one by Robin, and some have claimed that these now work so well that the argument is settled. However I don't think the larger physics/philosophy community is convinced.
Posted by: Hal Finney | March 12, 2008 at 02:58 PM
Will Pearson, I'm having trouble determining to whom your comment is addressed.
Posted by: Cyan | March 12, 2008 at 03:04 PM
Roland and Ian C. both help me understand where Eliezer is coming from. And PK's comment that "Reality will only take a single path" makes sense. That said, when I say a die has a 1/6 probability of landing on a 3, that means: Over a series of rolls in which no effort is made to systematically control the outcome (e.g. by always starting with 3 facing up before tossing the die), the die will land on a 3 about 1 in 6 times. Obviously, with perfect information, everything can be calculated. That doesn't mean that we can't predict the probability of a specific event.
Also, I didn't get a response to the Gomboc ( http://tinyurl.com/2rffxs ) argument. I would say that it has an inherent 100% probability of righting itself. Even if I knew nothing about the object, the real probability of it righting itself is 100%. Now, I might not bet on those odds, without previous knowledge, but no matter what I know, the object will right itself. How is this incorrect?
Posted by: GBM | March 12, 2008 at 03:13 PM
::Okay, so unpack "ungrounded" for me. You've used the phrases "probability" and "calculated or measured likelihood of heads coming up", but I'm not sure how you're defining them.::
Ungrounded: That was a good movie. Grounded: That movie made money for the investors. Alternatively: I enjoyed it and recommend it. -- is for most purposes grounded enough.
::I'm going to do two things. First, I'm going to Taboo "probability" and "likelihood" (for myself -- you too, if you want). Second, I'm going to ask you exactly which specific observable event it is we're talking about. (First toss? Twenty-third toss? Infinite collection of tosses?) I have a definite feeling that our disagreement is about word usage.::
You yourself said that we're dealing with one throw of a rigged coin, of unknown riggage. I don't think we have have a disagreement, exactly, except it looks to me like the discussion's moot.
But look: if I can back up a bit, the notion that we can be dealing with a rigged coin, know that it's rigged, and say that the --er, chances-- of getting a heads is "really" 50%, because we Just Don't Know, is useless. At that point you're using 50-50 because we have two possible known outcomes:
But in fact we deal with unknown probabilities *all the time*. Probabilities are by default unknown, until we measure them by repeated trial and a lot of scratch-work. What about when you're dealing with a medication that might kill someone, or not: in the absence of any information, do you say that's 50-50?
Conrad.
Posted by: Conrad | March 12, 2008 at 03:36 PM
GBM:: ..That said, when I say a die has a 1/6 probability of landing on a 3, that means: Over a series of rolls in which no effort is made to systematically control the outcome (e.g. by always starting with 3 facing up before tossing the die), the die will land on a 3 about 1 in 6 times.::
--Well, no: it does mean that, but don't let's get tripped up that a measure of probability requires a series of trials. It has that same probability even for one roll. It's a consequence of the physics of the system, that there are 6 stable distinguishable end-states and explosively many intermediate states, transitioning amongst each other chaotically.
Conrad.
Posted by: Conrad | March 12, 2008 at 03:48 PM
I have to say that it sure does seem like there ought to be some way to use the notion of frequentist probability to construct subjective probability along these lines.
Assign a measure to each possible world (the prior probabilities). For some state of knowledge K, some set of worlds Ck is consistent with K (say, the set in which there is a brain containing K). For some proposition X, X is true in some set of worlds Cx. The subjective probability P(X|K) = measure(intersection(Ck,Cx)) / measure(Ck). Bayesian updating is equivalent to removing worlds from K. To make it purely frequentist, give each world measure 1 and use multisets.
Does that work?
Posted by: Nick Tarleton | March 12, 2008 at 04:12 PM
Who else thinks we should Taboo "probability", and replace it two terms for objective and subjective quantities, say "frequency" and "uncertainty"?
The frequency of an event depends on how narrowly the initial conditions are defined. If an atomically identical coin flip is repeated, obviously the frequency of heads will be either 1 or 0 (modulo a tiny quantum uncertainty).
Posted by: Nick Tarleton | March 12, 2008 at 04:24 PM
Oops, removing worlds from Ck, not K.
Posted by: Nick Tarleton | March 12, 2008 at 04:27 PM
GBM, I think you get the idea. The reason we don't want to say that the gomboc has an inherent probability of one for righting itself (besides that we, um, don't use probability one), is that as it is with the gomboc, so it is with the die or anything else in the universe. The premise is that determinism, in the form of some MWI, is (probably!) true, and so no matter what you or anyone else knows, whatever will happen is sure to happen. Therefore, when we speak of probability, we can only be referring to a state of knowledge. It is still of course the case that if you toss a fair die a very large number of times, the proportion of threes you get will tend towards 1/6--we're just not using such cases as a definition of what probability means.
Having already written the above, I must add that I like Nick's just-posted frequency/uncertainty breakdown.
Posted by: Z. M. Davis | March 12, 2008 at 04:32 PM
Cyan, sorry. My comment was to Eliezer and statements such as
"that probabilities express ignorance, states of partial information; and if I am ignorant of a phenomenon, that is a fact about my state of mind, not a fact about the phenomenon."
Posted by: Will Pearson | March 12, 2008 at 04:35 PM
I think there's still room for a concept of objective probability -- you'd define it as anything that obeys David Lewis's "Principal Principle" which this page tries to explain (with respect to some natural distinction between "admissible" and "inadmissible" information).
Posted by: steven | March 12, 2008 at 04:52 PM
Before accepting this view of probability and the underlying assumptions about the nature of reality one should look at the experimental evidence.
Try Groeblacher, Paterek, et al arXiv.0704.2529 (Aug 6 2007)
These experiments test various assumptions regarding non=local realism and conclude=
"...giving up the concept of locality is not sufficient to be consistent with quantum experiments, unless certain intuitive features of realism are abandoned"
Posted by: sonic | March 12, 2008 at 04:55 PM
Standard reply from MWIers is that MWI keeps realism and locality by throwing away a different hidden assumption called "counterfactual definiteness".
Posted by: steven | March 12, 2008 at 05:02 PM
Nick Tarleton:
Who else thinks we should Taboo "probability", and replace it two terms for objective and subjective quantities, say "frequency" and "uncertainty"?
I second that, this would probably clear a lot of the confusion and help us focus on the real issues.
Posted by: roland | March 12, 2008 at 05:34 PM
The "probability" of an event is how much anticipation you have for that event occurring. For example if you assign a "probability" of 50% to a tossed coin landing heads then you are half anticipating the coin to land heads.
Posted by: PK | March 12, 2008 at 05:56 PM
You've already given me information by using the word medication -- implicity, you're asking me to recall what I know about medications before I render an answer. So no, those outcomes aren't *necessarily* equally plausible to me. Here's a situation which is a much better approximation(!) of total absence of information: either event Q or event J has happened just now, and I will tell you which in my next comment. The asymmetry in your information is just that I chose the label Q for the first event and J for the second event. Which event do you find more plausible? I'd like you to justify your choice, pretending (if necessary) that I am honest in this instance.
Posted by: Cyan | March 12, 2008 at 05:58 PM
Now, if at least one child is a boy, it must be either the oldest child who is a boy, or the youngest child who is a boy. So how can the answer in the first case be different from the answer in the latter two?
Because they obviously aren't exclusive cases. I simply don't see mathematically why it's a paradox, so I don't see what this has to do with thinking that "probabilities are a property of things."
The "paradox" is that people want to compare it to a different problem, the problem where the cards are ordered. In that case, if you ask "Is your first card an ace," "Is your first card the ace of hearts," or "Is your first card the ace of spades," then there is the same probability of 1/3 in all three cases that both cards are aces given an answer "Yes." In that case the averaging makes sense because the cases are exclusive. In the "paradox," you can't average by saying that, "well, if there's one it's either the Ace of Spades or the Ace of Hearts, and in either case the answer would be 1/3, so it averages to 1/3." The problem is that you're double-counting.
I'm a Bayesian, but I don't see what this particular example has to do with subjectivity and agents. Probability is a result of the measure and the universe one is dealing with, and that may lead to results that seem unintuitive to those who don't grasp the mathematical principles (that seem obvious to me), but that has nothing to do with needing an agent. Define the measure space as you have done, claim that the probabilities are cold hard inherent facts about the objects themselves, and the result is independent of an agent.
This "paradox" seems on the same level to me as the confusion as to why the chances of rolling a 6 in three rolls of a die is not 1/2, or the problem that if one takes an outbound trip averaging 30 mph, then it is impossible to make the inbound trip so as to average 60 mph without teleporting instantaneously.
Posted by: John Thacker | March 12, 2008 at 06:48 PM
Or, I suppose, I would compare it to the other noted statistical paradox, whereby a famous hospital has a better survival rate for both mild and severe cases of a disease than a less-noted hospital, but a worse overall survival rate because it sees more of the worst cases. Merely because people don't understand how to do averages has little to do with them requiring an agent.
Posted by: John Thacker | March 12, 2008 at 06:51 PM
The estimated Bayesian probability has nothing to do with the coin. If it did, assigning a probability of 0.5 to one of the two possible outcomes would be necessarily incorrect, because one of the few things we know about the coin is that it's not fair.
The estimate is of our confidence in using that outcome as an answer. "How confident can I be that choosing this option will turn out to be correct?" We know that the coin is biased, but we don't know which outcome is more likely. As far as we know, then, guessing one way is as good as guessing the other.
The sides of the coin do have an actual probability associated with them, which is why it's wrong to say that one particular outcome is more likely. That's a truth statement that we can't justify with the available data. Without knowing more about the coin, we can't speak about it. We can only speak to our confidence and how justified it is with the data we know.
The assertion that uncertainty is not an aspect of reality goes far beyond what anyone can justify, and is an example of gross overconfidence in one's opinions, btw.
Posted by: Caledonian | March 12, 2008 at 07:00 PM
Another way to look at it: if you repeatedly select a coin with a random bias (selected from any distribution symmetric about .5) and flip it, H/T will come out 50/50.
Posted by: Nick Tarleton | March 12, 2008 at 07:28 PM
Silas: The uncertainty principle comes from the fact that position and momentum are related by Fourier transform. Or, in laymans terms, the fact that particles act like waves. This is one of the fundamental principles of QM, so yeah, it sort of does depend on the validity thereof. Not the Schrodinger equation itself perhaps, but other concepts.
As for whether QM proves that all probabilities are inherent in a system, it doesn't. It just prevents mutual information in certain situations. In coin flips or dice rolls, theoretically you could predict the outcome with enough information. Most probabilistic situations are that way; they're probabilistic because you don't have that info. QM is a bit different, and scientists still argue about it, but the fine detail of behavior of atoms doesn't have any effect on a poker game.
Posted by: Enginerd | March 12, 2008 at 08:56 PM
Z. M. Davis: Thank you. I get it now.
Posted by: GBM | March 12, 2008 at 09:29 PM
Follow-up question: If Bob believes he has a >50% chance of winning the lottery tomorrow, is his belief objectively wrong? I would tentatively propose that his belief is unfounded, "unattached to reality", unwise, and unreasonable, but that it's not useful to consider his belief "objectively wrong".
If you disagree, consider this: suppose he wins the lottery after all by chance, can you still claim the next day that his belief was objectively wrong?
Posted by: Rolf Nelson | March 12, 2008 at 11:41 PM
Nick Tarleton: Not sure I entirely correctly understood your suggestion, need to think about it more.
However, my initial thought is that it may require/assume logical omnicience.
ie, what of updating based on "subjective guesses" of which worlds are consistent or inconsistent with the data. That is, as consistent as you can tell, given bounded computational resources. I'm not sure, but your model, at least at first glance, may not be able to say useful stuff about those that are not logically ominicent.
Also, I'm unclear, could you clarify what it is you'd be using a multiset for? Do you mean "increase measure only by increasing number of copies of this in the multiset, and no other means allowed" or did you intend something else?
(incidentally, I think I do prefer coherence/dutch book/vulnerability style constructions of epistemic probability. Especially the ones that build up decision theory along the way, so one ends up starting with utilities almost. Such have very much of a "mathematical karma" flavor, as I've expressed elsewhere.)
Posted by: Psy-Kosh | March 13, 2008 at 03:56 AM
Hal, I'd say probability could be both part of objective physics and a mental state in this sense: Given our best understanding of objective physics, for any given mental state (including the info it has access to) there is a best rational set of beliefs. In quantum mechanics we know roughly the best beliefs, and we are trying to use that to infer more about the underlying set of states and info.
Posted by: Robin Hanson | March 13, 2008 at 06:45 AM
Rolf Nelson:
"Follow-up question: If Bob believes he has a >50% chance of winning the lottery tomorrow, is his belief objectively wrong? I would tentatively propose that his belief is unfounded, "unattached to reality", unwise, and unreasonable, but that it's not useful to consider his belief "objectively wrong"."
It all depends on what information Bob has. He might have carefully doctored the machines and general setup of the lottery draw to an extent that he might have enough information to have that probability. Now if Bob says he thinks he has a greater than 50% chance of winning the lottery because he is feeling lucky, and that is it, you can probably say that is unattached to reality or ignoring lots of relevant information.
Posted by: Will Pearson | March 13, 2008 at 09:04 AM
However, my initial thought is that it may require/assume logical omnicience.
Probably. Bayes is also easier to work with if you assume logical omniscience (i.e. knowledge of P(evidence|X) and P(E|~X)).
Also, I'm unclear, could you clarify what it is you'd be using a multiset for? Do you mean "increase measure only by increasing number of copies of this in the multiset, and no other means allowed" or did you intend something else?
Yes, using multisets of worlds with identical measure is equivalent to (for rational measures only) but 'more frequentist' than sets of worlds with variable measure.
incidentally, I think I do prefer coherence/dutch book/vulnerability style constructions of epistemic probability. Especially the ones that build up decision theory along the way, so one ends up starting with utilities almost. Such have very much of a "mathematical karma" flavor, as I've expressed elsewhere.
Yeah, my idea was only meant as an existence proof and is probably an inferior formal construction, although it is how I tend to personally think about subjective probability. I guess at heart I'm still a frequentist.
(You could think about Rolf's problem this way; if the vast majority of the measure of possible worlds given Bob's knowledge is in worlds where he loses, he's objectively wrong.)
Posted by: Nick Tarleton | March 13, 2008 at 09:39 AM
Nick Tarleton: Yeah, I know assuming logical omniciense makes some stuff easier, I simply wanted a frequentist derivation that didn't require it. That is, that could handle, what would one call it, computational uncertainty?
And yes, I know it was an existance proof. I was also arguing that, as a response to Eliezer's claim about it being impossible to get epistemic probability from frequentist probability. It sure at least _seems_ like something similar to what you suggested ought to work.
The thingie about vulnerability arguments was just an aside. I just happen to like them, they tend to have that "mathematical karma" flavor (you don't have to listen, but if you don't, "mathematical karma's going to get you and take away your pennies") not to mention that they generally are comparatively simple. Each step to produce another of the rules only takes a few lines of argument, and these sorts of derivations seem to not require anything tougher than basic linear algeabra.
Posted by: Psy-Kosh | March 13, 2008 at 12:26 PM