<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Different meanings of Bayesian statistics</title>
	<atom:link href="http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html</link>
	<description>Overcoming Bias is economist Robin Hanson’s blog, on honesty, signaling, disagreement, forecasting, and the far future.</description>
	<lastBuildDate>Fri, 12 Mar 2010 18:47:19 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Alexander Kruel &#183; A Guide to Bayes&#8217; Theorem &#8211; A few links</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-443416</link>
		<dc:creator>Alexander Kruel &#183; A Guide to Bayes&#8217; Theorem &#8211; A few links</dc:creator>
		<pubDate>Sat, 27 Feb 2010 11:44:14 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-443416</guid>
		<description>[...] Different meanings of Bayesian statistics: overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html [...]</description>
		<content:encoded><![CDATA[<p>[...] Different meanings of Bayesian statistics: overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cyan</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388194</link>
		<dc:creator>Cyan</dc:creator>
		<pubDate>Thu, 29 Jan 2009 01:39:26 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388194</guid>
		<description>Daniel, thanks for your perspective; it gives me lots to ponder.
</description>
		<content:encoded><![CDATA[<p>Daniel, thanks for your perspective; it gives me lots to ponder.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Daniel Burfoot</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388193</link>
		<dc:creator>Daniel Burfoot</dc:creator>
		<pubDate>Thu, 29 Jan 2009 01:22:49 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388193</guid>
		<description>Cyan,

See, this kind of terminological disagreement illustrates why I think it&#039;s better to use the codelength idea :-)

Can normalized maximum likelihood be used to send data? If so, then it implies an implicit prior over data sets which is exactly 2^(-l(x)), where l(x) is the length of the code. Whether or not this means it is &quot;equivalent&quot; to Bayes would seem to depend on what the word &quot;Bayesian&quot; means to you; in my lexicon it means a philosophical commitment to the necessity of using prior distributions that are essentially arbitrary. Once you&#039;ve accepted that priors are necessary, then the rules for updating them are mathematical theorems which are no longer disputable.

Note that the above argument &quot;Can method X be used to send data? If so, then it implies an implicit prior over data sets...&quot; works for a wide range of methods X (e.g. Support Vector machines, Belief nets) which various people have claimed are not explicitly Bayesian.

It also means they are ALL subject to the mighty &lt;b&gt;No Free Lunch Theorem&lt;/b&gt; which says roughly that in general, data compression cannot be achieved. All modeling and statistical learning techniques should therefore be prefaced by disclaimers noting that &quot;this method does not work in general, but if we make certain assumptions about the nature of the process generating the data...&quot;

Andrew, thanks for starting this discussion, looking forward to future OB posts from you (don&#039;t tell Eliezer that you&#039;re into things like the Gibbs sampler and Metropolis algorithm, though).



</description>
		<content:encoded><![CDATA[<p>Cyan,</p>
<p>See, this kind of terminological disagreement illustrates why I think it&#8217;s better to use the codelength idea <img src='http://www.overcomingbias.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>Can normalized maximum likelihood be used to send data? If so, then it implies an implicit prior over data sets which is exactly 2^(-l(x)), where l(x) is the length of the code. Whether or not this means it is &#8220;equivalent&#8221; to Bayes would seem to depend on what the word &#8220;Bayesian&#8221; means to you; in my lexicon it means a philosophical commitment to the necessity of using prior distributions that are essentially arbitrary. Once you&#8217;ve accepted that priors are necessary, then the rules for updating them are mathematical theorems which are no longer disputable.</p>
<p>Note that the above argument &#8220;Can method X be used to send data? If so, then it implies an implicit prior over data sets&#8230;&#8221; works for a wide range of methods X (e.g. Support Vector machines, Belief nets) which various people have claimed are not explicitly Bayesian.</p>
<p>It also means they are ALL subject to the mighty <b>No Free Lunch Theorem</b> which says roughly that in general, data compression cannot be achieved. All modeling and statistical learning techniques should therefore be prefaced by disclaimers noting that &#8220;this method does not work in general, but if we make certain assumptions about the nature of the process generating the data&#8230;&#8221;</p>
<p>Andrew, thanks for starting this discussion, looking forward to future OB posts from you (don&#8217;t tell Eliezer that you&#8217;re into things like the Gibbs sampler and Metropolis algorithm, though).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mjgeddes</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388192</link>
		<dc:creator>mjgeddes</dc:creator>
		<pubDate>Thu, 29 Jan 2009 00:13:48 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388192</guid>
		<description>&lt;i&gt;It sounds to me as though you have your concept of reason muddled-in with ideas from decision theory. &quot;What is true&quot; and &quot;what to do&quot; are rather different issues.&lt;/i&gt;

You don&#039;t have to be a genius to see that there&#039;s something suspiciously incomplete about Bayes.  The Anthropic puzzles have not been solved using Bayesian methods, and Bayesian experts report being &#039;confused&#039; in certin cases such as the Doomsday argument.  This hints that there may be more powerful reasoning methods.

We know that Deduction is merely a special case of Induction (namely the case where the probabilities are set to 100%).  In other words, Deduction is merely a &lt;i&gt;shadow&lt;/i&gt; of Induction.  Could it be that Induction in turn is merely a &lt;i&gt;shadow&lt;/i&gt; of some as yet undiscovered more powerful method still?  If so, there would be some shifting parameter which is not &lt;b&gt;probability&lt;/b&gt;, but that when set to some special case would &lt;b&gt;look like&lt;/b&gt; a probability.

&lt;b&gt;semantic distance&lt;/b&gt; perhaps?

</description>
		<content:encoded><![CDATA[<p><i>It sounds to me as though you have your concept of reason muddled-in with ideas from decision theory. &#8220;What is true&#8221; and &#8220;what to do&#8221; are rather different issues.</i></p>
<p>You don&#8217;t have to be a genius to see that there&#8217;s something suspiciously incomplete about Bayes.  The Anthropic puzzles have not been solved using Bayesian methods, and Bayesian experts report being &#8216;confused&#8217; in certin cases such as the Doomsday argument.  This hints that there may be more powerful reasoning methods.</p>
<p>We know that Deduction is merely a special case of Induction (namely the case where the probabilities are set to 100%).  In other words, Deduction is merely a <i>shadow</i> of Induction.  Could it be that Induction in turn is merely a <i>shadow</i> of some as yet undiscovered more powerful method still?  If so, there would be some shifting parameter which is not <b>probability</b>, but that when set to some special case would <b>look like</b> a probability.</p>
<p><b>semantic distance</b> perhaps?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard Kennaway</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388191</link>
		<dc:creator>Richard Kennaway</dc:creator>
		<pubDate>Wed, 28 Jan 2009 21:30:36 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388191</guid>
		<description>@Tyrrell, intuitionism:

Intuitionistic logic is the only one I can think of that has any possibility of competing for the office of &lt;i&gt;calculus ratiocinator&lt;/i&gt;. I can just about imagine conducting all one&#039;s thought on every meta-level without ever assuming that if a proposition cannot be false, it must be true. But a classicist can pass among intuitionists just by prefixing everything with not-not, and who&#039;s to know if a professed intuitionist isn&#039;t doing the same? Personally, I think intuitionism was a historical accident that would never have happened if Babbage&#039;s machines had been more practical, but that is another story.

@Sebastian: &lt;i&gt;My personal probability of Bayes&#039; theorem being incorrect is &#039;epsilon&#039;, i.e. nonzero but too low for me to bother tracking the exact order of magnitude.&lt;/i&gt;

If I&#039;m not assigning actual numbers, I&#039;m not doing probabilistic inference, even if I believe I am. Even if I have an actual epsilon not plucked out of the air, if I throw it away, I&#039;ve reverted to POML.

BTW, combining the last two points, I see from Google that there is such a thing as intuitionistic Bayesianism. I do not know how well known this is.

@Andrew: &lt;i&gt;All these systems can work, and they all have logical holes too.&lt;/i&gt;

Logical holes in POML?  Gödel&#039;s completeness theorem proves their absence.  (His incompleteness theorems talk about theories expressed in POML, not POML itself.)  Did you have something else in mind?
</description>
		<content:encoded><![CDATA[<p>@Tyrrell, intuitionism:</p>
<p>Intuitionistic logic is the only one I can think of that has any possibility of competing for the office of <i>calculus ratiocinator</i>. I can just about imagine conducting all one&#8217;s thought on every meta-level without ever assuming that if a proposition cannot be false, it must be true. But a classicist can pass among intuitionists just by prefixing everything with not-not, and who&#8217;s to know if a professed intuitionist isn&#8217;t doing the same? Personally, I think intuitionism was a historical accident that would never have happened if Babbage&#8217;s machines had been more practical, but that is another story.</p>
<p>@Sebastian: <i>My personal probability of Bayes&#8217; theorem being incorrect is &#8216;epsilon&#8217;, i.e. nonzero but too low for me to bother tracking the exact order of magnitude.</i></p>
<p>If I&#8217;m not assigning actual numbers, I&#8217;m not doing probabilistic inference, even if I believe I am. Even if I have an actual epsilon not plucked out of the air, if I throw it away, I&#8217;ve reverted to POML.</p>
<p>BTW, combining the last two points, I see from Google that there is such a thing as intuitionistic Bayesianism. I do not know how well known this is.</p>
<p>@Andrew: <i>All these systems can work, and they all have logical holes too.</i></p>
<p>Logical holes in POML?  Gödel&#8217;s completeness theorem proves their absence.  (His incompleteness theorems talk about theories expressed in POML, not POML itself.)  Did you have something else in mind?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrew</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388190</link>
		<dc:creator>Andrew</dc:creator>
		<pubDate>Wed, 28 Jan 2009 16:30:38 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388190</guid>
		<description>This has been an interesting discussion, revealing to me that the participants in this forum have a much different perspective on Bayes, compared to the perspectives of Christian Robert and I have.

I have little to add to the discussion except to comment on Richard Kennaway&#039;s statement that &quot;standard mathematical logic does [model valid reasoning in general].&quot;

One thing I&#039;ve learned in applied statistics is that there are lots of different logical frameworks that can work well.  So, sure, mathematical logic can model valid reasoning, so can Bayes, so can fuzzy sets, machine learning, etc.  All these systems can work, and they all have logical holes too.  It&#039;s the nature of inference.
</description>
		<content:encoded><![CDATA[<p>This has been an interesting discussion, revealing to me that the participants in this forum have a much different perspective on Bayes, compared to the perspectives of Christian Robert and I have.</p>
<p>I have little to add to the discussion except to comment on Richard Kennaway&#8217;s statement that &#8220;standard mathematical logic does [model valid reasoning in general].&#8221;</p>
<p>One thing I&#8217;ve learned in applied statistics is that there are lots of different logical frameworks that can work well.  So, sure, mathematical logic can model valid reasoning, so can Bayes, so can fuzzy sets, machine learning, etc.  All these systems can work, and they all have logical holes too.  It&#8217;s the nature of inference.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cyan</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388189</link>
		<dc:creator>Cyan</dc:creator>
		<pubDate>Wed, 28 Jan 2009 14:49:30 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388189</guid>
		<description>Thanks, Daniel and Manuel!

Daniel, I&#039;ve read Grünwald&#039;s tutorial, and it&#039;s clear that MDL is not Bayesian: just look at the normalized maximum likelihood distribution, which when it exists solves the MDL problem -- and violates the likelihood principle, as it requires a summation over the data space. (...And does not require an explicit prior; not sure if there&#039;s an implicit prior determined by the choice of data format, per your statement.) Rissanen is pretty contemptuous of the Bayesian approach (and the frequentist approach and the likelihood approach; dude has some strong opinions). I was hoping for a nice 70 page Grünwald-style tutorial on MML, as I have not been able to find one myself.
</description>
		<content:encoded><![CDATA[<p>Thanks, Daniel and Manuel!</p>
<p>Daniel, I&#8217;ve read Grünwald&#8217;s tutorial, and it&#8217;s clear that MDL is not Bayesian: just look at the normalized maximum likelihood distribution, which when it exists solves the MDL problem &#8212; and violates the likelihood principle, as it requires a summation over the data space. (&#8230;And does not require an explicit prior; not sure if there&#8217;s an implicit prior determined by the choice of data format, per your statement.) Rissanen is pretty contemptuous of the Bayesian approach (and the frequentist approach and the likelihood approach; dude has some strong opinions). I was hoping for a nice 70 page Grünwald-style tutorial on MML, as I have not been able to find one myself.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sebastian Hagen</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388188</link>
		<dc:creator>Sebastian Hagen</dc:creator>
		<pubDate>Wed, 28 Jan 2009 13:35:16 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388188</guid>
		<description>&lt;blockquote&gt;What is the probability that P(A&#124;B) = P(B&#124;A) P(A)/P(B)? One.&lt;/blockquote&gt;
This is just an approximation.&lt;br/&gt;
The only way to check a mathematical proof is by feeding it to a proof-verifying physical system. That system could e.g. be a program running on a digital computer or a human (including yourself). Unless you have perfect and absolutely reliable knowledge about both the physical system and the physical laws of this universe - which you can&#039;t, in practice - there&#039;s always the chance that the system in question does it wrong; in this case, that it outputs &quot;proof is correct&quot; even though the proof isn&#039;t.&lt;br/&gt;
Feeding the same possible proof to very many different proof-verifying physical systems can dramatically decrease the probability of an incorrect verdict like that. It can&#039;t push it to exactly zero.&lt;br/&gt;
My personal probability of Bayes&#039; theorem being incorrect is &#039;epsilon&#039;, i.e. nonzero but too low for me to bother tracking the exact order of magnitude.
</description>
		<content:encoded><![CDATA[<blockquote><p>What is the probability that P(A|B) = P(B|A) P(A)/P(B)? One.</p></blockquote>
<p>This is just an approximation.<br />
The only way to check a mathematical proof is by feeding it to a proof-verifying physical system. That system could e.g. be a program running on a digital computer or a human (including yourself). Unless you have perfect and absolutely reliable knowledge about both the physical system and the physical laws of this universe &#8211; which you can&#8217;t, in practice &#8211; there&#8217;s always the chance that the system in question does it wrong; in this case, that it outputs &#8220;proof is correct&#8221; even though the proof isn&#8217;t.<br />
Feeding the same possible proof to very many different proof-verifying physical systems can dramatically decrease the probability of an incorrect verdict like that. It can&#8217;t push it to exactly zero.<br />
My personal probability of Bayes&#8217; theorem being incorrect is &#8216;epsilon&#8217;, i.e. nonzero but too low for me to bother tracking the exact order of magnitude.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tyrrell McAllister</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388187</link>
		<dc:creator>Tyrrell McAllister</dc:creator>
		<pubDate>Wed, 28 Jan 2009 12:24:16 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388187</guid>
		<description>@Richard Kennaway

I think the question of whether probabilistic reasoning or POML reasoning is more fundamental is not so straightforward.  When one searches for a &quot;most&quot; fundamental logic, one finds that the structure of the candidate systems becomes &quot;loopy&quot;, with each perfectly capable of being embedded within the others.

For example, should you go with POML or intuitionistic reasoning in mathematics?  There&#039;s no formal criterion.  Every intuitionistic theorem is a classical theorem once you restrict the quantifiers properly.  Every classical theorem is an intuitionistic theorem if you replace &quot;P or Q&quot; with &quot;not-(not P and not-Q).
</description>
		<content:encoded><![CDATA[<p>@Richard Kennaway</p>
<p>I think the question of whether probabilistic reasoning or POML reasoning is more fundamental is not so straightforward.  When one searches for a &#8220;most&#8221; fundamental logic, one finds that the structure of the candidate systems becomes &#8220;loopy&#8221;, with each perfectly capable of being embedded within the others.</p>
<p>For example, should you go with POML or intuitionistic reasoning in mathematics?  There&#8217;s no formal criterion.  Every intuitionistic theorem is a classical theorem once you restrict the quantifiers properly.  Every classical theorem is an intuitionistic theorem if you replace &#8220;P or Q&#8221; with &#8220;not-(not P and not-Q).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard Kennaway</title>
		<link>http://www.overcomingbias.com/2009/01/different-meanings-of-bayesian-statistics.html#comment-388186</link>
		<dc:creator>Richard Kennaway</dc:creator>
		<pubDate>Wed, 28 Jan 2009 11:34:07 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2009/01/different-meanings-of-bayesian-statistics.html#comment-388186</guid>
		<description>anonym: &lt;i&gt;I think part of the &quot;mystical&quot; feeling probably comes from the realization of how widely applicable Bayes&#039; theorem is and a sense that it can function as something like the foundation of a calculus of thought such as people like Leibniz and Boole have sought for so long and can be applied to every aspect of thought.&lt;/i&gt;

&quot;Calculus of thought&quot; is a solved problem.  What Leibniz sought, Boole found, and Frege, Russell, and Whitehead brought to completion: the calculus of thought is propositional and first-order predicate calculus.  That is, the calculus with which we must reason, to reason validly, not the ways in which we do reason, which is whatever our meatware does, valid or not, and is not a calculus.

A problem with regarding Bayesian reasoning or anything else -- quantum logic, modal logic, or whatever --  as the calculus of thought is that on the metalevel, we always go on reasoning in POML: plain old mathematical logic, where things are simply true, or false. Bayes&#039; theorem itself is of this nature. It is about probabilities, but is not a probabilistic statement.  What is the probability that P(A&#124;B) = P(B&#124;A) P(A)/P(B)?  One.  Bayesian and other calculi are mathematically accurate models of certain aspects of the world, but they do not model valid reasoning in general.  Standard mathematical logic does.
</description>
		<content:encoded><![CDATA[<p>anonym: <i>I think part of the &#8220;mystical&#8221; feeling probably comes from the realization of how widely applicable Bayes&#8217; theorem is and a sense that it can function as something like the foundation of a calculus of thought such as people like Leibniz and Boole have sought for so long and can be applied to every aspect of thought.</i></p>
<p>&#8220;Calculus of thought&#8221; is a solved problem.  What Leibniz sought, Boole found, and Frege, Russell, and Whitehead brought to completion: the calculus of thought is propositional and first-order predicate calculus.  That is, the calculus with which we must reason, to reason validly, not the ways in which we do reason, which is whatever our meatware does, valid or not, and is not a calculus.</p>
<p>A problem with regarding Bayesian reasoning or anything else &#8212; quantum logic, modal logic, or whatever &#8212;  as the calculus of thought is that on the metalevel, we always go on reasoning in POML: plain old mathematical logic, where things are simply true, or false. Bayes&#8217; theorem itself is of this nature. It is about probabilities, but is not a probabilistic statement.  What is the probability that P(A|B) = P(B|A) P(A)/P(B)?  One.  Bayesian and other calculi are mathematically accurate models of certain aspects of the world, but they do not model valid reasoning in general.  Standard mathematical logic does.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
