<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Fig Leaf Models</title>
	<atom:link href="http://www.overcomingbias.com/2007/02/fig_leaf_models.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html</link>
	<description>Overcoming Bias is economist Robin Hanson’s blog, on honesty, signaling, disagreement, forecasting, and the far future.</description>
	<lastBuildDate>Sat, 11 Feb 2012 23:23:58 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
	<item>
		<title>By: Richard Hollerith</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421745</link>
		<dc:creator>Richard Hollerith</dc:creator>
		<pubDate>Sat, 24 Feb 2007 17:56:11 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421745</guid>
		<description>Stuart: it is testing a model against predictions (not retrodictions) that guards against human biases (in the choice of model).
</description>
		<content:encoded><![CDATA[<p>Stuart: it is testing a model against predictions (not retrodictions) that guards against human biases (in the choice of model).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stuart Armstrong</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421744</link>
		<dc:creator>Stuart Armstrong</dc:creator>
		<pubDate>Sat, 24 Feb 2007 09:21:00 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421744</guid>
		<description>Another take entirely (just one page):

&lt;a href=&quot;http://www.dartmouth.edu/~jcarey/simple%20models%20(Carey).pdf&quot; rel=&quot;nofollow&quot;&gt;A Note on Simple Models&lt;/a&gt;

What implicitly come out from the example he uses is that models are often simple from the point of view of those making them, but only them (Bayseian analysis, for instance, is only simple once you&#039;ve properly understood it). Might be another bias; not sure of its implications here.
</description>
		<content:encoded><![CDATA[<p>Another take entirely (just one page):</p>
<p><a href="http://www.dartmouth.edu/~jcarey/simple%20models%20(Carey).pdf" rel="nofollow">A Note on Simple Models</a></p>
<p>What implicitly come out from the example he uses is that models are often simple from the point of view of those making them, but only them (Bayseian analysis, for instance, is only simple once you&#8217;ve properly understood it). Might be another bias; not sure of its implications here.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stuart Armstrong</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421743</link>
		<dc:creator>Stuart Armstrong</dc:creator>
		<pubDate>Sat, 24 Feb 2007 09:09:32 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421743</guid>
		<description>&lt;i&gt;You are positing an undocumented bias to too easily believe simple models, and suggesting we correct for that bias by avoiding simple models, relative to their other benefits.&lt;/i&gt;

Ah, I see. You were suggesting that complicated models are prone to bias because of what their creators put into them; I was suggesting that simple models are prone to bias because they are less likely to be rejected when contrary evidence arrives.

Is this the case? Politically, it does seem that people do cling to simpler models rather than complicated ones - the enthusiasm generated by models like &quot;invade iraq to bring peace&quot; and &quot;get out of iraq to bring peace&quot; is all out of proportion to that of &quot;let us negociate a rational compromise, involving this, this and this and even this, to bring peace.&quot; On a personal note, my bosses would continually try and get the simpler model they understood adopted, rather than the more complicated ones that were really called for.

But it would be good to get this studied properly (could not easily find any studies online, don&#039;t have access to most journals). Then we could put up procedures to avoid both types of bias, if warranted (restricting complex models to retrodictions, not predictions, and publicly announcing the failure of a simple model when it does fail - rather than the usual &quot;the model needs to be refined&quot;; maybe, but the model, as is, has failed).

But I think the most important is a splitting of the predictions of a model from its assumptions, as the second are prone to bias and the first are testable. I feel that this is more easy to do in complicated models.
</description>
		<content:encoded><![CDATA[<p><i>You are positing an undocumented bias to too easily believe simple models, and suggesting we correct for that bias by avoiding simple models, relative to their other benefits.</i></p>
<p>Ah, I see. You were suggesting that complicated models are prone to bias because of what their creators put into them; I was suggesting that simple models are prone to bias because they are less likely to be rejected when contrary evidence arrives.</p>
<p>Is this the case? Politically, it does seem that people do cling to simpler models rather than complicated ones &#8211; the enthusiasm generated by models like &#8220;invade iraq to bring peace&#8221; and &#8220;get out of iraq to bring peace&#8221; is all out of proportion to that of &#8220;let us negociate a rational compromise, involving this, this and this and even this, to bring peace.&#8221; On a personal note, my bosses would continually try and get the simpler model they understood adopted, rather than the more complicated ones that were really called for.</p>
<p>But it would be good to get this studied properly (could not easily find any studies online, don&#8217;t have access to most journals). Then we could put up procedures to avoid both types of bias, if warranted (restricting complex models to retrodictions, not predictions, and publicly announcing the failure of a simple model when it does fail &#8211; rather than the usual &#8220;the model needs to be refined&#8221;; maybe, but the model, as is, has failed).</p>
<p>But I think the most important is a splitting of the predictions of a model from its assumptions, as the second are prone to bias and the first are testable. I feel that this is more easy to do in complicated models.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eliezer Yudkowsky</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421742</link>
		<dc:creator>Eliezer Yudkowsky</dc:creator>
		<pubDate>Fri, 23 Feb 2007 20:48:18 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421742</guid>
		<description>All,

A substantial amount of effort in statistics and machine learning goes into just &lt;i&gt;figuring out&lt;/i&gt; how much freedom a model has, so that it can be penalized accordingly - for example, the &quot;effective degrees of freedom&quot; of regularized linear regressions.

Once you know the number of the effective degrees of freedom, there&#039;s the question of how to penalize them - two popular approaches are the Akaike Information Criterion (which underestimates the penalty) and the Bayesian Information Criterion (which overestimates it).  Some major approaches don&#039;t count degrees of freedom at all.  Vapnik-Chervonenkis dimension views the affair from the standpoint of the data, not the model, asking what is the most complex data that the model class can fit exactly - if a model involves a hundred variables, but can only fit a very limited class of potential outcomes, we should pay more attention to it than a hundred-variable model which can precisely fit a much broader class of outcomes.  The more advanced versions of Minimum Description Length take account of the precisions of parameters and predictions, not just their number (but are often extremely hard to compute).

However, I think everyone pretty much agrees on that:

(1) All else being equal, having more &lt;i&gt;effective&lt;/i&gt; freedom (not the same as the raw number of variables in the model) is bad for you, just as, all else being equal, having more data is good for you;
(2) There is a deep connection between the &lt;i&gt;relative&lt;/i&gt; simplicity of a model, compared to the amount of training data it is fitted to, and that model&#039;s ability to generalize to new data.
</description>
		<content:encoded><![CDATA[<p>All,</p>
<p>A substantial amount of effort in statistics and machine learning goes into just <i>figuring out</i> how much freedom a model has, so that it can be penalized accordingly &#8211; for example, the &#8220;effective degrees of freedom&#8221; of regularized linear regressions.</p>
<p>Once you know the number of the effective degrees of freedom, there&#8217;s the question of how to penalize them &#8211; two popular approaches are the Akaike Information Criterion (which underestimates the penalty) and the Bayesian Information Criterion (which overestimates it).  Some major approaches don&#8217;t count degrees of freedom at all.  Vapnik-Chervonenkis dimension views the affair from the standpoint of the data, not the model, asking what is the most complex data that the model class can fit exactly &#8211; if a model involves a hundred variables, but can only fit a very limited class of potential outcomes, we should pay more attention to it than a hundred-variable model which can precisely fit a much broader class of outcomes.  The more advanced versions of Minimum Description Length take account of the precisions of parameters and predictions, not just their number (but are often extremely hard to compute).</p>
<p>However, I think everyone pretty much agrees on that:</p>
<p>(1) All else being equal, having more <i>effective</i> freedom (not the same as the raw number of variables in the model) is bad for you, just as, all else being equal, having more data is good for you;<br />
(2) There is a deep connection between the <i>relative</i> simplicity of a model, compared to the amount of training data it is fitted to, and that model&#8217;s ability to generalize to new data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Robin Hanson</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421741</link>
		<dc:creator>Robin Hanson</dc:creator>
		<pubDate>Fri, 23 Feb 2007 19:15:22 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421741</guid>
		<description>Stuart, I was referring to your saying &quot;Simple models are ... very dangerous - people can understand them, find them plausible, and then decide they are true. And then defend the model even if it turns out to be false.&quot;  You are positing an undocumented bias to too easily believe simple models, and suggesting we correct for that bias by avoiding simple models, relative to their other benefits.
</description>
		<content:encoded><![CDATA[<p>Stuart, I was referring to your saying &#8220;Simple models are &#8230; very dangerous &#8211; people can understand them, find them plausible, and then decide they are true. And then defend the model even if it turns out to be false.&#8221;  You are positing an undocumented bias to too easily believe simple models, and suggesting we correct for that bias by avoiding simple models, relative to their other benefits.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stuart Armstrong</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421740</link>
		<dc:creator>Stuart Armstrong</dc:creator>
		<pubDate>Fri, 23 Feb 2007 19:11:21 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421740</guid>
		<description>&lt;i&gt;Stuart, I am reluctant to endorse explicit biases supposedly to correct poorly documented biases.&lt;/i&gt;

I&#039;m sorry, I don&#039;t quite get what you mean. I don&#039;t think I was advocating explicit biases (just the fact that lay people shouldn&#039;t feel they need to understand every assumption of a model). Can you clarify?
</description>
		<content:encoded><![CDATA[<p><i>Stuart, I am reluctant to endorse explicit biases supposedly to correct poorly documented biases.</i></p>
<p>I&#8217;m sorry, I don&#8217;t quite get what you mean. I don&#8217;t think I was advocating explicit biases (just the fact that lay people shouldn&#8217;t feel they need to understand every assumption of a model). Can you clarify?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stuart Armstrong</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421739</link>
		<dc:creator>Stuart Armstrong</dc:creator>
		<pubDate>Fri, 23 Feb 2007 19:08:01 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421739</guid>
		<description>&lt;i&gt;I would appreciate people sharing their impressions on the desirability and feasibility of taking humans out of the model-picking and model-refinement loop in selected fields, e.g., climate science.&lt;/i&gt;

I&#039;m entirely for it (as long as precautions are taken to test the models on enough results, vis Anders information point). It would take the ideology and biases out of the assumptions, and let the model live or die only on its results.

However, once you&#039;re got a model, then humans will pick it apart, try and understand it, and build new models. You need that for science to progress, but new biases will then be introduced - to hopefully be eliminated later.

As for feasibility... no idea! :-)
</description>
		<content:encoded><![CDATA[<p><i>I would appreciate people sharing their impressions on the desirability and feasibility of taking humans out of the model-picking and model-refinement loop in selected fields, e.g., climate science.</i></p>
<p>I&#8217;m entirely for it (as long as precautions are taken to test the models on enough results, vis Anders information point). It would take the ideology and biases out of the assumptions, and let the model live or die only on its results.</p>
<p>However, once you&#8217;re got a model, then humans will pick it apart, try and understand it, and build new models. You need that for science to progress, but new biases will then be introduced &#8211; to hopefully be eliminated later.</p>
<p>As for feasibility&#8230; no idea! <img src='http://www.overcomingbias.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Stuart Armstrong</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421738</link>
		<dc:creator>Stuart Armstrong</dc:creator>
		<pubDate>Fri, 23 Feb 2007 19:02:12 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421738</guid>
		<description>&lt;i&gt;I think the problem with complex models, as I said before, is that most &quot;predictions&quot; are actually retrodictions.&lt;/i&gt;

Very good point there. In the absence of easy testing, then simple models do have an advantage.

&lt;i&gt;It is the complex models for simple predictions that really should be avoided.&lt;/i&gt;

That feels generally true. But what about, say, old-fashioned military ballistics? A complicated model, fed with many variables (temperature, pressure, wind speed, etc...) whose sole prediction is where a shell will land on a two dimensional plane.

(And in principle (as long as it&#039;s tested against predictions, not retrodictions), there&#039;s nothing wrong with a model that would take the information of the entire world stock exchanges, plus the murder rate in every Brazilian town, to &lt;i&gt;accurately&lt;/i&gt; predict the price of fish tomorrow.)

A slight change of your information model would put the input information as &quot;the number of free parameters in the system&quot; and the output as &quot;the amount of tested predictions&quot;.

Then ballistics models are fine, as the amount of information output is basically infinite - just keep firing the gun, again and again.

Complicated economics models would be a problem though, as you might have to wait a long time to test enough predictions to have more information that your free parameters input. So complex models should have higher standards of proof than simple ones - requiring either complicated results, or lots and lots of simple results.
</description>
		<content:encoded><![CDATA[<p><i>I think the problem with complex models, as I said before, is that most &#8220;predictions&#8221; are actually retrodictions.</i></p>
<p>Very good point there. In the absence of easy testing, then simple models do have an advantage.</p>
<p><i>It is the complex models for simple predictions that really should be avoided.</i></p>
<p>That feels generally true. But what about, say, old-fashioned military ballistics? A complicated model, fed with many variables (temperature, pressure, wind speed, etc&#8230;) whose sole prediction is where a shell will land on a two dimensional plane.</p>
<p>(And in principle (as long as it&#8217;s tested against predictions, not retrodictions), there&#8217;s nothing wrong with a model that would take the information of the entire world stock exchanges, plus the murder rate in every Brazilian town, to <i>accurately</i> predict the price of fish tomorrow.)</p>
<p>A slight change of your information model would put the input information as &#8220;the number of free parameters in the system&#8221; and the output as &#8220;the amount of tested predictions&#8221;.</p>
<p>Then ballistics models are fine, as the amount of information output is basically infinite &#8211; just keep firing the gun, again and again.</p>
<p>Complicated economics models would be a problem though, as you might have to wait a long time to test enough predictions to have more information that your free parameters input. So complex models should have higher standards of proof than simple ones &#8211; requiring either complicated results, or lots and lots of simple results.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Richard Hollerith</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421737</link>
		<dc:creator>Richard Hollerith</dc:creator>
		<pubDate>Fri, 23 Feb 2007 18:15:03 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421737</guid>
		<description>So, having scientists generate scientific models is prone to severe biases.  Has anyone tried to remove those biases by having a computer program pick and refine the models according to the algorithm first published by Solomonoff?  Scientists working in some field would labor long and hard to put most of the relevant knowledge about the field into a formal model, then spend their time feeding empirical data into the computer program that refines the model and interpreting the model for engineers, policy makers and other &quot;consumers&quot; of scientific knowledge.

Economics would be a good field to try this with because of the difficulty of acquiring new empirical data (leading economists to rely  heavily on retrodictions, e.g., of stock market data going back to the start of stock markets) and the strong tendency of humans to have prejudices about economics and to form factions of economic belief.  Then again, economics would be a bad field to try this in because the economic realm is affected by a vast tangle of complex causal chains; e.g., the very complex realm of human psychology strongly affects the economic realm.  Thus the cost (in researcher&#039;s time) of creating a formal model of economics that includes most of the relevant knowledge that economists have about economics would be sky high.

Climate science is another area that necessarily relies heavily on retrodictions and is prone to factionalization, but is not nearly as complex as economics (but might still be too complex to codify as a single formal model).

I would appreciate people sharing their impressions on the desirability and feasibility of taking humans out of the model-picking and model-refinement loop in selected fields, e.g., climate science.
</description>
		<content:encoded><![CDATA[<p>So, having scientists generate scientific models is prone to severe biases.  Has anyone tried to remove those biases by having a computer program pick and refine the models according to the algorithm first published by Solomonoff?  Scientists working in some field would labor long and hard to put most of the relevant knowledge about the field into a formal model, then spend their time feeding empirical data into the computer program that refines the model and interpreting the model for engineers, policy makers and other &#8220;consumers&#8221; of scientific knowledge.</p>
<p>Economics would be a good field to try this with because of the difficulty of acquiring new empirical data (leading economists to rely  heavily on retrodictions, e.g., of stock market data going back to the start of stock markets) and the strong tendency of humans to have prejudices about economics and to form factions of economic belief.  Then again, economics would be a bad field to try this in because the economic realm is affected by a vast tangle of complex causal chains; e.g., the very complex realm of human psychology strongly affects the economic realm.  Thus the cost (in researcher&#8217;s time) of creating a formal model of economics that includes most of the relevant knowledge that economists have about economics would be sky high.</p>
<p>Climate science is another area that necessarily relies heavily on retrodictions and is prone to factionalization, but is not nearly as complex as economics (but might still be too complex to codify as a single formal model).</p>
<p>I would appreciate people sharing their impressions on the desirability and feasibility of taking humans out of the model-picking and model-refinement loop in selected fields, e.g., climate science.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Hal Finney</title>
		<link>http://www.overcomingbias.com/2007/02/fig_leaf_models.html#comment-421736</link>
		<dc:creator>Hal Finney</dc:creator>
		<pubDate>Fri, 23 Feb 2007 17:49:15 +0000</pubDate>
		<guid isPermaLink="false">http://prod.ob.trike.com.au/2007/02/fig-leaf-models.html#comment-421736</guid>
		<description>I think the problem with complex models, as I said before, is that most &quot;predictions&quot; are actually retrodictions.

If a model is making truly accurate predictions, and they are informative and useful, then you shouldn&#039;t care how many parameters it has. If someone can consistently predict tomorrow&#039;s stock prices today, you don&#039;t care how many parameters he is using. His model is good, by virtue of its predictive success in a difficult domain.
</description>
		<content:encoded><![CDATA[<p>I think the problem with complex models, as I said before, is that most &#8220;predictions&#8221; are actually retrodictions.</p>
<p>If a model is making truly accurate predictions, and they are informative and useful, then you shouldn&#8217;t care how many parameters it has. If someone can consistently predict tomorrow&#8217;s stock prices today, you don&#8217;t care how many parameters he is using. His model is good, by virtue of its predictive success in a difficult domain.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using disk (enhanced)
Database Caching using disk
Object Caching 442/459 objects using disk
Content Delivery Network via Amazon Web Services: S3: overcomingbias-assets.s3.amazonaws.com

Served from: www.overcomingbias.com @ 2012-02-11 18:49:34 -->
