<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: When Is A Small Sample Really A Small Sample?</title>
	<atom:link href="http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/feed/" rel="self" type="application/rss+xml" />
	<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/</link>
	<description>Get Marooned</description>
	<lastBuildDate>Fri, 23 Mar 2012 20:15:02 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: chadrem</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18749</link>
		<dc:creator><![CDATA[chadrem]]></dc:creator>
		<pubDate>Wed, 08 Oct 2008 14:33:18 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18749</guid>
		<description><![CDATA[Matt, good work writing that code.]]></description>
		<content:encoded><![CDATA[<p>Matt, good work writing that code.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Shalmanese</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18748</link>
		<dc:creator><![CDATA[Shalmanese]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 21:25:36 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18748</guid>
		<description><![CDATA[People whose macbooks have died are more likely to tell you the status of their macbook than those who haven&#039;t.]]></description>
		<content:encoded><![CDATA[<p>People whose macbooks have died are more likely to tell you the status of their macbook than those who haven&#39;t.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mattmaroon</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18747</link>
		<dc:creator><![CDATA[mattmaroon]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 18:48:18 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18747</guid>
		<description><![CDATA[Hmm, the probability that 3 out of 4 Macbooks would fail for anyone, with a normal laptop failure rate of, let&#039;s say, 5%, is probably still pretty small. I don&#039;t think they sell that much.]]></description>
		<content:encoded><![CDATA[<p>Hmm, the probability that 3 out of 4 Macbooks would fail for anyone, with a normal laptop failure rate of, let&#39;s say, 5%, is probably still pretty small. I don&#39;t think they sell that much.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mattmaroon</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18746</link>
		<dc:creator><![CDATA[mattmaroon]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 18:06:39 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18746</guid>
		<description><![CDATA[They&#039;re the only 4 who I know what happened to. I have lots of friends with macbooks, and I&#039;ve heard lots of complaints, but I have no idea whose went to the shop and whose did not.]]></description>
		<content:encoded><![CDATA[<p>They&#39;re the only 4 who I know what happened to. I have lots of friends with macbooks, and I&#39;ve heard lots of complaints, but I have no idea whose went to the shop and whose did not.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: tom</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18745</link>
		<dc:creator><![CDATA[tom]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 14:12:29 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18745</guid>
		<description><![CDATA[seconded. &lt;br&gt;&lt;br&gt;the fact that you did this analysis *because* you saw a high failure rate invalidates the quality of your sample. &lt;br&gt;&lt;br&gt;to look at it another way: think of how many people there are out there who have 4 friends who own macbooks: lots. now, what is the probability that not one of them knows 4 macbook owners, of which 3 are faulty: very very low. it had to happen to someone...&lt;br&gt;&lt;br&gt;to look at it one more way: what is the probability that a lottery winner is rich? does that mean that everyone is rich?&lt;br&gt;&lt;br&gt;tom saffell]]></description>
		<content:encoded><![CDATA[<p>seconded. </p>
<p>the fact that you did this analysis *because* you saw a high failure rate invalidates the quality of your sample. </p>
<p>to look at it another way: think of how many people there are out there who have 4 friends who own macbooks: lots. now, what is the probability that not one of them knows 4 macbook owners, of which 3 are faulty: very very low. it had to happen to someone&#8230;</p>
<p>to look at it one more way: what is the probability that a lottery winner is rich? does that mean that everyone is rich?</p>
<p>tom saffell</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Shalmanese</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18744</link>
		<dc:creator><![CDATA[Shalmanese]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 09:00:01 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18744</guid>
		<description><![CDATA[Do you really only have 4 friends with macbooks? Or are those simply the most memorable 4 precisely because something notable happened to them.]]></description>
		<content:encoded><![CDATA[<p>Do you really only have 4 friends with macbooks? Or are those simply the most memorable 4 precisely because something notable happened to them.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mattmaroon</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18743</link>
		<dc:creator><![CDATA[mattmaroon]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 05:50:09 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18743</guid>
		<description><![CDATA[What about if I google for &quot;broken Macbook&quot; and the top 8 links all involve one? :)]]></description>
		<content:encoded><![CDATA[<p>What about if I google for &#8220;broken Macbook&#8221; and the top 8 links all involve one? <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gareth Allen</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18742</link>
		<dc:creator><![CDATA[Gareth Allen]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 05:47:21 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18742</guid>
		<description><![CDATA[If you were talking about Macbooks and THEN realized that 3/4 of the ones you knew of were broken, I think that makes it alright (which sounds like the case for you). But if you only discussed it because you had this evidence, then its not a random sample, especially on an online forum, since 1. perhaps 50 people were reading, and only the few that had data like yours chime in, 2. you read discussions about many products and might not chime in unless you had data like this.&lt;br&gt;&lt;br&gt;However, if you posted here every time you had data like this about a product, we couldn&#039;t infer much from it as readers (and certainly not to 3.7%), since you have knowledge of a lot of products. It&#039;s sort of a survivorship bias.]]></description>
		<content:encoded><![CDATA[<p>If you were talking about Macbooks and THEN realized that 3/4 of the ones you knew of were broken, I think that makes it alright (which sounds like the case for you). But if you only discussed it because you had this evidence, then its not a random sample, especially on an online forum, since 1. perhaps 50 people were reading, and only the few that had data like yours chime in, 2. you read discussions about many products and might not chime in unless you had data like this.</p>
<p>However, if you posted here every time you had data like this about a product, we couldn&#39;t infer much from it as readers (and certainly not to 3.7%), since you have knowledge of a lot of products. It&#39;s sort of a survivorship bias.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: mattmaroon</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18741</link>
		<dc:creator><![CDATA[mattmaroon]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 02:10:12 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18741</guid>
		<description><![CDATA[The probability of Aces getting cracked 10 out of 10 times is so close to 0 that if it happened to me at a site, I&#039;d probably stop playing there. That&#039;s not the same as 10 in a row over a large number of pocket aces though, since a sample of X pocket aces has X-9 consecutive groups of 10 contained in it. 100 pocket aces has 91 groups of 10 (1-10, 2-11...91-100). That&#039;s what&#039;s happening when people cry foul on poker sites (and even then, their standards for calling shenanigans are usually far lower than 10 aces in a row getting cracked). I probably saw something like 2000 pocket aces a year. I&#039;m too lazy to figure out what the odds were that 10 in a row got cracked in there are, but it&#039;s probably substantial. Especially when, as you say, I include kings, sets on the flop, etc., as those people do.&lt;br&gt;&lt;br&gt;My Apple sample is akin to the 10/10 though. I wasn&#039;t looking for product failures, we were talking about Macbooks being bricked, and I realized that I only knew the status of 4 of them definitively, and 3 of them had needed repairs shortly after purchase. If I put the purchases of hundreds or thousands of them in chronological order, then found 4 in a row that had failed, it wouldn&#039;t mean much (like the Aces example). If I have a sample of only 4, and 3 failed, that means quite a bit more.]]></description>
		<content:encoded><![CDATA[<p>The probability of Aces getting cracked 10 out of 10 times is so close to 0 that if it happened to me at a site, I&#39;d probably stop playing there. That&#39;s not the same as 10 in a row over a large number of pocket aces though, since a sample of X pocket aces has X-9 consecutive groups of 10 contained in it. 100 pocket aces has 91 groups of 10 (1-10, 2-11&#8230;91-100). That&#39;s what&#39;s happening when people cry foul on poker sites (and even then, their standards for calling shenanigans are usually far lower than 10 aces in a row getting cracked). I probably saw something like 2000 pocket aces a year. I&#39;m too lazy to figure out what the odds were that 10 in a row got cracked in there are, but it&#39;s probably substantial. Especially when, as you say, I include kings, sets on the flop, etc., as those people do.</p>
<p>My Apple sample is akin to the 10/10 though. I wasn&#39;t looking for product failures, we were talking about Macbooks being bricked, and I realized that I only knew the status of 4 of them definitively, and 3 of them had needed repairs shortly after purchase. If I put the purchases of hundreds or thousands of them in chronological order, then found 4 in a row that had failed, it wouldn&#39;t mean much (like the Aces example). If I have a sample of only 4, and 3 failed, that means quite a bit more.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gareth Allen</title>
		<link>http://mattmaroon.com/2008/10/06/when-is-a-small-sample-really-a-small-sample/#comment-18740</link>
		<dc:creator><![CDATA[Gareth Allen]]></dc:creator>
		<pubDate>Tue, 07 Oct 2008 01:32:38 +0000</pubDate>
		<guid isPermaLink="false">http://mattmaroon.com/?p=520#comment-18740</guid>
		<description><![CDATA[While your analysis is technically correct, you are making a fundamental mistake. Your decision to post and to run the analysis was based on the fact that you already had evidence that Macbooks suck. It&#039;s essentially the same as an individual claiming that a poker site is rigged because he lost with aces ten times in a row. While it may be true, and the chances are abysmally small, it fails to look at the true &#039;space&#039; of outcomes that would result in a post. For instance, losing with kings ten times in a row, losing with sets ten times in a row, etc (now multiply each space by the number of individuals who obsessively look for such anomalies). In your case, you would probably post somewhere if a high proportion of any product your friends used failed. So if the failure rate for ALL products is 10%, and 20 products fit the category (owned by at least 4 friends) the chance you will post here against at least one product is something like 1-.963^20 = 53%.&lt;br&gt;&lt;br&gt;This is a very common fallacy in statistical analysis, and one reason its tough to trust most social sciences&#039; studies. They find a large dataset, throw a few hundred variables together, and see what ends up as significant to a 5% alpha level. By definition, 5% of the variables will be significant even if there is absolutely no relation, so it is easy to find relationships that are significant, simplify the model, and then claim to have found some truth.&lt;br&gt;&lt;br&gt;Macbooks probably do suck, but if you want a more random sample, call someone up out of the blue and ask them about their friend&#039;s Macbooks. If 3/4 of them have had problems, you can use Baye&#039;s Theorem then.]]></description>
		<content:encoded><![CDATA[<p>While your analysis is technically correct, you are making a fundamental mistake. Your decision to post and to run the analysis was based on the fact that you already had evidence that Macbooks suck. It&#39;s essentially the same as an individual claiming that a poker site is rigged because he lost with aces ten times in a row. While it may be true, and the chances are abysmally small, it fails to look at the true &#39;space&#39; of outcomes that would result in a post. For instance, losing with kings ten times in a row, losing with sets ten times in a row, etc (now multiply each space by the number of individuals who obsessively look for such anomalies). In your case, you would probably post somewhere if a high proportion of any product your friends used failed. So if the failure rate for ALL products is 10%, and 20 products fit the category (owned by at least 4 friends) the chance you will post here against at least one product is something like 1-.963^20 = 53%.</p>
<p>This is a very common fallacy in statistical analysis, and one reason its tough to trust most social sciences&#39; studies. They find a large dataset, throw a few hundred variables together, and see what ends up as significant to a 5% alpha level. By definition, 5% of the variables will be significant even if there is absolutely no relation, so it is easy to find relationships that are significant, simplify the model, and then claim to have found some truth.</p>
<p>Macbooks probably do suck, but if you want a more random sample, call someone up out of the blue and ask them about their friend&#39;s Macbooks. If 3/4 of them have had problems, you can use Baye&#39;s Theorem then.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

