Sunday, November 19, 2006

I Think I'm a Radical Baysean

To learn something that isn't math, I've been picking up the readings from some of my friends' classes if they look interesting. This last week I read a pair of articles by Kahneman and Tversky about flaws in heuristic reasoning and an objection by the frequentist Gigerenzer.

K&T illustrate one of the flaws of heuristic reasoning by asking people to evaluate the probability that a particular woman is a librarian or a librarian and feminist, given a description of her that fits the stereotype of a feminist. Many people say the latter is more probable, when in fact the probability of any subset (that someone is both A and B) must be less than the probability of the superset (that someone is A). They call this the "representativeness" error -- that is, that if a particular event is very representative of a particular class, then people overestimate its likelihood. An example of the same effect is that people consider the sequence of flips of a fair coin "H-T-H-H-T" more likely than the sequence "H-H-H-H-H" because the first evokes randomness more, even though we know that any particular sequence must have equal probability.

Gigerenzer objects to this in a couple of ways. One is that people, when asked to evaluate two possibilities, A or A and B, they naturally assume that the asker actually means "A and not B" or "A and B", since otherwise it's kind of a stupid question. This makes sense to me. His second objection is the one I really want to talk about, though, and that is that it doesn't make sense to ascribe probability to a single, non-repeatable event.

In case you don't know, like I didn't before I came here, there are two different schools of thought concerning probability: the Frequentists and the Bayseans. The former believe, like Gigerenzer, that in order for an event to have meaningful probability, it must be repeatable, and that the probability is interpreted as the limit of the frequency of occurances as the number of samples goes to infinity. Bayseans allow one to speak about the probability even of events that occur only once, such as a particular team winning the twentieth Superbowl, and interpret probability as a measure of certainty concerning a particular outcome.

I take great objection to the Frequentist point of view, not only in cases where we are evaluating the probability of a single event, but also when the measured outcome is "repeatable". For example, take the flipping of a coin. The fact that when I flip a coin, we reasonably describe the probability of it coming up heads has nothing to do with the fact that many coins have been flipped before, or the possibility of flipping the coin many more times. It has to do with the symmetry between the two sides, the information loss introduced by the flipping action (negating the asymmetry of the coin starting either heads up or heads down), and a reasonable physical model of the world that suggests that symmetric outcomes should happen with equal likelihood. Were this theory in doubt, a coin has added benefit that we can flip it many times and get a more rigorous estimate of the likelihood of it coming up heads through the reasonable assumptions of statistical inference, but this possibility can't reasonably be said to inform our estimate of the probability of heads. The first coin that was ever flipped, long before statistics textbooks, had just as much of a fifty-fifty chance of heads and tails as any coin today. (Ignoring recent improvements in the manufacture of symmetric coins, I suppose.)

In fact, the moment the coin has left your thumb, the outcome is as determined as the profession of an unknown person. Depending on what your ideas about free will are, you could say it is determined even before that. We describe its outcome in probabilistic terms because of a lack of information, not because of its similarity to possible future situations. Looking at it this way, there's no basic philosophical difference between a coin flip and evaluating the probability that a single person has a particular profession -- we are just able, through simplifying physical assumptions and statistical inference, to make a much better estimate of the probability of the latter knowing very little about the particular event.

Statistics and probability are primarily concerned with trying to make inferences about situations about which there is insufficient information. The notion of "probability" is simply a convenient way of thinking about deterministic and determined events that we cannot observe completely. When we say an event is "repeatable", we simply mean that a series of events in time are similar enough with respect to the variables we can observe and think are salient to its outcome. This allows us to use convenient mathematical models and make accurate predictions about the future. Random events have a continuum of repeatability, but they all have one thing in common, whether they happen once or many times: we don't know enough about them to predict their outcomes, and this is what allows us to speak of their probability.

I think this makes me a radical Baysean, a label that I think sounds pretty cool. (Ever since my friend Fred described himself as a "radical agronomist" this summer, I think I've wanted to be a radical something.) Of course, my understanding of this issue is still pretty shallow, and probably biased by the fact that I've now read heavyweight Bayseans (Kahneman won the Nobel prize in economics) and a less prestigious Frequentist. I'm sure you're all sitting on the edge of your seats, wondering how this will all turn out.

5 Comments:

At 8:56 PM, Mike Rolig said...

This all goes to show how academics can fall into the trap of arrogance - showing how much smarter they are with their "logic" than "normal" people are without it.

This is right up there with the asinine people who correct you're usage of "who" instead of "whom" -- grammar is an empirical science, not a judging criteria to write-off "uneducated" people. If you understand me well enough to give me the "correct" word, there's no need for your stupid extra "m"! Language evolves, get over it!

 
At 8:15 AM, Melissa said...

I'm one of those people that can't wrap my mind around stating the probability of unrepeatable events. What are the implications of stating a probability that cannot be tested?

The same can be said for weather predictions as for football games. Given circumstances X, Y, and Z, we can say that we think A will happen N% of the time and A will not happen 100-N% of the time. But, with a world so complicated with so many different variables (El nino, global warming, how much beer Manning drank the night before, and wind speed at the time of a field goal kick), there are probably never going to be the same conditions repeated ever. There will be similarities in conditions that help a probabilist make a calculated guess. With a coin flip, roll of the dice, spin of the wheel, there are a much more limited number of influences on the result. You could eliminate many of them by placing the subject in an environment void of air pressure and test your probability.

Is there a difference between probability (in my eyes revered as simply a matter of adding up numbers) and a calculated hypothesis pertaining to unrepeatable events? You and Bayes seem to say not. But how much legitimacy can be placed in an estimate if you can't get events to repeat themselves? This is only to say that I think probability estimates of repeatable events are slightly more legitimate than unrepeatable ones.

 
At 9:39 AM, thegio said...

Obviously there is less legitimacy in something that cannot be repeated, in the sense that our estimation of the probability is less accurate. However, I don't see a fundamental difference between the repeatable and non-repeatable events, only their predictability.

Also, remember that there is a continuum of repeatability. For example, consumer survey data (to predict the probability that a given person will like a new brand of chips, say), is less repeatable than a single event but more repeatable than a coin flip. Is there some magical cutoff point where probability stops making sense at all? No, only diminishing ability on our part to make accurate statistical predictions.

 
At 3:50 PM, Jon said...

What do you mean by "repeatable event?" Maybe everything that happens just happens once and you can't re-do that event no matter what. You can try to approximate, simulate or recreate, but it still isn't the same event. Hell, I've even seen a re-enactment of the civil war, and let me tell you, it wasn’t very convincing. I've heard a comedian discuss his plans to orchestrate a re-enactment of the same war which only involves prostitutes (the problem is finding enough confederate hotpants). So, how is flipping a coin any more or less repeatable than any other event. You can't re-flip, you can only flip again and you still don't know what the outcome of that flip will be. Sure, you can conceive of a set of outcomes and when you one is observed you say: "Oh yeah, I'm not surprised to see heads at all because I knew that it could be heads about half the time." If you're lucky enough to observe a really unlikely event, like the coin standing on edge you will undoubtedly relish in your lucky joy to see something so rare. Sometimes an event is observed which wasn’t in the original set of outcomes – this can be very exciting, or depressing depending on the reason the outcome wasn’t considered possible.

I claim that there are no repeatable events.

In regards to the comment ‘Language evolves, get over it!’ I have this to say:

I refuse to get over it and likely won't. Explanations which involve the non-word "expecially" don’t heighten my understanding of a concept; they highlight the explainer’s dumbness. Also, they are negatively correlated with my attention span. I also refuse to say "blog" and always insist on saying "web log" in a deliberate and obnoxious fashion. I’ve found that the best way to accentuate my non-conformity is to pause slightly and then announce “web log” in a louder voice. This helps, just in case someone wasn’t sure what I was doing.

If you’ve noticed that I referred to a negative correlation, understand that correlation requires an expectation of a random variable to be calculated, and that this is inconsistent with my prior claim that there are no repeatable events, then you ought to pat yourself on the back for being one smart (and possibly delicious) cookie. You’re also smarter than I am because I’ve been working on this remark for far too long (and it is too long) and have just discovered the problem. I tried to repackage it in a way that makes me look really knowledgeable, but I’m pretty sure that it’s not going to work.

 
At 3:23 PM, thegio said...

I think Jon it it on the head with respect to the repeatability of events. "Repeatable" events are simply events that have enough in common with one another that we treat them as multiple manifestations of the same thing. As such, there is a continuum of "repeatability". Now, repeatability is a very, very powerful idea in statistics, and in all science for that matter. But the notion of "probability" should be applicable to all uncertain events, not just those who fit the axioms necessary for certain kinds of statistical analysis. In particular, -all- probabilities, whether they refer to repeatable or non-repeatable events, should obey the law of subsets (the probability of a proper subset is less than that of the superset).

As far as language changing, I couldn't agree less. Expression an communication are paramount. Rigid grammar facilitates that sometimes, but often inhibits it as well.

And let me add that sample correlation is a perfectly acceptable notion even if you deny the existence of any underlying probability.

In conclusion, I am indeed a delicious cookie.

 

Post a Comment

Links to this post:

Create a Link

<< Home