On Polling Models, Skewed & Unskewed

Great article below linking this polling data / Presidential forecasting and Moneyball baseball statistics-driven analysis. This is one of the reasons I am drawn to it like a moth to a flame. And why I'm posting it on a baseball-oriented blog.

That and I do think this is the most important Presidential election of all-time (even though we seem to say that all the time now) and some things are more important than baseball.

There I said it out loud. Are you happy?

Anyway, on to the stats.

Put me down officially for:
Romney 50.8%
Obama 48.2%
Other 1%

On the Electoral College front, I'm at 291-247 Romney, with MI, PA and NV staying with Obama despite a late push. If any of those flip to Romney, it could be over early. I have WI and OH joining CO, FL, VA, NH and IA on the Romney side. That may be a bit optimistic, but....I call them as I see them.

If the Dems are right and the enthusiasm for Obama has built back up to almost 2008 levels and the Republicans are as (un) enthusiastic for Romney as they appeared to be for McCain -- I can get to 50-49% Obama.

That's assuming the D-R-I (Dems - Repubs-Indies) breakdown reverts to 2008 levels of 37.5% D - 34.0% R and 28.5% I with the traditional 91.5% party hold rate -- Dems vote for Dems 91.5% and cross-over at 8.5% rate and same for Repubs. In 2008, Obama held Dems 93-6% while McCain held Repubs 95-4%, they just didn't get off the couch in numbers like the D's did for Obama.

That's the hardest factor to quantify in advance IMO, the "off the couch", enthusiasm, intensity rate. It's well known in hindsight, but hard to predict in advance.

The 2010 elections changed the D-I-R composition to 31-29-40 according to Gallup and the enthusiasm shifted to the Repubs a bit in the mid-terms. I don't think the hold rate shifts more than 93% R and 90% D according to IBD/Tipp polling which breaks down the "enthusiasm" somewhat. The Independents appear to be 55% - 45% in favor of Romney. He needs that number to hold to have any chance.

The best case for Obama:

( 2008-10 model ) D=37.5 R=34.0 I=28.5 Hold Rate 91.5% I's split=55-45

Romney Total using 2008 model=(34*0.915)+(37.5*0.085)+(28.5*0.55) or 50.03%

Obama Total using the 2008 model=(37.5*0.915)+(34*0.085)+(28.5*0.45) or 49.97%

The best case for Romney:

( 2012 model ) D = 32 R = 28 I = 38+2 (Gallup 2012) Hold Rate R=90 D=85, I's split=55-45

Here Romney gets to 52.00%
Obama gets to 48.00%

Using a probability grid of best case / worst case possibilities:

25% R=52% O= 47%
40% R=51% O=48%
25% R=50% O=49%
10% R=49% O=50%

gets me to my final answer of 50.8% - 48.2%. The possibility remains for Romney to pull an AlGore and win the popular vote, but lose the Electoral College. As a Romney lead grows above 1.5% - 2.) the odds of that diminish.

I don't see Other getting over 1.0%. Sorry to all you Ron Paul and Mickey Mouse fans out there.

On the comparable election front, I get the comparison between this one and Reagan -Carter, I think it's Romney's best-case and wildest dream but unfortunately -- Romney is not quite Reagan and Obama is not suite considered as bad as Carter. So that one is fading.

The Dems seem to be hoping a 2004 replay is in order with Obama as Bush (the incumbent hold) and Romney as John Kerrey -- the Richie Rich comparison. Bush held on barely and Kerrey was history. Aside from the irony of being linked to Bush in any way, much less for political survival, this one also falls a bit short on the comp front even if it is fairly comparable on the statistical front. Mainly approval ratings which translate well to final vote. Obama made a late push to 50% in approvals, If he gets close to 50% popular vote, he wins the Electoral College.

The comp that makes the most case in my mind is 2000, with Obama playing the role of AlGore, the quasi-incumbent, carrying the Clinton mantle of success versus the national neophyte George Bush. Gore actually had better sales material to close the deal and still lost. Obama ironically -- since he touts the term Romnesia -- has to hope the electorate gets a healthy does of Obama-nesia and forget the last four years even happened, bringing him closer to the relative Shangri-La of 2008.

We'll know soon.

from RedState.com
On Polling Models, Skewed & Unskewed | RedState:

"I haven’t pulled apart all the pieces of Chambers’ model, but my main objection to UnskewedPolls is that it re-weights the electorate twice:

The QStarNews poll works with the premise that the partisan makeup of the electorate 34.8 percent Republicans, 35.2 percent Democrats and 30.0 percent independent voters. Additionally, our model is based on the electorate including approximately 41.0 percent conservatives, 20.0 percent moderates and 39.0 percent liberals.

Republicans are 89 percent conservative, 9 percent moderate and 2 percent liberal. Among Democrats, 3 percent are conservative, 23 percent are moderate and 74 percent are liberal. Independents include 33 percent conservatives, 49 percent moderates and 18 percent liberals.
Our polls are doubly-weighted, to doubly insure the results are most accurate and not skewed, by both party identification and self-identified ideology. For instance, no matter how many Republicans answer our survey, they are weighted at 34.8 percent. If conservatives are over-represented among Republicans in the raw sample, they are still weighted at 89 percent of Republicans regardless."

'via Blog this'

Moneyball and PECOTA’s World

Let me use an analogy from baseball statistics, which I think is appropriate here because it’s where both I and Nate Silver first learned to read statistics critically and first got an audience on the internet: in terms of their predictive power, poll toplines are like pitcher win-loss records or batter RBI.

At a very general level, the job of a baseball batter is to make runs score, and the job of a baseball pitcher is to win games, so traditionally people looked at W-L records and RBI as evidence of who was good at their jobs. And it’s true that any group of pitchers with really good W-L record will, on average, be better than a group with bad ones; any group of batters with a lot of RBI will, on average, be better than a group with very few RBI. If you built a model around those numbers, you’d be right more often than you’d be wrong.

But wins and RBI are not skills; they are the byproducts of other skills (striking people out, hitting home runs, etc.) combined with opportunities: you can’t drive in runners who aren’t on base, and you can’t win games if your team doesn’t score runs. If you build your team around acquiring guys who get a lot of RBI and wins, you may end up making an awful lot of mistakes. Similarly, you can’t win the votes of people who don’t come to the polls.

Baseball analysis has come a long way in recent decades, because baseball is a closed system: nearly everything is recorded and quantified, so statistical analysis is less likely to founder on hidden, uncounted variables. Yet, even highly sophisticated baseball models can still make mistakes if they rest on mistaken assumptions. Baseball Prospectus.com’s PECOTA player projection system – designed by Nate Silver and his colleagues at BP – is one of the best state-of-the-art systems in the business. But one of PECOTA’s more recent, well-known failures presents an object lesson. In 2009, PECOTA projected rookie Orioles catcher Matt Wieters to hit .311/.395/.546 (batting/on base percentage/slugging). As regular consumers of PECOTA know, this is just a probabilistic projection of his most likely performance, and the actual projection provided a range of possible outcomes. But the projection clearly was wrong, and not just unsuccessful. While Wieters has developed into a good player, nothing in his major league performance since has justfied such optimism: Wieters hit .288/.340/.412 as a rookie, and .260/.328/.421 over his first four major league seasons. What went wrong? Wieters had batted .355/.454/.600 between AA and A ball in 2008, and systems like PECOTA are supposed to adjust those numbers downward for the difference in the level of competition between A ball, AA ball and the major leagues. But as Colin Wyers noted at the time, the problem was that the context adjustments used by PECOTA that season used an unusually generous translation, assuming that the two leagues Wieters had played in – the Eastern League and the Carolina League – were much more competitive in 2008 than they had been in previous years. By getting the baseline of the 2008 environment Wieters played in wrong, PECOTA got the projection wrong, a projection that was out of step with what other models were much more realistically projecting at the time. The sophistication of the PECOTA system was no match for two bad inputs in the historical data.

My point is not to beat up on PECOTA, which as I said is a fantastic system and much better than anything I could design. Let’s consider for a further example one of PECOTA’s most notable successes, one where I questioned Nate Silver at the time and was wrong; I think it also illustrates the differing approaches at work here. In 2008, PECOTA projected the Tampa Bay Rays to win 88-89 games, a projection that Nate Silver touted in a widely-read Sports Illustrated article. It was a daring projection, seeing as the Rays had lost 95 or more games three years running and never won more than 70 games in franchise history. As Silver wrote, “[i]t’s in the field…that the Rays will make their biggest gains…the Rays’ defense projects to be 10 runs above average this year, an 82-run improvement.” I wrote at the time: “this is nuts. Last season, Tampa allowed 944 runs (5.83 per game), the highest in the majors by a margin of more than 50 runs. This season, BP is projecting them to allow 713 runs (4.40 per game), the lowest in the AL, third-lowest in the majors…and a 32% reduction from last season…it’s an incredibly ambitious goal.”

PECOTA was right, and if anything was too conservative. The Rays won 97 games and went to the World Series, without any improvement by their offense, almost entirely on the strength of an improved defense. I later calculated that their one-year defensive improvement was the largest since 1878. Looking at history and common sense, I was right that PECOTA was projecting an event nearly unprecedented in the history of the game, and I would raise the same objection again. But the model was right in seeing it coming.

If you looked closely, you could see why: the frontiers of statistical analysis had shifted. Michael Lewis’ book Moneyball, following the 2002 Oakland A’s, captured the era when statistical analysts stressed hitting and de-emphasized fielding on the theory that it was easier to use sophisticated metrics to find better hitters, but harder to quantify the benefits of defense. By 2008, the metrics were creating more opportunities to study defense, and – as captured in Jonah Keri’s book The Extra 2% (about the building of that Rays team) – the Rays took advantage.

But for the Rays, the 2008 environment was not so easily repeated in subsequent years. While still a successful club with a solid defense in a pitcher’s park (and still far better defensively than in 2007) they have led the league in “Defensive Efficiency Rating” only once in the past four years. It’s what Bill James called the Law of Competitive Balance: unsuccessful teams adapt more quickly to imitate the successes of the successful teams, bringing both sides closer to parity. Trende, in his book The Lost Majority, applies the same essential lesson to political coalitions. Assuming that the 2008 turnout models, which depended heavily on unusually low Republican turnout, still apply to Obama’s current campaign ignores the extent to which multiple factors favor a balance swinging back to the Republicans. And the polls that make up the averages – averages upon which Nate Silver’s model rests – are doing just that. Nate’s model might well work in an election where the relationship between the internals and the toplines was unchanged from 2008. But because that assumption is an unreasonable one, yet almost by definition not subject to question in his model, the model is delivering a conclusion at odds with current, observable political reality.

The Slav's Baseball Blog - BASEBALL 24-7-365

Pages

Tuesday, November 06, 2012

On Polling Models, Skewed & Unskewed

No comments:

Post a Comment