tag:blogger.com,1999:blog-38600807.post1031188730357000559..comments2018-06-02T14:19:34.554-04:00Comments on Advanced Football Analytics (formerly Advanced NFL Stats): NFL Win Prediction MethodologyUnknownnoreply@blogger.comBlogger20125tag:blogger.com,1999:blog-38600807.post-89065107660098466432014-02-05T01:12:47.965-05:002014-02-05T01:12:47.965-05:00hi Brian,
good stuff here… but i was just looking...hi Brian,<br /><br />good stuff here… but i was just looking at your 4th down calculator and i'm wondering one thing…<br /><br />how can you predict win percentages etc without entering timeouts??<br /><br />end of game scenarios, impact of going for it, would be hugely affected by timeouts..<br /><br />right?tito puentehttps://www.blogger.com/profile/13831539501726930277noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-82060569230060772152012-10-02T09:43:15.068-04:002012-10-02T09:43:15.068-04:00Python,
The calculator is using the cutoff of the...Python,<br /><br />The calculator is using the cutoff of the 35 yl for the boundary between a FG attempt and a punt on a 4th down. It's not 100% realistic, I realize, but that's the explanation.Brian Burkehttps://www.blogger.com/profile/12371470711365236987noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-56284376852101816062012-10-01T21:58:18.605-04:002012-10-01T21:58:18.605-04:00Brian,
Love site, go here all the time, tell my f...Brian,<br /><br />Love site, go here all the time, tell my friends about it, etc.<br /><br />Was looking at WP Calculator. Used the following inputs:<br /><br />Score Diff: -1 (team with ball trailing by 1)<br />Time Left: 4 mins in 4th Qtr<br />Field Position: 36 yard line of opposition<br />2nd Down and 10 to go.<br /><br />This gives a WP of .49.<br /><br />If you change Field position to 35 yard line of opposition, WP changes to .59.<br /><br />Seems like an extreme swing for 1 yard. Was wondering if I was missing something. Please explain how 1 yard changes 10% of games with 4 minutes left.Pythonnoreply@blogger.comtag:blogger.com,1999:blog-38600807.post-449257737257690332011-11-29T14:19:13.023-05:002011-11-29T14:19:13.023-05:00What are the odds that all teams lose?What are the odds that all teams lose?Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-38600807.post-71525466556349834432011-10-22T01:02:44.032-04:002011-10-22T01:02:44.032-04:00Brian, looking over the week 7 predictions (for 20...Brian, looking over the week 7 predictions (for 2011), I was surprised not just by Dallas and SF, teams whose records suggest different placings, but by the bottom. I find it hard to imagine Seattle losing at a neutral site to Indy, or that Tampa is worse than Saint Louis. <br /><br />That lead me to read your methodology. I can't criticize it: I don't have the statistical or historical football knowledge to offer a substantive challenge, but I am curious why points and wins (or wins vs adj. opponent quality) aren't included. <br /><br />With SF this year, their efficiencies might look poor, but their point differential is third. While the blow-out win against Tampa skews that, I would expect that consistently winning by decent margins would be an indicator of a good team. Likewise, Dallas has a losing record and has negative points differential, not qualities that suggest a good team. Saint Louis hasn't won a game, is -88 points and still ranked above teams with winning records and not entirely pathetic point differentials.<br /><br />Have you written about why you don't include wins or points in your model (or by not reading the detailed model, did I miss that)? If not, would you address it some time?Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-38600807.post-6615987600328199482011-02-02T12:08:59.207-05:002011-02-02T12:08:59.207-05:00I know you posted this article some time ago, but ...I know you posted this article some time ago, but would you lend some insight as to how you generate the in-game win probabilities? How do you take into account the time left? What does the dataset look like before you fit the model? Many thanks in advance! This stuff is great!Datamonkey3https://www.blogger.com/profile/08812044269714828490noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-67855427334821354212009-10-29T15:06:42.346-04:002009-10-29T15:06:42.346-04:00Your approach is very interesting. As a statistic...Your approach is very interesting. As a statistician I think it's neat. I have been working on a similar type of statistical model that predicts teams' winning percentages and also winning margins against the point spreads. The variables I use are slightly different from yours, however. I am particularly interested in beating point spreads which my model has done 55% of the time between 2004 and 2008. This year (2009) so far my model has beaten the point spreads 62% of the time. You can check out my website at: www.nflforecasts.comSteve - The NFL Modelmanhttp://www.nflforecasts.comnoreply@blogger.comtag:blogger.com,1999:blog-38600807.post-15245695396731800602008-11-02T20:08:00.000-05:002008-11-02T20:08:00.000-05:00Dan-Here is the article you're looking for.Dan-<A HREF="http://www.advancednflstats.com/2008/03/singal-vs-noise-in-football-stats.html" REL="nofollow">Here</A> is the article you're looking for.Brian Burkehttps://www.blogger.com/profile/12371470711365236987noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-30918118534468871062008-11-02T19:42:00.000-05:002008-11-02T19:42:00.000-05:00Hi Brian; I just have a question regarding how you...Hi Brian; <BR/>I just have a question regarding how you tested<BR/>your chosen statitics( above) for your model.<BR/>I remember in one of your other posts you outlined this. I believe you collected each statistic through eight games then measured how<BR/>well they predicted(the next eight games) <BR/>Could you remid me how this is done(or direct me to your past post)<BR/>did you run a correlation to 2nd half wins? or points?<BR/>thx danMr.Ceraldihttps://www.blogger.com/profile/16527141701099632659noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-37873360797811983072008-10-21T18:19:00.000-04:002008-10-21T18:19:00.000-04:00what would the chart look like for the buffalo-hou...what would the chart look like for the buffalo-houston, 1992 AFC wildcard game; 35-3 with a 41-38 final.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-38600807.post-55331776649497461592008-09-03T16:03:00.000-04:002008-09-03T16:03:00.000-04:00So now that you know the stabilized value for Ariz...So now that you know the stabilized value for Arizona's opponents' generic average win probability and its corresponding logit value, you can now include that logit value in the equation for their win probability in their upcoming matchup correct?<BR/><BR/>Unrelated to all this, I have one more question and I promise this is my last one on this article. In your article titled "Why the Chargers Defense Will Decline in '08" you described how defensive interception rates are more due to luck than any defensive skill. However, you include it as an independent variable in your prediction model. Since you have proven defensive interception rate is no indication of future performance, wouldn't the model be better off without it?Brianhttps://www.blogger.com/profile/15394006910997218733noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-35952312628979730002008-09-03T14:41:00.000-04:002008-09-03T14:41:00.000-04:00Brian-Your equation is sound. I just picked those ...Brian-Your equation is sound. I just picked those numbers out of a hat for an example of the process. The math won't add up. <BR/><BR/>I'll use a real world example. After week 13 last year, ARI's own unadjusted generic win probability was 0.47 (from a logit value of -.14). Their opponent average win probability was 0.45 (a logit value of -.21). <BR/><BR/>So I subtracted -.21 from -.14 for an adjusted logit value of -.35. The adjusted win probability then becomes 0.41. <BR/><BR/>It gets a little trickier, too. Now that I have an adjusted logit value for each team, that changes everyone's average opponent strength. So I iterate the process until the probabilities converge on a stabilized solution.<BR/><BR/>Ultimately, ARI's generic win probability stabilized at 0.40 for week 13. It dropped from .41 to .40 because their opponent's had slightly weaker schedules themselves.Brian Burkehttps://www.blogger.com/profile/12371470711365236987noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-59256935171430102262008-09-03T13:36:00.000-04:002008-09-03T13:36:00.000-04:00Brian,Thanks for the quick response. Wouldn't a 0...Brian,<BR/><BR/>Thanks for the quick response. Wouldn't a 0.55 win probability correspond to a logit value of 0.2 (not 1.2). Tell me if this is correct. Set 0.55 = 1/[1+(e^(-z))] and then solve for z. Doing this gets me z = 0.2. You got 1.2. What am I doing wrong?Brianhttps://www.blogger.com/profile/15394006910997218733noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-68209406356667254332008-08-31T11:48:00.000-04:002008-08-31T11:48:00.000-04:00The simple explanation is: find each team's oppone...The simple explanation is: find each team's opponents' average generic win probability. Say the Cardinals' is 0.55. That translates into a logit value of 1.2 (or so). This means that Arizona is underrated before accounting opponent strength. So, in the logit equation that computes the win probability in their next game, I add 1.2 to their estimation. I would do the same thing for every opponent. Say they're playing Seattle and their opponent average win probability is 0.45. This works out to a logit value of 0.8. I'd add 0.8 to the Seahawks logistic estimate. In all, there's a net 0.4 advantage for Arizona because of opponent considerations. This works out to, say, a 3% adjustment in win probability in favor of ARI.<BR/><BR/>Sorry, this doesn't translate well without writing out the equations.Brian Burkehttps://www.blogger.com/profile/12371470711365236987noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-69316740900894416782008-08-31T02:16:00.000-04:002008-08-31T02:16:00.000-04:00Brian,I have a few questions about how you are adj...Brian,<BR/><BR/>I have a few questions about how you are adjusting for opponent strength.<BR/><BR/>First, you said that you calculate each opponent's win probability against an average opponent at a neutral site. Doesn't you model require that the AHome variable equal 0 or 1? How can you model a neutral site?<BR/><BR/>You said that you average the generic win probability for each opponent together and then include it back into the win model. How does this work? Are you creating a new independent variable for opponent strength?<BR/><BR/>I'm curious because the "Game Model Coefficients" post does not mention adjustments made for strength of opponent.<BR/><BR/>Thanks,<BR/>BrianBrianhttps://www.blogger.com/profile/15394006910997218733noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-25992870604986165772008-08-17T16:07:00.000-04:002008-08-17T16:07:00.000-04:00Brian,A friend and I are in the process of develop...Brian,<BR/><BR/>A friend and I are in the process of developing a regression model for predicting the probability that a team will win an upcoming game. I've looked around the web and have found your site to be one of the most thorough and systematic. <BR/><BR/>I was hoping you could help answer a few of our questions or direct us to some useful resources. First, I've read your four part series on determining how many wins a team should get over the course of the season. I was wondering if you have a similar set of articles that outline how you determined the coefficients for the week-to-week efficiency model. Second, compiling all the stats available from the past five seasons is quite an undertaking. I was wondering if there is some site you could direct me to that has the stats already set up in a spreadsheet? That would sure save us a lot of time! <BR/><BR/>Keep up the good work. Your site's always interesting to read. Its good to know that other people are out there trying to understand football in a systematic way.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-38600807.post-36483966349676223262008-07-17T18:05:00.000-04:002008-07-17T18:05:00.000-04:00SSR turned out to be ok as a predictor as I recall...SSR turned out to be ok as a predictor as I recall, but not better than a yardage efficiency model. Just like points for/against it captures a lot of luck and unique game situations. Yes, points for/against captures ALL team data, but it captures even more noise. The finer the resolution in the picture, the clearer it will be.Brian Burkehttps://www.blogger.com/profile/12371470711365236987noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-39021302036801395302008-07-17T17:36:00.000-04:002008-07-17T17:36:00.000-04:00If you're attempting to use SSR as a predictor why...If you're attempting to use SSR as a predictor why not just use points for and against adjusted for strength of opponent? I ask because you say SSR is a simple way of capturing a lot of data about a team...well points for and against is a simple way of capturing ALL of the data about a team.Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-38600807.post-44240186787928139942007-10-01T19:03:00.000-04:002007-10-01T19:03:00.000-04:00I agree for the most part. I'm working on somethin...I agree for the most part. I'm working on something just as you suggest, but I'm letting the season generate some more data before finalizing it or posting anything.<BR/><BR/>I'm building a model around series success rates (SSR). It's the percentage a team gets a 1st down in any given series, or prevents one on defense. The average rate is 65% in the NFL. I would think that each teams offensive and defensive SSR is a very simple, handy method of capturing a lot of data about a team. One way or another it captures run and pass efficiencies, turnovers, sacks, penalties, and coaching tactics.<BR/><BR/>So far, however, it's not as predictive as efficiency stats.<BR/><BR/>Some of FO's stuff is really good, but some of it leaps to conclusions after a couple interesting correlations.Brian Burkehttps://www.blogger.com/profile/12371470711365236987noreply@blogger.comtag:blogger.com,1999:blog-38600807.post-1745770243882544132007-10-01T18:38:00.000-04:002007-10-01T18:38:00.000-04:00Rather than assuming that including something like...Rather than assuming that including something like, say, first downs or TDs will overfit the model, why not test for he predictive power of these statistics?<BR/><BR/>I sense some disdain for the FO methodology here. It's true they haven't done (or at least they haven't claimed to have done) any rigorous satistical analysis of the factors they consider. But they do claim that all changes to the model are tested by whether they improve the correlation of the statistics from one year to the next, or the correlation of last years DVOA to next year's wins. They are not testing correlation of this year's stats to this year's wins, which would obviously lead to severe overfit.Tarrhttps://www.blogger.com/profile/14368810359650066790noreply@blogger.com