tag:blogger.com,1999:blog-7965585998403674176.post7065520173913381599..comments2008-04-27T18:24:55.661-04:00Comments on Vegas Watch: Finding The Happy MediumVegas Watchhttp://www.blogger.com/profile/02353166004125421683noreply@blogger.comBlogger18125tag:blogger.com,1999:blog-7965585998403674176.post-78397930001491271282008-04-27T18:24:00.000-04:002008-04-27T18:24:00.000-04:00Wish I could add something as lucid as those comme...Wish I could add something as lucid as those commenters above, but all I have is a request to keep us updated at the 40 game mark.<BR/><BR/>Great work.Kyle B.http://www.blogger.com/profile/08789056887333531061noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-57274870460823665292008-04-25T14:28:00.000-04:002008-04-25T14:28:00.000-04:00What I would want to do if we incorporated strengt...What I would want to do if we incorporated strength of schedule is take the PECOTA predictions and figure out how difficult each team's schedule has been so far. Then, incorporate it either as a separate variable, or to help figure out how much weight to give to Pythag W%.Vegas Watchhttp://www.blogger.com/profile/02353166004125421683noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-68162770477756047012008-04-25T08:55:00.000-04:002008-04-25T08:55:00.000-04:00If you have the MLB schedule in a database, it is ...If you have the MLB schedule in a database, it is not that hard to implement chuckdickens's approach. But would it really be more predictive? The sample size for a single team v team matchup is vanishingly small, especially for interdivisional (or even interleague) matchups, and hence especially prone to results very deviant from the mean. Would the deviations even out over an entire schedule? Perhaps, but isn't that a tacit admission that the value of Pythag prediction is in the aggregate?skoormithttp://www.blogger.com/profile/14018318250262889801noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-61680953373227378022008-04-24T21:23:00.000-04:002008-04-24T21:23:00.000-04:00"It would be a nightmare to compile all of the nec..."It would be a nightmare to compile all of the necessary data, but I wonder if weighting the pythagorean projections based upon the schedule a given team has already played versus the schedule a given team has yet to play would provide more accurate predictions."<BR/><BR/>I think it probably would be more accurate. I also think it'd probably take hundreds of hours to do that. I'd like to incorporate some kind of schedule component, but have yet to figure out some kind of reasonable way to do it.Vegas Watchhttp://www.blogger.com/profile/02353166004125421683noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-68476479185955995322008-04-24T21:16:00.000-04:002008-04-24T21:16:00.000-04:00Interesting article. It would be a nightmare to co...Interesting article. <BR/><BR/>It would be a nightmare to compile all of the necessary data, but I wonder if weighting the pythagorean projections based upon the schedule a given team has already played versus the schedule a given team has yet to play would provide more accurate predictions. <BR/><BR/>For instance, the Twins have played the Royals 6 times thus far. In those games, they scored a total of 19 runs and allowed 15. The pythagorean expectation would project them to win about 62% of their remaining 12 games against the Royals. Given that they have 140 games left, you could give .62 a weight of 12/140. Rinse and repeat for the rest of the teams, applying the overall pythagorean projection towards any teams that they haven't played. Then, sum up the respective pythagorean shares to determine a weighted pythagorean expectation. <BR/><BR/>Then again, that's a lot of work. In any case, great article.ChuckDickenshttp://www.blogger.com/profile/11709571849561444523noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-87773083344668585772008-04-23T13:51:00.000-04:002008-04-23T13:51:00.000-04:00"That's a concern of mine, but it's not the only o..."That's a concern of mine, but it's not the only one. You are also using Pythag based on actual RS/RA, rather than AEQR/AEQRA (expected RS/RA via the batting lines produced, adjusted by SOS)."<BR/><BR/>I wanted to use that, but figured I wouldn't be able to find it for the past data.<BR/><BR/>What I did is essentially the same thing as combining Davenport's original Playoff Odds, and his PECOTA Playoff odds. For example, the original odds have the Diamondbacks winning 95.4 games, and PECOTA has them winning 90.9 games. My prediction of 93.3 comes down right in between- it's higher than (90.9+(95.4-90.9)*.1) because he is regressing to .500 to get that 95.4 number; if he was using straight Pythag or straight AEQR/AEQRA, I'd assume they would be expected to win more than 95.4 games at this point.Vegas Watchhttp://www.blogger.com/profile/02353166004125421683noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-46254460484626953502008-04-23T10:00:00.000-04:002008-04-23T10:00:00.000-04:00"My only concern is that the PECOTA numbers alread..."My only concern is that the PECOTA numbers already account for SOS, so teams that play hard schedules would be getting penalized twice."<BR/><BR/>That's a concern of mine, but it's not the only one. You are also using Pythag based on actual RS/RA, rather than AEQR/AEQRA (expected RS/RA via the batting lines produced, adjusted by SOS). Both of these concerns relate to #2 on my list of things needed to do a Monte Carlo sim. <BR/><BR/>Davenport's solution is to regress the AEQR/AEQRA Pythag to .500, then take a random sample from a normal distribution around the regressed WPCT. The flaw in that, IMHO, is that teams do not regress to .500. Your work above indicates that teams regress strongly to their PECOTA projections. And that's exactly what Davenport does in his PECOTA-adjusted Playoff Odds. Perhaps we should find out if he is using regression values similar to the ones you derived.skoormithttp://www.blogger.com/profile/14018318250262889801noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-74689858290464850342008-04-22T21:46:00.000-04:002008-04-22T21:46:00.000-04:00"These take into account the team's record so far;..."These take into account the team's record so far; the W% column is their expected winning percentage the rest of the way."<BR/><BR/>Actual record is what is used for games already played. The Pythag record is only used to help determine their winning percentage from here on out.Vegas Watchhttp://www.blogger.com/profile/02353166004125421683noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-83734437944228787222008-04-22T21:41:00.000-04:002008-04-22T21:41:00.000-04:00You should be using actual record, not pythad. If...You should be using actual record, not pythad. If you are trying to predict what the actual record of the team is going to be at the end of the season at this stage, just take their actual record and pro-rate the rest of the season using whatever projection system you want. Pythag tells us how many games the team should have won up until this point, but frankly, we don't care how many games the team should have won since we KNOW how many games the team has won. Whether a team is 21-1 by winning all 1 run games or 21-1 by blowing out their opponents, I don't care. I know the team won 21 games. I don't need to, nor should I correct for that. The proration of the projection system over the next 140 games is going to take care of that regression to the mean in luck.Ryanhttp://www.blogger.com/profile/10206573228455542540noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-69199017400178765182008-04-22T18:07:00.000-04:002008-04-22T18:07:00.000-04:00Fastness, Evilmonkeycma, DCThrowback, Passive Voic...Fastness, Evilmonkeycma, DCThrowback, Passive Voice, and Bobby S- Thanks for the insightful comments. (I'm kidding, I'm glad you like the post.)<BR/><BR/>Sky- Looking at <A HREF="http://www.matchbook.com" REL="nofollow">Matchbook</A>, the ones that jump out are ATL O85.5 +125, and KC U72.5 +150. I'm not sure where else win totals are still posted.<BR/><BR/>Skoor- This would be fun. My only concern is that the PECOTA numbers already account for SOS, so teams that play hard schedules would be getting penalized twice. I could do #1, although it would take awhile unless there is something like that freely available.Vegas Watchhttp://www.blogger.com/profile/02353166004125421683noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-16461200489105784352008-04-22T17:52:00.000-04:002008-04-22T17:52:00.000-04:00Awesome.Now, how do those projected win totals com...Awesome.<BR/><BR/>Now, how do those projected win totals compare to current online odds? Can we make some money by taking advantage of others' overreactions?Skyhttp://skyking162.comnoreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-42292748881811391932008-04-22T13:50:00.000-04:002008-04-22T13:50:00.000-04:00"I am not smart enough to run a Monte Carlo simula..."I am not smart enough to run a Monte Carlo simulation."<BR/><BR/>Hogwash. If you are smart enough to do regression analyses, you are twice smart enough to do a Monte Carlo. Take your projected wpct and run through the remaining schedule an arbitrarily large number of times. (BP does one million.) The log5 method is a quick and effective way to get a result for each game based on the two team's wpct.<BR/><BR/>All you need:<BR/>1) A database with the 2008 MLB schedule.<BR/>2) Your projected wpct for each team for the remainder of the schedule.<BR/>3) A program to read the database and your projections, run the simulations, and report the results.<BR/>4) A snack to munch on while the program runs.<BR/><BR/>If you are saying you are not smart enough for #3, I still say hogwash. But if you'll give me (or point me to) #1, I'll make you #3.Skoorhttp://www.blogger.com/profile/14018318250262889801noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-91301751516919348752008-04-22T13:26:00.000-04:002008-04-22T13:26:00.000-04:00Excellent, excellent post. Thanks.Excellent, excellent post. Thanks.Bobby Shttp://www.blogger.com/profile/03280124122225159775noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-39294099587594441572008-04-22T12:54:00.000-04:002008-04-22T12:54:00.000-04:00Check-plus.Check-plus.Passive Voicehttp://www.blogger.com/profile/06911624103621257431noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-7582399511193359422008-04-22T12:10:00.000-04:002008-04-22T12:10:00.000-04:002nd to evil monkey. muchas gracias2nd to evil monkey. muchas graciasDCThrowbackhttp://www.blogger.com/profile/02595347138052680931noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-34218144966825531592008-04-22T10:40:00.000-04:002008-04-22T10:40:00.000-04:00Excellent post. Nothing else needs to be saidExcellent post. Nothing else needs to be saidEvilmonkeycmahttp://www.blogger.com/profile/12659818492125894265noreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-11810383508927090322008-04-22T06:07:00.000-04:002008-04-22T06:07:00.000-04:00Good work. It's interesting how pecota is still w...Good work. It's interesting how pecota is still weighted much more heavily even late into the season. <BR/><BR/>I'm surprised pct3 is the same. On the bottom of the pecota adjusted postseason odds page, it says they are taking the aeqr pythag wpct and regressing that toward the pecota projections, but they obviously aren't.<BR/><BR/>One more thing - the pecota adjusted playoff odds page says it is regressing performance toward the pecota predicted winning percent, then running a simulation of the rest of the season using the log5 method to determine who wins the game, but the pecota projected winning percent is based on simulations of the entire season. So the pecota projected winning percent is not an approximation of a team's adjusted winning percent (like for example, the kenpom.com pythag wpct), but an approximation of their winning percent given their schedule. So it would then assume the cubs and red sox (both projected to go 91-71) are equal by pecota, while pecota actually finds the red sox better, but having a harder schedule.Matthewfastfoodreviews.netnoreply@blogger.comtag:blogger.com,1999:blog-7965585998403674176.post-60378556015992923112008-04-22T03:10:00.000-04:002008-04-22T03:10:00.000-04:00This post is the kind of stuff to shove up Rick Re...This post is the kind of stuff to shove up Rick Reilly's ass. Good stuff.Fastnesshttp://www.blogger.com/profile/04250511345868226554noreply@blogger.com