Order NowThe Hardball Times Baseball Annual 2010 is now in development and will ship in mid November! This year's book will feature articles by THT's staff as well as Bill James, Rob Neyer, Tom Tango and Craig Wright. If you use this link to purchase the Annual, you will be in the first group to receive it and you'll be supporting THT. ![]() Derek Ambrosino
John Burnson Derek Carty Marco Fujimoto Eriq Gardner Matt Hagen Jonathan Halket Rob McQuown Troy Patterson Mike Silver Paul Singman Michael Street And here's the full roster. Got a question for our fantasy baseball experts? Email us:
Heater MagazineAdd 10 MPH to your fantasy team — see for yourself
HEATER MAGAZINE Winner, 2008 CBS Sportsline Fantasy League of Experts ![]() Plus our Statistical Definitions Most Recent Comments
Top 10 prospects for 2010: Toronto Blue Jays and Kansas City Royals (8)
Top 10 prospects for 2010: Chicago White Sox and Detroit Tigers (4) Waiver Wire Offseason: NL (2) Acey-deucey (1) Waiver Wire Offseason: NL (12) Monthly Archives
November, 2009
October, 2009 September, 2009 August, 2009 July, 2009 June, 2009 May, 2009 April, 2009 March, 2009 February, 2009 January, 2009 December, 2008 November, 2008 October, 2008 September, 2008 August, 2008 July, 2008 June, 2008 May, 2008 April, 2008 March, 2008 February, 2008 January, 2008 December, 2007 November, 2007 October, 2007 September, 2007 August, 2007 July, 2007 June, 2007 May, 2007 Gear up for baseball season with Chicago White Sox tickets and New York Yankees tickets. LA Angels tickets, Houston Astros tickets, and Atlanta Braves tickets are hot sellers! You can get Boston Red Sox tickets, San Diego Padres tickets or Chicago Cubs tickets for your favorite baseball fan. Coast to Coast Tickets has the best MLB tickets like Minnesota Twins tickets, LA Dodgers tickets, Milwaukee Brewers tickets, New York Met tickets and St. Louis Cardinals tickets. Find premium Chicago Cubs tickets and other Chicago tickets at JustGreatTickets.com. Chicago Cubs Tickets Chicago Tickets ![]() All content on this site (including text, graphs, and any other original works), unless otherwise noted, is licensed under a Creative Commons License. |
Most Recent Posts
Thursday, August 27, 2009In a sentimental moodPosted by Jonathan Halket at 3:36amToday I'd like to take a second-cut (in a second-best fashion) at John Burnson's nice article last week on near-sighted fantasy players. In his article, Burnson used two forecasting systems, Marcels, and Near-sighted Marcels (NSM, for short), to find some baseball players for next year that were likely to be overvalued by fantasy players that focused mainly on this year's performance. After reading his article, I was interested in answering some questions: How much worse off are near-sighted fantasy players? And is near-sightedness actually helpful in forecasting certain types of players? As a reminder, Tom Tango's Marcels forecasting system is intentionally simple (so simple that Marcel the Monkey could do it). It takes a weighted average of a player's last three seasons and regresses it to the mean somewhat. All NSM does is put more weight on a player's more recent performance than Marcels does. Here's the percent weight that each system puts on past performance:
How much better or worse is NSM for fantasy? To answer this, I have to make a few adjustments, some of which are harmless while others are necessary but unfortunate. First, I need a metric to measure performance. I'm going to use mean absolute deviation (MAD) (aka mean absolute forecast error). Another possible metric is root mean squared error (RMSE), but that's for another day. I'm going to compare OPSs, since that's what John did in his article and also it saves me a few computational steps. Instead of forecasting 2010 like John did, I'm going to use 2006-2008 to forecast 2009 and compare the forecasts with the actual data we have so far (since I'm using OPS, which is a rate, it doesn't really matter that we haven't completed the season yet). Next I have to "fantasize" the OPS forecasts and sample. Fantasizing the sample means dropping players that are projected to have too few plate appearances or are projected to perform too poorly to be fantasy relevant in most leagues. For today, I dropped any player with fewer than 200 projected plate appearances or less than a .700 OPS (for these projections I used Marcels only so that I would have a consistent sample across forecasting systems—nothing really changes if you use NSM for this step instead). Alas this introduces the possibility of sample selection bias. I also didn't include players without three years of usable data - just like John did. Fantasizing the forecasts implies removing the means of each forecast. Player values are based on performance relative to league average—so that is how we should measure the value of our forecast systems (though in this case, the average is a sample average and not a league average). Lastly, I'm going to weight each player by the number of plate appearances he's actually had in 2009 (though this doesn't make a qualitative difference for any result). So what's the MAD?
Indeed, NSM has a higher MAD and thus a worse performance than Marcels. But the difference is numerically negligible and not statistically significant. The difference is not statistically significant because, frankly, reweighting doesn't do all that much compared to the overall forecast errors that are inherent in either system. Both NSM and Marcels "regress to the mean" of league performance identically- a step which probably accounts for a large part of either system's success. I will follow Burnson in calling the difference between the forecasts for each player as that player's "sentiment". A positive sentiment means NSM is more optimistic about the stat than is Marcels. One thing I wondered was how sentiment varied by age and whether there was any variation in forecast performance by age.
We can see a slight age based pattern here: NSM is sentimental about younger players whereas Marcels is more "nostalgic" about about older players. Interestingly enough, this sentimentality and nostalgia are somewhat appropriate. NSM does a little better at forecasting the younger players while Marcels does better with the older ages. In neither case is the difference statistically significant though. What about variation by player performance? Is NSM more sentimental about better players? As we can see in the scatterplot with the fitted line, the answer is a qualified yes (again there's too much noise in the data for statistical significance). This isn't terribly surprising. For NSM to be sentimental about a player, that player must have done particularly well in 2008 relative to his 2007 and 2006 campaigns. Some of this is luck perhaps, but to the extent that any of the improved performance is persistent, this luck/skill will carry over into 2009 as well. ![]() To conclude: Being sentimental can hurt you, but being a sentimental monkey (that is using Marcels with different weights) doesn't hurt you that much and actually may help you with young players. That said, sentimental monkeys are pretty smart since they regress to the mean. Tango made his monkey fairly simple-minded and that includes his 5/4/3 weighting system. In his explanation of Marcels, Tango does not explain where he came up with the particular weights. Probably, in order to keep it simple, it is just a rule of thumb. So it would be interesting to see what the optimal weighting system would be (i.e. the one that provided the best forecasting performance using a certain metric). It might not be all that far from NSM. Jonathan Halket is an economist in New York. He welcomes questions and comments here.
Toffer Peak said...
Jeremy B. - Not sure where I’ve read it but I’m pretty sure that Marcel already weights pitchers by 3/2/1 rather than 5/4/3 like he does for hitters. I also agree with you observation that different positions (or at least catchers) should probably be weighted slightly differently. Catchers just seem to break down much more quickly. Same with large, oafish DHs/1Bs. I’m sure he won’t change it though in order to keep it as simple as possible. Maybe Zips, CHONE, etc. do consider these though. Posted 08/28 at 02:58 PM
Page 1 of 1
Commenting is not available in this weblog entry.
Next Post: Looking forward: Some of the more prominent prospects of 2009>> <<Previous Post: Player Profile: Miguel Montero | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nice article, Jonathan,
I was thinking while reading this that different positions, particularly “Pitchers” versus “Batters”, using fantasy lingo, might have different optimal weights. Catchers might also have different optimal weights from the other “Batters”.
I don’t really have a hypothesis about what the different optimal weights would be for pitchers vs. batters, because looking at it one way, pitchers might supply around 1.25 days per week of data while hitters supply around 5 days per week of data, but looking at it another way, they both pitch/bat approximately the same number of at bats per week… except for catchers…