May 21, 2013

THT Essentials:
Fangraphs Player Search:


And here's the full roster.

Now available


You can now purchase the Hardball Times Baseball Annual 2013, with 300 pages of great content. It's also available on Amazon and Kindle. Read more about it here.

THT's latest e-book


Third Base: The Crossroads is THT's new e-book, available for $3.99 from the Kindle store. The good news is that anyone can read a Kindle book, even on a PC. So enjoy the best from THT in a new format.

Most Recent Comments





Get your very own THT merchandise from our CafePress store. We've got baseball caps, t-shirts, coffee mugs and even wall clocks with the classy THT logo prominently displayed. Also, check out the THT Bookstore. Please support your favorite baseball site by purchasing something today.



Or you can search by:


Creative Commons License
All content on this site (including text, graphs, and any other original works), unless otherwise noted, is licensed under a Creative Commons License.
Roll mouse over date for entries
THT Live Calendar
May 2013
S M T W T F S



1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31

Sunday, February 19, 2012

Park factor fix for Forecasts

Posted by Brian Cartwright
The Forecasts update posted yesterday, Feb. 18, will have different numbers for every player from the week before, as I discovered a logic error in my code which was preventing park factors from being applied to each player's batting and pitching projections. Recently published Forecasts were park neutral but now are as intended, specific to the player's team.

Major league players' projections are customized to the weighted mean of all the ballparks his team played in the previous season. Projections for players in the minor leagues are based on their parent major league team. Schedules for 2012 may now be downloaded from mlb.com, and as soon as those are imported into the Oliver database, I will use the number of games scheduled to be played in each ballpark in the coming season instead of the previous.

I do not expect using 2012 instead of 2011 to produce major changes in the projections, but some players who play in extreme parks did see sizable differences when their park factors were correctly applied. Troy Tulowitzki solidified his rank as the overall best position player; playing half his games in Coors Field inflates his batting projection from a park neutral .285/.358/.506 to .302/.371/.544. On the other end of the ballpark spectrum, half a season in Petco Park (as well as a disproportionate number in parks such as Dodger Stadium and AT&T Park) drop Chase Headley's projection from .283/.353/.408 to .269/.342/.383.

I apologize for any inconvenience and welcome any comments from subscribers who suspect something may be amiss. Sometimes we'll have an explanation, but other times we have been able to catch errors. As a result of Matt Swartz's testing at FanGraphs of several projections, including Oliver, I am currently at work on some improvements on projecting ERA, as well as an existing project to be able to project a pitcher as either a starter or a reliever for those times when a player's role changes.



Brian got his start in amateur baseball way back in the 1970's as the statistician for his local college summer league in Johnstown, Pa, which also hosts the annual All-American Amateur Baseball Association. A longtime APBA and Strat-o-Matic player, he still tends to look at everything as a simulation. He has also written for StatSpeak and Fangraphs, was runnerup in the Baseball Prospectus Idol competition, and has consulted for a major league team. You can contact him at .(JavaScript must be enabled to view this email address).


Comments

evo34 said...

Can you explain what process you use to project park factors for the upcoming season?  Thanks.

Posted 02/22  at  01:51 AM
Brian Cartwright said...

For each team, I take a weighted mean of the park factors of all the ball parks they are scheduled to play in during the coming season.

Those are multi-year factors, calculated from past season games, and split for right and left handed batters. For pitching projections, these are applied based on the expected percent of right and left handed batters faced, by whether the pitcher throws right or left.

For the new park in Miami and the reconfigured Citi Field in New York, I do not have an estimate of their factors. At the beginning of this season, the factors for those parks will start at 1.00, but as more games are played the factors will approach their ‘true’ value, as the observed data is regressed to the mean of 1.00.

With some recoding, it would be possible to seed the new parks with a value other than 1.00, allowing me to make an initial guess as to how each new park will play.

Posted 02/22  at  02:03 AM
evo34 said...

Thanks.  How many past seasons are used to calc. the factors?  Would you mind posting a table with all the park factors you are using this season?

Posted 02/22  at  02:09 AM
Brian Cartwright said...

Seasons are 1998-2011.

https://docs.google.com/spreadsheet/ccc?key=0Akieb136KCz2dGVrcjFkNTBEaXk0aEdJaE85LTd0VlE

Posted 02/22  at  02:41 AM
Page 1 of 1

Leave a comment:

Commenting is not available in this weblog entry.