Order NowThe Hardball Times Baseball Annual 2010 is now in development and will ship in mid November! This year's book will feature articles by THT's staff as well as Bill James, Rob Neyer, Tom Tango and Craig Wright. If you use this link to purchase the Annual, you will be in the first group to receive it and you'll be supporting THT. ![]() Derek Ambrosino
John Burnson Derek Carty Marco Fujimoto Eriq Gardner Matt Hagen Jonathan Halket Rob McQuown Troy Patterson Mike Silver Paul Singman Michael Street And here's the full roster. Got a question for our fantasy baseball experts? Email us:
Heater MagazineAdd 10 MPH to your fantasy team — see for yourself
HEATER MAGAZINE Winner, 2008 CBS Sportsline Fantasy League of Experts ![]() Plus our Statistical Definitions Most Recent Comments
Top 10 prospects for 2010: Toronto Blue Jays and Kansas City Royals (8)
Top 10 prospects for 2010: Chicago White Sox and Detroit Tigers (4) Waiver Wire Offseason: NL (2) Acey-deucey (1) Waiver Wire Offseason: NL (12) Monthly Archives
November, 2009
October, 2009 September, 2009 August, 2009 July, 2009 June, 2009 May, 2009 April, 2009 March, 2009 February, 2009 January, 2009 December, 2008 November, 2008 October, 2008 September, 2008 August, 2008 July, 2008 June, 2008 May, 2008 April, 2008 March, 2008 February, 2008 January, 2008 December, 2007 November, 2007 October, 2007 September, 2007 August, 2007 July, 2007 June, 2007 May, 2007 Gear up for baseball season with Chicago White Sox tickets and New York Yankees tickets. LA Angels tickets, Houston Astros tickets, and Atlanta Braves tickets are hot sellers! You can get Boston Red Sox tickets, San Diego Padres tickets or Chicago Cubs tickets for your favorite baseball fan. Coast to Coast Tickets has the best MLB tickets like Minnesota Twins tickets, LA Dodgers tickets, Milwaukee Brewers tickets, New York Met tickets and St. Louis Cardinals tickets. Find premium Chicago Cubs tickets and other Chicago tickets at JustGreatTickets.com. Chicago Cubs Tickets Chicago Tickets ![]() All content on this site (including text, graphs, and any other original works), unless otherwise noted, is licensed under a Creative Commons License. |
Most Recent Posts
Wednesday, January 28, 2009Introducing: CAPS Road Park FactorsPosted by Derek Carty at 1:05am
As I'm sure all of our regular readers know (and all the ones Rob Neyer sent over here), a couple of weeks ago I unveiled a new stat that I called CAPS (Context Adjusted Pitching Statistics). If you aren't familiar, I'd definitely suggest checking out that article, but to briefly summarize, CAPS adjusts a pitcher's peripheral numbers based on a number of different contexts to give us a better idea about what that pitcher should be expected to do going forward. Up until now, CAPS adjusted for home ballpark, quality of batters faced, and any league change. Today, I'd like to add one more adjustment to the mix: road ballpark factors. As we know (and as David Gassko covered thoroughly in this article) ballparks can have a significant effect on just about every stat we fantasy players look at: everything from runs and home runs to strikeouts, walks and ground balls. For the majority of baseball players, we tend to ignore these effects because they remain on the same team from one year to the next. The context remains exactly the same, so it has no bearing on our expectations. Have you ever considered, though, what the effects might be of all of the games that are played on the road? While a player may play all of his home games in the same environment, his mix of road stadiums will undoubtedly differ from year to year. When most everyone talks about ballpark factors, they talk about the home ballpark impacting the numbers, working under the assumption that the road side is completely neutral. This, however, is simply not true. A pitcher who happens to throw a disproportionate number of times in PETCO and AT&T Park will be helped in the home run department simply as a matter of context, the same as a pitcher who throws too often in Coors and Chase Field will be hurt. So with the help of Retrosheet, I've calculated an individualized "Road Park Factor" for each pitcher using his exact blend of road ballparks (and the time spent in each) for every year back to 2004 and for every stat we care about, neutralized it, and then applied a 2009 factor based on the exact 2009 road schedule for every team. MethodOnly read this if you're interested in hearing a little more about exactly what I did. If you're not interested, you know all you need to and can skip to the next section. The method used is pretty intuitive, but to elaborate just a bit further, I calculated each pitcher's road park factor by weighting each park he played in depending on the number of opportunities he had to accumulate each stat. To come up with the strikeout factor, for instance, I looked at every non-HBP batter faced. For all types of hits and batted balls, I looked at all fair, contacted balls. (If it makes it easier, think about it in terms of Pizza Cutter's flow chart.) Once I arrived at the factor, I simply applied it to each pitcher's road stat line. The one other note I need to make deals with batted-ball types (ground balls, infield flies, etc.). Because Retrosheet classifies these differently than Baseball Info Solutions does, I wasn't able to apply these factors to the pitcher's road stat line. Instead, for batted balls, I had to cut the pitcher's full-year line in half, applying the road factors to one half and the home factors to the other half. It shouldn't make that much difference, but it does need to be noted. After coming up with these factors and neutralizing the player's line for each year, I then took each team's 2009 road schedule and combined the ballparks appropriately. I then applied this factor to every year we'll look at to put all of the numbers into the context of 2009, which is what we care about. CAPS: Where we're atTo summarize, the CAPS numbers you'll be seeing going forward take all of the following into account:
How large is the road ballpark impact?As I noted earlier, baseball analysts have long ignored road park factors, assuming these things are neutral. While logically we know this isn't true, could it just be that the effects are so small that this is a fair assumption to make? Let's take a look at the leaders and trailers for 2008 and find out. Note: The "a" before each stat in the third column stands for "adjusted." This is what the player's stat would look like if it was neutralized for road park. Also, because more strikeouts are good and fewer walks, homers, and hits are bad, the tables are arranged so that the five unluckiest are always on top and the five luckiest are always on the bottom, regardless of stat. +------------------+-----+-------+------+ +-----------------+-----+------+------+ | PLAYER | K | aK | DIFF | | PLAYER | BB | aBB | DIFF | +------------------+-----+-------+------+ +-----------------+-----+------+------+ | Gil Meche | 104 | 108.5 | 4.5 | | Miguel Batista | 34 | 33.0 | 1.0 | | Ubaldo Jimenez | 100 | 104.5 | 4.5 | | Jeff Suppan | 33 | 32.1 | 0.9 | | Jorge de la Rosa | 62 | 65.9 | 3.9 | | Manny Parra | 30 | 29.1 | 0.9 | | Zack Greinke | 93 | 96.8 | 3.8 | | Felix Hernandez | 32 | 31.1 | 0.9 | | Josh Beckett | 105 | 108.2 | 3.2 | | Dave Bush | 24 | 23.1 | 0.9 | | ..................................... | | ................................... | | Chad Billingsley | 89 | 86.3 | 2.7 | | Zach Duke | 21 | 21.6 | 0.6 | | Felix Hernandez | 78 | 75.3 | 2.7 | | Phil Dumatrait | 24 | 24.7 | 0.7 | | Jake Peavy | 67 | 64.2 | 2.8 | | Paul Maholm | 29 | 29.8 | 0.8 | | Cole Hamels | 106 | 102.7 | 3.3 | | Tom Gorzelanny | 34 | 34.9 | 0.9 | | Ricky Nolasco | 106 | 101.5 | 4.5 | | Ian Snell | 44 | 45.3 | 1.3 | +------------------+-----+-------+------+ +-----------------+-----+------+------+ +----------------+------+--------+------+ +-----------------+-----+------+------+ | PLAYER | H-HR | aH-aHR | DIFF | | PLAYER | HR | aHR | DIFF | +----------------+------+--------+------+ +-----------------+-----+------+------+ | Jon Lester | 93 | 90.1 | 2.9 | | Javier Vazquez | 14 | 12.6 | 1.4 | | Jon Garland | 99 | 96.3 | 2.7 | | Shaun Marcum | 14 | 12.6 | 1.4 | | Josh Beckett | 78 | 75.8 | 2.2 | | Brett Myers | 16 | 14.6 | 1.4 | | Joe Saunders | 75 | 72.9 | 2.1 | | Aaron Harang | 16 | 14.7 | 1.3 | | Jeff Weaver | 88 | 86.0 | 2.1 | | Gavin Floyd | 12 | 10.7 | 1.3 | | ..................................... | | ................................... | | Kyle Kendrick | 97 | 99.6 | 2.6 | | Scott Olsen | 18 | 19.0 | 1.0 | | Mike Mussina | 87 | 89.8 | 2.8 | | Jon Garland | 12 | 13.0 | 1.0 | | Andy Pettitte | 103 | 106.0 | 3.0 | | Nate Robertson | 16 | 17.0 | 1.0 | | Cliff Lee | 113 | 116.4 | 3.4 | | Todd Wellemeyer | 13 | 14.0 | 1.0 | | Carlos Silva | 121 | 124.8 | 3.8 | | Brian Bannister | 14 | 15.0 | 1.0 | +----------------+------+--------+------+ +-----------------+-----+------+------+ Looking at our four leaderboards (the one on the bottom left represents all singles, doubles and triples, if it isn't clear), we can see that the effects aren't huge, but they are there. Obviously the biggest raw differences are seen with strikeouts and hits because they are more numerous to begin with, but these effects are pretty large even in a relative sense. With 4.5 more strikeouts, Gil Meche's K/9 would have jumped 0.2 points from 7.8 to 8.0. Twenty previously unaccounted for points of K/9 is huge. In terms of walks, the effects are much smaller, with Felix Hernandez's BB/9 falling from 3.59 to just 3.55 and Miguel Batista's from 6.18 to 6.11. Even Ian Snell's would only have risen 0.08 points. Looking at home runs, though, we see some big changes. Aaron Harang's HR/FB would have fallen from 15.3 to 14.7, which explains a sizable portion of his unlucky-looking HR/FB this year. It's very nice to be able to write it off to a specific cause instead of simply to "bad luck" (although it wouldn't really be wrong to do). Of course, we're dealing with the extremes, but you can see that the assumption that road effects are neutral is simply not true. Also, while these effects won't be very large for many players, the whole point is to add this onto our current CAPS system. When we combine all of the different effects—even if any one is small in isolation—we can see some big differences in value. And that, I believe, is what fantasy leaguers care about. If this can highlight for us just a few undervalued players or help us to avoid a few overvalued ones, this becomes a powerful, powerful tool. Also of interest (albeit perhaps more to the non-fantasy crowd) is the groupings, which some of you may have picked up on. If you notice, all five of the luckiest in walks are Pirates. Three of the unluckiest with walks are Brewers. The unluckiest with hits are all Red Sox and Angels. Two of the unluckiest with strikeouts are Royals and two are Rockies. As this started as an exercise to determine "divisional park effects" (the inspiration for which came from commenter Nick on the original CAPS article), it's not surprising to see players of the same team appear on the lists together. Derek LoweI didn't get a chance to post a full article about Lowe when he signed with the Braves, so we'll take a quick look at him now. +------+------+-------+---------+-------+------+------+------+---------+------+-------+-------+ | YEAR | LAST | FIRST | TEAM | IP | QERA | K/9 | BB/9 | K/BB RI | xGB% | BABIP | HR/FB | +------+------+-------+---------+-------+------+------+------+---------+------+-------+-------+ | 2005 | Lowe | Derek | Dodgers | 222.0 | 3.88 | 5.92 | 2.23 | 0.04 | 59.4 | 0.286 | 21.4 | | 2005 | Lowe | Derek | Braves* | 222.0 | 3.91 | 6.06 | 2.16 | 0.05 | 59.2 | 0.276 | 20.0 | +------+------+-------+---------+-------+------+------+------+---------+------+-------+-------+ | 2006 | Lowe | Derek | Dodgers | 218.0 | 4.09 | 5.08 | 2.27 | -0.19 | 63.8 | 0.293 | 12.3 | | 2006 | Lowe | Derek | Braves* | 218.0 | 4.15 | 5.05 | 2.17 | -0.22 | 63.6 | 0.281 | 11.0 | +------+------+-------+---------+-------+------+------+------+---------+------+-------+-------+ | 2007 | Lowe | Derek | Dodgers | 199.3 | 3.75 | 6.64 | 2.66 | 0.13 | 62.8 | 0.292 | 17.9 | | 2007 | Lowe | Derek | Braves* | 199.3 | 3.76 | 6.62 | 2.54 | 0.11 | 62.6 | 0.281 | 15.8 | +------+------+-------+---------+-------+------+------+------+---------+------+-------+-------+ | 2008 | Lowe | Derek | Dodgers | 211.0 | 3.65 | 6.27 | 1.92 | 0.15 | 57.8 | 0.286 | 10.1 | | 2008 | Lowe | Derek | Braves* | 211.0 | 3.62 | 6.16 | 1.62 | 0.18 | 58.3 | 0.276 | 9.0 | +------+------+-------+---------+-------+------+------+------+---------+------+-------+-------+ Nothing of much interest here. Lowe's adjustments are minimal, as will be the case with a lot of players. I normally try writing about the players who are more interesting, but as Lowe is a guy I'm sure many of you have been wondering about ... ta-da! As I said earlier, though, the value in the CAPS system won't be the guys that it values the same, but rather the guys who it sees a big difference in. Check out our next player for a case like that. Lowe is obviously an extreme groundball pitcher, though he does manage to strike out about as many batters as a league-average pitcher. This has value in fantasy leagues, as does his mid-3.00s ERA. Overall, taking Lowe in the round 12-to-15-area of a traditional, 12-team mixed league should get you a fine player. He seems to struggle a little with home runs, but he keeps his BABIP pretty low, and the move from the Dodgers to the Braves could help. The UZR difference between the two was 4.8 per 150 last year. Dan HarenWe covered Haren in the original CAPS article, but he seems to have also caught a string of bad luck on the road—particularly with strikeouts—and we didn't look at all of his numbers last time. +------+-------+----------+-------+------+-----+------+---------+------+-------+-------+ | YEAR | LAST | TEAM | IP | QERA | K/9 | BB/9 | K/BB RI | xGB% | BABIP | HR/FB | +------+-------+----------+-------+------+-----+------+---------+------+-------+-------+ | 2006 | Haren | A's | 223.0 | 3.75 | 7.1 | 1.8 | 0.44 | 45 | 0.292 | 14.2 | | 2006 | Haren | D'Backs* | 223.0 | 3.49 | 7.6 | 1.6 | 0.57 | 45 | 0.299 | 16.0 | +------+-------+----------+-------+------+-----+------+---------+------+-------+-------+ | 2007 | Haren | A's | 222.7 | 3.71 | 7.8 | 2.2 | 0.52 | 44 | 0.292 | 10.5 | | 2007 | Haren | D'Backs* | 222.7 | 3.54 | 8.4 | 2.3 | 0.63 | 43 | 0.301 | 11.9 | +------+-------+----------+-------+------+-----+------+---------+------+-------+-------+ | 2008 | Haren | A's | 216.0 | 3.17 | 8.6 | 1.7 | 0.81 | 45 | 0.308 | 9.7 | | 2008 | Haren | D'Backs* | 216.0 | 2.76 | 9.5 | 1.4 | 1.09 | 46 | 0.310 | 9.3 | +------+-------+----------+-------+------+-----+------+---------+------+-------+-------+ As you can see, Haren has had some terrible luck for a few years now. In terms of his strikeout rate, he's probably the unluckiest pitcher in baseball over the past three years. This bad luck wasn't quite as pronounced in past years (and part of those past years' numbers are due to the league change, so he really shouldn't have been expected to post them with the A's), but in 2008 Haren really deserved much better. His strikeout rate was almost a point too low, and his QERA was a ridiculous 2.76. 2008's actual QERA leader was C.C. Sabathia's Brewers stint at 2.89, to put things into perspective. We shouldn't expect him to post identical numbers in 2009, but he has steadily risen four years in a row, will be 28 years old, and should have a good deal of luck catching up with him. His current Mock Draft Central ADP is 57.39, which would put him at the end of the fourth round in a 12-team league, though I have seen him go in the sixth. I'm not a fan of taking pitchers that early, but if Haren falls into the eighth or ninth round, I don't imagine I'll be passing him up. If the strategy you're employing allows you to take starters earlier than that, Haren seems like a very good choice. Concluding thoughtsAs I said in the original CAPS article, if you guys have any ideas for further things we could adjust for, feel free to contact me. If you have any questions about CAPS or anything fantasy baseball related, also don't hesitate. ErrataIn the original CAPS article, I accidentally applied the home ballpark factors to Javier Vazquez's entire line instead of just the home side. This has been fixed, and the new CAPS numbers (with road ballpark adjustments included) are displayed below. As you can tell, very little changes, and my evaluation remains the same; Vazquez makes a great fantasy pick this year. Javier Vazquez +------+-------+------+------+------+---------+------+-------+-------+ | YEAR | IP | QERA | K/9 | BB/9 | K/BB RI | xGB% | BABIP | HR/FB | +------+-------+------+------+------+---------+------+-------+-------+ | 2006 | 202.7 | 3.84 | 8.2 | 2.5 | 0.59 | 40 | 0.311 | 10.7 | | 2006 | 202.7 | 3.37 | 9.4 | 2.3 | 0.87 | 40 | 0.311 | 9.0 | +------+-------+------+------+------+---------+------+-------+-------+ | 2007 | 216.7 | 3.34 | 8.8 | 2.1 | 0.84 | 38 | 0.294 | 12.1 | | 2007 | 216.7 | 2.89 | 10.1 | 2.0 | 1.15 | 39 | 0.298 | 10.1 | +------+-------+------+------+------+---------+------+-------+-------+ | 2008 | 208.3 | 3.76 | 8.6 | 2.6 | 0.62 | 39 | 0.320 | 11.3 | | 2008 | 208.3 | 3.30 | 9.7 | 2.4 | 0.95 | 39 | 0.319 | 9.6 | +------+-------+------+------+------+---------+------+-------+-------+ Derek Carty is a 22-year old fantasy baseball analyst residing in New Jersey. In addition to writing for THTF, his work has appeared at Rotoworld (NBC), Sports Illustrated, FOX Sports, and Heater Magazine. In his two years competing in expert leagues, he has won 2 titles with 4 four top three finishes, including a LABR NL title in 2009, making him the youngest person to ever win a major expert league title. Derek is a proud graduate of the MLB Scouting Bureau's Scout Development Program and is a firm believer in the importance of combining stats and scouting. He welcomes questions via e-mail. Comments
Andrew said...
Great piece, Derek. Do you still plan on writing about that experts mock draft from last week? I was just curious about a few selections and also wanted to know if any picks were guys you would not have taken in a real draft. Posted 01/28 at 04:59 AM
Andrew B. said...
I’d always thought that it was important to know what road parks a player was playing in. When Matt Holliday was traded this offseason, everyone cited his significant home-road splits in arguing that he could not maintain a high level of performance outside of Colorado. While this is certainly possible, and while there also is a great deal of variability in 200-300 PA samples, I think the fact that Holliday played a significant portion of his road games in pitcher’s paradises like Los Angeles, San Francisco, and San Diego plays factors in as well. Posted 01/28 at 04:47 PM
Nick said...
Thanks, Derek. Again, outstanding work. I had the same thought as Victor above: in creating the super-all-powerful-grand-master projection system, we need to first adjust stat lines using something like CAPS. Posted 01/29 at 12:24 AM
Derek Carty said...
Andrew, Andrew B, Posted 01/29 at 01:57 AM
Andrew said...
I was curious about the following picks: Doumit, Ibanez, and Joba. I’m interested because all of those players are on my keeper league roster, and I’m in the midst of considering off-season trades. Do you think Doumit’s skills from last year were legit? Do you like Ibanez this year in hitter-friendly Philly? Do you see a big year from Joba? Thanks, Derek. I figured you took the draft seriously; I just wanted to make sure. Posted 01/29 at 02:14 AM | ||
Derek,
This is really great stuff. Have you thought about applying this to any projection systems? Ideally, we could apply this stuff to get a projection of a player in a completely neutral environment (adjusting for home park factor, road park factor, quality, etc.) and then take that projection and adjust for those various things.