Friday, May 01, 2009
Hit Rate Observer
Posted by John Burnson at 1:52amSo you want a starting pitcher who’ll get you a win. Which do you choose?
“A good one.” Yes—but pitchers who are good at getting wins might be less good at other things…
Let’s start here: Suppose you have a 4.50 ERA pitcher. How many wins can he be expected to get? I am sure we could run a calculation based on overall runs allowed and runs scored and get an approximate total for the season. However, since wins are accumulated by game, it may be fruitful to stick to that level. (For the discussion that follows, I will use ERA as our guide, though certainly RA would be a touch better.)
Now, a “W” is a matter of bookkeeping; it goes to the pitcher who was in the game when a team took its final lead. As such, the critical component for a starter to get the “W” is how long he lasts. Obviously, skill plays a role in that. And events don’t always correlate perfectly—in a blow-out win, a thriving pitcher might be pulled early to rest his arm, while in a losing but low-scoring affair, a manager may ride his ace. In general, though, the deeper into a game that a starter pitches, the better his chances for the Win.
So our scheme is two-step:
1. How deep into a game is our 4.50 ERA pitcher expected to go?
2. What is the win likelihood for a pitcher who goes that distance?
We can get an idea of the answer to the first question by looking at the aggregate ERAs of starters who pitched games of different lengths. We collected data from 2006-2009. The answer turns out to be neat:
Outs = 27 - (2*ERA)
That is, the expected number of outs is equal to 27 minus twice the starter's ERA. That’s a close re-statement of the trend line from this graph:

(Note: In this graph, the dependent variable is actually Outs Lasted; I’ve rotated the graph for our purposes.) By this rule of thumb, a 6.00 ERA starter would give us 15 outs (5 innings), a 0.00 ERA starter would go nine innings, and our 4.50-ERA guy can be expected to last six innings.
That answers the first part of our question. Now: All things equal, what’s the win likelihood for a six-inning start? Here are the numbers from 2006-09:
Win likelihood goes from roughly 1-in-4 in a five-inning start to greater than 9-in-10 (but well less than 100%) for a nine-inning start. (We removed the data points for 25 and 26 outs because there are so few examples.)
The graph is actually slightly better fit by an exponential curve than by a straight line:

A non-linear curve is reasonable: Holding a lead deeper into a game not only deprives the opponent of more chances to recover but also lets the starter hand off the game to his better relievers.
If we read a six-inning start from the trend line, we find that our 4.50 ERA starter has a 35.5% chance of getting the win. (This is not considering the efficiency of the two offenses or of the bullpen.)
That’s informative, but the exponential curve of our second graph has a deeper implication, which is that you really want a starter who can go deep into games, even if he also sometimes flames out. Consider two starts of a combined 14 innings: A starter who goes seven innings in each start has an expected total in the two games of 0.98 Wins, whereas a starter who goes five innings in one start but nine innings in the other has an expected total of 1.18 Wins—a fifth of a Win more. In fantasy, erratic genius pays.
Does this genius have an identifiable quality? What skill or skills are good for the long haul? To find out, we calculated the aggregate K/9 and BB/9 of all starters who reached X or more outs in a game:

The above is not a typical line graph; instead, the line here tracks the shifting skills exhibited in starts of increasing length. The path consists of 28 points—from 0 outs (shown by the orange dot) to 27 outs (shown by the green dot). Each point gives the strikeout and walk rates posted by starters who recorded that many outs or more. (The blue dot marks 21 outs—the end of the seventh inning.)
The graph has two sets of arcs. The main arc sweeps from the upper right to the lower left, from the first out to the last. There are a number of fascinating aspects. For one thing, there’s little distinction in strikeout rate from basically the start of the game to the seventh inning; at each step, we’re looking at 6.3-6.4 K/9.
In starts that last more than seven innings, strikeout rate does takes a turn—but lower, not higher. Starts that go into the eighth inning are characterized by an overall strikeout rate of just 6.2 K/9. And the rate in nine-inning starts is well below 6.0 K/9.
A higher strikeout rate is no help to going deeper into games. In fact, to reach a start longer than seven innings, it’s a downright hindrance. One reason is certain: Strikeouts cost pitches, generally more than the number required to post an equivalent number of non-K outs. We also wonder if, even apart from the ballooning pitch count, power pitchers lose control earlier in the game than do finesse pitchers.
What is important—far more important—for pitching deep into a game is a low walk rate. We can show this more clearly in a second graph, this one a straightforward plot of K/9 and K/BB versus Outs Reached:

In nine-inning starts, K/BB nearly eclipses K/9!
Now, there is some backwardness here: The data show that long starts are characterized by high K/BB, not that pitchers with high K/BB are fated to pitch long. Still, I think the point holds.
Fantasy leaguers love strikeouts. And it’s a fair point that a high strikeout rate can contribute to a high K/BB. However, these graphs say that a low walk rate deserves allegiance on its own. A walk rate under 2.0 BB/9 is a sign of not only control but also good health and, within a game, an absence of fatigue. And it’s the pitcher’s walk rate, not his strikeout rate, that determines whether he will stay in a game in which he is pitching well.
If you’re scouting for wins, you should leap at the chance to roster a starter with a 2.5 K/BB, even if he has a sub-6.0 K/9. What he costs in Ks, he could recoup in Ws.
(What about the second set of arcs? Those are the epicycles, the curious switchbacks in which strikeout rate creeps up and then suddenly drops. The drops mark the start of new innings. The decision to send a starter who has completed N innings back into the (N+1)th inning relies on factors other than his arm—whether the game is close, whether the bullpen is fresh. However, once a starter has been put in for another inning, then whether he survives to the end of that inning does depend, marginally, on his K rate.)
Compiled by THT Staff.





 
Brilliant work—thank you.