Get It Now!Hardball Times Annual is now available. It's got 300 pages of articles, commentary and even a crossword puzzle. You can buy the Annual at Amazon, for your Kindle or on our own page (which helps us the most financially). However you buy it, enjoy!
And here's the full roster.
Most Recent Comments
A Look at John Buck (1)
THT Fantasy has moved to Rotographs (4)
THT Roundtable: How seriously do experts take mock drafts? (21)
Player-A-Day: Tim Lincecum (3)
Player-A-Day: Josmil Pinto (5)
All content on this site (including text, graphs, and any other original works), unless otherwise noted, is licensed under a Creative Commons License.
THT's Fantasy Archives
Thursday, June 04, 2009
Jason Giambi hit a three-run home run in today's game vs. the White Sox, making it his third round tripper in five games. In deep mixed leagues (16+ teams) and AL-Only leagues Giambi makes for an intriguing source of cheap home runs. You'll have to stomach the paltry .250 average at best, but for teams desperate for power Giambi can be a smart add if he stays hot.
Posted by Paul Singman at 7:33pm
Pirates OF prospect Andrew McCutchen did indeed lead off today's game, pushing Nyjer Morgan to the #2 spot and leading the team to put Freddy Sanchez in the #3 spot (what?!). This would make him at least a two category contributor (steals and runs) and possibly a three category guy (average), so he can be owned in deep mixed leagues.
Posted by Derek Carty at 1:30pm (0) Comments
Rockies 3B Garrett Atkins may get demoted as his rough start to the season has continued into June. If he does get demoted, Ian Stewart would probably take most of the playing time at third. Since the team would probably like to trade Atkins, I imagine he'd be called back up shortly after he starts hitting at Triple-A and would be back by July.
Posted by Derek Carty at 1:28pm (2) Comments
Slot machines are pure luck: you put your coin in, you pull the lever, and you take your chances. Repeat as often as you like or until you get a free drink. The longer you play, the more likely you are to end up with about the average outcome (which for slots is a negative amount—the house always wins). This is a version of the law of large numbers.
Now that the season is more than a quarter over, lots of batters have been playing their version of a slot machine for a while. Every time a batter puts a ball in play, he pulls a lever on the fielding slot machine. Sometimes he gets lucky and it is a hit and sometimes he gets unlucky and it is an out. The well known statistic Batting Average on Balls in Play (BABIP) tracks the average number of hits on balls in play.
Equally well known is that players' skills have little impact on their BABIP; once the batter puts the ball in play (home runs don't count), whether or not the ball goes for a hit has little to do with the name on the back of the batter's jersey.
In this article, I'm going to do three things: I'm going to equate the luck on balls in play to a version of a coin flip, I'll then simulate some of these coin flips and show that it looks a lot like the outcomes that batters have thus far in the season, and, then lastly, we will see that players can still be pretty lucky after only a quarter of a season. What are the practical implications? After a quarter of a season, you should still be skeptical (though not necessarily incredulous) towards players' performances.
As we'll see, the slot machine that a player plays when he puts the ball in play doesn't have to be complex. In fact, let's just suppose that this machine is a simple weighted coin flip. Instead of a 50-50 chance of heads and tails, let's suppose the coin is 30-70. So 30 percent of the time the coin comes up heads and the player gets a hit, 70 percent of the time it goes for an out.
A player's total number of hits and his BABIP after, say, 200 balls in play are random (just like the total number of times heads comes up after 200 coin flips is random). In fact, the distribution of hits and BABIPs (a distribution is sort of like the percentage of time we can expect to observe, say, 78 hits on 200 balls in play) is given by the binomial distribution. It is pretty easy to use a computer to simulate outcomes from a binomial distribution and compare it to the data we have so far from the season.
What I've done: I've taken each batter with at least 100 at-bats (243 batters). I've computed the number of hits in play (hits - home runs) for these batters and their BABIP. For each batter, I've then calculated what their batting average could look like if each at-bat was simulated and the outcome determined by a binomial random variable with the same average success rate (30.3 percent).
The graph below shows the number of hits we get from the data (blue) and from the simulation (red). Not bad (if you're really curious, the two distributions are considered statistically identical according to a Komologorov-Smirnov Test). We can smooth things out and compute a distribution for each—that's the next figure. The third graph is the same smoothed distribution, only this time for actual and simulated BABIPs. On this one, the match is even better.
What can we see from these graphs? The average number of balls in play for each batter is fairly high: 127. As far as statistics is concerned, 127 is a lot of coin flips. You might have read or heard other fantasy commentators say something like "Now that we're in June, we don't have to worry as much about small sample sizes." While that is still literally true, the third graph shows that there is still a lot of variation left in the data. In fact, if you look at the CDF (cumulative density function), you can see that as of June 1, fifteen percent of players still have a BABIP below .250 even though their expected BABIP is .303. That is, even though the coin they are flipping should come up heads 30.3 percent of the time, they've gotten unlucky routinely and have only gotten heads less than 25 percent of the time.
My final graph shows what happens if we simulate 500 balls, or roughly four times the number of balls in play. The blue line is the same simulation from before, with on average 127 balls in play per batter. The green line simulates 243 batters with 500 balls in play using the binomial distribution. As we can see, the more balls in play we have, the more likely we are to get the median outcome and the less likely we are to get extreme outcomes.
In other words, in June, after 125 balls in play, a batter can still be lucky and have a high BABIP. In September, it should be far harder to have had a season of luck. So in June you must still be aware of the small sample.
Posted by Jonathan Halket at 1:35am (8) Comments
Wednesday, June 03, 2009
Busy day. The White Sox recalled top prospect MI Gordon Beckham from Triple-A this evening, designating CI Wilson Betemit for assignment. Unless the team wants to bench 2B Chris Getz or 3B Josh Fields, Beckham's playing time will probably come piecemeal, splitting time between 2B, 3B, and SS. He'll also probably have a chance to overtake Getz or Fields at some point if he hits well, but he should probably get close to regular at-bats anyway. I doubt the team would recall him just to sit on the bench.
AL-only leaguers, go get him now if he's not already owned. Mixed leaguers can probably hold off for now. He doesn't really excel in any one category, and he probably won't show much speed at all. His batting average might be serviceable and he has a little power, though it's yet to be seen where he'll hit in the order for RBI and runs. Probably in the 7-8-9 area to start.
MI Jayson Nix will probably lose some at-bats with this move.