May 24, 2013

THT Essentials:
Fangraphs Player Search:


And here's the full roster.

Now available


You can now purchase the Hardball Times Baseball Annual 2013, with 300 pages of great content. It's also available on Amazon and Kindle. Read more about it here.

THT's latest e-book


Third Base: The Crossroads is THT's new e-book, available for $3.99 from the Kindle store. The good news is that anyone can read a Kindle book, even on a PC. So enjoy the best from THT in a new format.

Most Recent Comments





Get your very own THT merchandise from our CafePress store. We've got baseball caps, t-shirts, coffee mugs and even wall clocks with the classy THT logo prominently displayed. Also, check out the THT Bookstore. Please support your favorite baseball site by purchasing something today.



Or you can search by:


Creative Commons License
All content on this site (including text, graphs, and any other original works), unless otherwise noted, is licensed under a Creative Commons License.
Roll mouse over date for entries
THT Live Calendar
May 2013
S M T W T F S



1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31

Saturday, August 08, 2009

Another look at bimodal distributions

Posted by Colin Wyers
In my last article, I looked at how players regress to the mean, and how players on the borderline between the minors and majors might not have a readily identifiable mean to regress to.

But what if we add in AA players?

It seems that adding AA players does not give us a trimodal distribution; the means for the two leagues (or at least their major league equivelencies) are pretty close to each other. It's possible that this is a selection bias from the way the MLEs are computed, of course. But this also squares pretty well with what we think we know about the difference in league quality, so even if there is such a bias I'm not sure it's large enough to give us a truly bimodal distribution between AA and AAA players.

But while that shifts our second mode over to the left a bit, it also gives us a much larger population of players in the minors than the majors. Here's a helpful illustration, at about 110 PAs:

image

I almost wonder if there's something I'm missing here, though, with my assumptions - if pressed I would guess that in real life the right-side part of the curve on the two distributions line up a lot better than what I'm showing here. There are estimated standard deviations, and so maybe the observed SDs for minor league talent are larger than what I'm showing. I'll have to check into that.



Colin Wyers knows exactly how much of a nerd he is. He is very interested in hearing about any other concerns you may have; you can reach him by e-mail, and he will try his best to respond in a timely fashion. He also blogs at Statistically Speaking.


Comments

The Rabbit said...

What does the curve look like for AA only vs. MLE?  If it’s here somewhere, I’ve missed it.
A THT report earlier this week cited the differences in age at the AAA vs AA levels.
The variation was the greatest (not surprisingly) for contending teams who use AAA as an expanded ML roster for injuries, meltdowns, etc. and are, therefore, populated with older, more experienced players. These players may actually be in the majors if signed to another team.
I would expect a combination of AA/AAA to be slightly skewed given the “philosophical” differences in the purpose of the AAA teams.

Posted 08/08  at  02:30 PM
Colin Wyers said...

You want to see the AA and AAA teams on seperate curves?

Frankly, while I can (and have) graphed that, its really not interesting - I’d say somewhere between 85 and 95 percent of the two cuves overlap. That’s why I feel comfortable graphing that the way I did - we really don’t need to know whether or not a guy is in AA or AAA to regress him to the proper mean, so long as what we are looking at are his translated stats, rather than his actual stats.

Now, of course this relies on the assumption that the DTs are capturing the correct translated means for each league - if the difference in league quality is substantially larger than what the DTs are saying, than this is wrong.

Posted 08/08  at  05:09 PM
The Rabbit said...

Thanks for your response.
Nope, didn’t particularly want to see the curves, but assumed you must have done it.
I’m always curious about the nature of the underlying data…It’s a curse from my career (thankfully, retired) in financial analysis.

Posted 08/08  at  06:16 PM
Page 1 of 1

Leave a comment:

Commenting is not available in this weblog entry.