November 22, 2009

Player Search:

Order Now


The Hardball Times Baseball Annual 2010 is now in development and will ship in mid November! This year's book will feature articles by THT's staff as well as Bill James, Tom Tango and Craig Wright. If you use this link to purchase the Annual, you will be in the first group to receive it and you'll be supporting THT.


And here's the full roster.



Or you can search by:

Sports Tickets

Gear up for baseball season with Chicago White Sox tickets and New York Yankees tickets. LA Angels tickets, Houston Astros tickets, and Atlanta Braves tickets are hot sellers! You can get Boston Red Sox tickets, San Diego Padres tickets or Chicago Cubs tickets for your favorite baseball fan. Coast to Coast Tickets has the best MLB tickets like Minnesota Twins tickets, LA Dodgers tickets, Milwaukee Brewers tickets, New York Met tickets and St. Louis Cardinals tickets.
Find premium Chicago Cubs tickets and other Chicago tickets at JustGreatTickets.com.
Chicago Cubs Tickets
Chicago Tickets
Championship Tickets



Creative Commons License
All content on this site (including text, graphs, and any other original works), unless otherwise noted, is licensed under a Creative Commons License.

Another look at bimodal distributions

by Colin Wyers
August 08, 2009

In my last article, I looked at how players regress to the mean, and how players on the borderline between the minors and majors might not have a readily identifiable mean to regress to.

But what if we add in AA players?

It seems that adding AA players does not give us a trimodal distribution; the means for the two leagues (or at least their major league equivelencies) are pretty close to each other. It's possible that this is a selection bias from the way the MLEs are computed, of course. But this also squares pretty well with what we think we know about the difference in league quality, so even if there is such a bias I'm not sure it's large enough to give us a truly bimodal distribution between AA and AAA players.

But while that shifts our second mode over to the left a bit, it also gives us a much larger population of players in the minors than the majors. Here's a helpful illustration, at about 110 PAs:

image

I almost wonder if there's something I'm missing here, though, with my assumptions - if pressed I would guess that in real life the right-side part of the curve on the two distributions line up a lot better than what I'm showing here. There are estimated standard deviations, and so maybe the observed SDs for minor league talent are larger than what I'm showing. I'll have to check into that.

Colin Wyers knows exactly how much of a nerd he is. He is very interested in hearing about any other concerns you may have; you can reach him by e-mail, and he will try his best to respond in a timely fashion. He also blogs at Statistically Speaking.


The Rabbit said...

What does the curve look like for AA only vs. MLE?  If it’s here somewhere, I’ve missed it.
A THT report earlier this week cited the differences in age at the AAA vs AA levels.
The variation was the greatest (not surprisingly) for contending teams who use AAA as an expanded ML roster for injuries, meltdowns, etc. and are, therefore, populated with older, more experienced players. These players may actually be in the majors if signed to another team.
I would expect a combination of AA/AAA to be slightly skewed given the “philosophical” differences in the purpose of the AAA teams.

Posted 08/08  at  02:30 PM
Colin Wyers said...

You want to see the AA and AAA teams on seperate curves?

Frankly, while I can (and have) graphed that, its really not interesting - I’d say somewhere between 85 and 95 percent of the two cuves overlap. That’s why I feel comfortable graphing that the way I did - we really don’t need to know whether or not a guy is in AA or AAA to regress him to the proper mean, so long as what we are looking at are his translated stats, rather than his actual stats.

Now, of course this relies on the assumption that the DTs are capturing the correct translated means for each league - if the difference in league quality is substantially larger than what the DTs are saying, than this is wrong.

Posted 08/08  at  05:09 PM
The Rabbit said...

Thanks for your response.
Nope, didn’t particularly want to see the curves, but assumed you must have done it.
I’m always curious about the nature of the underlying data…It’s a curse from my career (thankfully, retired) in financial analysis.

Posted 08/08  at  06:16 PM
Page 1 of 1 Commenting is not available in this weblog entry.

Do you have a general question or comment for one of THT's writers? Send it in to our weekly mailbag We also welcome unsolicited op-ed pieces of approximately 500 words for consideration. We reserve the right to edit for length, clarity and consistency of style. Please include your whole name and location to be considered. If you have a comment about this specific article, please email the writer.



The best online source for major league baseball tickets is Ticket City.