May 25, 2012

Now Available for 2012


THT Essentials:

Now Available



The Hardball Times Baseball Annual 2012, an annual "must buy" for all baseball fans, is now shipping. Read this article to learn more about it.
Fangraphs Player Search:

THT's latest e-book


Third Base: The Crossroads is THT's new e-book, available for $3.99 from the Kindle store. The good news is that anyone can read a Kindle book, even on a PC. So enjoy the best from THT in a new format.

Most Recent Comments




And here's the full roster.

Dish TV Packages options for all televised baseball games.



Or you can search by:

Sports Tickets






Championship Tickets


Creative Commons License
All content on this site (including text, graphs, and any other original works), unless otherwise noted, is licensed under a Creative Commons License.

Part of the USA Today Sports Media Group
Roll mouse over date for entries
THT Live Calendar
May 2012
S M T W T F S


1 2 3 4 5
6 7 8 9 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30 31

Friday, May 07, 2010

Visual Baseball:  Introducing the BatCode

Posted by Kevin Dame
Happy Friday. Here's an experiment that I'd love some feedback on. I'm using the barcode metaphor to visualize a hitter's performance for each at-bat. Line thickness represents the type of hit (thinnest for walks and singles, thickest for homeruns) with each at-bat separated by a space. Big white areas represent long stretches of outs. Lots of lines represent a string of hits, with thicker patches representing power. Let's take BatCode for a spin.

Here's one of the league's best hitters - Robinson Cano - for the month of April. Pretty consistent and with lots of power throughout.

image

Here's a player - Curtis Granderson - who got off to a hot start but has really tailed off (both in terms of OBP and power).

image

And here's David Ortiz' entire 2009 season with all the ups and downs.

image

Click the link below to see more, including historic seasons of Bonds, Ichiro and Sosa.


I got a request for Barry Bonds' 2001 season, Ichiro's 2004 season, and Sammy Sosa's 1998 season. These BatCodes are pretty remarkable (albeit steroid-laden for two out of the three):

image

image

image

And here are a few more April 2010 BatCodes. What insight do you get from scanning them?

image

image

image

image


"The commonality between science and art is in trying to see profoundly - to develop strategies of seeing and showing." - Edward Tufte.

Feel free to send comments, questions, and suggestions to Kevin via email.



Comments

JK said...

So if a player hits 4 singles in a row does it look just as thick as a home run?

Also, I don’t see the link to click for more.

Posted 05/07  at  04:26 PM
Kevin Dame said...

Hi JK. 

There’s a space between each at bat.  So 4 singles in a row would look like 4 thin lines with spacing in between. If you look at Cano’s BatCode you’ll see he started the season with an out, a double, and a single consecutively. 

I’ve been able to automate these in Excel (which has made it easier and faster to crank out more) but there’s a limitation in Excel.  For thicker lines it doesn’t preserve as much spacing as with thinner lines.  For example, with back to back homers the spacing between at bats is harder to see than for back to back singles.  So there’s some distortion to how this is visualized, although I’d say overall it communicates the basics.  If there’s interest in this as a visual medium I’d want to solve this spacing problem somehow.

There’s a red link that says “click for more” at the bottom of the post.  Click that and you’ll see several more BatCodes…

Posted 05/07  at  04:59 PM
JK said...

Still not seeing the link. I really like this though. It seems that this would be much more helpful than seeing that somebody hit .278 over a season, when that season is probably filled with hot-streaks and cold-streaks and will give a very good visual impression of how consistent a batter is.

I assume this is only going to show hits, and not walks/errors/stolen first bases. Very cool though.

Posted 05/07  at  05:20 PM
Kevin Dame said...

Thanks JK.  If anyone else is having trouble seeing the red “click for more” link, let me know.  Not sure what’s up with that…

Posted 05/07  at  05:33 PM
Kevin Dame said...

Another option (to see more) is to click on the post on the left side of the site under “Most Recent Comments” and you’ll see the entire post.

Posted 05/07  at  05:35 PM
JK said...

Ok, the problem was I was seeing them all from the get-go. So the link wasn’t there, unless you go to THT-Daily. So I was seeing the result of the links to start with…

Anyway, I think it would be really cool to take a streaky player and do year over year to see if there are noticable patterns.

Posted 05/07  at  05:41 PM
Benjamin said...

Interesting stuff! Can’t wait to see a barcode reader that translates them back again.. wink

Just a cosmetic tip though, you should also include a mark that shows where the month/barcode ends. So you can see how many at bats a player has had during the month, that or make the graph as big as the number of at bats so you can still place them together like you did with Ortiz.

Can’t see the click more link either, got a Puljols barcode back there..?

Posted 05/07  at  05:52 PM
JayT said...

This is a cool way to quickly see hot and cold streaks.

I’d love to see Bonds’ 73 homer season, Sosa’s 20 homer June in 1998, and Ichiro’s 262 hit (with 224 singles!) 2004.

Posted 05/07  at  06:04 PM
David Gassko said...

This is awesome!

Posted 05/07  at  07:09 PM
Dave Studeman said...

I agree with David. These are awesome, Kevin.

You don’t have to say “click link for more.”  If there is more, our site says it automatically but only on THT Live.

Posted 05/07  at  07:31 PM
david moon said...

super idea.  you get a quick overview of streaks, power and hitting.  in a sport where stats have become almost intrusive to to the information they’re suppose to clarify, this system synthisizes everything into a quick snapshot of a streak, season, career.  no matter if the lines lose their individual integrity, its a useful tool.

Posted 05/08  at  07:58 AM
Dave Studeman said...

One thought after staring at these for a few seconds: It’s hard to judge these unless you’re looking at several at the same time, because you don’t know what to think of the white space.

I think the full season codes are best (or, three months on a “line”) because there is less wasted white space.  So I’d suggest picking a set dimension (like three months on a line) and keeping to it, even if less time has been played.  You could “gray” the time remaining for players who have only played one year so far.

Posted 05/08  at  10:41 AM
david moon said...

yeah, i’ve been thinking about the white space. does it represent a player scuffling or a dl stint or a trip to the bus leagues. easy fix though; red blocks could represent the dl and..say…green could represent the minors or other designation. then the white space would truely represent a major league slump period. and, yes, i know that is an incorrect use of the semicolon.

Posted 05/08  at  12:49 PM
Page 1 of 1

Leave a comment:

Commenting is not available in this weblog entry.