The Futility Infielder

A Baseball Journal by Jay Jaffe I'm a baseball fan living in New York City. In between long tirades about the New York Yankees and the national pastime in general, I'm a graphic designer.

Sunday, November 23, 2003

 

Another Rich Interview

Rich's Weekend Baseball Beat continues its fine interview series this weekend, checking in with David Pinto of the prolific Baseball Musings blog. What's most interesting about Pinto is that he has a lot more professional experience around baseball than most bloggers. He built software for Stats, Inc., working with Bill James and John Dewan and helping to develop the Zone Rating system. He worked as the head of research for ESPN's Baseball Tonight TV show, creating graphics and putting numbers at the fingertips of the show's hosts. He numbers Peter Gammons and Rob Neyer among his friends. Recently his name even came up in connection with an opening for a stat analysit in the New York Mets front office, though the job apparently went to somebody else. Rich Lederer touches bases with Pinto on all of these topics in his interview.

In addition to his short takes on the news of the day, Pinto has been working on a much larger and more complex project related to defensive statistics which he calls "Probabilistic Model of Range". Here's how he describes his work to Lederer:
Range is the Holy Grail of baseball stats. We all have a feeling for what range represents, but it's really difficult to pin down with a number. Plays per game, plays per nine innings, and zone ratings were all attempts at measuring range, and they all have their flaws. UZR was the first probabilistic model that I know of. It looked at the probability of making a play in a particular zone (area) on the field. Mine is similar to that, although I eliminate the idea of a zone.

Basically, there is a probability distribution of balls put into play. The normal position of fielders should be where those probabilities are densest; in other words, the shortstop should stand where the most ground balls are hit in his area of responsibility. Ground balls hit in the densest region should be easier to field because that's where the SS is usually standing. So if you field a ball there it's no big deal, everyone does that. But as you move left or right from the region of highest density, the balls are more likely to get through for hits. So a SS who consistently fields those balls well should get more credit than someone who doesn't. So the probabilistic model of range tries to model these probabilities and assign them to fielders based on where balls are hit.
For the uninitiated, UZR stands for Ultimate Zone Rating, a system by Mitchell Lichtman which examines defense using play-by-play data including the location and speed of batted balls. Basically, what both Lichtman's and Pinto's systems are asking is, What is the probability of a batted ball becoming an out, given the parameters (direction, how hard, and type) of that batted ball? From Pinto's blog:
I've used the STATS, Inc. database to obtain three parameters for each ball; its direction (a slice of pie fanning out from home plate), its batted type (ground, fly, line, bunt or pop) and how hard the ball was hit (soft, medium or hard). I then did a maximum likelihood estimate of the probability of an out given those three parameters for each of the nine fielders.
In a follow-up post, Pinto explains the difference between the two systems. Ultimately, work such as this will give us a better understanding of just how much influence a pitcher has in influencing the outcome of a ball in play, expanding upon the work of DIPS inventor Voros McCracken.

Pinto is definitely a prominent figure in the world of baseball blogging, one who's clearly got the skills to be employed inside the game. Catch up with him before some team entices him to put his number-crunching skills to work for them.

Comments: Post a Comment

Subscribe to Post Comments [Atom]





<< Home

Archives

June 2001   July 2001   August 2001   September 2001   October 2001   November 2001   December 2001   January 2002   February 2002   March 2002   April 2002   May 2002   June 2002   July 2002   August 2002   September 2002   October 2002   November 2002   December 2002   January 2003   February 2003   March 2003   April 2003   May 2003   June 2003   July 2003   August 2003   September 2003   October 2003   November 2003   December 2003   January 2004   February 2004   March 2004   April 2004   May 2004   June 2004   July 2004   August 2004   September 2004   October 2004   November 2004   December 2004   January 2005   February 2005   March 2005   April 2005   May 2005   June 2005   July 2005   August 2005   September 2005   October 2005   November 2005   December 2005   January 2006   February 2006   March 2006   April 2006   May 2006   June 2006   July 2006   August 2006   September 2006   October 2006   November 2006   December 2006   January 2007   February 2007   March 2007   April 2007   May 2007   June 2007   July 2007   August 2007   September 2007   October 2007   November 2007   December 2007   January 2008   February 2008   March 2008   April 2008   May 2008   June 2008   July 2008   August 2008   September 2008   October 2008   November 2008   December 2008   January 2009   February 2009   March 2009   April 2009   May 2009   June 2009   July 2009   August 2009   September 2009   October 2009   November 2009   December 2009   January 2010   February 2010   March 2010   April 2010   May 2010  

This page is powered by Blogger. Isn't yours?

Subscribe to Posts [Atom]