Baseball Musings: Probabilistic Model of Range, 2007, Teams

November 04, 2007

Probabilistic Model of Range, 2007, Teams

Baseball Info Solutions sent me their final stats for 2007 over the weekend. That means it's time to start presenting the 2007 Probabilistic Model or Range. If you're new to this, you can find explanations in this archive. Basically, for each fieldable (non inside the park home runs) ball put in play, six parameters are used to determine how difficult it was to field the ball. A probability of turning the ball into an out is calculated, and those probabilities are summed. That gives us expected batted balls turned into outs. We turn that into a predicted DER (defensive efficiency record), compare that to the actual DER and calculate a ranking.

The model is based primarily on visiting player data, smoothed, distance on fly balls. Only 2007 data was used to construct the model.

Note that a team can post a poor DER during the season, but do well in this model if the balls put into play were extremely difficult to field. In fact, the team ranked first in 2007 is a bit of a surprise for that very reason.

Probabilistic Model of Range, 2007 Data, Teams, Visit Smooth Distance Model, Ranked by Difference
Team	In Play	Actual Outs	Predicted Outs	DER	Predicted DER	Difference
Yankees	4511	3103	3041.46	0.688	0.674	0.01364
Red Sox	4226	2974	2919.61	0.704	0.691	0.01287
Cubs	4177	2943	2895.51	0.705	0.693	0.01137
Blue Jays	4349	3060	3017.22	0.704	0.694	0.00984
Royals	4528	3093	3058.20	0.683	0.675	0.00768
Angels	4325	2930	2900.79	0.677	0.671	0.00675
Phillies	4505	3085	3056.00	0.685	0.678	0.00644
Rockies	4599	3221	3195.95	0.700	0.695	0.00545
Tigers	4486	3094	3072.58	0.690	0.685	0.00477
Braves	4404	3069	3048.96	0.697	0.692	0.00455
Mets	4362	3050	3033.08	0.699	0.695	0.00388
Giants	4467	3108	3096.80	0.696	0.693	0.00251
Orioles	4403	3017	3006.12	0.685	0.683	0.00247
Rangers	4518	3071	3061.36	0.680	0.678	0.00213
Nationals	4591	3198	3191.04	0.697	0.695	0.00152
Indians	4548	3112	3107.26	0.684	0.683	0.00104
Padres	4476	3131	3128.60	0.700	0.699	0.00054
Mariners	4535	3050	3051.99	0.673	0.673	-0.00044
Diamondbacks	4351	3013	3016.84	0.692	0.693	-0.00088
Dodgers	4310	2942	2945.91	0.683	0.684	-0.00091
Cardinals	4587	3150	3154.99	0.687	0.688	-0.00109
Twins	4384	3003	3014.01	0.685	0.688	-0.00251
Astros	4530	3099	3120.86	0.684	0.689	-0.00483
Reds	4533	3068	3096.08	0.677	0.683	-0.00619
Pirates	4608	3099	3132.67	0.673	0.680	-0.00731
Athletics	4499	3110	3144.35	0.691	0.699	-0.00763
Brewers	4392	2966	3011.82	0.675	0.686	-0.01043
White Sox	4545	3089	3141.16	0.680	0.691	-0.01148
Marlins	4491	2962	3039.28	0.660	0.677	-0.01721
Devil Rays	4378	2867	2943.31	0.655	0.672	-0.01743

That's right, the Yankees are number one. Without running the individual numbers, I'm guessing that a full season of Melky Cabrera and keeping Giambi off first really helped. The Red Sox defense turned a higher percentage of their balls in play into outs, but they also were given easier balls to field in general.

I wondered why the Tampa Bay pitching staff did so poorly with the high number of strikeouts they collected, and the reason is clear in these numbers. The Devil Rays defense was horrible. In fact, the state of Florida just can't play defense, with the Marlins ranking 29th in the majors.

For the second year in a row, the Kansas City Royals look a lot better than their posted DER. If they ever get a good set of pitchers on that team, they're going to post a low ERA.

For those of you who prefer a ranking by ratio of DER/Predicted DER, here's the table with that data.

Probabilistic Model of Range, 2007 Data, Teams, Visit Smooth Distance Model, Ranked by Difference
Team	In Play	Actual Outs	Predicted Outs	DER	Predicted DER	Ratio
Yankees	4511	3103	3041.46	0.688	0.674	102.02
Red Sox	4226	2974	2919.61	0.704	0.691	101.86
Cubs	4177	2943	2895.51	0.705	0.693	101.64
Blue Jays	4349	3060	3017.22	0.704	0.694	101.42
Royals	4528	3093	3058.20	0.683	0.675	101.14
Angels	4325	2930	2900.79	0.677	0.671	101.01
Phillies	4505	3085	3056.00	0.685	0.678	100.95
Rockies	4599	3221	3195.95	0.700	0.695	100.78
Tigers	4486	3094	3072.58	0.690	0.685	100.70
Braves	4404	3069	3048.96	0.697	0.692	100.66
Mets	4362	3050	3033.08	0.699	0.695	100.56
Orioles	4403	3017	3006.12	0.685	0.683	100.36
Giants	4467	3108	3096.80	0.696	0.693	100.36
Rangers	4518	3071	3061.36	0.680	0.678	100.31
Nationals	4591	3198	3191.04	0.697	0.695	100.22
Indians	4548	3112	3107.26	0.684	0.683	100.15
Padres	4476	3131	3128.60	0.700	0.699	100.08
Mariners	4535	3050	3051.99	0.673	0.673	99.93
Diamondbacks	4351	3013	3016.84	0.692	0.693	99.87
Dodgers	4310	2942	2945.91	0.683	0.684	99.87
Cardinals	4587	3150	3154.99	0.687	0.688	99.84
Twins	4384	3003	3014.01	0.685	0.688	99.63
Astros	4530	3099	3120.86	0.684	0.689	99.30
Reds	4533	3068	3096.08	0.677	0.683	99.09
Pirates	4608	3099	3132.67	0.673	0.680	98.93
Athletics	4499	3110	3144.35	0.691	0.699	98.91
Brewers	4392	2966	3011.82	0.675	0.686	98.48
White Sox	4545	3089	3141.16	0.680	0.691	98.34
Marlins	4491	2962	3039.28	0.660	0.677	97.46
Devil Rays	4378	2867	2943.31	0.655	0.672	97.41

Posted by David Pinto at 08:40 PM | Defense • | Probabilistic Model of Range | TrackBack (0)

Comments

This just makes me believe we are still not close with defensive metrics. The Yankees simply are not a good fielding team.

Would love to see how these match up with Tango's "Scouting Report"

Posted by: Lou Poulas at November 4, 2007 11:39 PM

This just makes me believe we are finally close with defensive metrics. The Yankees simply are a good fielding team.

(After years of ineptitude, of course.)

Posted by: dzop at November 5, 2007 01:09 AM

This just makes me deeply unsure whether we are finally close with defensive metrics. After all these years of reading about Derek Jeter's shortcomings on this blog, I'm just not convinced the Yankees are a good fielding team.

I still find it difficult to accept that all of the variables that go into fielding can be reduced to a formula. With all respect, I suspect the results are a little like the SAT scores, which really only measure how well a kid does on SAT exams, a measurement that correlates only roughly to performance in college. In something of the same way, these stats really only measure how well a team does by various measures that are inconsequential in themselves, and it's a leap of faith - how much of a leap is the question - whether that has any correlation to the actual manufacture of outs.

The teams at the bottom had pretty bad years, which suggests some level of correlation. But two and a half (the Padres) playoff teams are in the second half of the list, which makes me a bit dubious.

Posted by: JJ at November 5, 2007 02:05 AM

I think it's funny that faced with David's analysis of reams of data, the main response is that since it doesn't match up with our impressions, it's the analysis that must be wrong! Come on. Look at the methodology and say what's wrong with the method, or else have some integrity and admit that your *impressions* were mistaken.

Look, Jeter's range up the middle is not good. But Cano and Rodriguez were great, and the Yankees' OF (esp. Cabrera) were excellent. You can probably find some details of the Probabilistic Model of Range to quibble over, but even haters should admit that the Yankees were *among* the top-fielding teams last year.

Posted by: James at November 5, 2007 06:32 AM

At the team level, this model concludes that the difference between the best fielding team and the worst fielding team was about 0.85 plays a game (measured against their predicted plays). What it really suggests is that there is really very little overall difference between teams, and that it should difficult to tell them apart based on observation.

Posted by: joe arthur at November 5, 2007 07:15 AM

David - how do park factors come into play with this? Could the high grass at Yankee Stadium be helping the stats for the Yankees? They do play 81 games there. I've always suspected that the grass helps Wang's home/road splits. Could it be helping the Yanks, as a team, here too?

Posted by: Steve Lombardi at November 5, 2007 08:57 AM

Yes, park is a factor.

Posted by: David Pinto at November 5, 2007 08:59 AM

So this rates the Yankees as the 13th best real-life fielding team in baseball, when they were on this model expected to be the 27th "best."

So they were most successful team in "lucking" into outs.

Big deal! All you Yankee suck-ups need to reread the table - or else realize you're beating your chests about outperforming analytical expectations simply in order to become average.

Woo woo - Yankees are average, rock on!!!!

Posted by: Freddie at November 5, 2007 09:23 AM

Hey wait a second, is it any coincidence that the Big Three are the top three here. Any chance the Yankers, Bosux, and Cubs are simply benefitting from bad umpiring here - seems like those three teams get more calls going their way than any other teams in baseball anyway. A call a game and boom your actual DER outperforms the predicted.

When is anyone gonna publish umpire evaluations of this nature?

Posted by: Constantino at November 5, 2007 09:29 AM

Ok maybe not a call a game but bad calls even at a lower rate could still explain the discrepancy que no.

Posted by: Constantino at November 5, 2007 09:30 AM

I don't think it's hard to believe the results from the model - IF you are willing to accept the underlying assumptions surrounding "predicted DER".

The Yankees gave a lot of innings to guys who were pretty hittable: Igawa, Clippard, Henn, Farnsworth, DeSalvo, Rasner. Clemens is showing signs of being near the end of the road, which is reflected in his K rate (by far the lowest of his career); it's not hard to believe that batters are making contact more consistently and with more authority against him. Ditto Pettitte, ditto Mussina.

Posted by: Mike Emeigh at November 5, 2007 09:45 AM

Freddie, you didn't understand the chart. The predicted DER of .674 for the Yanks implies that that's what an average team of fielders, given the Yankees context (batted ball distribution, park, etc) would have achieved. Basically, it's real hard to get an out with the Yankees pitchers and playing half your games at Yankee stadium.

Anyway, that's what PMR is saying. Whether that's true is another question.

Posted by: tangotiger at November 5, 2007 09:53 AM

"I think it's funny that faced with David's analysis of reams of data, the main response is that since it doesn't match up with our impressions, it's the analysis that must be wrong!"

Maybe the impressions are right. PMR is not the only pbp defense analysis out there. When the advanced metrics don't agree then we either have to pick one and trust it implicitly, or go by our impressions.

For the Yanks, here's how certain players did on the John Dewan plus/minus leaders and trailers:

Jeter -34
Melky -22
Abreu -12

Cano +17

If I assume the maximum possible rating for the others: A-Rod didn't make the leader or trailer list, so if the #10 3B is +7, then A-Rod could be at most a +6. So:

A-Rod +6
Damon +7
1B +1

That's -37 for the starters, so plus minus does not see the Yankees as the best defensive team, or anywhere near it. I'm pretty sure UZR did not either.

If your inititial impression was that the Yankees are poor defenders, then you should stick with it unless you have some reason to believe PMR is more accurate than the other systems.

Posted by: Rally at November 5, 2007 09:57 AM

Well, for what it's worth I think PMR does a much better job at taking the actual distribution of balls in play allowed by a team, and modeling defensive expectations against that distribution instead of against a league-average distribution, than either plus/minus or UZR do.

Posted by: Mike Emeigh at November 5, 2007 10:06 AM

Rally, I have no problem with bringing in alternative measures to contest David Pinto's. I was laughing at the idea that one's impressions are a good rebuttal -- that's ludicrous, it's Joe Morganesque.
So, let's ask what the Dewan measure is measuring, and what the Pinto PMR formula is measuring, and then decide which of those better gets at the true defensive ability of a team. (Keeping in mind JJ's point way up near the top of the comments, to the effect that there could just be several different things we could mean by 'defensive ability'.)

If a team's fielders see a lot of very difficult balls to field, should that go into the model? I would say it should. PMR is much more comprehensive than other models in incorporate the difficulty of batted balls in play. So I'll tend to trust PMR when it conflicts with other measures.

Posted by: James at November 5, 2007 10:20 AM

I'd like to see the balls in air/balls on ground distributions. In the absence of hang-time measurements, the ball-in-air type decision is quite subjective and of significant importance.

I am pretty sure that the Yankee infield was far from the best in the league, but as for the outfield, it could have been quite good. There were two centerfielders out there.

This is the "visiting player model". It may be that Yankee opponents fielded fewer balls in play than anticipated, in part because of the park but also because of the quality of the Yankee offence. It will be interesting to see what the results are using the other models.

Posted by: Mike Green at November 5, 2007 10:22 AM

.85 plays per game is definitely a big deal -- that's about .7 runs per game. If league-average ERA is 4.50, that bumps the best teams up to a 4.15 ERA and the worst teams down to 4.85. Sure, that's not as big a difference as hitting or pitching, but I think we already knew that. Defense is important.

Posted by: Sky at November 5, 2007 10:41 AM

Rally,

Melky -22? I'll have to run CF first when I do players. I find that tough to believe. Melky ranked very high last season in PMR among LF. I suppose he could be lost in CF, but he didn't look that bad.

Posted by: David Pinto at November 5, 2007 10:53 AM

It is a little hard to believe, but here it is:

http://www.billjamesonline.net/fieldingbible/2007-plus-minus-leaders.asp

Mike Green,

I see a correlation of 0.5 between difficulty of a team's fielding chances and that team's batters out percentage (1-babip). You should expect some correlation because of the ballparks, whether a .5 is too high I don't know.

Posted by: Rally at November 5, 2007 11:08 AM

Interesting model, Mr. Pinto. I'd love to get a look at the data, but I suppose you've got to pay BIS some good cash for that. :-) Mostly, I'd like to see how standardized the classifications for "line drive"/"fly ball"/"etc. and "slow"/"medium"/"fast".

As for some reasons for the differences from these results and those of the plus/minus:

Since plus/minus measures plays made vs. "expected" plays made, a LF with very good range may take a few fly balls that the CF might otherwise get to. Presumably, Damon's presence in LF could account for some of Melky's low score. (OTOH, this would also mean that Damon should have a similarly higher score, which isn't something we saw.)

Posted by: mraver at November 5, 2007 02:11 PM

"Since plus/minus measures plays made vs. "expected" plays"

Its PMR that compares plays made to expected plays. +/- assigns a positive vlaue for making a play and a negative value for not making it, with the magnitude of that value depending on how difficult a chance was.

Posted by: Rally at November 5, 2007 03:11 PM

With PMR, you may see some instances of a player inflating their rating through ball-hogging. In prior year Orlando Hudson did this by grabbing every infield pop fly. But that doesn't matter for team ratings, for the Yankees it really doesn't matter if Melky or Damon makes the catch, as long as it was made.

Posted by: Rally at November 5, 2007 03:14 PM

Does this mean that a team like the Angels had a staff that was one of the easiest to hit or that they play in a relatively easy hitters park or a little of both? Their park factor this year was 105, but I'm not sure how much of that is balls in play and how much is other factors.

Posted by: Jon at November 5, 2007 04:04 PM

Angel hitters hit .337 at home on balls in play, and .294 on the road. The pitchers allowed .260 at home and .273 on the road.

The park didn't change or anything, it hasn't been a hitters park in the past, but for some reason Angel hitters went wild at home this year.

Posted by: Rally at November 5, 2007 04:49 PM

Jon, that's something I've tried to tease out, but I haven't come up with a good way to do it. I think it is a combination of both.

Posted by: David Pinto at November 5, 2007 04:50 PM

I think Mike Green and Rally have identified a serious problem with the visitor-dominated model: it effectively lowers the defensive bar for teams whose hitters have a high BABIP. I sorted the teams by predicted DER, and the 8 teams with the lowest predicted DER -- i.e. the teams whose pitchers ostensibly allowed harder to field balls -- had an above-average BABIP in every single case. The 8 teams with the highest predicted DER look just the opposite: 6 of the 8 have below-average BABIP, 1 was average (NYM), and just 1 was above average (CO).

The low predicted DER teams had an average BABIP of .313, while the high predicted DER teams were just .295 (MLB average was .303). All figures from B-Ref.

This can't be a coincidence. What is likely happening is that the model, looking at the experience of visiting fielders in Anaheim trying to stop the rockets hit by Angels hitters (.315 BABIP), concludes "boy, it's hard to turn balls into outs in this park." But down the street in S.D. (.291), or up the road in SF (.281), it looks mighty easy to field those same balls.

Posted by: Guy at November 5, 2007 10:10 PM

Well, Guy, I suggest you take this up with the people who insisted that including the home team polluted the model too much.

Posted by: David Pinto at November 5, 2007 10:35 PM

I can appreciate you may feel a bit whip-sawed by contradictory criticisms, David. But I think both approaches pose real problems. The answer may lie in other directions. For example:

1) use multi-year data to create the model;

2) control for quality of home team hitting -- if visiting fielders turn a given ball in Anaheim into an out at a .72 rate, and the Angels hitters are .020 above-average on BABIP, then the expected DER is .74 rather than .72 on that ball.

3) Use park-specific parameters for the OF, but not the IF (assuming you use them for IFs). I could be wrong, but with turf gone it seems infields are similar enough that any park factor you use probably adds more bias than it removes. This would give you much more robust samples for all IF BIP, and reduce the impact of having an especially good or bad group of home hitters in a ballpark.

Posted by: Guy at November 5, 2007 11:24 PM

Mike Green, clearly Rally wasn't going on the general "impression" that Jeter sucks really bad at shortstop. The general "impression" has him winning gold gloves and considering him good. It is the defensive metrics that have pointed out that he's a joke. Dewan's +- system puts him as the 2nd worst defender in the league after Manny, and those two are blowing away the competition in being atrocious. Maybe you consider it common knowledge that Jeter's a horse's ass in the field because you're knowledgeable about this stuff and discuss it with other such informed people. But really you or me or whoever likely haven't just guesstimated that Jeter's worthless out there. We know it from reading the analysis.

If the Yankees are actually interested in being, like, really good, why have Jeter at ss and weak-hitting plus-defense firstbaseman Doug M??? A sane thing to do would be to play Jeter's broke-ass at 1st base and get a slick fielding ss- the premium defensive position.

Posted by: Brittainator at November 6, 2007 02:34 AM

Also, regarding infields. It made me curious when I heard the Colorado had one of if not THE best fielding groups of infielders, and that Colorado, to help diminish the effect of the hitter-favored park, now has one of, if not THE longest infield grass in baseball.

Does adjusting for park eliminate this long grass effect?

Posted by: Brittainator at November 6, 2007 02:44 AM

Overall, there are 33 more actual outs than predicted outs. Is that just accumulation of rounding errors? The presentation of the difference between actual and predicted DER to 5 decimal places implies that the maximum discrepancy from rounding error should be about 1 when adding up 133,000 balls in play. I believe I suggested in last year's discussion that the visiting team model would lead to a net positive outcome, since (for whatever reason), one symptom of the home field advantage is that home teams' batted balls are hits more often; primarily using the visitors' fielding sets a baseline slightly below the overall (home+visitor) average.

I think Mike, Rally, and Guy's critique amounts to saying that if a team hits particularly well at home (on balls staying in play), then this lowers the visitor's performance and the lowers the expectation for the home team's defense at home, making it easier for them to exceed expectations. In road games, the team's offensive performance will have far less to do with the baseline used to measure its defense. So home and road PMR splits for teams may shed some light on how much of an issue this is. [I calculated a correlation of -0.36 between teams' BA on BIP specifically at home and their overall predicted DER; the top 5 teams in BABIP at home averaged a predicted DER of .679; the bottom 5 teams in BABIP at home averaged a predicted DER of .692. Obviously 1 year of data doesn't prove anything, but it does support their hypothesis.]

With only one year of data used to construct this year's model, there's still a lot of room for luck in setting the baselines at each park. I do expect that this is significant, in comparison to the relatively small margins you currently find between the teams.

Multi year data would help with that, as Guy suggested. Before 2006 you used multi-year data in constructing the baselines for PMR, and I believe you changed to a one year basis last year (because a change in the data feed you received from BIS made prior years' data incompatible). Do you have a technical reason for excluding 2006 data this time around, or have you changed your mind about the propriety of a multi-year baseline?

Posted by: joe arthur at November 6, 2007 06:43 AM

Brittainator, the park is figured into every set of parameters, so if long grass helps defense, that should fall out in the model.

Joe Arthur, there is year to year line drive variation, and that ends up making everyone in a season positive or negative. If 2007 was a better defensive year than 2006, everyone looks good in 2007.

Posted by: David Pinto at November 6, 2007 08:00 AM

David,

can't you use single year values for hit types and multi-year for parks?

Say a line drive at location X is fielded 17% in 2007 and 15% in 2006, a multiyear park factor of .95, just multiply the year values by the park factor.

Posted by: Rally at November 7, 2007 09:23 AM

Coming into this late, but isnt the park factored into the model simply by using the park average out for each play each single year, and not using a league wide average out on each play each single year. So how is David using mulityear park factors with single year values for hit type?

Posted by: Darren at November 7, 2007 05:29 PM