Gears of War III: Evaluating Pitching
We talked last time about how we can calculate the number of wins based on the number of runs created or produced. If you missed it, the equation is pretty simple:
Wins ~ Runs/10.
The "~" sign there means "approximately," and that's an approximation because (amongst other things) the pitcher affects the run-scoring environment. Think of it this way: If all baseball games were won 1-0, then each run would be worth a win. If all baseball games were won 100-99, it would take a LOT of runs to increase your team's win total. So the number of runs/game affects the number of runs it takes to generate a win. Because a pitcher will have the ball in his hands for half the time he's in the game, he has a large influence on the run-scoring environment. In other words, good pitchers change the game so that each run becomes more valuable. This is part of the reason why managers are more likely to make sacrifices in games involving two great pitchers. Now that we know this, we can figure out how many wins a pitcher contributes to a team by calculating the number of runs they save compared to some baseline or compared to some other player, and calculating how many wins those prevented runs are worth. Follow me below the fold for more on how this works.
We use replacement-level production for the baseline in WAR (it's the "R" in WAR, Wins Above Replacement), with "replacement-level" being defined as the expected contribution a team could get from a readily available player making league-minimum. You can think of replacement-level players as the ones that "ride the Iowa-Chicago shuttle" as they get promoted when needed and demoted when their arm (or Lou's patience) is used up. We use this baseline because it represents the minimum investment the team could make to replace that player's innings. Now, there are particular cases where a team has replacements that are above or below this definition of "replacement-level." (Hopefully, Starlin Castro serves as an example of an "above replacement-level replacement.") When that is the case, you measure both players - the one "being replaced" and the one "doing the replacing" from the same baseline, so that when you take the difference between the two the baseline cancels out. When you are calculating the baseline you also make corrections for the environment in which the pitcher plays in, including their league, role, and ballpark. A pitcher that allows 4.0 runs/game pitching half his games at Great America Ballpark is doing a better job than a pitcher that allows 4.0 runs/game for pitching half his games at Wrigley Field, because more runs are scored at Great America Ballpark. Likewise, a pitcher that allows 4.0 runs/game in the American League is doing a better job than a pitcher that allows 4.0 runs/game in the National League, both because the American League is better AND because the run-scoring environment is higher. Finally, a starter that allows 4.0 runs/game is considered more valuable than a reliever that allows 4.0 runs game, because the starter has to pitch more innings, which means a larger repertoire of pitches and the need to "save himself" for later innings and tense situations. Thus, "replacement-level" means different things for different pitchers, depending on their role and their environment.
We also have to decide what metric to use when evaluating the pitcher's performance. Since a pitcher's job is to prevent runs, usually some metric is used that measures runs allowed compared to the replacement-level baseline discussed above. But we don't normally use ERA or even RA (total runs allowed/9 IP). Why not? Well, we're trying to get a handle on the pitcher's ability to prevent runs, not the team the pitcher is playing on. In other words, we want to isolate the pitcher's ability to prevent runs from the run-preventing capabilities of his defense. To do this, we use a metric called FIP (Fielding Independent Pitching) that translates the allowed home runs, strikeouts, and walks a into a runs-allowed statistic. By limiting ourselves to plays that the defense is NOT involved in, we can do a decent job of isolating the pitcher's run-preventing abilities. You may also want to account for good or bad luck in the number of home runs the pitcher has allowed given their fly ball rate. For this, we use a metric called xFIP, ("expected" FIP). An example of a pitcher whose xFIP was much lower than his FIP was Rich Harden during the first half of 2009. Based on that, you would have expected him to have improved in the 2nd half of the season... just as he did. (Quick preview of next time: Carlos Zambrano's FIP is much higher than his xFIP. Carlos Silva's FIP is much lower than his xFIP. Guess which one we can expect to get worse as we go forward? Hint: THE ONE GETTING MORE INNINGS.)
Got all that? If not, I'll break it down into steps:
1.) pick your metric. Usually, this is FIP or xFIP
2.) calculate your baseline, based on the environment the pitcher is in and their role (reliever or starter)
3.) calculate their runs allowed, subtract the baseline runs allowed, and divide by the number of runs/win for the given run environment. That'll give you the player's WAR. If you want to, this is also the appropriate time to apply leverage for relievers (we'll discuss this next time).
4.) go on the interwebs and complain that the Cubs are severely under-utilizing their 2nd-best pitcher
If you're still not getting it, my next post may help. Next time I will discuss the Cubs' recent, panicked move of Zambrano to the bullpen as an example of how to apply pitching WAR to real-world scenarios. I hope to show you all why many of us flipped out at this, and why I called it "cosmically stupid." I also hope to show how a move of Zambrano to the bullpen could actually make the team better. (Hint: Basically, the opposite of the way they're doing it!) Stay tuned...
16 comments
|
0 recs |
Do you like this story?
Comments
Nice post. Also, it took my mind off last night's game.
I’ll read it more in depth after I have my coffee.
"I'm not much of a chemistry guy, you know. Chemistry to me is a pinch-hit double with the bases loaded"--Jim Frey, Chicago Tribune, 1985.
agreed
I dont know how you wrote that at 4:30 am. I’ll read it again after some coffee. I like how the game can be described and even predicted somewhat accurately with the numbers alone. I guess with a sport as old as baseball with a century of stats to sort through patterns become obvious.
Just because I don't care doesn't mean I don't understand. - Homer J Simpson
Of course, there's always the problem of one's assumptions.
That’s the weakness of all these baseball statistics. For example, what kind of stochastic process will you be assuming for your Monte-Carlo calculations? A Weiner (random-walk) process based on a normal distribution? An Ornstein-Uhlenbeck process based on a drifting random-walk (which would account for a player slowly losing or gaining his talents based on age, for example)? The devil is in the details.
"I'm not much of a chemistry guy, you know. Chemistry to me is a pinch-hit double with the bases loaded"--Jim Frey, Chicago Tribune, 1985.
Well for one...
I wrote it here in Seattle, where it was only 2:30.
For two, I wrote most of this the day Z was moved to the pen. I then struggled for two weeks getting it short enough for one story, and last night gave up and split it in two.
Two things are infinite: the universe and human stupidity; and I'm not sure about the the universe. - Albert Einstein
by Shawn Domagal-Goldman on May 9, 2010 11:18 AM CDT up reply actions
They've used Z 5 times since April 20, for 6 IP.
And only half of those relief innings really mattered at all.
And in the meantime, the rotation has regressed significantly.
And I’d bet a sizeable amount that this team has absolutely no idea what the plan for Z is moving forward.
They would have gotten essentially as much production from letting Z sit at home and eat Funyuns for the last 3 weeks.
And people wonder why we get frustrated when the Cubs repeatedly make decisions that are “unconventional” at best, “cosmically stupid” at worst.
MLBMilestone.com - following the numbers to Cooperstown
Got any numbers to back up that "rotation has regressed significantly" comment?
Except for a couple of games, my personal recollection is that the starters have done quite well. Don’t have time to look it up. Anyone?
"You can observe a lot just by watching." ~ Yogi Berra
I posted it below. Sorry.
Last 2 weeks have been brutal, both in median and in mean production.
MLBMilestone.com - following the numbers to Cooperstown
Right. And the big issue is...
the team should have seen the regression coming from Silva.
Two things are infinite: the universe and human stupidity; and I'm not sure about the the universe. - Albert Einstein
by Shawn Domagal-Goldman on May 9, 2010 11:19 AM CDT up reply actions
Since Z's first bullpen appearance everyone has 3 starts.
Dempster has remained awesome – 2.78 ERA in 3 starts, 22.2 IP, 7 ER. (3 good starts)
Gorzo has been very good – 3.15 ERA in 3 starts, 20 IP, 7 ER. (3 good starts)
Silva has posted a 6.35 ERA in 3 starts, 17 IP, 12 ER. (2 poor starts, 1 good start.)
Wells has posted a 7.80 ERA in 3 starts, 15 IP, 13 ER. (2 poor starts, 1 good start.)
Lilly has posted a 5.29 ERA in 2 starts, 17 IP. (2 poor starts, 1 excellent start.)
More to my point about going from “awesome to adequate to ‘kinda crappy’” since 4/20, Silva, Wells and Lilly have all had 2 straight poor starts.
In the last 15 starts, we have 9 good, 6 poor. 91 IP, averaging 6 IP/start.
But in the last 10 starts, we have 6 poor, and only 2 good (all Demp/Gorzo). The starters’ collective ERA for the last 2 turns is ridiculously bad.
Demp 15 IP, 6 ER, 3.80 ERA
Wells 8 IP, 12 ER, 12.38 ERA
Silva 10 IP, 9 ER, 8.10 ERA
Gorzo 13 IP, 5 ER, 3.46 ERA
Lilly 11 IP, 10 ER, 8.18 ERA
Any way you slice it, it’s been a crappy run for the starters over the last 2 weeks. 57 IP, 42 ER. 6.63 ERA.
MLBMilestone.com - following the numbers to Cooperstown
can you show us that the cause of the decline is due to moving Z to the pen?
I always turn to the sports section first. The sports page records people's accomplishments; the front page has nothing but man's failures.
~Earl Warren
by lookingdeadred on May 9, 2010 11:23 AM CDT up reply actions
Of course not.
I can only show that 3/5 of the rotation has floundered while Z has been in the pen.
MLBMilestone.com - following the numbers to Cooperstown
There doesn't have to be causality.
The point is the rotation would be better if Z were in it.
Two things are infinite: the universe and human stupidity; and I'm not sure about the the universe. - Albert Einstein
by Shawn Domagal-Goldman on May 9, 2010 2:29 PM CDT up reply actions
Quibble with your Wells numbers.
I would call it one good, one mediocre, one poor.
"You can observe a lot just by watching." ~ Yogi Berra
6 IP, 5 ER is not mediocre. It's awful.
That’s a 7.50 ERA.
He had 8 punchouts in that game, but he was getting knocked around pretty brutally – 6 extra base hits in 6 innings. Those 5 ER were earned.
MLBMilestone.com - following the numbers to Cooperstown
i say we put Z back in the rotation. We hardly use him in the pen anyways so it is better off starting him. I think lou is gonna have to do what he doesnt like to do and make a big decision. I really think the zambrono pen was the front office and same with castro being called up. So maybe the rickets or jim hendry need to make the decision and tell pinella that zambrono is back in the rotation and get off there buts and find another avalible pitcher. I say offer grabow and gray or ber and the losser of the gorz silva battle for 2 relievers one lefty one righty although we dont need anohter lefty because we have quit a few so.
by Adam Tarasievich on May 9, 2010 7:12 PM CDT reply actions
Re
One large problem with this approach is that some pitchers pitch to contact, so they need to be evaluated in terms of whether or not they successfully induce ground balls. Eliminating events that the pitcher initiates is hardly a fair methodology; far better would be to adjust the results for errors. That way, a pitcher who consistently causes the batter to hit a ground ball at the defense is rewarded in the analysis for doing so. There’s far more to being a pitcher than not walking batters, not giving up home runs, and striking batters out. In fact, there are times when walking the batter is an acceptable outcome.
Visit The Digital Gazette

by 




















