Gears of WAR, Part I: Linear Weights
Here begins my introductory series on sabermeric concepts, which is going to use WAR (Wins Above Replacement) as a unifying framework. I plan to use WAR for a few reasons. For one, if you're just going to use one stat for both evaluating past player performance and valuing future player production, it should be WAR. Why does WAR have this prominence as the best single number to know about a player? I can answer this by defining the statistic. Simply put, WAR is the number of wins you'd expect a team to have added to its total due to the presence of a particular player, instead of a replacement from the minor league ranks. Thus, WAR is important for the exact same reason why many people are misled into thinking the W-L record of a pitcher it so important: WAR is measured in the same units as the ultimate measure of success... wins! I hope to talk about this later, but the difference between WAR and W-L records is that WAR succeeds in this goal whereas W-L records fail. W-L records represent a very poor attempt to attribute wins to a pitcher. WAR does this, too... only WAR does it much, much, much better, and more precisely than W-L records do and they have they have the tremendous advantage of being capable of awarding "wins" to position players and pitchers on an equal footing. In order to do this properly, WAR need to be all-encompassing; this necessitates the inclusion of many components. Thus, by framing things with WAR, I can also introduce all of you to a variety of tools and concepts currently adopted for use by much of the sabermetric community: linear weights, wOBA, UZR, FIP, EqBRR, replacement-level, positional adjustments, player valuation, forecasting systems, and, error and uncertainty. This article is the first in a series of 10 parts. (For now... maybe the series will end up being 11 parts? 8 parts? I'm honestly not sure yet... We'll see where this goes.) Anyways, this is Part I: Linear Weights. Jump below the fold to for an introduction to a very central gear in the WAR machine...
Originally, I wanted to make this first story one on error and uncertainty, but I quickly realized I first need to introduce statistics like wOBA and UZR... and before I can explain stats like wOBA or UZR I should first tell you what linear weights are... and before I tell you what linear weights are I should explain what a run expectancy matrix is. So let's start there... run expectancy. Take a look at the following table (taken from Tangotiger's site), which tells you the average runs scored in innings that occurred in the years 1999-2002, starting with given base/out states. The base/out states are a combination of the position of runners, here shown in different rows, and the number of outs in the inning, here shown in different columns. Look below the table for an example on how to use it.
RE 99-02
0
1
2
Empty
0.555
0.297
0.117
1st
0.953
0.573
0.251
2nd
1.189
0.725
0.344
3rd
1.482
0.983
0.387
1st_2nd
1.573
0.971
0.466
1st_3rd
1.904
1.243
0.538
2nd_3rd
2.052
1.467
0.634
Loaded
2.417
1.65
0.815
Let's use the game state with a runner on second and nobody out as an example. If you want to know how many runs were scored (on average) from this game state to the end of the inning, go to the column labeled "0" (representing the number of outs) and look down to where it intersects the row labeled "2nd" (representing the presence of a runner on 2nd). There, you'll find the value "1.189." This means that, on average, teams scored 1.189 runs in the rest of an inning where there were 0 outs and a runner on 2nd. This particular calculation was for the years 1999-2002, but in principle one could calculate this for any given year or range of years, or even with models. (Professionally, I'm a modeler so this makes me smile.) If you assume that the run expectancy hasn't changed much since this table was calculated, this information is very useful. Every base/out combination is in here, so you can figure out how many runs a team is expected to score in the rest of an inning at any point in the game, assuming average production from that team the rest of the inning. Note that run expectancy doesn't take into account things like whether Derrek Lee or Neifi Perez is due up in the inning, or whether Carlos Zambrano or Aaron Heilman is on the mound. It just tells you how many runs a team would be expected to score the rest of the inning, based on what teams have done in the past starting from the same base/run state. It's an expectation based on precedent. Run expectancy is an important concept, as it's the foundation of a great deal of sabermetric products these days, including the (super-cool) win expectancy charts at fangraphs, their measurements of "leverage index," and some cool individual statistics such as "clutch." It's also at the heart of linear weights, a system that places run values on every type of play - singles, doubles, errors, wild pitches, walks, etc - and in so doing gives us the ability to translate a player's accumulated singles, doubles, stolen bases, etc. into runs created. Once we know the runs created, we can translate that to wins, and that'll give us WAR. But I'm saving that discussion for next time.
Linear weights uses another, larger table (again taken from Tango's site). This table is similar to the prior one, except the entire game states (including both where runners are on base AND the outs) are in now in the different rows and game events (single, double, strikeout, reached on error, etc.) are now in the different columns. The other difference in this table is that the numbers now represent a change in the average number of runs teams scored. This change is the result of the event in the column occurring, and is measured with respect to the game state in the row. Anyways, here's the table (if it doesn't make sense, look past the table for an example):
| Base | Outs | HR | 3B | 2B | 1B | RBOE | HBP | NIBB | IBB | Bunt | Out | KO | PB | WP | BK | SB | CS | PO | POE | OA | INT | FoulE | DefInd |
| Empty | 0 | 1.04 | 0.93 | 0.62 | 0.39 | 0.48 | 0.4 | 0.4 | --- | 0.1 | -0.26 | -.27 | --- | --- | --- | --- | --- | --- | --- | --- | ... | -0.22 | --- |
| Empty | 1 | 1 | 0.65 | 0.4 | 0.27 | 0.31 | 0.34 | 0.28 | ... | 0.04 | -0.18 | -.19 | --- | --- | --- | --- | --- | --- | --- | --- | ... | ... | --- |
| Empty | 2 | 1 | 0.29 | 0.21 | 0.13 | 0.15 | 0.17 | 0.13 | ... | 0.06 | -0.12 | -.12 | --- | --- | --- | --- | --- | --- | --- | --- | ... | ... | --- |
| Base | Outs | HR | 3B | 2B | 1B | RBOE | HBP | NIBB | IBB | Bunt | Out | KO | PB | WP | BK | SB | CS | PO | POE | OA | INT | FoulE | DefInd |
| 1st | 0 | 1.67 | 1.44 | 1.06 | 0.68 | 0.81 | 0.87 | 0.59 | ... | -0.04 | -0.47 | -.45 | 0.32 | 0.31 | 0.36 | 0.27 | -0.58 | -0.34 | ... | ... | ... | ... | 0.14 |
| 1st | 1 | 1.72 | 1.28 | 0.95 | 0.47 | 0.62 | 0.43 | 0.42 | ... | -0.1 | -0.37 | -.33 | 0.07 | 0.19 | 0 | 0.2 | -0.46 | -0.24 | --- | -0.48 | ... | ... | -0.14 |
| 1st | 2 | 1.87 | 1.06 | 0.75 | 0.26 | 0.36 | 0.29 | 0.23 | ... | -0.09 | -0.25 | -.25 | 0.06 | 0.06 | 0.03 | 0.11 | -0.25 | -0.15 | --- | -0.25 | ... | ... | 0.13 |
| Base | Outs | HR | 3B | 2B | 1B | RBOE | HBP | NIBB | IBB | Bunt | Out | KO | PB | WP | BK | SB | CS | PO | POE | OA | INT | FoulE | DefInd |
| 2nd | 0 | 1.3 | 1.4 | 0.97 | 0.72 | 0.65 | 0.38 | 0.48 | -0.32 | -0.02 | -0.36 | -.51 | 0.28 | 0.25 | ... | 0.32 | ... | -0.65 | --- | ... | ... | ... | --- |
| 2nd | 1 | 1.6 | 1.45 | 1.01 | 0.68 | 0.64 | 0.31 | 0.24 | 0.18 | -0.16 | -0.36 | -.39 | 0.25 | 0.16 | 0.28 | 0.38 | -0.6 | -0.21 | --- | ... | ... | ... | ... |
| 2nd | 2 | 1.77 | 1.08 | 1.02 | 0.72 | 0.58 | 0.11 | 0.14 | 0.02 | ... | -0.34 | -.34 | 0.14 | 0.07 | ... | 0.05 | -0.34 | -0.2 | --- | ... | ... | ... | ... |
| Base | Outs | HR | 3B | 2B | 1B | RBOE | HBP | NIBB | IBB | Bunt | Out | KO | PB | WP | BK | SB | CS | PO | POE | OA | INT | FoulE | DefInd |
| 3rd | 0 | 1.04 | ... | 0.78 | 0.52 | 0.28 | 0.24 | 0.38 | ... | ... | -0.28 | -.42 | ... | ... | --- | --- | ... | --- | --- | ... | --- | --- | --- |
| 3rd | 1 | 1.3 | 0.83 | 0.69 | 0.56 | 0.55 | 0.15 | 0.24 | 0.21 | 0.12 | -0.25 | -.63 | ... | 0.35 | ... | ... | ... | ... | --- | ... | ... | ... | --- |
| 3rd | 2 | 1.75 | 1.08 | 0.97 | 0.88 | 0.96 | 0.22 | 0.14 | 0.07 | 0.27 | -0.39 | -.38 | ... | 0.75 | ... | ... | ... | ... | --- | ... | ... | ... | --- |
| Base | Outs | HR | 3B | 2B | 1B | RBOE | HBP | NIBB | IBB | Bunt | Out | KO | PB | WP | BK | SB | CS | PO | POE | OA | INT | FoulE | DefInd |
| 1st_2nd | 0 | 1.84 | 1.89 | 1.45 | 0.94 | 1.03 | 0.79 | 0.87 | --- | 0.11 | -0.63 | -.66 | 0.31 | 0.48 | ... | 0.66 | -0.75 | -0.39 | --- | ... | ... | ... | ... |
| 1st_2nd | 1 | 2.39 | 1.9 | 1.54 | 0.88 | 0.99 | 0.68 | 0.69 | ... | -0.12 | -0.6 | -.53 | 0.35 | 0.52 | 0.6 | 0.42 | -0.64 | -0.38 | --- | ... | ... | ... | ... |
| 1st_2nd | 2 | 2.65 | 1.89 | 1.52 | 0.86 | 0.67 | 0.29 | 0.4 | ... | ... | -0.46 | -.46 | 0.01 | 0.21 | ... | 0.23 | -0.42 | -0.19 | --- | ... | ... | ... | ... |
| Base | Outs | HR | 3B | 2B | 1B | RBOE | HBP | NIBB | IBB | Bunt | Out | KO | PB | WP | BK | SB | CS | PO | POE | OA | INT | FoulE | DefInd |
| 1st_3rd | 0 | 1.65 | ... | 1.29 | 0.71 | 0.54 | 0.55 | 0.52 | ... | -0.17 | -0.47 | -.67 | ... | 0.2 | ... | 0.32 | ... | ... | --- | ... | ... | --- | ... |
| 1st_3rd | 1 | 2.06 | 1.66 | 1.26 | 0.81 | 0.95 | 0.46 | 0.34 | ... | -0.16 | -0.47 | -.74 | 0.54 | 0.47 | ... | 0.28 | -0.95 | -0.46 | --- | ... | ... | ... | ... |
| 1st_3rd | 2 | 2.61 | 1.96 | 1.46 | 0.92 | 1.11 | 0.24 | 0.28 | ... | ... | -0.54 | -.53 | ... | 0.61 | ... | 0.13 | -0.43 | -0.09 | --- | ... | --- | ... | 0.02 |
| Base | Outs | HR | 3B | 2B | 1B | RBOE | HBP | NIBB | IBB | Bunt | Out | KO | PB | WP | BK | SB | CS | PO | POE | OA | INT | FoulE | DefInd |
| 2nd_3rd | 0 | 1.51 | ... | 0.92 | 0.85 | ... | 0.35 | 0.53 | 0.26 | ... | -0.44 | -.66 | ... | ... | --- | --- | --- | ... | --- | --- | --- | --- | --- |
| 2nd_3rd | 1 | 1.83 | ... | 1.36 | 0.92 | 0.76 | 0.36 | 0.23 | 0.14 | ... | -0.41 | -.82 | ... | 0.63 | ... | --- | ... | ... | --- | ... | --- | ... | --- |
| 2nd_3rd | 2 | 2.53 | 1.93 | 1.73 | 1.46 | 1.29 | 0.15 | 0.22 | 0.14 | ... | -0.63 | -.63 | ... | 0.69 | ... | ... | --- | ... | --- | ... | ... | --- | --- |
| Base | Outs | HR | 3B | 2B | 1B | RBOE | HBP | NIBB | IBB | Bunt | Out | KO | PB | WP | BK | SB | CS | PO | POE | OA | INT | FoulE | DefInd |
| Loaded | 0 | 2.12 | ... | 1.55 | 1.11 | 0.82 | 0.65 | 0.9 | --- | ... | -0.63 | -.72 | ... | 0.2 | ... | --- | ... | --- | --- | ... | --- | ... | --- |
| Loaded | 1 | 2.56 | 2.23 | 1.88 | 1.17 | 1.39 | 1.08 | 0.91 | --- | ... | -0.7 | -.89 | ... | 1.03 | ... | --- | ... | ... | --- | ... | --- | ... | --- |
| Loaded | 2 | 3.31 | 2.63 | 2.18 | 1.53 | 1.35 | 0.88 | 1.06 | --- | ... | -0.81 | -.81 | ... | 1 | ... | ... | --- | ... | --- | ... | ... | ... | --- |
Base |
Outs |
HR |
3B |
2B |
1B |
RBOE |
HBP |
NIBB |
IBB |
Bunt |
Outs |
KO |
PB |
WP |
BK |
SB |
CS |
PO |
POE |
OA |
INT |
FoulE |
DefInd |
| Average | 1.409 | 1.063 | 0.764 | 0.474 | 0.546 | 0.385 | 0.33 | 0.102 | -0.001 | -.299 | -.31 | 0.285 | 0.284 | 0.237 | 0.195 | -.456 | -.255 | -.953 | -.572 | 0.429 | 0.026 | 0.063 | |
Let's continue from the previous example (a runner on 2nd and nobody out) and see the change in the expected runs scored if the next thing that happened was a double. Go to the row with "2nd" in the "Base" column and "0" in the "Outs" column. Now head across that row to the "2B" column. The number there is 0.97. This tells us that, on average, doubles that occurred when there was a runner on second and 0 outs increased the runs scored in the inning by 0.97 runs. This makes sense! Think about it with me for a moment. Most of the time, when there's a double hit with a runner on second, the runner scores. Thus, after the double the team will usually be left with the same situation we started with: a runner on second with nobody out. However, there's a huge difference... a run scored! They're still expected to score 1.189 more runs after the double, and have an extra run across the board. Thus, the increase in run expectancy of 0.97 runs (a number very close to 1) makes sense. This allows us to place a value on that double: in this game state, a double is worth 0.97 runs.
Now if you want to find out how many runs a player added to a team's totals, you could take all the doubles they got with runners on 2nd and nobody out, multiplied each double by its value, and summed the products... and then do that again for every game event in every game state, summing the totals. But there are two problems with that: 1.) it's tedious and 2.) it assumes players do better in certain game states than other game states. I'll show in a later post how there's no reason to believe 2.) is true case except for a couple minor exceptions. Besides, it would be nice to be able to go straight from the stats we'd usually find on player's stats page into runs and wins. Linear weights solves these problems by assigning values to each game event, independent of the game state. Basically, it answers the following question: how much is a generic double (or any other game event) worth, in terms of runs? The answer to that question will let us calculate a player's contributions much more easily, as we'll be able to simply sum up their doubles, singles, etc. all multiplied by the run value for each event type, and ignore all the permutations of game type with game state. We don't have to worry about game state, because we've made the doubles (etc.) all context-independent. In other words, linear weights gives us the bridge we need to get from game events that are easily found in a player's stat page (things like singles, doubles, stolen bases, etc.) to runs created. For pitching, defense, or Ryan Theriot's baserunning it gives us a similar bridge to runs prevented. That's pretty cool, if you ask me.
How does linear weights do this? It takes a weighted average of the value of each event over all the game states. The weighted average allows the game states that occur more often to count more often in the average. This is necessary because, for example, the game state "nobody on, nobody out" will occur much more often than the game state "bases loaded, 2 out." The former state is guaranteed to occur at least 17 times a game; any game in which the latter game state occurs 17 times would be long remembered for its excitement, (and perhaps as Carlos Marmol's longest career outing). The weighted average allows us to account for this discrepancy in how often the game states occur, by applying heavier weights to more common game states. Once we do that, we have the value of the average double... and every other game event. Look at the last row in the above table, which has these values for every game event.
If you understand this much, you've grasped a VERY important concept in sabermetrics. If you "get" how all these game events are valued in terms of runs, you'll be able to easily grasp other advanced stats like UZR and wOBA, two of the most important components in WAR. We'll return next week with a look at wOBA, which is the way we incorporate hitting into WAR; we'll also talk about going from the runs calculated in linear weights to wins. If you DON'T understand what I've said so far, then fire your questions away in the comments section!
Before I finish this piece, I'd like one last word and caveat. You can derive all of these numbers and tables using a variety of methods, even (woot!) models. I showed the number/tables above because they were readily available (thanks to Tangotiger), and because they're empirically derived. It would have taken a lot more text to introduce things like Markov chains. So, just realize there are other ways to derive these numbers, and some will result in "better" numbers than the ones here. Indeed, there are a few wonky numbers in the tables above. If you use Markov chains that wonkiness goes away, but the differences aren't huge and mostly show up in the last decimal place. More than anything else, my goal here is to help people understand what a run expectancy matrix is, and how linear weights allow us to put run values on individual events. Finally, if you're really into this stuff I strongly recommend grabbing a copy of "The Book: Playing the Percentages in Baseball" by Tom Tango (tangotiger), Mitchel Lichtman (mgl) and Andrew Dolphin (dolphin). That book is a great place to learn more about all this stuff, and will include many applications I may not get the chance to talk about here. So get the book - it's worth it!
136 comments
|
5 recs |
Do you like this story?
Comments
The 2nd table got cut off on the far right end, but you're not missing much...
just the values of “defensive indifference” which is relatively meaningless. If you want to find the whole table, click the link above it and you can see it on tangotiger’s site.
by Shawn Domagal-Goldman on Jan 18, 2010 4:46 PM CST reply actions
Table error?
If you want to know how many runs were scored (on average) from this game state to the end of the inning, go to the column labeled “0” (representing the number of outs) and look down to where it intersects the row labeled “2nd” (representing the presence of a runner on 2nd). There, you’ll find the value “0.725.”
Shouldn’t that be 1.189?
I got (and read) “The Book” for Christmas. It was quite illuminating.
Yup!
I’ll fix that in a moment. Thanks…
by Shawn Domagal-Goldman on Jan 18, 2010 5:01 PM CST up reply actions
I had originally done this using a different game state.
Runner on 2nd, with 1 out. I forgot to change all the 0.725 values, but that’s fixed now.
Thanks again…
by Shawn Domagal-Goldman on Jan 18, 2010 5:02 PM CST up reply actions
Shawn...this is an outstanding start to the series. Look forward to reading more. Really nice job of breaking down the fundamentals of complex stats into a piece that was easy to read/follow.
www.Tomahawknation.com
Thanks!
I’m hoping to develop this into materials I could use in a course on sabermetrics that I hope to teach… if I ever get a faculty position.
by Shawn Domagal-Goldman on Jan 18, 2010 5:03 PM CST up reply actions
I would be interested in seeing lay over tables and analysis between regular season and playoffs
It might be just me but I think there are two pieces to the puzzle…
this analysis which is good in that it measures things in a large and valid sample size…entire regular seasons….WAR is definable….
But then getting into small sample sizes like playoffs where instead of dealing with basically a round robin of 3-games series either twice or 5 times a year with opponents.
What I am saying is finding the mix between a regular season division winner and players who also are playoff series WAR difference makers in 7-game series.
Piniella: "This is a tougher job than I thought it would be, I'm going to be honest with you."
I think sample size is going to be the issue there.
You could do something like this, but the uncertainty on your conclusions would be so large that they’d render the conclusions essentially meaningless. I’m going to finish with a post on sample size, error, and uncertainty. If that doesn’t help answer your question, let me know and I’ll try addressing it further then. To make matters worse, the regular season has such tremendous sample size (relative to the postseason) that you’ll be much better off using the career numbers than the regular season/playoff splits.
As we get closer to the regular season and start talking about platoons and the like, I may also do a post on platoon splits (R/L). That may also help address this issue.
by Shawn Domagal-Goldman on Jan 18, 2010 5:10 PM CST up reply actions
glad you took me seriously
No matter what, baseball is a team sport, meaning everything else is meaningless outside of wins and losses….in that context a player who provides a statistical difference in wins against another player is by definition superior… regardless of the sidebars as in individual statistics.
But again WAR appears to be useful in regular season contexts and it appears fairly self evident that in playoffs different players react differently.
Piniella: "This is a tougher job than I thought it would be, I'm going to be honest with you."
....
But again WAR appears to be useful in regular season contexts and it appears fairly self evident that in playoffs different players react differently.
I don’t really understand what this means.
by vivaelpujols on Jan 18, 2010 8:30 PM CST up reply actions
Did you catch any errors in what I wrote?
I value your feedback, VeP. Thanks…
by Shawn Domagal-Goldman on Jan 19, 2010 1:00 AM CST up reply actions
I haven't fully read it yet
I’ve been studying for finals. I’ll get to it tomorrow probably.
by vivaelpujols on Jan 19, 2010 1:10 AM CST up reply actions
I'm not sure I agree with the last statement.
It seems evident based on what you see, but what you see isn’t very much. That’s not a knock on you; there’s just not much there to see in terms of playoff production. Again, I’ll come back to this issue at the end. Misunderstandings of sample size and uncertainty it the biggest source of disagreements with statheads like me, from what I can tell.
by Shawn Domagal-Goldman on Jan 19, 2010 12:27 AM CST up reply actions
This last sentence is stupid.
"The riches of the game are in the thrills, not the money." --Ernie Banks
wow, it may be the wine but....I'm lost :(
I’ll try again tomorrow morning with a cup of coffee :) is this something everybody should understand or just the sabermetrics guys??
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
Let's put it this way:
if you want to follow the rest of the “sabermetrics lessons” and know how things like wOBA and WAR are derived, then this would be good to understand.
That said, if you trust the saber-folks to do this stuff right, then feel free to ignore this post. All I’m doing on this post is pulling back the curtain a little bit.
by Shawn Domagal-Goldman on Jan 18, 2010 5:24 PM CST up reply actions
... and I'll also add that I'm on PST...
and going strong on coffee at the moment. ;-)
by Shawn Domagal-Goldman on Jan 18, 2010 5:26 PM CST up reply actions
ok....I'm gonna give it a go, even if I just get a better understanding of what WAR is, I'll consider it time well spent.
I’ll let you know tomorrow. I have to say, this may sound crazy, but it looks beautiful, if that’s possible.
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
by cooliogirl47 on Jan 18, 2010 5:36 PM CST up reply actions
Thanks!
It’s worth my time to do this, both because I plan to teach this stuff one day, and because I believe that by teaching all of you this stuff I learn it much, much better myself. Feel free to let me know “where” I lost you and I’ll try to help you figure it out.
by Shawn Domagal-Goldman on Jan 18, 2010 5:38 PM CST up reply actions
you're not the only one, coolio - and I'm not even drinking wine
I was fine up until the second table. Then the eyes started glazing over…
I guess my philosophy is… “I don’t understand them. I just catch them.”
I’ll continue to read (and re-read) these as they come out – maybe something will click later – but my main interest/concern/curiosity is more towards how well these different stat thingees actually work. In other words, comparing the projections with the actual results and seeing which ones did the best.
Lou Brown: "My kinda team, Charlie, my kinda team..."
do you think I can absorb this???
I’m not sure I even should. There is something to be said of just enjoying the simplicity of things. I love baseball and if it becomes complicated, I’m afraid it will become tedious to me.
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
by cooliogirl47 on Jan 18, 2010 11:14 PM CST up reply actions
thats happened to me
i like baseball, but every conversation/article lately makes me feel like i wandered into a calc test i forgot i had.
There is no infinity button for failing in sports. At some point, things turn. They always do. - Bill Simmons
no...we love baseball and that's why we're here.....
there is something to be said for the simplicity of things….If we go into the netherland of numbers will we come out with the same passion we went in with? Will it become so tedious to our fun-loving minds that we will turn away from that which we love? I say step into the waters with caution, and if you start to loose that sensation which brought you here in the first place, step away. It is an adventure. Choose wisely.
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
by cooliogirl47 on Jan 18, 2010 11:39 PM CST up reply actions
For me? Yes.
The numbers, if anything, make me more passionate about the game. But to each their own…. Some people don’t dig them as much as I do.
by Shawn Domagal-Goldman on Jan 19, 2010 12:32 AM CST up reply actions
call me cg47! :)
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
by cooliogirl47 on Jan 18, 2010 11:41 PM CST up reply actions
Think of it this way...
Pick a situation – any situation – on the basepaths, and a number of outs in the inning. You could even go to a particular situation in the middle of a game. The first table tells you how many runs a team will score the rest of that inning, on average.
For example, on average, teams will score 0.55 runs in the rest of an inning where the bases are empty and there are 0 outs. This is the same number as the average runs scored per inning: 0.55 runs.
by Shawn Domagal-Goldman on Jan 19, 2010 12:31 AM CST up reply actions
Now this makes my head hurt...
If the average runs per inning is 0.55 and the average runs scored the rest of the inning after there’s nobody on and nobody out is 0.55, what happens to the times where the leadoff man homers? Surely that can’t be an insignificantly rare occurrence.
Perhaps I’m misunderstanding exactly what the numbers are saying. I understand that the expected runs with 0 on and 0 out would have to be the expected runs per inning since that’s how every inning starts, but it also seems that somehow a leadoff HR makes the numbers recursive in a way.
Good pitching beats good hitting, and vice versa.
by tibbelkrunk on Jan 21, 2010 10:30 AM CST up reply actions
Think of it this way:
Teams score, on average, about 5 runs/game. Divide that by 9 and you get 0.5555555555555…
Now as to your example. Before the HR is hit, teams will average 0.55 runs in the rest of the inning. After the HR is hit, the team is still expected to score 0.55 runs in the rest of the inning.
How many runs was the HR valued at? Look at the 2nd table. It says a HR is worth 1 run. This makes sense. Before the HR, the team was expected to score 0.55 runs in the rest of the inning. After the HR, the run expectation from the new batter until the end of the inning same, but a run has scored in the meantime. You get 1 run from the HR and are still expected to score 0.55 more runs after the HR. So the HR increased the expected runs in the inning by 1.
by Shawn Domagal-Goldman on Jan 21, 2010 10:23 PM CST up reply actions
Actually the table says the HR is worth 1.04 runs...
which I don’t want to try explaining because I’m afraid it will confuse matters. Consider it “methodology error.”
by Shawn Domagal-Goldman on Jan 21, 2010 10:33 PM CST up reply actions
There are about 3 HR/100 PA.
So only about 3% of innings have a leadoff HR. (Probably less than that, as leadoff batters with little power will get a few more of those PA’s). So it doesn’t affect the calculations that much. If all of those leadoff HR’s turned into outs, the expected runs/inning would only drop to something like 0.52.
by Shawn Domagal-Goldman on Jan 21, 2010 10:32 PM CST up reply actions
Books to consider
I found these books that I would consider using to teach a course on sabermetrics or learn on your own at a viewpoint that I think is easy to follow:
Understanding Sabermetrics: An Introduction to the Science of Baseball Statistics
Read it in 2 hours without doing the “problems” — very straight forward
Practicing Sabermetrics: Putting the Science of Baseball Statistics to Work
Shawn — Thanks for posting this topic — awesome stuff here! You might want to consider contacting the author of these books as I think he has taught a course on sabermetrics already.
Yeah there are a few great books out there on the stuff...
I may try to avoid textbooks, as there’s so much great stuff on the web. There’s no reason (IMO) to make my students buy something for $40 they can get for free. BUT they’d still be a great resource for me.
by Shawn Domagal-Goldman on Jan 18, 2010 5:49 PM CST up reply actions
Interlibrary loan is a great thing...why buy?
Piniella: "This is a tougher job than I thought it would be, I'm going to be honest with you."
Good point...
I’d argue to support the writers, who did great work in putting the book together.
But yes, interlibrary loan is also a great option.
by Shawn Domagal-Goldman on Jan 19, 2010 12:33 AM CST up reply actions
You had me at Gears of WAR...
Are you on Twitter? Check out the BCB and Cubs Twitter Community! Post your Twitter name and start tweeting with us!
by Schwa on Jan 18, 2010 5:49 PM CST via mobile reply actions
I've never played the game...
… but still loved the play on words.
by Shawn Domagal-Goldman on Jan 18, 2010 5:49 PM CST up reply actions
Great post -- looking forward to more of these
I never took the time to understand a lot of the newer metrics and how they are derived. This is good stuff.
I'm singing, "GO CUBS GO! GO CUBS GO!" -- DrCrawdad on Jun 12, 2009 7:23 AM CDT
Facts are meaningless. You could use facts to prove anything that's even remotely true! -- Homer J. Simpson
by Shanghai Badger on Jan 18, 2010 5:50 PM CST reply actions
This was awesome!
I’m eagerly anticipating the rest of the series. I have only a rudimentary understanding of advanced stats. I tend to trust them and use them to the extent that I am able (within the limits I’ve been instructed to by those who’ve come up with them), but this will help it all make more sense. Thanks!
hope there is no problem with printing this, and using it as a guide
and printing the future ones as well to use in the same manner
Just because you talk a lot doesn't mean you're saying anything. dtpollitt 1-7-10
Print and redistribute as you see fit.
None of this would have been possible if the people that did the real work on it didn’t make their stuff freely available.
by Shawn Domagal-Goldman on Jan 18, 2010 6:09 PM CST up reply actions
By the way, your questions to Tango helped me think about how to frame all this...
so thanks go to you, as well.
by Shawn Domagal-Goldman on Jan 18, 2010 6:09 PM CST up reply actions
i was gald to see I was not ripped apart in the comments to those questions
slightly OT do you think these type of metrics could be done for NHL, NBA, NFL?
Just because you talk a lot doesn't mean you're saying anything. dtpollitt 1-7-10
Yes, broadly speaking.
I don’t know jack about hockey. But basketball has some nice sabermetric-style sites as does the NFL. Football Outsiders is great for the NFL, as is advancednflstats.com. Basketball Prospectus is good for basketball (duh), as is John Hollinger’s work at ESPN, and a couple other sites I can’t remember the names of off the top of my head.
by Shawn Domagal-Goldman on Jan 18, 2010 7:18 PM CST up reply actions
If you mean more specifically, sort of.
In the NFL they try to equate things to yards instead of points. And in the NBA they do a good job of translating the various box score stats into points. The main thing one has to do is get various game events in whatever sport you’re looking at on the same footing. That’s the main part of the battle.
by Shawn Domagal-Goldman on Jan 18, 2010 7:20 PM CST up reply actions
If you haven't read Michael Lewis' article on Shane Battier that is a good place to start.
http://www.nytimes.com/2009/02/15/magazine/15Battier-t.html
Basketball-reference.com is set up the same way as the baseball component, and it has a decent blog and glossary of advanced stat terms. 82games.com and hoopsdata.com are probably the best basketball stats sites for free.
The problem of course with basketball is that it’s far, far more difficult to assign statistical value to every single thing that goes on during the game. I don’t know if NBA is your game, but if you want in-depth writing of the whole league definitely check out Kevin Pelton at Basketball Prospectus, or Kelly Dwyer on Yahoo, easily the two best writers out there.
Taj Gibson is the face of Bulls basketball!
by Trey23 on Jan 5, 2010 6:31 PM EST reply actions 0 recs
by Ozzie Montana on Jan 18, 2010 10:23 PM CST up reply actions
i am more curious about NHL
asking about all three was more or less just out of curiosity
Just because you talk a lot doesn't mean you're saying anything. dtpollitt 1-7-10
A Google search got me this site.
I’m guessing a sport that pays such close attention to +/- has some decent advanced stats analysis out there if you look for it.
Taj Gibson is the face of Bulls basketball!
by Trey23 on Jan 5, 2010 6:31 PM EST reply actions 0 recs
by Ozzie Montana on Jan 18, 2010 10:31 PM CST up reply actions
Yeah that's one of the better ones...
tangotiger puts hockey musings out every once in a while on his blog. You can search there, too.
by Shawn Domagal-Goldman on Jan 19, 2010 12:33 AM CST up reply actions
I'm taking the same approach.
I have a really hard time focusing on posts this long and detailed while they’re on my computer monitor. But I would encourage anyone who’s only recently getting into WAR (like me) to print this out and study it.
Catch my act on Twitter as @dat_cubfan_dave.
All we are saying is...

Another outstanding post, Shawn! Thanks for your contributions.
"Only a mediocre person is always at his best." ~W. Somerset Maugham
LOL. That's awesome. I may have to use that next week. :-)
by Shawn Domagal-Goldman on Jan 18, 2010 6:19 PM CST up reply actions
I'd like to see some serious sabrmetrician
tooling around in a green Prius with that bumper sticker.
That would blow some people’s minds.
"Only a mediocre person is always at his best." ~W. Somerset Maugham
If it's gonne happen, it'll likely be in Seattle...
hope to many Priuses and sabermatricians.
by Shawn Domagal-Goldman on Jan 20, 2010 5:36 AM CST up reply actions
Very nice, Shawn
Glad to see it. Start of what promises to be a very helpful series.
Fontenot (fon-te-no): Cajun for "scrappy"
This was a great article, I had to stop beating up 1st base coaches for a minute to read the whole thing.
Look forward to the rest of the series.
Taj Gibson is the face of Bulls basketball!
by Trey23 on Jan 5, 2010 6:31 PM EST reply actions 0 recs
ROFL
I'm singing, "GO CUBS GO! GO CUBS GO!" -- DrCrawdad on Jun 12, 2009 7:23 AM CDT
Facts are meaningless. You could use facts to prove anything that's even remotely true! -- Homer J. Simpson
by Shanghai Badger on Jan 19, 2010 9:09 AM CST up reply actions
Someone lied to you.
There will always be math. ;-)
by Shawn Domagal-Goldman on Jan 19, 2010 12:34 AM CST up reply actions
Wonderful, interesting, helpful and very very informative
Thanks for the effort. I like stats generally, but never took the time to try and decode all the latest sabermetric mumbo-jumbo. I guess that’s because I lacked the fundamentals—it’s like trying to do algebra if you never learned basic math. Anyway (once I figured out what RBOE stood for—duh), I loved your presentation and generally found it to be clear and concise.
I assume “Bunt” means a sacrifice bunt. Are the numbers only for successful sacrifices or do they include failed attempts—like bunting into a force out? If the numbers only include successful sacrifices, it’s amazing how little this tactic contributes positively.
The other result which I found most interesting is the limited benefit of a SB relative to the cost of being caught.
I seldom post, but felt a need to acknowledge your efforts and express appreciation. Good luck with your future plans for this and I’ll look forward to your upcomiing installments.
by West Coast Diehard on Jan 18, 2010 11:58 PM CST reply actions
Thanks! I really appreciate that...
To answer your questions:
I assume "Bunt" means a sacrifice bunt. Are the numbers only for successful sacrifices or do they include failed attempts—like bunting into a force out? If the numbers only include successful sacrifices, it’s amazing how little this tactic contributes positively.
I think it’s for any bunt. Take a look at the values with nobody on base. If it was just for sacrifices, these boxes would be empty. And it includes all the potential outcomes: successful sacrifices, bunts for a hit, unsuccessful attempts at the prior two things. In general, sacrifices aren’t a good idea. They generally decrease the expected runs scored AND the team’s chances of winning. That said, there are cases when they simultaneously decrease runs scored and increase a team’s chances of winning. One example of this making sense would be a situation with runners 3rd and nobody out in a tie game in the bottom of the 8th. By bunting, you decrease the expected runs scored… BUT you increase the chances of scoring at least 1 run. In this case, that’s a trade you’re probably willing to make.
The other result which I found most interesting is the limited benefit of a SB relative to the cost of being caught.
Surprising, isn’t it? To make stealing bases worth your while, you have to succeed more than 70% of the time. Very few are good enough to do that.
by Shawn Domagal-Goldman on Jan 19, 2010 12:55 AM CST up reply actions
Why not other stats?
I don’t necessarily disagree with your choice of more advanced performance metrics, since I well understand that many exist and picking between statistics of various accuracy and relevancy inevitably becomes a stressful task. However, I would appreciate a few lines on your reasoning behind the use of WAR versus, say, Win shares or WARP.
There are a few reasons...
I like the baseline chosen for WAR: replacement-level play. This is really the right way to go if you’re trying to measure the value players have in terms of things like $. WAR is also open source. We can see into its guts, calculate it ourselves if we so choose, and access free (as in beer) databases with the values in it. Things like WARP/VORP are nice too, but aren’t as open source, which has hurt their use.
If you’re interested in the WAR/Win Shares argument, tangotiger has a whole series of posts on it:
http://www.tangotiger.net/#Winshares
by Shawn Domagal-Goldman on Jan 19, 2010 1:00 AM CST up reply actions
WARP and VORP are plenty open source
They are just not as good as WAR for a few reasons:
1) The run estimators in WARP and VORP (EqA and RC) aren’t as good as wOBA (linear weights)
2) WAR, at least FanGraphs interpretation of it, uses UZR – while WARP uses an inferior metric in FRAA and VORP doesn’t include defense
3) The replacement level in WAR is generally accepted to be closer to reality than in WARP
by vivaelpujols on Jan 19, 2010 1:14 AM CST up reply actions
this is where some of us start to get lost
you use EqA for example, but how many know what that is? Using what is not known to answer the questions only adds to the confusion and misunderstanding.
Just because you talk a lot doesn't mean you're saying anything. dtpollitt 1-7-10
Maybe at the end...
I can go through the various run estimators. It’ll probably be pretty math-heavy, but I could do it.
by Shawn Domagal-Goldman on Jan 19, 2010 10:37 AM CST up reply actions
I don't think you should emphasize the math
It really isn’t important, unless you are planning to do your own work – which most people aren’t.
I would spend much more time emphasizing the concepts – with as much clarity and little math as possible.
by vivaelpujols on Jan 19, 2010 12:15 PM CST up reply actions
agreed
it is more understanding what VEP means than how it is derived, especially at the beginning stages IMHO
Just because you talk a lot doesn't mean you're saying anything. dtpollitt 1-7-10
If the question is "why WAR instead of WARP" the answer has to be math-heavy, right?
I mean, unless you’re really glazing over it. There are some non-math heavy differences between WAR and VORP, or Win Shares, but even there I think the mathy ones are very important.
One point about EqA vs wOBA that’s not math-heavy: EqA is a lot more complicated than Linear Weights, and the calculations, as I’ve seen them explained, don’t make as much intuitive sense. Linear Weights is really elegant and simple as far as these things go, and since estimators like wOBA seem to perform at least as well as EqA, Occam’s Razor prefers them.
for those (including myself)
who have no idea what the hell EqA is
Equivalent Average. A measure of total offensive value per out, with corrections for league offensive level, home park, and team pitching. EQA considers batting as well as baserunning, but not the value of a position player’s defense. The EqA adjusted for all-time also has a correction for league difficulty. The scale is deliberately set to approximate that of batting average. League average EqA is always equal to .260
Just because you talk a lot doesn't mean you're saying anything. dtpollitt 1-7-10
You don't have to use math
Simply emphasize the differences in people talk.
EqA undervalues the walk relative to wOBA. That’s been shown. If they want evidence for that, show them the proof – which is very math heavy. However, they will probably accept it at face value.
by vivaelpujols on Jan 19, 2010 5:28 PM CST up reply actions
Thanks...
somewhere in the back of my brain I knew all that, but couldn’t recall it last night.
by Shawn Domagal-Goldman on Jan 19, 2010 10:37 AM CST up reply actions
I'm also excited to see what Colin Wyers and Pizza Cutter have in store...
for BP. I expect a lot of their products to improve.
by Shawn Domagal-Goldman on Jan 19, 2010 10:38 AM CST up reply actions
I thoroughly enjoyed reading the first instalment,
and while the basic concepts appear to be clear, I’ll be rereading when I’m home this evening to be sure that I’ve understood fully.
Are “Linear Weights” a purely Sabermeric term? They would seem to be a basis for many forms statistical analysis having nothing to do with sports.
"A waist is a terrible thing to mind." - Terry 'Fat Tub of Goo' Forster
@Twitter as @brommmietze
Yes... and no.
Linear weights is a generic way to build a model such as the one above. The “linear” part of it (which i didn’t mention) has to do with the equation used to stitch things back together.
That said, if you google “linear weights” most of the work you’ll find is about baseball. I think this is the best-known application of the tool.
by Shawn Domagal-Goldman on Jan 19, 2010 10:49 AM CST up reply actions
Nicely done Shawn.
I’ve gone from dipping my toe into the pool of sabermetrics, to about mid-calf after reading this!
What is this, sabermetrics 101?
I’m … a little lost….
"People ask me what I do in winter when there’s no baseball. I’ll tell you what I do. I stare out the window and wait for spring."
-Rogers Hornsby-
by Vermont Cubs Fan on Jan 19, 2010 10:18 AM CST reply actions
More or less...
yes. I’ll be doing a whole series of posts on sabermetrics, for those that want to learn the stuff. This is the first post in that series.
by Shawn Domagal-Goldman on Jan 19, 2010 10:39 AM CST up reply actions
"Replacement" is not "Average"
I like the WAR system, I find it to be a nice system that takes a complex data set and boils it down to an “understandable” number.
I have always had one real gripe with the lingo though. I don’t understand the term “replacement” level. The “R” in WAR is based on this:
instead of a replacement from the minor league ranks
and I question that. The averages that you lay out as the basis for the WAR statistic are league averages, including the production of a Bobby Scales and an Albert Pujols.
So, a “replacement player” is not going to achieve league average #s in all likelihood, rather he would be at the low end of the curve. Therefore, the WAR statistic should really be:
WAA: or Wins Above Average
Who cares? Isn’t this just a silly argument based lingo when clearly, at least part of the WAR choice of terms was constructing a good acronym?
Eamus Ursuli!
You have some really good point/questions in here.
And I’m going to address all of them (I hope) in a future post. There will be a whole post dedicated to what “replacement” means, and why it is used instead of average.
To answer in a brief manner: we use replacement level as the baseline instead of average players as the baseline, because if you lost a player to injury you’d likely replace them with someone from the minor league ranks, or by a bench player that is aging and past his prime, young and not quite there yet, or just not good enough to be a starter. Either way, the “replacement” is going to be below league average most times. If they were better than league average, they’d likely find a way to start. Now…. the various components like the hitting and fielding and baserunning are measured with respect to average. The reason we still can make the comparison to WAR is there is a separate calculation done to give you the difference between average and replacement level. This is handy for a number of reasons I’ll get to.
I hope I’ve answered your questions there. If not, a full post on the matter should help.
by Shawn Domagal-Goldman on Jan 19, 2010 10:46 AM CST up reply actions
There are also very good economic reasons to use "replacement" as the baseline.
by Shawn Domagal-Goldman on Jan 19, 2010 10:46 AM CST up reply actions
True enough.
I haven’t read his arguments yet. But the ones for replacement level hav me convinced. I’ll take a look at his stuff before I do the piece on replacement level.
by Shawn Domagal-Goldman on Jan 19, 2010 6:56 PM CST up reply actions
Not sure about the $ argument
I don’t know that I see the economics argument in that if “average” were the baseline, then most replacement level players could have a negative value.
Eamus Ursuli!
Exactly.
You want to set the “baseline” so that the worst players are making league minimum. That’s what setting replacement level at 0 essentially does. Anyone playing below replacement level should be replaced by someone better, and you shouldn’t pay more than league minimum for someone playing below replacement level.
by Shawn Domagal-Goldman on Jan 19, 2010 6:58 PM CST up reply actions
One more note...
for those struggling with the details a little (or a lot), this is probably the “toughest” entry in the series I’ll post here, and the one that is the most math-heavy. The rest will hopefully be easier to digest.
by Shawn Domagal-Goldman on Jan 19, 2010 10:50 AM CST reply actions
I'm just reading through this post.
Care to translate it into English?
On a serious note, I guess I am not cut out to be a sabermetrician.
"People ask me what I do in winter when there’s no baseball. I’ll tell you what I do. I stare out the window and wait for spring."
-Rogers Hornsby-
by Vermont Cubs Fan on Jan 19, 2010 12:09 PM CST up reply actions
I do an awful lot of work with statistics for a living
And as much as I try to avoid it, I still get geeked up about stuff like this when it is explained to me AND I can understand it. I understood the main idea of it before, but this still helps tremendously. Thanks, Shawn.
"This next song... it's about the White Sox. It's called: F*** Em'." - Eddie Vedder
I'm sorry Shawn, it's beyond me.....I've been reading the same paragraph for the last hour. :(
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
Which one?
I’ve been pretty slammed with work today/yesterday (same day to me at this point) but I’ve been in enough hours to catch up a little. If you let me know which paragraph it is I can try to step through it with you. Just realize I usually don’t post often between ~9 AM and 5 PM, PST.
by Shawn Domagal-Goldman on Jan 20, 2010 5:39 AM CST up reply actions
You're not alone.
I don’t get it either. I’m totally with you about loving baseball and being afraid it will become too tedious. So with that thought, I gave up much sooner. Good luck-I wish I could try as hard as you. I quit shortly after the fold. Sorry Shawn. It’s not your fault. I just really HATE numbers. I have a hard enough time just trying to read box scores.
"Fasten those seatbelts"-Pat Hughes
by katie casey on Jan 20, 2010 10:19 AM CST up reply actions
I can't deal with the stuff
Way over my head but thanks for trying to explain it.
"I am not ashamed to say I love Greg Maddux" - Jim Hendry
Me either Jim
Ok, so that's VCF, CG47, Doggie and myself that have flunked out of Stats 101.
When the next class starts up, we’ll be the ones outside smoking in the parking lot…
Lou Brown: "My kinda team, Charlie, my kinda team..."
I'd suggest you all try the next piece...
it should be more accessible. I’m /hoping/ to get it out this weekend. But I’m swamped, so we’ll see.
by Shawn Domagal-Goldman on Jan 19, 2010 6:59 PM CST up reply actions
well the kids will probably go with cigarettes trying to look cool...
I might be puffing away on a cigar, unless you can bring down some of the good stuff – you know, some Anchorage Cold… ;-)
Lou Brown: "My kinda team, Charlie, my kinda team..."
lol :)
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
by cooliogirl47 on Jan 19, 2010 10:55 PM CST up reply actions
.... and laughing at us...
when we get beat up for out lunch money. Something like that, right?
by Shawn Domagal-Goldman on Jan 20, 2010 5:38 AM CST up reply actions
heck no...we're the ones who keep those guys away from you so you can help us with our homework
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
by cooliogirl47 on Jan 20, 2010 9:16 AM CST up reply actions
Mind if I join you?
Only can I drink instead of smoke?
"Fasten those seatbelts"-Pat Hughes
by katie casey on Jan 20, 2010 10:21 AM CST up reply actions
No problem
Cooler’s in the back seat of that red (slightly rusted) ‘72 Rally Nova over there. Grab a beer and come lean against the principal’s car with the rest of us. We’re just hanging out, waiting for shop class to start…
Lou Brown: "My kinda team, Charlie, my kinda team..."
I'm with katie, mind if I grab a cold one ballhawk?
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
by cooliogirl47 on Jan 20, 2010 1:15 PM CST up reply actions
you must mean my profile page
"People ask me what I do in winter when there's no baseball. I'll tell you what I do. I stare out the window and wait for spring."--Rogers Hornsby
by cooliogirl47 on Jan 20, 2010 4:28 PM CST up reply actions
I think my BRAIN JUST EXPLODED
But I did make it through the whole article. It takes a special sort of nerd to do that.
Proud recipient of a hot dog shot from the Iowa Cubs hot dog gun.
My brain hurts.

"Fasten those seatbelts"-Pat Hughes
by katie casey on Jan 20, 2010 10:47 AM CST up reply actions
Mine exploded as soon as I got after the fold.
"People ask me what I do in winter when there’s no baseball. I’ll tell you what I do. I stare out the window and wait for spring."
-Rogers Hornsby-
by Vermont Cubs Fan on Jan 20, 2010 11:34 AM CST up reply actions
Probably wOBA, and if there's room UZR and EQBRR (or some baserunning metric).
Did you have a suggestion?
by Shawn Domagal-Goldman on Jan 20, 2010 7:40 PM CST up reply actions
I would personally have a WAR post first, without the hardcore math
WAR breaks down into offense + defense + position + replacement
If you could go into the theoretical aspect of each of those parts, that would be good I think. Then go into the different ways of modeling each of those parts, without too much detail. So for defense, talk about UZR, +/-, Total Zone. For offense, talk about Linear Weights (which you already did!), park adjustments, league adjustments, quality of batters faced adjustments.
In my opinion, WAR (not FanGraphs WAR, but the idea of WAR), is the biggest part of sabermetrics today. If you can emphasize the concept, and then branch out into the technical details in later articles, I think that would be the best way to go about this series.
by vivaelpujols on Jan 20, 2010 9:12 PM CST up reply actions
I hear you...
but my thinking is to put it together piece by piece. My plan for next time was to go over the runs to wins conversion (which is a big part of how fielding and hitting go into WAR) and then do the batting part through wOBA.
by Shawn Domagal-Goldman on Jan 21, 2010 10:34 PM CST up reply actions
Starting with WAR keeps the audience's interest.
People throw around WAR here too much without understanding its fundamental components. WAR or UZR should be next, IMO.
"The riches of the game are in the thrills, not the money." --Ernie Banks
I think WAR should be next.
And then break down the components of it as follows.
"The riches of the game are in the thrills, not the money." --Ernie Banks
That was interesting reading...
Thanks for posting it. But I’m not so sure I want to know how the sausage is made. ;-)
by bison on Jan 20, 2010 9:56 PM CST via mobile reply actions
Are you sure you really like the Cubs?
I can think of at least one team in the division that wouldn’t mind you joining ranks
Free Melodi Dushane
I really like veb... but my heart is in Chicago.
Besides, if I were to switch allegiances, I’d become a Mariners fan. That front office has been amazing, and I have a standing offer to join a season ticket group.
by Shawn Domagal-Goldman on Jan 21, 2010 10:24 PM CST up reply actions
Well, besides the fact that the Mariners suck balls.
"The riches of the game are in the thrills, not the money." --Ernie Banks
I think everybody should read THE BOOK.
Not to devalue Shawn (I think he’s great), but this post is relatively child’s play after reading THE BOOK. It’s the best introductory text I can suggest.
"The riches of the game are in the thrills, not the money." --Ernie Banks

by 



















