Navigation: Jump to content areas:


Pro Quality. Fan Perspective.
Login-facebook
Around SBN: Miikka Kiprusoff Wins 300th Game, Buffalo Crushes Boston

Best offense in the league?

Baseball Prospectus finally released their depth charts today and it has the Cubs team hitting .275/.337/.460. The depth charts are basically the intelligent look through the team's PECOTA projections and assigning playing time at each position. No more Tyler Colvin or Mark Pawelek's in the projections. For example, BP has Soriano projected to get 90% of the playing time in CF followed by Felix Pie and Angel Pagan each at 5%.

Star-divide

The Dodgers had the highest batting average in the NL in 2006 at .276 (2nd best team was at .270). The .275 mark would rank them 2nd in average.

A .337 OBP would have them tied with St. Louis and Atlanta for 5th in the National League.

Atlanta had the highest slugging in the league last season at .455. BP has the Cubs team projected at .460 (easily the best in the league for 2006 standards).

Philly had the highest OPS in the NL in 2006 at .794. If you can add, you know the projected team OPS for the Cubs is .797. OPS has a very high correlation with run scoring. Philly scored 865 runs in 2006; 16 more than runner-up Atlanta and they were one of only 5 NL teams to score more than 781 runs.

So if these projections prove to be true and 2006 was an average season in the NL, the 2007 Cubs will likely have the best offense in the National League.

But hold on. Their projected starting 5 for the Cubs is Carlos Zambrano, Ted Lilly, Rich Hill, Jason Marquis, and Sean Marshall with Prior, Miller and Guzman as spot starters making 11, 11, and 6 starts respectively. They have a total of 18 pitchers pitching for the Cubs in 2007 (Jae Kuk Ryu included -- obviously these were completed before the trade). These 18 pitchers (8 starters and 10 relievers) combine to go 74-67, which is about right on line with what many of us have been suggesting the Cubs are capable of going (7-10 games over .500).  That's probably enough to win the Central in 2007.

Based on these depth charts and projections, the Cubs are going to have a powerful offense and a very mediocre pitching staff (if not slightly below average). From a quick glance through, though, it does look like the offense will be enough to carry the pitching.

I spent about 5 minutes writing this as I was looking at the depth charts and checking 2006 stats so if I made an error, I apologize.

This is a FanPost and does not necessarily reflect the views of SB Nation or Al Yellon, managing editor (unless it's a FanPost posted by Al). FanPost opinions are valued expressions of opinion by passionate and knowledgeable baseball fans.

Comment 61 comments  |  0 recs  | 

Do you like this story?

Comments

Display:

This summer at Wrigley
Could be quite a show, maybe not always one we're hoping for. I imagine we'll see lots of 8-5, 9-7, etc.

Or maybe the pitching staff will surprise. Crazier things have happened.

I very excite about this upcoming season! Jenqui!

by nextyearcub on Feb 17, 2007 4:34 PM CST reply actions  

Somehow I can see the Cubs figuring out...
...a way to have the highest OPS in the league, and still not score any runs.

But very nice analysis as always maddog. You are a giant amongst us.

PINIELLA!

by theprognosticator on Feb 17, 2007 4:34 PM CST reply actions  

Thanks.
I can see the Cubs having the highest OPS and not scoring many runs as well.  I was thinking about including that, but figured I'd let the facts do the talking...and the projections, of course.  

by Maddog on Feb 17, 2007 4:44 PM CST up reply actions  

you're welcome...
...any time I try the statistical analysis bent and crunch the numbers I get grumpy very very quickly.

You obviously enjoy the numbers and you make it enjoyable and engaging for the rest of us as well.

Thanks again.

PINIELLA!

by theprognosticator on Feb 17, 2007 4:49 PM CST up reply actions  

I enjoy stats.
This one was relatively simple and it was also related to something else I've been working on (though not very much I admit).  I thought I'd take a quick stab at this and have this ready to see how it compares to what i'm doing, which, if I get busy, I may have ready to present on Monday or Tuesday.  Thanks for the compliments.

by Maddog on Feb 17, 2007 4:57 PM CST up reply actions  

I know
that Cub fans are prone to expect the worse, but I see no reason to expect them to score fewer runs than their OPS projects. Any specific reasons you feel otherwise?
Players win awards but teams win championships.

by tharr on Feb 17, 2007 7:09 PM CST up reply actions  

No.
It's just because I'm a Cubs fan.  I expect the worst.  

by Maddog on Feb 17, 2007 8:22 PM CST up reply actions  

Experience
is a great teacher.
Players win awards but teams win championships.

by tharr on Feb 17, 2007 10:30 PM CST up reply actions  

I've learned
that what can go wrong will go wrong with the Cubs.  It's inevitable.  They're the Cubs.

by Maddog on Feb 17, 2007 10:48 PM CST up reply actions  

Is that statement
statistically provable? Personally, I'm going with "This may be our year".
Players win awards but teams win championships.

by tharr on Feb 18, 2007 12:45 AM CST up reply actions  

I dig your optimism...
...I completely sympathize and share maddog's pessimism.

Only the Cubs could figure out a way to lead the league in OPS and be dead last in runs scored.

Probably by setting records for most times picked off, and most times getting thrown out at home plate.

PINIELLA!

by theprognosticator on Feb 18, 2007 3:33 AM CST up reply actions  

See, I am the exact opposite..
I maintain an heir of blissful optimistic ignorance right before the season and want to believe that the best season is the one that is about to happen.......

It has to one day, right?

Ladies and Gentlemen....Mark William Prior is BACK!!!! Yes, I am drinking the Kool-Aid!!!!

by TheBeerBaron on Feb 18, 2007 10:30 AM CST up reply actions  

one would think so...
...nice sig by the way...my hope for Prior grows daily too.
PINIELLA!

by theprognosticator on Feb 18, 2007 3:48 PM CST up reply actions  

See...
... that's where I have a problem with these sorts of projections. Obviously, projecting Prior, Miller and Guzman to "spot" start that many starts is based on last year's performances.

That's not likely to happen this year, wouldn't you agree? Either Prior and Miller are going to be full-rotation starters, or be traded, or be on the DL all year.

Guzman likely spends the year at Iowa.

18 starters? In 2006, a terrible pitching staff, only 15 different pitchers started a game, and seven pitchers started 128 of them. I'd think the Cubs will have FEWER starters in 2007, not more.

"[BCB] is much better than... well, everything." -- gravedigger, January 21, 2007

by Al Yellon on Feb 17, 2007 4:44 PM CST reply actions  

18 pitchers.
I'm sorry (and will change it) if I said 18 starters.  I didn't mean to.  Only 8 starters (the 5 I named and then the 3 spot starters).  10 relievers.  That's quite realistic and maybe even a little low.

I think those spot starts are not only based on last year's performances for the three of them, but the last several years.  Furthermore, today in the Tribune, this was written:  Piniella said he's looking for a long reliever, and the loser of the competition for the fifth spot involving Mark Prior, Wade Miller, Marshall and perhaps Neal Cotts could be that man.

There are obviously more choices to be had than pitchers for that 5th spot and there will be several losers.  It's distinctly possible, Al, that both Prior and Miller fail to claim that 5th spot out of spring training and Guzman has a great spring.  Then, with his injury history, it's not at all unlikely for him to go down after 6 starts leaving the 5th spot open for Prior, Miller, or Marshall.  

Prior's stuff last season was just plain bad.  He was worse than Jason Marquis and Glendon Rusch.  Injuries or not, he was bad.  It takes a leap of faith to believe he's capable of making 20 starts and actually earning that 5th starter's spot at this point.  

I'm just glad to hear that Lou isn't going to give him the job if he doesn't earn it.  I've been afraid that no matter how poorly he pitched, if he was healthy it was his job.  According to that article, Lou has said he's going with the best pitcher.  

5th starters for teams generally make about 20 starts and teams (healthy teams) have 3 to 4 guys who make a handful of spot starts.  

I don't think those projected playing time numbers are that far off.  They may end up being way off, but do you feel confident that either Prior, Miller, or Guzman will be good enough to earn that spot?  I sure don't.  

by Maddog on Feb 17, 2007 4:55 PM CST up reply actions  

18 pitchers....
... that makes a lot more sense, absolutely. Maybe I misread or misinterpreted what you wrote.

Everything else you say here makes sense, too, but generally, teams that are of contending caliber don't have more than perhaps 8 or 9 men start games during the season.

Guzman could have a great spring and Prior and Miller might suck, true enough. That doesn't seem all that likely, though, at least not all THREE of those things happening.

Finally, I agree. Piniella's going to do the right thing, make the guys earn their spots (well, except for the obvious one -- Z -- he'll be in the rotation regardless). I'm liking Piniella more and more with everything he does, and I was one of the ones saying loudest that he shouldn't have been the guy.

I'll be happy to be wrong about that.

"[BCB] is much better than... well, everything." -- gravedigger, January 21, 2007

by Al Yellon on Feb 17, 2007 5:04 PM CST up reply actions  

I've always liked Piniella.
I like those kinds of managers.  They won't give anything to the players (Zambrano probably being the only exception...maybe Ramirez and Soriano as well).  The rest of them are going to have to earn it.  That's how it should be.  

I don't see all 3 of those things happening either, but I also don't see Prior or Miller making more than 11 starts.  And, yeah, I'm basing that off their past performance, which is all I have.  If I had to give my predictions for number of starts it would be something like (this won't add up to 162 so don't bother)...

Zambrano -- 33
Hill -- 32
Lilly -- 30
Marquis -- 30
Prior -- 15 (I'm being generous)
Miller -- 10
Guzman -- 3
Gallagher -- 2 (I think we'll see him in September)

by Maddog on Feb 17, 2007 5:22 PM CST up reply actions  

155, pretty close.
How's this:

Z: 34
Hill: 33
Lilly: 33
Marquis: 30
Prior: 18
Miller: 10
Misc (Not predicting who): 4

"[BCB] is much better than... well, everything." -- gravedigger, January 21, 2007

by Al Yellon on Feb 17, 2007 5:50 PM CST up reply actions  

starts
I hope Zambrano can start 34 or 35 games. His back may get in the way of that, unfortunately. It does seem to flare up on him at some point every year.
"Hello again, everybody. Harry Caray from Wrigley Field on a beautiful day for baseball."

by danimal15 on Feb 17, 2007 8:31 PM CST up reply actions  

Oh, it's Kool-aid time!!!!
Zambrano - 32
Lilly - 31
Hill - 31
Prior - 30
Marquis - 21
Miller - 20
Marshall - 9
Guzman - 4
Ladies and Gentlemen....Mark William Prior is BACK!!!! Yes, I am drinking the Kool-Aid!!!!

by TheBeerBaron on Feb 18, 2007 10:35 AM CST up reply actions  

I just realized that this would be
178 starts.....so, I am just going to ask you to disregard that last post....

Except Prior making 30 starts!!!! It's happening this year is the year!!

Ladies and Gentlemen....Mark William Prior is BACK!!!! Yes, I am drinking the Kool-Aid!!!!

by TheBeerBaron on Feb 18, 2007 10:37 AM CST up reply actions  

i agree
Big Z will be a beast and win the cy young, hill will have anywhere between 18-21 wins, and prior will get 15-19 wins. this will be a fun year people are underestimating this rotation and wasnt our bullpins one of the best in the nl

by tbizzle83 on Feb 18, 2007 12:56 PM CST up reply actions  

i think i read
our bullpen had an ERA of 4.04 last year, i think thatd have to be top 3 in the league, and it should be better this year so im very optimistic about the pitching as well.
DON'T TRADE PIE!

by kylejo on Feb 18, 2007 2:22 PM CST up reply actions  

no
it was only good for fifth or sixth last year IIRC.
Baby, you got a stew goin'

by Thelonious on Feb 18, 2007 7:35 PM CST up reply actions  

I have a question, Al.
When you're taking an analytical look at the Cubs as I'm sure you do as the season approaches (as the offseason moves along), don't you also consider projections in the conclusions you come to?  Maybe not projections from BP or the other numerous projections available, but at least your own?  

I assume we could agree that past performance is the best indicator of future performance.  Obviously you can't predict the future and in only one season there is a wide amount of variance, but that doesn't mean we can't use the tools to come to conclusions.  These projections are basically giving us a realistic idea of what a player is capable of if the 2007 season was played thousands of times...what you may call the player's true ability.  He may out-perform these projections, under-perform them, and he may be dead on.  I don't believe anybody who lists these or does these believes they're going to be accurate for one individual player.  As you know, the variance is too great in just one season.  

I also assume you are basing many of your opinions about possible future production from players on their past performance.  For example, I think you'd agree Juan Pierre isn't going to hit 50 home runs.  We know this for a couple of reasons.  The first being the years of data we have to look at where he's hardly hit any per season and his size, which is a more flawed way to take a look at future production (small guys have power too and some big guys don't).  

When you say that you think Lee is capable of playing quite well (somewhat similar to his 2005 numbers), you are basing that opinion on past performance.  You may not know it when you do it, but when you're talking about how good or how bad this team may be, your conclusions are the result of instantaneous projections based on your opinions, which are certainly based on past production.

So I guess my question is this:  How are these projection systems so flawed, but your own predictions/projections aren't?  You're doing the same thing without the mathematical analysis.  You're even considering age (you've mentioned how Murton may develop power as he ages--people who do these projections consider a natural age progression in their formulas).  You surely believe that as a player ages into his mid-30s that his production will likely continue to decline.  

You're unconsciously doing the same thing that you are saying you dislike.  Is it the relatively complex math being brought forward that you dislike?  I doubt that's it.  Batting average and slugging percentage was once complex math for this game and I'm sure you know that.  

Obviously these are flawed.  I don't think anybody has said they aren't.  I believe I read a study recently on some Angels blog that compared the various projections for hitters who had over 500 plate appearances.  PECOTA turned out to be the most reliable having ONLY a correlation coefficient of .51 (slightly higher I believe).  Not a very strong correlation at all.  But there is one and I'd think you'd agree that any available information that can make the team better is worthwhile information.  

Bill James won't even bother projecting pitchers; says it can't be done.  And the study showed that the correlation was much lower, but there was still a correlation (zips having correlation coefficient the highest with pitching, PECOTA 2nd).  

It's information.  It's flawed, but no more so than looking at a player's batting average and RBI's in one single season.  

by Maddog on Feb 17, 2007 5:16 PM CST up reply actions  

I'm not saying I don't...
... look at such things. Of course I do. And I've been doing it since I read Bill James' first nationally published Abstract in 1982.

I just think that you can't simply boil down projections or predictions to mathematical formulas (as you do above when you say PECOTA has a "correlation coefficient" -- when I hear that sort of thing, my eyes start glazing over).

Sure, I'll look at all of that. I do think that some people -- not you -- wind up being married to these projections, thinking that they are the be-all and end-all, and looking down their noses at those of us who don't bow down to the Great Stats God.

Like I said, this isn't directed at you. There's a middle ground, and I think I try to occupy it. If you think I'm one of those who just looks at the "TV stats", you are quite wrong.

"[BCB] is much better than... well, everything." -- gravedigger, January 21, 2007

by Al Yellon on Feb 17, 2007 5:48 PM CST up reply actions  

Correlation coefficients and
baseball sound goofy, I know, but they do exist.  They exist like run expectancy exists.  A home run correlates to at least one run scored (correlation coefficient of 1.00).  This is an obvious correlation and we all understand it and never refer to it in coefficient terms.  But it can indeed be written as such.  So can other things.  Singles, doubles, triples, walks, errors, and virtually every other thing that's possible in this game.  

A correlation coefficient of 1 implies, as you probably know, a linear relationship between two variables (home runs and at least one run scored in the above example).  If it's 0, there's no relationship whatsoever (home runs to men landing on the moon).  The closer it gets to 1, the greater the correlation between the two variables (OPS to runs scored, if memory serves, is about 0.88--pretty damn high).  -1 is an inverse linear relationship (home runs to opposing team's runs prevented).  

So, when a team like the Phils score 865 runs with an OPS .005 points lower than the CUbs projection for 2007, we have very good reason to believe that if these projections are accurate that the Cubs SHOULD score as many or more runs.  

These kinds of things will become commonplace in baseball in a generation or two.  

by Maddog on Feb 17, 2007 8:21 PM CST up reply actions  

I believe you.
But it still makes my eyes glaze over.
"[BCB] is much better than... well, everything." -- gravedigger, January 21, 2007

by Al Yellon on Feb 17, 2007 8:23 PM CST up reply actions  

It used to do that to my eyes as well.
I was always very good at math (initial major in college was Electrical Engineering) and as I aged and stopped doing it, my math skills slipped and in many cases vanished entirely.  It's been baseball over the last few years that has allowed me to keep the remaining skills I do have.  

by Maddog on Feb 17, 2007 10:23 PM CST up reply actions  

That's what's funny...
When a manager sacrifices someone, or a general manager wants someone who "hits in the clutch" or someone who "provides leadership" they are correlating those things with scoring more runs, or more simply, winning. It's just when statistics are use to either show that or not, they don't want to hear it.

All the stats and correlation coefficients are really doing is confirming what they thought (or denying it).

by tyger1147 on Feb 17, 2007 8:43 PM CST up reply actions  

Exactly.
We all do similar things, whether it's projections or correlating certain things with run scoring or winning, but it scares people away to the point they are ready to dismiss it when you introduce math into the discussion.  

by Maddog on Feb 17, 2007 10:24 PM CST up reply actions  

0.88?
What do you mean the correlation is 0.88? Is that in placement? Meaning, if a team finishes in X spot of the league in OPS that there is a very strong relationship they will finish in that X spot in rankings of runs scored?

by tyger1147 on Feb 17, 2007 8:47 PM CST up reply actions  

A correlation of 0.88
between runs scored and OPS implies that there is a strong linear relationship between OPS and runs scored meaning that the higher your OPS is, the more runs you score.  If the correlation was 1 it would be a perfect linear relationship.  In other words, if the correlation coefficient of OPS to runs scored was 1, then we would know precisely how many runs a team would score that had a certain OPS.  There would be no variance if it was a perfect linear relationship.  

An r (correlation coefficient) is between -1 and 1.  If r is negative, it's a negative slope (inverse relationship...one variable increasing causes the other variable to decrease).  If it's positive, it has a positive sloped meaning that as one variable increases the other one does as well.  

I'm not sure if this image will show up on here, but here's a scatter plot I recently did showing the correlation coefficient between negative VORP and wins:

If the image doesn't show up, check it out in this article.  That image shows a correlation coefficient of 0.70 meaning that negative VORP and wins have a moderate correlation to one another, which is basically common sense.  I wanted to know how much and since the Cubs had a bunch of negative VORP in 2006, this is one of the ways they can improve the most in 2007.  Limit the negative production.

by Maddog on Feb 17, 2007 10:45 PM CST up reply actions  

Sorry it's not bigger.
And more readable.  But you can see the linear progression in that table.  If it was a perfect linear relationship (1), it would be a straight line (all data points would be on one line).

by Maddog on Feb 17, 2007 10:46 PM CST up reply actions  

Hurray for stats class.
I never thought I'd appreciate that class that much.
"I don't talk. I just let what I do talk for myself." -Johan Santana

by sparkles721 on Feb 18, 2007 11:36 AM CST up reply actions  

Something that was engrained into my mind
was that Correlation is never to be used as causation.

In other words, even if it was a perfect 1, OPS could never be a precise indicator for Runs Scored.

Statistics cannot be an end all be all for all of life's numerical questions.  

Statistical Anomalies and Sperrious correlation would still make some of your findings irrelevant and misleading.

Now, as for this article, especially since you are using compound stats, which some consider impure, I ask you can negative VORP and Wins, be sperriously correlated.

by cubsfan2883 on Feb 18, 2007 1:03 PM CST up reply actions  

You're right.
Correlation is NOT causation.  It's a mistake many people make when interpreting data or explaining their findings, but you're very much right.  For the record, I wasn't trying to state that the cause of runs scored is OPS.  Clearly it's not entirely.  

However, OPS (on-base plus slugging) creates runs.  Runs can score without either (errors), but those situations are limited.  You need men on base and the more bases you advance on a hit, the more runs you'll generally score (average situations considered).  

Not only are those situations in which runs score without the help of OPS limited, it's the defense that is responsible; not the offense.  

If the OBP of an entire team is .000, they'll score a minimal number of runs (the only runs that score will be the result of the opposing team's defense).  If an entire team's OBP is 1.000, they'll score an infinite number of runs (ignoring caught stolen bases and pick offs, and other baserunning outs, of course).  

OBP is but one part of OPS (I believe both parts are equal, and in all honesty would probably favor slugging over on-base skills if it were up to me, but as you can clearly see, as OBP rises, so do runs scored.  OBP isn't the cause entirely, but the runs that do score is, generally speaking, are caused after someone gets on base (obp) and numerous bases (slugging) are advanced.

As for the 2nd question (negative VORP and wins), the answer is yes.  There's not a lot of real value in the study that I did (unfortunately, perhaps fortunately, all my data has been destroyed since my hard drive crashed and I purchased a new computer).  Most of it's common sense (limit negative production and you'll win more games).  Most importantly, VORP is a stat with the intentions of comparing the performance of individual players and their impact in scoring runs to the team.  I combined the numbers from the players on the team, which probably should never be done in the first place.  But I did learn several things.

Dusty Baker has been very bad in his career as a manager at giving out playing time to players who are undeserving.  This isn't a revelation as we're Cubs fans and have seen this for years, but the numbers back it up.  Lou Piniella has been above average (very, very good in Seattle, but poor before that, and slightly below average in Tampa though better than the manager last year).  He plays who deserves to be played and we can expect that from him all season long.  If DeRosa isn't performing he's not going to hesitate to put Theriot in there or bring up Patterson or whatever.  

Interestingly, only a few teams pay attention to this kind of thing.  It was clear that Oakland has some understanding of negative production and so does Boston and Milwaukee under Ned Yost, but that's really it.  Ned Yost is a very good manager and he's probably the best at limiting playing time to undeserving players in the National League and probably only bettered by whoever is managing the A's at the time (Beane runs that team...).  

by Maddog on Feb 18, 2007 2:05 PM CST up reply actions  

Okay, so maybe I misinterpreted
VORP then. because VORP to me is basically the amount of positive things that a player brings to the field... or a players upside.

What IS VORP in statistical terms.. or in other words what stats go into VORP.

Same thing with WARP?

I know RC (or Runs Created) is a large part of what WARP is.  but could you go into a little bit of depth on what these two stats ARE for dummies like myself who really haven't delved much into the sabermetrics of baseball.

by cubsfan2883 on Feb 18, 2007 2:23 PM CST up reply actions  

VORP
only considers a player's offense so that's the downside of that stat.  Furthermore, the actual formula for calculating VORP is a bit of a mystery.  I don't know why Baseball Prospectus does that because these people are doing what they're doing largely because of the advances Bill James made in the field of sabermetrics.  James was very upset with the league for not providing various stats that he needed to do some of his work so he and many of his readers began keeping daily logs of games so they could do it without the league's stats.  

So why Baseball Prospectus would turn around and do virtually the same thing that Bill James had complained and complained about 25 years prior is baffling.  I like BP a lot and enjoy reading their site, but this is one thing that irritates me.  

We know how Replacement Player (RP) is calculated (and sorry, but it's rather long--if I have time later tonight, I'll do my best) and we can get a rough idea of a player's value over RP, but we don't know the current formula BP is considering so our attempts at doing so would no be accurate.

Same goes with WARP except that it DOES consider defense.  

I think that's one reason for people's skepticism of such stats and it's warranted.  I've done as much reading as I can into how these are calculated and fell confident they are accurate stats, but I could not defend the method because I do not know what it is.  And that's unfortunate because people who don't want to spend the time like I have reading and reading and reading to find the answers are rightfully going to dismiss such stats.  BP is doing sabermetrics a disservice by not making this information public.  They do so in the interest that it's THEIR stat and they want to be the only wants to publish that information.  I understand that.  They did the research and the hard work involved, but only when the public knows how these are calculated...and understands them can we make further necessary advances.  

BP is better than The Hardball Times, but THT at least makes their stat formulas available.  

VORP, though, is the cumulative effect the player has on run scoring compared to a replacement level player.  I think this is what you were asking.  I think you'd agree that last season, Ronny Cedeno, was worse than the average replacement player.  He could have been replaced by nearly anyone (you, me, Al?) and they could have done better.  His VORP on the season was, if I recall, -17.6 (3rd worst in baseball).  He provided a negative contribution to the team.  The average replacement player could have had a VORPof 0.0.  

So it's the addition of all things positive and all things negative and then comparing it to replacement level.  

WARP is similar except, as I said before, defense is included.  

by Maddog on Feb 18, 2007 5:36 PM CST up reply actions  

Does anyone else feel...
...that they just became about 8 times as smart as they used to be?

Thanks, Maddog, for your usual eloquent and thorough explanation.

I support Julie! :)

by gravedigger on Feb 18, 2007 5:49 PM CST up reply actions  

Not that this COMPLETELY answers the question...
...but I found a very old post that details how VORP was calculated upon its inception. It's a slightly tweaked version of RPOSMLV, which itself is just a version of MLV pegged to replacement level and position. (This tells us how MLV, RMLV and RPOSMLV are calculated.)

The question now simply becomes -- where does BP set replacement level? I'm at work, so I haven't the time or software necessary to toy with it, but given the data at hand, I'm pretty confident I could "reverse-engineer" VORP in an hour or so if I set myself to it.

by cwyers on Feb 18, 2007 6:19 PM CST up reply actions  

RPOSMLV?
Aren't those the letters Pat Sajak gives you for the final round of Wheel of Fortune?

No wonder it was winnowed down to VORP, though that seems like it would be the sound a creature in a swamp would make.

PINIELLA!

by theprognosticator on Feb 18, 2007 7:09 PM CST up reply actions  

So what stats are used stil
Because in that article it says that BA OBP and SLG are used to figure Marginal Lineup Value (hereafter referred to as MLV) and that Replacement level looks at both positional value and replacement value.

However, these stats seem only useful in retrospect, due to the lack of judging how well the replacement will play.  

For example, Ronny Cedeno had a negative VORP.   Everyone knew this.  His primary replacement was Neifi Perez, who also would make a negative contribution.

Also, how does VORP work with Pitchers?  Maddog stated that it only measures their offensive contributions, how do pitchers receive grades on their limiting contributions?

by cubsfan2883 on Feb 18, 2007 7:45 PM CST up reply actions  

Replacement level
is set like this (I'll try to be brief and simple here).  

Over the last 40 to 50 years, the amount of playing time for "regular" players has remained rather constant.  Essentially, 80% of the playing time is your regular players.  The remaining 20% is the replacement level.  

Basically, let's imagine one team.  Let's imagine the Cubs and let's make it simple and imagine that Derrek Lee plays exactly 80% of the time at 1st base next season and Daryle Ward plays there 10% and Henry Blanco plays there's 10%.  Replacement level is done at each position...the replacement level of a first baseman is going to be more productive than the the replacement level of a shortstop or catcher, for example.

Ok, Lee plays 80% and the other 2 make up replacement level.  The average of Ward and Blanco's stats represent replacement level at 1st base.  

If we now expand our imaginary league to what is reality, a simple way to remember replacement level is that the "regulars" (the regular 1st baseman for the Cubs in 2006 was John Mabry due to injury) make up 80% of the playing time.  

If we take all the 1st basemen in the game and place them in order of plate appearances, we add them up down the line until we get to 80% of the playing time.  The average of the remaining 20% is replacement level at 1st base.

Injuries to Lee last season change things somewhat from year to year.  Mabry qualified as a regular 1B last season and Lee's stats were added into the replacement level pile.  

For pitching, a number 1 through 5 starter is determined by who makes the most starts.  Clemens would have been a number 3 or 4 for the Astros.  Again, the top 80% of the pitchers (starters) make up the regulars while the average of the remaining 20% make up replacement level.  The same is done for relievers.

So every year, no matter what...no matter how damn good the players may be, there will be players who are below replacement level.  Stats that measure replacement level are intended to measure the value of an individual player across eras at the same position.  While we can find numerous correlations between VORP and runs or win or whatever else we may be trying to find, replacement level is, in its purest sense, a way to evaluate a single player's contributions above what a typical replacement may be able to do.  Your average replacement (fill-in) is not going to be average...he'll be below average, but hopefully above replacement level.  The stats weren't created to evaluate team performance and certainly aren't the best mark for doing so, but that doesn't mean we can't do some evaluations of teams using them.

by Maddog on Feb 19, 2007 12:10 AM CST up reply actions  

I'm pretty sure...
...that they just take a position-derived adjustment to the league average to determine replacement level. The traditional definition of replacement level that I keep hearing -- and the one that makes sense to me -- is guys that you can find on waivers, minor-league free agents, non-roster invitees; guys who you can pick up cheaply.

For example... Jeff Keppinger had 37 plate appearances playing third base for the Kansas City Royals last year, it would seem. He hit .267/.323/.400 -- a .723 OPS. He is, exactly, a replacement level player for third base. (The idea being that you can find some sad sack to hit the ball and somehow avoid being a total defensive travesty at FIRST base, but you'd better find someone who knows how to hold onto the ball at shortstop, meaning you're going to find... guys like Neifi and Tony Womack.)

But basically VORP is the sabermetric version of RBIs -- it measures run production as a counting stat, not a rate stat. It's really good at answering certain kinds of historical questions. It's not bad at producing reasonable, sane Silver Slugger and MVP nominees. It's alright comparing people at the same position, with some caveats. It's far less valuable for looking at guys like Soriano, who are changing position this year, and guys like Mark DeRosa, who have never HAD a real position.

by cwyers on Feb 19, 2007 2:18 AM CST up reply actions  

One thing I was thinking...
when trying to understand how teams with such a bad negative VORP could still produce well was the following explanation, tell me if I'm on the right track:

It's not just about the negative VORP, as you mentioned, but more about what a team or manager does (or can do, really) to offset that negative VORP. This is common sense, of course, but just something I was thinking. Sure, the Yankees had a large negative VORP (injuries to LF, RF and 2B), but when they have Giambi, Posada, Damon, Jeter and ARod, the overall impact would be less. This would be the opposite of say having an injury to Derek Lee, but then only having it offset by Ramirez, Murton, Pierre, Cedeno, Barrett, etc.

I know you said a lot of data was lost and VORP is an individual anlysis, but I'd be curious to see if you had an "average VORP" for teams. Or if that would even make any sense.

by tyger1147 on Feb 18, 2007 3:10 PM CST up reply actions  

Precisely.
Teams like the Yankees can get away with an unusually high negative VORP because they're producing an unusually high positive VORP due to, in large part, to their payroll.  

I have the average negative VORP for a team since 1996, which is -72 and I believe the average PLAYOFF team in 2006 had a VORP of 420.  The average playoff team had a negative VORP of only -50 (and the Yankees threw this off a bit).

I think the Cubs need to shoot for a negative VORP of -45 or lower in 2007.  If they do that, they should be in good shape.  

One thing to keep in mind about negative VORP is that youth plays a key role in this.  It's one of the things I was going to look into and may still do it, but it's different when a player is 22 and providing a negative VORP than it is when a player is 36 and doing so.  A player at 22 is worth giving some time to develop and the guy at 36 is only going downhill...get him out of there.  

So teams who had a lot of young players (2006 Cubs) will likely be relatively poor in terms of negative VORP.  

I should point out that the 2001 Mariners (Lou Piniella...a billion-win team) had a negative VORP of -3.7 (that wasn't the lowest in the 11-year stretch I looked at, but it was top 2 or 3), but that team, pitching, had only one player provide negative VORP....and he was a position player (John Mabry).

by Maddog on Feb 18, 2007 5:44 PM CST up reply actions  

I have to agree...
Especially about Prior. I wanted to bring this up somewhere, but never saw the place.

What the heck makes anyone think this guy can start 25 games this year?

(adding the needed disclaimer: of course, it can happen)

BUT, this guys hasn't had a full season since 2003. And that was his only one.

He's pitched:
2002 - 116
2003 - 211
2004 - 118
2005 - 166
2006 - 43

We all know the troubles with increasing a rookie's innings pitched by a certain percentage: I don't remember the percentage, but there's one out there where it's pretty ridiculous to EXPECT someone to pitch a certain percentage of innings more than he did the previous year. Anyone know what I'm talking about?

Anyway, he hasn't pitched a lot of innings for four years, and at that it was only one time. It'd be nice to think that if he's healthy out of Spring Training, he could stay healthy all year, but I would put the chances (just a guess) at 10-20% that he can start more than 20 games and pitch more than 165 innings.

by tyger1147 on Feb 17, 2007 5:59 PM CST up reply actions  

i believe
its the rule of 30

if a pitcher exceeds their previous season's total, or maybe its career high, by more than 30 innings there's an increased risk of injury

by DartmouthCubsFan on Feb 17, 2007 6:10 PM CST up reply actions  

Okay...
If it's career high, then it's irrelavent to this. Or if it's a certain age or something, it's slightly irrelavent. But the 30 I think, might be right. What I saw was that there was a well over 50% chance (maybe as high as 75%) that if a guy pitched over that amount, he was going to get hurt.

by tyger1147 on Feb 17, 2007 6:23 PM CST up reply actions  

Now this is why I read BCB
I have nothing to add to this discussion other than to say it is an interesting and well thought out discusiion.  Thanks.

by cubswin on Feb 17, 2007 8:41 PM CST reply actions  

division crown?
look a-ram is in his prime, d.lee is healthy,soriano will do his thing,we will be strong up the middle,barrett is solid,j.jones can = last year,and with floyd backing murton/sounds great to me!!also,perry was the best pinch hitter in the nl last year!wood a closer or middle relief?if healthy?lmao!i do believe our pitching could surprise some.2 above average lefties,big z,a healthy prior,perhaps miller or marquis to round out the rotation.stranger things have happened like st.louis last year!!so i am very optimistic.GO CUBS!!!

by cubz409 on Feb 18, 2007 12:00 AM CST reply actions  

give me a taste
of just a few of the drugs you are currently under the influence of
DON'T TRADE PIE!

by kylejo on Feb 18, 2007 11:55 AM CST up reply actions  

WOW
 i just saw that ronnie belliard signed with washington. i was under the impression he was re-signing with stl. what happened with that? shoot im happy. who is their 2nd baseman?

by tbizzle83 on Feb 18, 2007 1:00 PM CST reply actions  

They signed
Adam Kennedy to play 2nd base.

by Maddog on Feb 18, 2007 2:05 PM CST up reply actions  

For what it's worth...
...I went ahead and totalled up the counting stats (runs scored and runs batted in) on that page.

If you go based on runs scored, that depth chart projects the Cubs to score 1,242 runs. If you go based on runs batted in, the Cubs score 1085 runs. (For reference, the Yankees scored 930 runs last year.)

So obviously something's not quite right here. (Also not quite right? As in not right at all? Daryl Ward getting bats at almost EVERYWHERE but at pinch hitter).

I'm pretty sure that this thing is double-counting some things for people who play more than one position, which is throwing a monkey's wrench in it.

Does this affect the veracity of the averaged averages? No idea.

by cwyers on Feb 18, 2007 2:28 PM CST reply actions  

Bench players
stats are the player's total stats on the season.  So, add those numbers up again, but add Daryle Ward only once...and other players who appear at other positions.  I haven't done so and I'd be interested to see what you come up with.

by Maddog on Feb 18, 2007 5:22 PM CST up reply actions  

Sorry...
...was thrown off by the fact that they kept changing the number of plate appearances listed, while keeping the rest of the line the same.

That said... I get 838 runs from totaling runs scored, 769 runs from totaling RBIs. Note that this doesn't cover offensive contributions from pitching -- and our pitching staff, if nothing else, can hit, as far as the meager standards of their compatriots go.

Now, let's go ahead and do the flip side of this... project out runs allowed, from the ERAs listed. Take the innings pitched, divide by nine, and multiply by the ERAs, and you come out with 716 earned runs allowed. Unfortunately, you can still lose games from unearned runs, so that, again, is an incomplete figure. So let's compound inaccuracies and introduce another level of abstraction, and derive a Pythagorean win expectation. It's late, and I'm tired, so we'll do the quick and dirty version, no fancy Pythagenport exponents.

Average it out to 803 runs scored... and you get a .557 winning percentage, or a 90-72 team.

Yay, cocktail napkin math.

by cwyers on Feb 19, 2007 2:33 AM CST up reply actions  

1,242 runs scored?
Crap, will we even lose a game?

Somehow, we'll figure out a way to lose 90 of them.

PINIELLA!

by theprognosticator on Feb 18, 2007 5:38 PM CST up reply actions  

90?
Sheesh, you're optimistic.
I support Julie! :)

by gravedigger on Feb 18, 2007 5:48 PM CST up reply actions  

Comments For This Post Are Closed


User Tools

Welcome to Bleed Cubbie Blue, the Chicago Cubs blog for the SB Nation, created on February 9, 2005 by Al Yellon

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Img_0001_small
Value of Various Plate Approaches
284_small
Cubs' Fantasy Camp 2012 as seen by a Player's Wife
P7200073_small
Randy Hundley Fantasy Camp 2012

Recent FanPosts

+ New FanPost All FanPosts >

FanShots

Quick hits of video, photos, quotes, chats, links and lists that you find around the web.

Recommended FanShots

Nice article about Ernie Banks
Yankees Hire Jim Hendry
Dale Sveum Meets Early Arrivals At Camp Buss

Recent FanShots

Another prospect watch: Garrett Schlecht
Ranking the Farm Systems
WGN Releases Season Schedule
MLB.com Cubs Top 20 prospect list
A position ranking of the NL central by ESPN.
Draft Pick Currency and the Cubs
Yoenis Cespedes
Cubs designate IF Blake DeWitt for assignment
Cubs Claim Adrian Cardenas
The Story Of Theo Epstein And A Broom

+ New FanShot All FanShots >

Featured Poll

Poll
Andrew Cashner for Anthony Rizzo!
Yea!
1167 votes
Nay!
95 votes
Meh
164 votes

1426 votes | Poll has closed

It Is Only...

It Is Only...

Cubs By The Numbers

Cubs By The Numbers is a history of the ballclub by uniform number, but the biographies help trace the history of our beloved team in a new way. For everyone who's a Cubs fan, anyone who ever wore the uniform is like family. Cubs By The Numbers reintroduces readers to some of their long-lost ancestors, even ones they think they already know.

Click here to order your copy, available now!

Recent Stories in Ticket Exchanges


Managing Editor

Alyellontoppscard_small Al Yellon

Front Page Contributors

Primary_fc_small Josh Timmers

Marvin_the_martian_small Shawn Domagal-Goldman

Other Contributors

Dsc_0139_small David Sameshima

Toonmike_small Mike Bojanowski