ERA+ is a great analytical tool. It permits comparisons of ERAs across different eras and different run environments by adjusting for general league scoring levels and park factors. Its advantages over simple ERA are obvious. It is the single pitching statistic most often regarded as the definitive tool for analyzing pitching careers. Some stat geeks have become so enamored of ERA+ and its derivatives that they deny certain baseball truisms that might call into question the validity of judging pitchers primarily on the basis of ERA+. They tend to deny the concept of clutch pitching, despite the fact that certain pitchers evince a tendency to pitch measurably better or worse in high leverage situations (see this post for a discussion of Leveraged ERA+, or LevERA+, which weights runs allowed (and runs prevented) based on the impact on win expectancy). They also tend to discount the theory that most pitchers "pitch to the score" by changing their pitching approach depending on the game situation.
A host of statistics confirm that most pitchers do indeed pitch to the score. Pitchers as a group adhere to the theory that when granted a big lead it is better to put the ball over the plate and make the opposition hit their way back into the game rather than risking a rally fueled by putting runners on base via bases on balls. Virtually all successful pitchers walk fewer batters when working with a significant lead. Virtually all pitchers, successful or not, surrender more runs when working with tremendous run support from their teammates. Baseball-Reference.com recently added pitching splits based on team run support, showing a pitcher's performance in games in which they received between 0 and 2 runs of support, 3 to 5 runs of support and 6 or more runs of support. The vast majority of pitchers will surrender more runs on average when working with 6 or more runs than they do when working with 5 or fewer runs. The run-support splits further confirm that variations in ERA in high run-support scenarios have little or no impact on a pitcher's winning percentage in these scenarios, with good pitchers winning between 90% and 95% of these decisions regardless of how much their ERAs increase with great run support.
Wednesday, March 31
Thursday, March 25
Great Pennant Race Performances of the '20s and '30s
I've discussed the great pennant race performances of pitchers over the last 50 years. It's time to look at some of the legendary pennant race performances from long ago. It helps to explain why certain pitchers with conspicuously thin career qualifications for the Hall were nonetheless inducted into the Hall of Fame. Some names will be very familiar, others less so. But each of these pitchers put together performances in the heat of pennant races that lifted their teams to glory.
Wednesday, March 24
A Recipe For Catfish
Catfish Hunter is frequently cited by the stat geeks as a prime example of an unworthy HOF inductee. He doesn't have a plaque at the Baseball Think Factory's Hall of Merit, where Dave Stieb, Bret Saberhagen and Wes Ferrell are enshrinees. Hunter's ERA+ appears to be the problem the HOM balloters with Hunter. It can't be the 224 career wins, since Stieb, Saberhagen and Ferrell each have far less. I've offered my explanation for Hunter's induction into the HOF, an induction I believe was more than worthy. I thought I'd look at Catfish's Team Relative performance.
During his ten-year prime from '67 to '76 Catfish outperformed his team by 11.3%. That's not a very good figure for a Hall of Famer, and I wasn't particularly surprised by it. What I was surprised by was Hunter's Team Relative index for his five-year prime of '71 to '75, which covers the A's World Series years and his first season with the Yankees. I apparently had assimilated the argument of the stat geeks that Hunter's record during that period was purely a function of pitching for a great team and getting huge run support. Not true, as it turns out. Hunter's Team Relative index for that five-year period is 28%. If you remove the '75 season, Catfish outperformed his A's teams by 29.3%. And if you limit the analysis to just the three World Series championship years with the A's, Catfish's Team Relative index was 34.6%.
To be clear, I'm not arguing that Catfish didn't benefit from great run support. He did. And I'm not arguing that Catfish would've had five consecutive 20-win seasons if he'd played for Blyleven's Twins teams in the '70s. What I am arguing, however, is that the claim that Hunter's great record during this period was just a function of great run support from a great team is demonstrably untrue. Take away the great run support and Hunter was still outperforming his team by 28% over a five-year period and a robust 34.6% during the A's championship years. Those are Hall of Famer-type numbers, albeit for a relatively brief period. It is simply a myth to argue that any pitcher with a Team Relative index like Hunter's was merely a product of great run support and great teams.
Let's look at another pitcher generally dismissed by the stat geeks as a mere product of great run support: Jack Morris. Morris's Team Relative index during his peak nine-year period of '79- '87 was 15.8%, not particularly good by HOF standards but right there with Bunning's 16% index for his 11-year peak. That means if Morris had played for an average hitting team with a .500 record he still would have posted a .579 winning percentage over those nine years. I think it's fair to conclude therefore that Morris's actual winning percentage of .615 during his peak was perhaps 30% attributable to his run support; the bulk of the credit for that outstanding winning percentage, approximately 70%, has to go Morris. If I'm not mistaken, Morris detractors would look at his 105 ERA+ and conclude that Morris's .577 career winning percentage was attributable 95% to his superior run support. This is plainly not the case. The Team Relative analysis demonstrates that Morris was able to perform far above the standard a career ERA+ of 105 would typically indicate.
It's no mystery why Catfish is in the Hall. He's in for the same reason Waite Hoyt, Jesse Haines, Lefty Gomez, and Red Ruffing are in the Hall despite falling well short of 300 wins, and for the reason Curt Schilling will make the Hall. They excelled on the big stage and made a huge impact for great teams. They put their imprint on legendary pennant races and World Series contests. That counts for a lot in HOF balloting, and it should.
Tuesday, March 23
The Celebrated Mr. K
His blazing five-year stretch from '62-'66 has become the standard by which all other great pitchers are measured. The Gold Standard. The definition of pitching dominance. Anyone who considers a new mode of analyzing pitching greatness has to insert his five peak seasons into the formulas and see what comes out. If you plug into your formulas his stats from these five seasons, during which he won five straight ERA titles, three pitching triple crowns and three 25+ win seasons in four years, and a historic result doesn't come out the other end, then maybe you need to double check your methods and formulas.
From '62 to '66 Sandy Koufax outperformed his team by 41%. If you exclude the '62 season, where Koufax's injury and the Dodger's decision to rush him back into the rotation in late September significantly skew the numbers, then Koufax outperformed his team by 49.5% from '63 to '66*. That's Randy Johnson territory. A 50% Team Relative performance over a period of years could be known as the Sandy-Randy Standard.
Monday, March 22
Lost In Translation
Not surprisingly, an analysis of Bert's prime years of '70 to '79 demonstrates that despite his superlative ERAs he didn't significantly improve his team when he was on the mound. Yes, Bert didn't get good run support from his teams, who scored .35 runs/game fewer for Bert than they did for other starting pitchers. It is also true that in measuring Bert's performance against his teams' Bert was competing against some pretty good pitchers. For the entire decade, Bert pitched on staffs that were slightly above average even without Bert's contribution, and the staffs on his '70, '72, '77 and '79 teams were among the very best in their leagues. But the Team Relative analysis controls for these factors, of course.
Even after increasing Bert's run support to team average, and adjusting his team's W-L record downward to reflect what it would have been with an average pitching staff, Bert still only outperformed his team's W-L record by 10.2%. That's down in Drysdale territory. As I've previously noted, Bert hugely underperformed his Pythagorean projection during those ten years, compiling a .536 winning percentage as compared to a .599 PythPro. If Bert had been able to perform to his PythPro he wouldn't be such a hot topic today because he would have been inducted into the Hall years ago.
Even after increasing Bert's run support to team average, and adjusting his team's W-L record downward to reflect what it would have been with an average pitching staff, Bert still only outperformed his team's W-L record by 10.2%. That's down in Drysdale territory. As I've previously noted, Bert hugely underperformed his Pythagorean projection during those ten years, compiling a .536 winning percentage as compared to a .599 PythPro. If Bert had been able to perform to his PythPro he wouldn't be such a hot topic today because he would have been inducted into the Hall years ago.
Saturday, March 20
Big Unit, Indeed
It's an amazing sight when you're watching great athletes compete at the highest levels in their sport and one competitor is so great that the opposition is just overmatched. I mean dominated; not just beat, not just bested, but dominated, almost completely helpless. In the realm of baseball, the greatest pitchers, at their best, will do this. Major league hitters, the best in the world, men with preternatural reflexes and freakish hand-eye coordination, are left to wave futilely at pitches or are so flummoxed they can't even swing.
I remember watching Clemens pitch against the Yanks in '97 and wondering how in the hell anyone ever hit the guy. I remember watching Pedro against the Yanks in September '99, the game he struck out 17, and feeling sorry for Yankee batters. Jorge Posada couldn't even get the bat off his shoulder. He had no idea what was coming - 96 mph fastball, or slider, or change-up or curveball. Yankee after Yankee left the plate after striking out shaking their head on the way back to the dugout, no doubt feeling the way Mickey Mantle felt after facing Koufax for the first time in the '63 World Series, when he said to the umpire as he was turning to leave the plate after striking out, "now how in the hell am I supposed to hit that shit."
But for sheer dominance, the ability to induce not only helplessness in big league batters but terror, there has perhaps never been anyone like Randy Johnson. It was sometimes like watching little league baseball, where the big kid is on the mound, the one that seemed to mature about two years ahead of the rest of the kids, and the ball is blowing by the batter before they can even think about swinging. One kid gets smoked and the next batter approaches the batters box looking like they're going to the gallows. They have no chance. When he was at his best, that was Randy Johnson on the mound. Too big, too nasty, too fast. And that slider - Christ, you pitied lefthanded hitters who had to face Randy Johnson.
Friday, March 19
More Team Relative Analyses: Lefty Grove
Lefty Grove weighs in at 36%. I didn't realize what tremendous run support Grove got from the Athletics in the early 30's.
It may be a shade behind Seaver and Maddux, but it doesn't change my opinion that Lefty was The Man among post-1920 pitchers. What it does do is make me appreciate how great Seaver and Maddux were. Man, that 35%+ improvement over team performance is one upscale neighborhood.
It may be a shade behind Seaver and Maddux, but it doesn't change my opinion that Lefty was The Man among post-1920 pitchers. What it does do is make me appreciate how great Seaver and Maddux were. Man, that 35%+ improvement over team performance is one upscale neighborhood.
Reappraisals of Palmer, Bunning and Drysdale
The analyses of pitcher performance relative to his team have yielded some interesting results.
As I described in the "The Theory of Relativity" post, it is possible to compare a pitcher's W-L record and winning percentage to his team's and adjust for factors that distort the comparison. These adjustments involve adjusting the pitcher's run support to equalize it with the run support the team provided to the other pitchers on the team and normalizing the ERAs and runs allowed by the rest of the staff to league average. These two adjustments assure that a pitcher won't benefit or suffer by virtue of run support that deviated from team average, or by virtue of the fact that the rest of the pitching staff, to whom the pitcher is effectively being compared, were either better than league average or worse than league average. A pitcher may be a great pitcher but his W-L record relative to his team's won't be very impressive if the rest of the team's pitching staff is comprised of great pitchers. In comparing Greg Maddux's W-L record to his team's it is obviously necessary to adjust for the fact that the Braves' pitching staffs were great and produced tremendous winning percentages because of the presence of guys like Glavine and Smoltz.
Here are the new results. Curt Schilling's ten-year peak from 1997 to 2006 was pretty impressive, as his excellent winning percentage and ERAs would suggest. Schilling outperformed his team by approximately 27% over that decade. Bob Gibson outperformed his team by approximately 28% over his nine-year peak of '64 to '72; not sure whether people will find that disappointing or impressive. Both these results obviously cast Ron Guidry in a very good light, because Schilling's and Gibson's team relative performance figures are right in Guidry territory. Ron is in good company.
Here are the results I found surprising. I ran the numbers on Bunning, Drysdale and Palmer. I've always thought of Bunning and Drysdale as being very similar, and I've conceived of Palmer as an American League version of Tom Seaver, although a shade behind Tom Terrific. My team relative analyses have fundamentally changed my appraisals of these pitchers.
Thursday, March 18
The Theory of Relativity
I love a lot of the new pitching stats. They're great analytical tools. Take FIP, for example ("fielding independent pitching"). It's based on the proposition that what happens on a ball put in play is frequently a function of random chance and team fielding. Bill James recognized its utility and cited Wally Bunker's 1964 season as an example of a pitcher apparently benefiting from some good luck insofar as his BAbip that year was .216. It turns out that Bunker in fact had a pretty good facility for generating low BAbip's in his career, presumably because, like Maddux in his prime, he was adept at keeping the ball away from the fat part of the bat and inducing batters to hit pitches outside the hitter's sweet spots in the strike zone. But Bunker never again came close to posting the .216 BAbip he posted in '64, despite being backed by the legendary team defense of the '60s Orioles.
FIP tends to understate the effectiveness of a pitcher with a demonstrated ability to consistently generate very low BAbips. Take The Great Rivera, for example. I was skeptical of Mariano's decision in '97 to move almost exclusively to the cutter because it seemed to sharply cut into his strikeouts. "Throw the high fastball!", I would shout, longing for Mariano's incredible strikeout ratio in '96 when he K's 130 batters in 107 innings. Still, I had to admit that batters seemed almost incapable of getting good wood on the cutter, but bloops and dribblers can and do become hits, while strikeouts can't and don't. Bloops and dribblers that found holes in the defense became, in my mind, "Mariano Specials." Obviously Mariano's decision to go with the cutter has been thoroughly vindicated and my early concerns were unfounded. But Mariano never fares too well in the FIP stat, and that's misleading because Mariano has demonstrated an ability to consistently generate low BAbips (Mariano's career BAbip is .265, as compared to a major league average of .299).
As Bill James has noted regarding FIP and various other new and sophisticated measures of pitching performance, they have a tendency to throw out a lot of information in an effort to isolate and identify a pitcher's performance independent of non-pitching factors. Bill is a little unsettled by this, and so am I. As he's argued, W-L records are the antipode to FIP and similar stats, incorporating all information, including unfortunately things that have nothing to do with a pitcher's performance, like offensive support and team fielding. However, the inclination of the stat geeks to summarily dismiss W-L records is extremely misguided. It is possible to start with W-L records and make appropriate adjustments, and that's what I'm about to propose.
The Theory of Relativity, in contrast to FIP, throws out nothing but attempts to adjust for everything (or at least most things) that happens outside of the pitcher's performance. Simply put, it compares a pitcher's W-L record to his teams record in games where the pitcher was not the pitcher of record (i.e., it subtracts the pitcher's W-L record from the team's), adjusting for factors that effect the pitcher's and team's W-L records but are largely unrelated to the pitcher's own performance. If a pitcher received run support better or worse than the run support a team generally provided its pitchers, the pitcher's W-L record is adjusted (via the Pythagorean theorem) to reflect what his W-L record would have been had he received run support equal to his team's average. It also adjusts for the performance of the rest of the team's pitching staff, because even a good pitcher who receives excellent run support will appear to fare poorly relative to his team's W-L record if the rest of the starting pitching staff is comprised of Walter Johnson, Pete Alexander, Tom Seaver and Randy Johnson, with Gossage, Eckersley and Rivera coming out of the bullpen.
FIP tends to understate the effectiveness of a pitcher with a demonstrated ability to consistently generate very low BAbips. Take The Great Rivera, for example. I was skeptical of Mariano's decision in '97 to move almost exclusively to the cutter because it seemed to sharply cut into his strikeouts. "Throw the high fastball!", I would shout, longing for Mariano's incredible strikeout ratio in '96 when he K's 130 batters in 107 innings. Still, I had to admit that batters seemed almost incapable of getting good wood on the cutter, but bloops and dribblers can and do become hits, while strikeouts can't and don't. Bloops and dribblers that found holes in the defense became, in my mind, "Mariano Specials." Obviously Mariano's decision to go with the cutter has been thoroughly vindicated and my early concerns were unfounded. But Mariano never fares too well in the FIP stat, and that's misleading because Mariano has demonstrated an ability to consistently generate low BAbips (Mariano's career BAbip is .265, as compared to a major league average of .299).
As Bill James has noted regarding FIP and various other new and sophisticated measures of pitching performance, they have a tendency to throw out a lot of information in an effort to isolate and identify a pitcher's performance independent of non-pitching factors. Bill is a little unsettled by this, and so am I. As he's argued, W-L records are the antipode to FIP and similar stats, incorporating all information, including unfortunately things that have nothing to do with a pitcher's performance, like offensive support and team fielding. However, the inclination of the stat geeks to summarily dismiss W-L records is extremely misguided. It is possible to start with W-L records and make appropriate adjustments, and that's what I'm about to propose.
The Theory of Relativity, in contrast to FIP, throws out nothing but attempts to adjust for everything (or at least most things) that happens outside of the pitcher's performance. Simply put, it compares a pitcher's W-L record to his teams record in games where the pitcher was not the pitcher of record (i.e., it subtracts the pitcher's W-L record from the team's), adjusting for factors that effect the pitcher's and team's W-L records but are largely unrelated to the pitcher's own performance. If a pitcher received run support better or worse than the run support a team generally provided its pitchers, the pitcher's W-L record is adjusted (via the Pythagorean theorem) to reflect what his W-L record would have been had he received run support equal to his team's average. It also adjusts for the performance of the rest of the team's pitching staff, because even a good pitcher who receives excellent run support will appear to fare poorly relative to his team's W-L record if the rest of the starting pitching staff is comprised of Walter Johnson, Pete Alexander, Tom Seaver and Randy Johnson, with Gossage, Eckersley and Rivera coming out of the bullpen.
Monday, March 15
The Guidry Decade
I've noted before the fact that Ron Guidry is the only pitcher in baseball history to lead the major leagues in wins and lead his own league in ERA and SO over a ten-year period and yet be rejected by the Hall. He averaged nearly 17 wins per season in the decade between '77 and '86 and had a 3.23 ERA (121 ERA+). When apprised of this achievement by Guidry, my fellow baseball fans have had remarkably similar reactions, initially expressing some surprise at Guidry's accomplishment but then arguing that Guidry's statistics during this period, while impressive, were pre-eminent during his ten-year peak only because this decade happened to occur at an odd interregnum in baseball, when greats like Seaver, Palmer and Carlton had just passed their prime and before the rise of Clemens, Maddux, Johnson and Martinez. They suggest that Guidry's performance really wouldn't have been that exceptional in any other era in baseball.
I must admit that I was inclined to give some credence to this argument. I assumed the win total wouldn't be that impressive when compared to all the titans who pitched during the eras of four-man rotations that prevailed in baseball until the '80s. I believed it was probably true that averaging about 17 wins a season over a decade while posting an ERA+ of 120 or greater was not all that unusual during many other eras in modern baseball history, and so I decided to check the record book. It turns out I was wrong. Averaging nearly 17 wins a season over a decade while compiling an ERA 20% better than the park-adjusted ERAs of your contemporaries has always been an achievement only the greats have attained. It turns out that this level of excellence over a decade gives a pitcher an almost automatic entree into Cooperstown. By my count, there have been 27 pitchers who accomplished this since 1920. All but four have already been inducted into the Hall of Fame or are almost certain to be inducted upon eligibility. And it further turns out that Guidry's accomplishment is becoming exceedingly rare in the age of the five-man rotation and seven inning starts.
Friday, March 12
The Sheer Improbability
My last post on pitchers who won two-thirds of their starts in a season got me to thinking again about Gator's amazing record in September division title races. I noted Lefty Grove's incredible '31 season in which he won 27 of 30 starts, a feat unmatched in baseball history. I noted that Bob Welch won a higher percentage of his starts in 1990 than any pitcher since 1954, and he won only 24 games during his best 30-start stretch.
I've noted before the sheer improbability of winning 26 of any 30 starts selected on the basis of any unbiased criterion. It's highly improbable that a pitcher would win 26 of 30 weekend starts, or starts in day games at home, or starts against teams in your own division. But to get an even clearer idea of how difficult it is to win 26 of any 30 starts, even 30 starts selected by a manifestly biased criteria, consider the following.
I've noted before the sheer improbability of winning 26 of any 30 starts selected on the basis of any unbiased criterion. It's highly improbable that a pitcher would win 26 of 30 weekend starts, or starts in day games at home, or starts against teams in your own division. But to get an even clearer idea of how difficult it is to win 26 of any 30 starts, even 30 starts selected by a manifestly biased criteria, consider the following.
Wednesday, March 10
Start The Game, Win The Game
Trivia question: since 1920 there have been only two pitchers to win two-thirds or more of their starts in a season more than twice (minimum 30 starts). Who are they? While you think about that I'll give you some idea of how special this achievement is.
Since 1954 a pitcher has started 30 or more games in a major league season more than 3000 times. Only 30 times has a pitcher started 30 or more games and won two-thirds of his starts. That's less than 1% of the 30-start seasons since 1954.
Since 1954 only four pitchers have accomplished this feat more than once. Can you name them? Here are some hints. Sandy Koufax never did it, narrowly missing in '66. Greg Maddux never did it, either, although he came close in '95 when he won 19 of 29 starts. Whitey Ford came close in '56 and '63, but never did it. Steve Carlton never did it, although he won 27 of his 41 starts in '72.
The following all-time greats did it once: Bob Gibson, Randy Johnson, and Tom Seaver. What year do you think Gibson did it? No, it wasn't '68, it was '70. Doc Gooden did it in his great '85 season. Denny McLain did it in '68 when he won 31 games.
Here are three of the four post-1954 pitchers who did it more than once: Roger Clemens did it twice, in '86 and '90. Juan Marichal did it twice, in '66 and '68. Pedro Martinez did it in '02 and would have done it '99, when he won 22 of his 29 starts. Let's give Pedro credit for that season, however, because he would have done it even if he hadn't won that 30th start.
The answer to our trivia question? Here's another hint first: both pitchers to have won 2/3s of their starts in a season more than twice are lefties. The answer? Lefty Grove did it four times, in '28, '30, '31 and '32. And Ron Guidry did it three times, in '78, '83 and '85. They are the only two pitchers since 1920 to have won 2/3s of their starts in a season more than twice.*
Since 1954 a pitcher has started 30 or more games in a major league season more than 3000 times. Only 30 times has a pitcher started 30 or more games and won two-thirds of his starts. That's less than 1% of the 30-start seasons since 1954.
Since 1954 only four pitchers have accomplished this feat more than once. Can you name them? Here are some hints. Sandy Koufax never did it, narrowly missing in '66. Greg Maddux never did it, either, although he came close in '95 when he won 19 of 29 starts. Whitey Ford came close in '56 and '63, but never did it. Steve Carlton never did it, although he won 27 of his 41 starts in '72.
The following all-time greats did it once: Bob Gibson, Randy Johnson, and Tom Seaver. What year do you think Gibson did it? No, it wasn't '68, it was '70. Doc Gooden did it in his great '85 season. Denny McLain did it in '68 when he won 31 games.
Here are three of the four post-1954 pitchers who did it more than once: Roger Clemens did it twice, in '86 and '90. Juan Marichal did it twice, in '66 and '68. Pedro Martinez did it in '02 and would have done it '99, when he won 22 of his 29 starts. Let's give Pedro credit for that season, however, because he would have done it even if he hadn't won that 30th start.
The answer to our trivia question? Here's another hint first: both pitchers to have won 2/3s of their starts in a season more than twice are lefties. The answer? Lefty Grove did it four times, in '28, '30, '31 and '32. And Ron Guidry did it three times, in '78, '83 and '85. They are the only two pitchers since 1920 to have won 2/3s of their starts in a season more than twice.*
Tuesday, March 9
Leverage Adjusted ERA (Or "Not All Runs Are Equal")
It's been surprising to me, given the profusion of new pitching statistics (FIP, VORP, Component ERA), that we haven't seen an expression of ERA or ERA+ that adjusts for leverage, weighing runs allowed in high-leverage situations more and runs allowed in low-leverage situations less. The data is available in the game logs at Baseball-Reference.com, but poring through and aggregating the data would be a tedious exercise. Fangraphs.com aggregates the data on a seasonal basis in the WPA, WPA/LI and Clutch statistics, but expresses the statistics in terms of incremental games won or lost rather than adjusted ERA.
Fangraphs calculates "Clutch" by subtracting WPA/LI, which aggregates the unleveraged increase or decrease in win probabilities associated with each plate appearance against a pitcher, from WPA, which also aggregates the win probabilities but assigns a leverage factor to each event based on the game situation (score, inning, base and out situation). Generally speaking, a pitcher with a positive Clutch factor performed better in high-leverage situations relative to his overall seasonal performance, or declined in performance in low-leverage situations relative to his overall seasonal performance, or some combination of the two. A better performance in high-leverage situations means that the incremental outs the pitcher got in high-leverage situations count for more than an average out (i.e., an out obtained in a game situation with a leverage factor of 1.0). A worse performance in low-leverage situations means that the incremental runs the pitcher allowed in low-leverage situations count for less than the average run (i.e., a run scored in a game situation with a leverage factor of 1.0).
The significance of the Clutch statistic should be obvious: not all runs allowed (and runs prevented) are equal. For example, the run surrendered in the bottom of the ninth of a tie game should be counted differently than the run surrendered in the bottom of the first inning after the visiting took a six run lead in the top half of the inning. ERA and ERA+ count each run the same, notwithstanding that the two runs I used as examples are likely to have had hugely disparate impacts on the outcome of the game. The advantage of expressing the number of leverage-weighted runs allowed as a variation on ERA should also be obvious: most fans will not know whether a Clutch factor of 0.74 is merely above average, or very good, or a spectacular achievement, but fans know how to compare a 116 ERA+ to a 135 ERA+.
Fangraphs calculates "Clutch" by subtracting WPA/LI, which aggregates the unleveraged increase or decrease in win probabilities associated with each plate appearance against a pitcher, from WPA, which also aggregates the win probabilities but assigns a leverage factor to each event based on the game situation (score, inning, base and out situation). Generally speaking, a pitcher with a positive Clutch factor performed better in high-leverage situations relative to his overall seasonal performance, or declined in performance in low-leverage situations relative to his overall seasonal performance, or some combination of the two. A better performance in high-leverage situations means that the incremental outs the pitcher got in high-leverage situations count for more than an average out (i.e., an out obtained in a game situation with a leverage factor of 1.0). A worse performance in low-leverage situations means that the incremental runs the pitcher allowed in low-leverage situations count for less than the average run (i.e., a run scored in a game situation with a leverage factor of 1.0).
The significance of the Clutch statistic should be obvious: not all runs allowed (and runs prevented) are equal. For example, the run surrendered in the bottom of the ninth of a tie game should be counted differently than the run surrendered in the bottom of the first inning after the visiting took a six run lead in the top half of the inning. ERA and ERA+ count each run the same, notwithstanding that the two runs I used as examples are likely to have had hugely disparate impacts on the outcome of the game. The advantage of expressing the number of leverage-weighted runs allowed as a variation on ERA should also be obvious: most fans will not know whether a Clutch factor of 0.74 is merely above average, or very good, or a spectacular achievement, but fans know how to compare a 116 ERA+ to a 135 ERA+.
Wednesday, March 3
Did You Know...
There are eleven pitchers since 1954 who won at least 50% of their starts. Here they are:
Pretty heady company for Gator. I count nine pitchers who are already in the Hall or are mortal locks, and one pitcher - Roy Halladay - who's making a pretty strong case. And then there's Ron Guidry.
Spahn's figure includes just his starts from and after 1954. It is likely that many pitcher's from the pre-1954 era would make this list because pitchers completed a far higher percentage of their starts in those days and accordingly tended to have fewer no-decisions. Pitchers like Walter Johnson, Lefty Grove and Christy Matthewson would probably top this list.
The presence of Guidry, Martinez, Halladay, Mussina and Clemens is impressive because they pitched in an era when pitchers generally incurred no-decisions in more than 20% of their starts. Spahn, Gibson and Marichal, by comparison, had no-decisions in only 12%, 14% and 17% of their starts, respectively.
Pretty heady company for Gator. I count nine pitchers who are already in the Hall or are mortal locks, and one pitcher - Roy Halladay - who's making a pretty strong case. And then there's Ron Guidry.
Spahn's figure includes just his starts from and after 1954. It is likely that many pitcher's from the pre-1954 era would make this list because pitchers completed a far higher percentage of their starts in those days and accordingly tended to have fewer no-decisions. Pitchers like Walter Johnson, Lefty Grove and Christy Matthewson would probably top this list.
The presence of Guidry, Martinez, Halladay, Mussina and Clemens is impressive because they pitched in an era when pitchers generally incurred no-decisions in more than 20% of their starts. Spahn, Gibson and Marichal, by comparison, had no-decisions in only 12%, 14% and 17% of their starts, respectively.
Sunday, February 28
BlyLeverage
Bill James did a piece a few years ago on Bert Blyleven in which he addressed the great mystery surrounding Blyleven's conspicuously mediocre W-L record. While conceding that Bert's critics make some good points - "Blyleven did not do an A+ job of matching his effort to the runs he had to work with" - he ultimately concluded that Bert's biggest problem was his lack of run support, not his failure to pitch better in critical situations. Bill attributed roughly two-thirds of Bert's relatively poor record to lack of run support and one-third to Bert's tendency to pitch relatively poorly in tight games.
Bill's analysis was disappointing in certain respects, however. First, he didn't note that Bert's relatively poor career W-L record is almost purely a function of his performance in the first nine years of his career ('70 to '78). Had Bert compiled a W-L record commensurate with his ERAs and run support in the '70s Bert would already be in the Hall and Bill James and I wouldn't be writing about him. Second, Bill didn't discuss Bert's pertinent statistics from this period that likely explain the disparity between Bert's excellent ERAs during that period and his pedestrian W-L record. As I've previously noted, Bert had terrible record in "late and close situations" in that period, far worse than any premier pitcher of that era that I've examined, and lost a disproportionate number of close games. While it strikes me as reasonable and logical to infer that a pitcher who performs poorly in the late innings of tight games will lose a disproportionate number of close games, I thought I'd look at the records of various pitchers in one-run games and attempt to determine if there is any significant correlation between a pitcher's performance in close games and his record in one-run games.
Bill's analysis was disappointing in certain respects, however. First, he didn't note that Bert's relatively poor career W-L record is almost purely a function of his performance in the first nine years of his career ('70 to '78). Had Bert compiled a W-L record commensurate with his ERAs and run support in the '70s Bert would already be in the Hall and Bill James and I wouldn't be writing about him. Second, Bill didn't discuss Bert's pertinent statistics from this period that likely explain the disparity between Bert's excellent ERAs during that period and his pedestrian W-L record. As I've previously noted, Bert had terrible record in "late and close situations" in that period, far worse than any premier pitcher of that era that I've examined, and lost a disproportionate number of close games. While it strikes me as reasonable and logical to infer that a pitcher who performs poorly in the late innings of tight games will lose a disproportionate number of close games, I thought I'd look at the records of various pitchers in one-run games and attempt to determine if there is any significant correlation between a pitcher's performance in close games and his record in one-run games.
Welcome To The Club, Bill
Bill James is coming out shortly with his Bill James Gold Mine 2010. He has a chapter in the book entitled "Comparing Starting Pitchers Across History." The chapter has been pre-released online and you can read it here.
In this chapter, Bill returns to one of his favorite subjects: Hall of Fame standards for starting pitchers. He's noted many times in the past that Hall of Fame voting in recent years appears to reflect a movement away from traditional HOF standards for starting pitchers toward an emphasis on longer careers and the accumulation of huge career statistics (high career win totals, strikeouts, etc.). If it were up to today's HOF voters would pitchers like Drysdale, Lemon, Newhouser, Bunning, Hunter, Gomez and Dean be in the Hall of Fame? It's not at all clear.
In this chapter, Bill returns to one of his favorite subjects: Hall of Fame standards for starting pitchers. He's noted many times in the past that Hall of Fame voting in recent years appears to reflect a movement away from traditional HOF standards for starting pitchers toward an emphasis on longer careers and the accumulation of huge career statistics (high career win totals, strikeouts, etc.). If it were up to today's HOF voters would pitchers like Drysdale, Lemon, Newhouser, Bunning, Hunter, Gomez and Dean be in the Hall of Fame? It's not at all clear.
Sunday, February 21
On The Subject of "Clutch"
It's a word you hear a lot about in discussions of athletics. It's a given among most sports fans and commentators that some performers are clutch and some aren't. Does anyone dispute that Michael Jordan was clutch? Does anyone dispute that John Elway was clutch? We all remember those game-winning shots and game-winning 4th quarter drives. Those were clutch, right? Ron Guidry's 26 wins in 30 September pennant race starts? That's gotta be clutch, doesn't it? And does anyone really dispute that Derek Jeter is clutch?
Well, yes, some people do dispute that Derek Jeter is clutch. And, frankly, they make some pretty good points. They correctly caution us that we should be careful about placing too much emphasis on "the flip" in '01 in the ALDS against the A's, or the walk-off home run in '01 against the D'backs in game 4 of the 2001 World Series. And they're right about relying on anecdotalism, or isolated instances of "clutch plays", or, more generally, very small sample sets. Those may have been clutch plays, but do they necessarily make Derek Jeter a clutch player? They point out that Derek Jeter in the post-season is pretty much like Derek Jeter in the regular season - almost identical batting average, OPS, and just a little bit more HR power in Oct/Nov than in April to September. Jeter's not being "clutch", they argue; he's merely being Jeter.
Reggie Jackson? Surely a .755 slugging average across five World Series establishes beyond question Reggie's clutch bona fides, right? Well, what about those 11 ALCS series, the skeptics ask. Those were big games, too, and Reggie slugged .380 and had an OBP under .300.
Well, yes, some people do dispute that Derek Jeter is clutch. And, frankly, they make some pretty good points. They correctly caution us that we should be careful about placing too much emphasis on "the flip" in '01 in the ALDS against the A's, or the walk-off home run in '01 against the D'backs in game 4 of the 2001 World Series. And they're right about relying on anecdotalism, or isolated instances of "clutch plays", or, more generally, very small sample sets. Those may have been clutch plays, but do they necessarily make Derek Jeter a clutch player? They point out that Derek Jeter in the post-season is pretty much like Derek Jeter in the regular season - almost identical batting average, OPS, and just a little bit more HR power in Oct/Nov than in April to September. Jeter's not being "clutch", they argue; he's merely being Jeter.
Reggie Jackson? Surely a .755 slugging average across five World Series establishes beyond question Reggie's clutch bona fides, right? Well, what about those 11 ALCS series, the skeptics ask. Those were big games, too, and Reggie slugged .380 and had an OBP under .300.
Saturday, February 20
Bill James Ranks The Lefties
Bill James ranked the 100 greatest pitchers in baseball history in The New Bill James Historical Baseball Abstract. The book was written after the 1999 season (and ultimately published in 2001) and consequently greats like Roger Clemens, Greg Maddux, Randy Johnson, Pedro Martinez and Mariano Rivera are not as elevated in James's rankings as they would be today (ranking 11th, 14th, 29th, 49th and unranked, respectively). Based on more recent commentary by James I think it's pretty clear that Randy Johnson, in particular, would make a huge jump in James's ranking.
Here is James's list of the top 20 southpaws of all time through 1999 (my list is here; open it up in a new window and do a side-by-side comparison of the lists):
A pretty good list, Mr. James. Bill's list is based on his Win Shares analyses, which combine career Win Shares, top three Win Share seasons, Win Shares over the pitcher's best five consecutive seasons and career Win Shares per season. Accordingly, Bill has Sandy Koufax ranked ahead of Steve Carlton despite having barely half Carlton's career Win Shares.
Bill's top 10 would look pretty much the same today, I believe, with two exceptions: Randy Johnson would jump ahead of Koufax and probably even Spahn, taking the two spot behind Grove; and Glavine would likely crack the top 10 given his superior 2000 to 2002 seasons (55-27, 133 ERA+).
It's harder to speculate about what Bill's second 10 would look like today. Bill and his buddy Rob Neyer seem to think fairly highly of Andy Pettitte (Bill has said Andy will likely make the Hall of Fame, and Neyer has said that Andy is "qualitatively" better than Jack Morris). And Johan Santana is probably already on the cusp of Bill's second 10. However, the presence of Wilbur Cooper and Eppa Rixey on Bill's list suggests to me that he perhaps adhered too slavishly to his Wins Shares index, and I don't think either Pettitte or Santana fare too well on that basis (although Santana will certainly get there with another four or five good years).
Bill has seven pitchers ranked ahead of Guidry who I'd ranked behind: Plank, Newhouser, Waddell, Cooper, Pierce, John and Kaat. Conversely, Bill has Gomez ranked one spot behind Guidry whereas I had ranked Gomez one spot ahead of Guidry. The difference between our approaches to Plank and Waddell is easily explainable: I copped out and argued that it was simply to difficult to compare "deadball era" pitchers to post-1920 era pitchers. As for the rest, this is my take on Bill's take:
Here is James's list of the top 20 southpaws of all time through 1999 (my list is here; open it up in a new window and do a side-by-side comparison of the lists):
A pretty good list, Mr. James. Bill's list is based on his Win Shares analyses, which combine career Win Shares, top three Win Share seasons, Win Shares over the pitcher's best five consecutive seasons and career Win Shares per season. Accordingly, Bill has Sandy Koufax ranked ahead of Steve Carlton despite having barely half Carlton's career Win Shares.
Bill's top 10 would look pretty much the same today, I believe, with two exceptions: Randy Johnson would jump ahead of Koufax and probably even Spahn, taking the two spot behind Grove; and Glavine would likely crack the top 10 given his superior 2000 to 2002 seasons (55-27, 133 ERA+).
It's harder to speculate about what Bill's second 10 would look like today. Bill and his buddy Rob Neyer seem to think fairly highly of Andy Pettitte (Bill has said Andy will likely make the Hall of Fame, and Neyer has said that Andy is "qualitatively" better than Jack Morris). And Johan Santana is probably already on the cusp of Bill's second 10. However, the presence of Wilbur Cooper and Eppa Rixey on Bill's list suggests to me that he perhaps adhered too slavishly to his Wins Shares index, and I don't think either Pettitte or Santana fare too well on that basis (although Santana will certainly get there with another four or five good years).
Bill has seven pitchers ranked ahead of Guidry who I'd ranked behind: Plank, Newhouser, Waddell, Cooper, Pierce, John and Kaat. Conversely, Bill has Gomez ranked one spot behind Guidry whereas I had ranked Gomez one spot ahead of Guidry. The difference between our approaches to Plank and Waddell is easily explainable: I copped out and argued that it was simply to difficult to compare "deadball era" pitchers to post-1920 era pitchers. As for the rest, this is my take on Bill's take:
Thursday, February 18
If You Had To Win One Game...
If you had to choose one pitcher to start a critical, late September game in a tight division race, who would you choose? Sabathia? Halladay? Santana? Carpenter?
I know who I would choose, and you know who I'd choose, too, because his picture is to the right. I'd choose Roy Oswalt, the Astros ace, hands down. Year after year Roy has put up Guidry-like numbers in September with the Astros in contention for a division title or wild-card spot. He's as close to infallible in a battle for the post-season as any pitcher of his generation.
Thanks to the three division, wild-card format, the Astros have been in contention for a post-season berth in every year of Oswalt's career other than 2007 and 2009. Oswalt has made 40 September starts in the seven tight races in which he's participated and his record is 28-7 with a 2.49 ERA in 267.1 innings pitched. However, two of his losses came in late September 2002 after the Astros had been eliminated (the only two starts of his 40 September starts that occurred after the Astros had either clinched or been eliminated). Take away those starts and Oswalt is 28-5 with a 2.39 ERA in 38 September starts while the Astros were still in contention.
I know who I would choose, and you know who I'd choose, too, because his picture is to the right. I'd choose Roy Oswalt, the Astros ace, hands down. Year after year Roy has put up Guidry-like numbers in September with the Astros in contention for a division title or wild-card spot. He's as close to infallible in a battle for the post-season as any pitcher of his generation.
Thanks to the three division, wild-card format, the Astros have been in contention for a post-season berth in every year of Oswalt's career other than 2007 and 2009. Oswalt has made 40 September starts in the seven tight races in which he's participated and his record is 28-7 with a 2.49 ERA in 267.1 innings pitched. However, two of his losses came in late September 2002 after the Astros had been eliminated (the only two starts of his 40 September starts that occurred after the Astros had either clinched or been eliminated). Take away those starts and Oswalt is 28-5 with a 2.39 ERA in 38 September starts while the Astros were still in contention.
Saturday, January 30
Demythologizing Bert's Famous "Bad Luck"
The June 1976 SI article was published shortly after Bert's first appearance with the Rangers, in which Bert and Mark "The Bird" Fidrych each went 11 innings, with the Tigers prevailing, 3-2. Including Bert's last two starts with the Twins, this made the third consecutive start where Bert had pitched well and been tied going into the late innings but lost. But the SI article wasn't a product of Bert's disappointing results in tight games over the preceding few weeks. The SI article was prompted by two conspicuous aspects of Bert's record that had persisted for years. First, Bert's W-L record never seemed to match the rest of his record - the superior ERAs, the shutouts, the complete games and the strikeouts. Second, Bert had a propensity to lose a lot of close games in the late innings.
Bert's fanatical supporters always have two deceptively simple explanations at the ready for Bert's mediocre W-L records in the '70s: he received poor run support, and he was unlucky. Each of these two rationalizations offered by Bert's backers fail completely to explain Bert's relatively poor W-L records, and each are particularly absurd for having been offered by people who purport to possess some degree of sophistication in statistical analysis. Each can be dismissed quickly and definitively.
Bert's fanatical supporters always have two deceptively simple explanations at the ready for Bert's mediocre W-L records in the '70s: he received poor run support, and he was unlucky. Each of these two rationalizations offered by Bert's backers fail completely to explain Bert's relatively poor W-L records, and each are particularly absurd for having been offered by people who purport to possess some degree of sophistication in statistical analysis. Each can be dismissed quickly and definitively.
Monday, January 25
Boswell on Blyleven (or, "Bert Backers Bash Boswell")
Of all the commentary in the aftermath of the HOF voting results I was most struck by the following comments by Thomas Boswell during the course of an online chat at the Washington Post website:
"The push for Blyleven drives me crazy. I follwed his whole career. His reputation was that, more than any other top stuff pitcher, he would find a way to lose or not to win. He's just not a HOFer, in my book. He only won 20 games one time and more than 17 only twice! And he pitched in the era when top starters got 4-5 more starts a year and 20 wins was easier. BB had nine seasons with 36-to-40 starts and averaged 38 in those years. When Chuck Tanner got him in Pittsburgh the word went around that Chuck had decided, over BB's protestations, to take him out of late-and-close games because he'd never had the stomach for it. 'Take him out before he can lose.' Tanner never said it in public. But BB's winning opercentage gets better."
Bert Backers immediately denounced Boswell and charged him with trafficking in baseless rumors and innuendo. Sites like this one slammed Boswell, saying "Boswell will die with that 'word went around' crap in his head."
Well, we'll never know what was in Chuck Tanner's head, and Chuck is a classy guy and he ain't sayin'. But we do know the following: Boswell is absolutely correct regarding Blyleven's reputation, and Chuck Tanner did indeed resort to a quick hook with Blyleven beginning in the 1979 season, a strategy that succeeded wildly and was a critical part of the Bucs' march to the World Series that year.
I get the impression a lot of Bert Backers are too young to have closely followed the game back in the '70s, but Bert's reputation as a guy who lost the close ones and stumbled in the late innings of tight games is simply a fact, and also a matter of record. That was Bert's reputation; Boswell remembers it correctly. I remember it, too, and anyone else who followed the game back then would also remember it.
Well, we'll never know what was in Chuck Tanner's head, and Chuck is a classy guy and he ain't sayin'. But we do know the following: Boswell is absolutely correct regarding Blyleven's reputation, and Chuck Tanner did indeed resort to a quick hook with Blyleven beginning in the 1979 season, a strategy that succeeded wildly and was a critical part of the Bucs' march to the World Series that year.
I get the impression a lot of Bert Backers are too young to have closely followed the game back in the '70s, but Bert's reputation as a guy who lost the close ones and stumbled in the late innings of tight games is simply a fact, and also a matter of record. That was Bert's reputation; Boswell remembers it correctly. I remember it, too, and anyone else who followed the game back then would also remember it.
Subscribe to:
Comments (Atom)












