The Predictive Value of Streaks
J.J. Hardy is a pretty streaky hitter. It is a generally accepted consensus, and one I do not dispute. However, what we can look at is the predictive value of streaks, and how to determine if a streak will continue or if a good or bad streak is starting. So, here we will dive into some information.
This graph shows that yes, J.J. has had pronounced changes in his batting wOBA over the course of the past three years-- a significantly up and down pattern, compared to most hitters.
Take everything I present here with a small sample disclaimer. The Book clearly proves that a large group of "hot hitters" perform, on average, just as you'd expect them to in the next game following a week of hot hitting. I am more interested in how Hardy specifically performs in this situation, to provide a real-life example of a player most of us consider to be very streaky.
For Hardy, I'll make the best of the data available and define a "hot streak" as consecutive games reaching base 2 or more times. In his career, J.J. has one such streak of 7 games, 2 of 6 games, and 3 or 5 games. This seems like a good place to start. For each of these streaks, I will take the next week of data and compare the lines.
During the streak, and then over the next 7 games:
Streak 1
.467 .543 .833
.310 .394 .655
Streak 2:
.500 .533 .679
.120 .120 .280
Streak 3:
.583 .615 .958
.276 .290 .724
Streak 4:
.652 .667 1.348
.280 .379 .680
Streak 5:
.368 .455 .526
.148 .179 .185
After the samples of some hot hitting, J.J. performed over his career averages twice, and well under them twice. Seems that those weeks following the hot weeks are a pretty random group, something you'd expect if you pulled miscellaneous week samples out of his season.
Let's switch to the cold streaks. Do exceptionally bad stretches by J.J. have any predictive value? He has streaks of 12, 9, 7, 6, and 6 games with 1 or fewer times on base. Again, the format is the slash line during the streak and then the 7 games directly after the streak ended.
Streak 1
.154 .154 .346
.481 .533 .630
Streak 2
.079 .103 .184
.212 .278 .394
Streak 3
.000 .143 .000
.217 .308 .261
Streak 4
.087 .125 .087
.200 .259 .240
Streak 5
.115 .179 .115
.346 .433 .500
There seems to be a bit more possibility here. Two of the samples had J.J. hit well above his career averages, and the other three saw him improve from the cold streak but still stay below his career averages.
The first example is most interesting to me. For 52 at-bats last September, Hardy couldn't hit anything. .154/.154/.346 is a pathetic line, so should he have been benched (in reality or fantasy)? Well, a funny thing happened next: he went 3/5 in the very next game and went on to get on base more than 50% of the time in the next week.
These findings are pretty much what I would have expected to find, even with the problems in the data collection-- the nature of this type of data is that the very next game in the sequence is going to be a 0 or 1 times on base game for the hot streaks or a 1 or more times on base game after the cold streaks. Still, the point remains. There is almost no way to predict streaks for hitters. A player who is hot can go cold without warning. On the final day of every one of these streaks, a manager-- in fantasy or reality-- might have asked themselves if J.J. was going to snap out of the slump, or stay hot. And both types of managers would usually be best served by letting the superior player play and not trying to guess who has the "hot hand".
In general, I would highly recommend evaluating players based on their true skill level and awarding playing time on that basis. In some rare scenarios (Corey Hart's second half of 2008 might qualify), it might be justifiable to play a worse player due to a prolonged slump. But in general, even one of the streakiest players cannot give you a good idea of what will happen next. Adhere to the bigger picture.
Is J.J. Hardy a streaky hitter? Yes. I would argue that streakiness is not a bad thing, and may even be a good thing. My main point is that streakiness does not help us predict what will happen immediately following a streak-- and for that reason, there is little value in considering streaks. The hot kind are fun to watch, but there is little reason to form baseball philosophy based on them.
(Resources: Baseball-Reference Play Index, The Book)
24 comments
|
0 recs |
Do you like this story?
Comments
Interesting stuff
Sometimes the most illuminating research is done when you set out looking for something and can’t find it.
"The reports are that he is getting better. The definition of better is nebulous."
Good stuff
I thought this post was going to be about Wil Ferrell in Old School.
September 15: Not a bad little Monday
You have been pulled over

Also Hyatt— I think you’re looking for “you’re”, not “your”.
Seriously guys, don’t take it personally. High school English students just are this way.
The artist formerly known as jihad.
I wasn't in high school...
and apparently most of my college classmates aren’t either. Ironically, two of my professors this year are very big sticklers with grammar. I don’t think they follow my line of thinking: that’s what an editor is for. :)
I untuck my shirt!
exactly
As long as your content is good, pass it on to the editor. Without our bad grammar, editors become unemployed. I’d go so far to say, without bad grammar, the economy would be completely collapsed by now.
one of my professors told us today, in a joking way
that if we improve our writing grammar we’ll make a million more dollars in our lifetimes (we’re english/creative writing majors). But for me the most important thing is the ideas that are getting across. Yeah, some say bad grammar leads to a problem is getting ideas out accurately or it distracts…I guess I feel it shouldn’t unless it’s so blatantly bad.
I untuck my shirt!
No, I agree
I was really just joking. I don’t really care about grammar at all in comments. And I have Firefox to save me, too.
The artist formerly known as jihad.
Luck
Until BatterFX exists, and the data for things like speed of ball off bat is available, is it possible to use PBP data to determine when J.J. was simply unlucky or lucky for a streak?
Or even looking at videotape to identify those rocketing liners hit straight at someone and the dribblers hit to Randy Johnson?
I would think that almost all of the "hot streaks" would have been aided or moslty caused by luck
And cold streaks bad luck. That point would tend to agree with my point that streaks don’t have much, if any, predictive value.
You could just look at BABIP over those streaks. I wish I would have saved and shared the lists so you guys could look them over. But aren’t hot streaks by nature just more balls falling in and cold streaks having those same balls find gloves? Unless you can predict the day that the luck changes, trying to play the hot hand seems to be a pretty poor strategy.
HitFX will be aweseome if it ever gets going, I’d imagine it’s one of the missing links in BABIP (initial velocity, of course). And it’s good to see I brought one commenter back into the fray :).
The artist formerly known as jihad.
Eh
I’m reluctant to say that putting together a string of similar events is caused by luck. Slightly aided? Yes. But isn’t everything slightly aided by luck? Stretches of good play are caused by playing well, and I’m guessing that BABIP will show that. Even so, I’m not sure that BABIP would be a completely effective measure. When a player is hitting at his best, he is generally hitting the ball to all fields. On a hot streak, he should be able to place the ball where he wants to, raising his BABIP.
If the Brewers win 10 games out of 12, why did they win those games? Were they lucky, or did they play really well?
IIRC
This topic originally came from the ol’ fantasy baseball mailbag. I suppose it’s worth pointing out that part of the risk in jumping on a JJ cold streak is whom you’re replacing him with. In real life, you’re saying “I’m so sure JJ is on a cold streak that I’m going to start Craig Counsell for a week.” Yikes.
On your fantasy team, though, your backup might be Rafael Furcal, in which case the risk/reward isn’t so severe.
Personally, I hate streaky hitters on my fantasy baseball team, although it is fun to sell high on a streaky hitter and watch him sink the other team when the cold snap hits. :D
"I will agree that the attitude [at BCB] is ridiculous and they have done so much to instigate animosity and then block us from responding. Real mature!"
Can you combine the numbers?
What was Hardy’s overall line in all hot streaks and all weeks after hot streaks? Same for cold streaks. How do his week after stats compare to his overall hitting line? How about his overall hitting line with the recent streak removed?
One issue you’ll have is selective sampling. Once a hot streak ends, the next game is, by definition, a game with 0 or 1 times reaching base, which will bring down the week after numbers just a touch.
Beyond the Boxscore // Calling BJ Upton lazy is lazy.
I'm going to do this when I get a chance to
I wrote this above:
These findings are pretty much what I would have expected to find, even with the problems in the data collection— the nature of this type of data is that the very next game in the sequence is going to be a 0 or 1 times on base game for the hot streaks or a 1 or more times on base game after the cold streaks.
I could go a lot more in-depth with this at some point. We had a discussion in a fantasy baseball thread about if you should have a “consistent” backup for a streaky hitter like Hardy. My point was that you can’t predict when the streak is starting or stopping, and I was looking for an excuse to play with BR-PI. I will try to get the total lines, though.
The artist formerly known as jihad.
I believe that was me saying that you should keep a valuable backup for a streaky player
But I was talking about hot stretches of play that last about a month. When Hardy gets hot, he is among the elite in the majors. However, when he’s cold, he’s nothing to write home about — and sometimes it can get downright ugly. It had nothing to do with an actual streak.
That said, I don’ t see why this data would change much by adjusting the type or length of excellent play. I also don’t know how we could accurately quantify a hot stretch. Perhaps if we looked at 2 weeks of good play and how the next 2 weeks went. Or maybe take those same hot weeks and see how the next month was.
Also, the hot streaks we showed: Were those home stands followed by road trips? That could be a factor that could affect his production.
The more I think about this
I’ll do some more work on this subject, and report back to you guys either Friday, this weekend, or early next week.
The artist formerly known as jihad.
No mention of illness, injury or position in the lineup. I guess mentioning batting in front of the pitcher wouldn’t be politically correct.
I personally think Jordan should send his articles to you before submitting. So many good suggestions, all offered so constructively!
Also, cheese.
by Jeff Sackmann on Mar 6, 2009 11:18 AM CST up reply actions
Here we go again
First, if any player is injured or ill, you’d probably want a backup to play, wouldn’t you think? So I’m just going to look at your lineup position claim.
7th
.196 .288 .261
2nd, 5 games
.250 .250 .375
Then he got moved back to 6 and 7. His week after being moved up to 2:
.286 .333 .500
I have no idea what you are arguing, or why it is relevant. My thesis is: You cannot predict when streaks will start or end, therefore it is a poor strategy to “play the hot hand” with a backup when the starter is a superior player by far.
I don’t even know why I respond to your cryptic little comments anymore, because you just ignore my responses. I have no idea what your claim is here, (if a guy is injured, ill, or moved in the lineup, you should consider playing a backup?) and we probably never will. So how about accepting a quick post on whether Hardy’s streaks have predictive value for what it is, instead of leaving these critical, cryptic comments?
If you have a genuine objection to the method of research, go ahead and bring it up. But I’m not getting your references about political correctness or what you think about the factors you mentioned, and this is a trend. Think it was poorly written, too?
The artist formerly known as jihad.

by 





























