Tag Archives: Ken Phelps

December 26, 2023

Hot and Cold

For a while I thought it was possible to make an improvement to the Beat the Streak Picks by determining the current probability of the hotness or coldness of a hitter. It strikes me as a hidden Markov model (HMM). The idea would be to assign one of three states to a player at a given time; cold, normal, hot. Since it is difficult to know the current state of the player, these hidden states make up the hidden part of the model. On each sequence (a game), the model emits an observation, in this case, the quality of the game.

For example, if you are in the normal state and have a normal game, one would postulate that the player stayed in the normal state. But a high quality game or a low quality game might signify a transition to one of the other states.

Efficient algorithms exist to learn the probabilities of transitioning from one state to another, and also the probabilities of a particular observation given the current state. To train these models, however, we need games that are tagged with a good representation of the state.

Tagging can be difficult, however. One way would be to hire people to look at a game log for a player and find the hot, not, and normal areas of the sequence. I tried that myself, and found it rather difficult to eyeball.

Instead, I used my version of an offensive game score to do the tagging. I looked at a six day sequence going forward from a particular game and took the average game score. Average game scores less than 46 were considered Cold. Average game scores greater than 56 were considered Hot. Those in between were considered normal.

I ran the tagger for six players and generated a series of averages based on the state to see if the tagger worked decently. Here are the results:

PlayerStateGamesABHitsWalksDoublesTriplesHRKBA%GWH
Luis ArraezCold823.280.660.240.040.0000.010.300.20152.4
Luis ArraezNormal4013.781.220.330.200.0220.050.320.32475.6
Luis ArraezHot533.851.920.400.450.0380.080.190.50088.7
Jackie Bradley, Jr.Cold5472.760.380.210.100.0090.030.900.13932.5
Jackie Bradley, Jr.Normal5933.490.950.380.230.0300.130.920.27165.9
Jackie Bradley, Jr.Hot423.671.640.570.330.0950.360.670.44888.1
Freddie FreemanCold1573.210.490.190.090.0000.030.850.15338.2
Freddie FreemanNormal14253.751.080.520.240.0140.150.850.28869.4
Freddie FreemanHot3033.851.640.550.410.0300.340.640.42686.1
Spike OwenCold6432.820.450.270.080.0170.010.360.15837.8
Spike OwenNormal8663.430.990.440.180.0520.040.320.28968.5
Spike OwenHot354.031.860.540.340.0860.170.260.46185.7
Ken PhelpsCold3801.970.320.340.040.0050.060.530.16327.1
Ken PhelpsNormal3442.880.780.680.110.0090.240.670.27157.8
Ken PhelpsHot373.111.430.730.300.0540.430.430.46183.8
Mike TroutCold663.090.390.290.080.0000.080.980.12734.8
Mike TroutNormal10903.671.000.620.190.0330.211.070.27267.5
Mike TroutHot3333.601.530.800.300.0480.410.690.42583.8
Averages per game by State

There are four types of hitters represented here. Freeman and Trout are superstar batters. Owen and Bradley were known more for their defense, and they are seldom hot. They are in fact cold as much as they are normal. Ken Phelps is the power hitter who walks a lot. His profile is similar to the defensive specialist, seldom hot but very often cold. Finally, Arraez is the great hitter with little power, and the lack of power keeps him from being hot often. He is seldom cold, however.

Note that for each player, the hot state is associated with a low strikeout rate, while K rates don’t vary that much between normal and cold. It would seem that in hot states players are seeing the ball well and really driving it.

The column I’m most interested in would be the last one, percentage games with a hit. Note how even the weak hitters are well over 80% when they are hot. If one is trying to choose between Arraez and Freeman on a particular day, knowing if one is hot might make a big difference.

So the idea would be to used this automated tagging to train an HMM. Use the HMM to figure out the probability of a player being hot at a particular part of time, then train a new neural network to take advantage of that parameter.

Wish me luck.

August 31, 2022

Best Batter Today

Tuesday’s game produced no change to the top five in the Baseball Musings Batter Rankings. Aaron Judge of the Yankees goes three for five with his 51st home run to help beat the Angels 7-4. His current home run pace stands at 64 for the season with his most likely total rising to 59. Note that his pace and his most likely total are stating to converge, and the average of the two would at least tie the AL record of Roger Maris. The probability of Judge hitting at least 62 home runs rises to 0.16, or about 1 in 6.

The Heisenberg Uncertainty Principle seems to apply to Paul Goldschmidt of the Cardinals, who seemed to go into a slump once people noticed he had a shot at the triple crown. Since his five RBI game put him in tie with Pete Alonso for the NL lead, Goldschmidt is 3 for 17 with no extra base hits and no RBI. His 0 for 3 with a walk on Tuesday leaves him in second place behind Judge, but Goldschmidt’s probability of winning the triple crown is down to 0.015. His teammate, Nolan Arenado goes one for four in the 5-1 Reds win. Arenado sits in fourth place.

Alex Bregman of the Astros stands third as he draws two walks in a 4-2 win over the Rangers. Freddie Freeman of the Dodgers posts a one for four game for fifth place, Los Angeles beating the Mets 4-3. Freeman closed a gap a tiny bit on Goldschmidt in the battle for the NL batting title.

The best game score of the day belongs to Nick Pratto of the Royals, an 86. The rookie first baseman goes four for five with two home runs and a double in a 9-7 win over the White Sox. Pratto has not delivered many hits this season with a .209 BA, but sixteen of his twenty four hits went for extra bases for a .461 slugging percentage. He’s an all or nothing player, with 46 K in 115 AB. He has the makings of Ken Phelps, his minor league numbers showing lots of walks and power to go with few hits. Note that the upside of Ken Phelps is Mark McGwire.

May 4, 2019

Homer Hitters

Gary Sanchez hit two home runs Friday night, bringing his season total to ten.


It marked the 12th multi-homer game of Sanchez’s career, in just 284 games. Only Ralph Kiner reached 12 multi-homer games faster, in 282 games.

NYPost.com

Sanchez collected 18 hits so far this season, 10 of them home runs. That’s a remarkable ratio of home runs to hits. For players with 1000 plate appearances (PA), here are the top home run to hit ratios in the low mound era (1969 on):

Player Plate App. Hits Home Runs HR to Hit Ratio
Joey Gallo 1380 245 99 0.404
Mark McGwire 7660 1626 583 0.359
Gary Sanchez 1205 269 81 0.301
Chris Carter 2852 536 158 0.295
Rhys Hoskins 1006 213 62 0.291
Aaron Judge 1360 305 88 0.289
Khris Davis 3198 704 203 0.288
Russell Branyan 3398 682 194 0.284
Adam Dunn 8328 1631 462 0.283
Kyle Schwarber 1374 269 76 0.283
Dave Kingman 7429 1575 442 0.281
Ken Phelps 2287 443 123 0.278
Ron Kittle 3013 648 176 0.272
Giancarlo Stanton 4839 1126 305 0.271
Rob Deer 4512 853 230 0.270

We are clearly in an era with a plethora of this type of hitter. Bill James named one type of hitter in this list “Ken Phelps.” They tended to be first-basemen/DH types that hit a lot of home runs, walked quite a bit, but had low batting averages as they struck out often. They were useful, but not stars. Joey Gallo, at the top of the list, fits this description. Judge and Stanton don’t and we’ll see how Gary Sanchez continues to develop. A Ken Phelps who can actually hit (Barry Bonds, for example) is an extremely valuable player.

December 20, 2012

Being Ken Phelps

A few days ago, I wrote about evaluating hitters with three metrics, Bayesian versions of BABIP. One was the classic batting average on balls in play, one for home runs, and one for strikeouts. It then struck me that this might be a good way to find ballplayers similar to each other. Note that the latter two measures, batting average on balls not in play, and strikeouts on balls not in play imply the player’s walk+hbp rate, since those are the only other balls not in play outcomes. Along with the BABIP prior, we have a very good four dimensional description of the batter:

  • How often he puts the ball in play.
  • How often he gets a hit on balls in play.
  • How often he homers.
  • How often he strikes out.

I thought these would well to construct a mean squared difference similarity score. I collected all players with at least 1500 career plate appearances since 1969, the year the mound reached its current height, division play began, and I started watching baseball. To get the score, I square the difference of the four measures with the player of interest, and divide by four. I did not account for playing time, a huge component of value. I was more interested in players with similar skills.

Over the next few days I’ll post various players of interest, starting with Ken Phelps.

Bill James once wrote that you could build a pretty good team with the Ken Phelps of the world. Then the Yankees traded Jay Buhner for him Ken, and no one ever forgot it. Phelps was a low BA, high OBP, high power, high strikeout slugger. In the 1980s, when the game was more concerned about batting average, Ken, and players like him, would constitute a market inefficiency. I tend to think of players like Rob Deer and Adam Dunn in the Ken Phelps class. but that’s not the case:

Ten players most like Ken Phelps, careers starting 1969 or later.
Batter BIPPrior BABIP BABnIP KBnIP MeanSqDist
Ken Phelps 0.72235 0.24540 0.12513 0.45677 0.0000000
Mike Schmidt 0.75470 0.27891 0.13642 0.46876 0.0004647
Carlos Quentin 0.77280 0.25285 0.14747 0.46071 0.0001900
Jose Bautista 0.77154 0.26742 0.12500 0.49454 0.0006372
Ken Griffey Jr. 0.77346 0.28672 0.16570 0.46791 0.0011591
Jason Giambi 0.78201 0.29710 0.12464 0.43696 0.0010219
Prince Fielder 0.77929 0.30287 0.13911 0.46174 0.0011743
Bob Hamelin 0.78594 0.26423 0.11612 0.50780 0.0010133
Carlos Delgado 0.76791 0.30341 0.13518 0.49871 0.0017417
Mark Teixeira 0.78608 0.29189 0.14677 0.48762 0.0011938
Cliff Johnson 0.80709 0.26710 0.12785 0.46902 0.0002095

As you can see, Ken was a fairly unique player, and many of the players in this don’t strike me as Ken Phelps types. It’s tough to match Ken well, due to his extremely low BABIP. I suspect he was an extreme fly ball hitter. When those don’t go out of the park, they often come down in someone’s glove.

Many of the ten closest to Ken or are good hitters. There’s nothing wrong with being in the same class as Mike Schmidt, Ken Griffey, Jr. and Carlos Delgado. Maybe Phelps was more undervalued than even Bill James imagined.

June 26, 2012

The Ken Phelps of Cananda

Ken Phelps played for Seattle in the 1980s. Ken performed well as a hitter in two dimensions; he drew walks and hit for power. He was a late bloomer, and during the height of his career hit .248/.388/.523. In that time he collected 373 hits, 179 extra-base hits, and 336 walks. Nearly half his hits went for extra bases, and he reached by a walk almost as often as a hit. At the time, this type of hitter was under appreciated due to the low batting average, and Bill James called them the Ken Phelps hitters. Rob Deer was in this group, and in some seasons Mark McGwire would fit this bill as well.

Jose Bautista falls into the Phelps category this season. With his 1 for 5 with a home run in the Blue Jays 9-6 win over the Red Sox Monday night, he now owns 63 hits, 32 of them for extra bases, and 48 walks. Looking at his career as a whole, 2011 was the outlier. Even when he languished in Pittsburgh, Jose produced a high number of walks and good power for his batting average. Toronto helped bring out his full Phelps, and now he’s considered one of the dangerous hitters in the game.

April 16, 2010

The Near Perfecto

Larry Stone remembers the near perfect game thrown by Brian Holman nearly 20 years ago, and how former Mariners first baseman Ken Phelps broke it up. He also recounts the trial and tribulations of Holman’s life since leaving baseball.

On the fateful pitch:

Phelps was known to work the count, rarely swinging at the first pitch. But he had decided upon a different game plan.

“Tony said to go up swinging,” he said. “There was no sense taking pitches off him. The way he had been throwing all night, I might get only one good pitch, and it might be the first.

“That was the game plan, and lo and behold, the first pitch was a pretty good fastball, up in the strike zone. I usually don’t offer at that pitch; I’m usually looking for something down. But if it was anything close, I was going to swing.”

Holman: “I threw the pitch in the wrong place. It was supposed to be down and away, but it was up in he middle of the zone, exactly where I shouldn’t have thrown it. It was the only mistake I made.”

I remember that game very well. I had started working with ESPN, and one of my jobs was supporting whomever was anchoring highlights in the studio. It was the second Friday of the season, and ESPN had a deal that if a big event was happening, they could cut to it. This was the first big event. John Saunders was in the studio, and we were watching the game intently. When Holman recorded the last out in the eighth, I turned to John and said, “Phelps will break it up in the ninth.” Sure enough he comes up with two outs and hits the homer, my crowning moment of prognostication.

Thanks to WJ Duffy for the link.