Machine Learning, NBA

Predicting 50+; part 2 of n

Inspired by a recent conversation about machine learning and decision trees, I have made my own for predicting 50+ PTS. Using the same dataset we used in the previous post, I built the below decision tree model in R, which is ~84% accurate with a sensitivity score of 64% (of the predicted scores of 50+, 64% were right). This is an OK model and continues to back up what was found in the in regression model that was built.

This is a very simple model. One of the most impactful in this model is that I required the model to look for at least 10 examples per node before a split can be made. For example it would look in the data 50+ results and then see what variable provided the most impact and had to do it with 10 examples. This model is fairly generalized, but also, hitting 50+ isn’t a common occurrence. I ran the data through another algorithm and got back nothing!

FG, FG3M, and FTM are all that matter…FG3A in some cases

Let’s use this decision tree against a recent 56 point performance from Trae Young.

  1. Had 17 FGM, so we go left with a yes
  2. FGM <17 , NO! We go right
  3. FTM <11? NO! He had 15, we go right
  4. FG3A < 7 ? NO! He had 12, so we go right and end with a YES
espn.com – ignore Capela

Cool! Let’s try another example!

50+ performance of Jaylen Brown.

espn.com
  1. He made FGM = 19
  2. So we go to the right after this first node, to FG3M
  3. He had 5 so again, we go to the right, this time to FTM <5
  4. He had FTM = 7
  5. The model would have predicted a yes

Steph Curry had a 50 PT game earlier this year:

espn.com
  1. Steph had FGM 14, so we go left this time, twice! to FTM <18
  2. FTM 13 … so we go left again to FG3M < 10
  3. FG3M he had 9… LEFT AGAIN to FGM <16
  4. FGM, we know he was less than 16 FGM, so that is a YES, which makes us go RIGHT to FTM < 11
  5. Steph had more than 11 FTM so that is a YES and goes RIGHT to FG3A < 8
  6. FG3A was 19, so NO! The model would have predicted Steph not to hit 50+

1 thought on “Predicting 50+; part 2 of n”

Leave a Reply