This analysis is based on scores and stats from individual rounds in the last ten World Wide Technology Championships: 4,100 rounds in total. Note: this event was played at El Camaleon Golf Course until 2022.
Key Points:
Differences in 2023 at El Cardonal at Diamante Cabo San Lucas: The 2023 event saw a decrease in correlation for Scrambling and Driving Accuracy, reflecting course layout differences from El Camaleon.
Key Points:
Differences in 2023 at El Cardonal at Diamante Cabo San Lucas: The 2023 event showed a slight decline in the Par-4 correlation, likely due to layout adjustments, with an increase in the Par-5 correlation indicating more impactful scoring on these holes.
Random Forest Regressor is an ensemble learning method that constructs multiple decision trees during training and outputs the average prediction. It combines the predictions of several models to improve accuracy and robustness.
Feature importance is a technique used to interpret a machine learning model. It refers to the score that quantifies the contribution of each feature to the prediction made by the model.
In a Random Forest, the importance of a feature is computed by looking at how much the feature decreases the impurity (e.g., variance for regression tasks) across all the trees in the forest. The more a feature decreases the impurity, the more important it is considered.
The calculated importance scores for all features are then normalized to give relative importance as a percentage. This shows the relative contribution of each feature to the prediction task.
Features with high relative importance percentages have a strong impact on the model's predictions. They are crucial for accurate predictions and indicate key areas where performance matters most.
Features with low relative importance have a minimal impact on the model's predictions. While they can still contribute, they are less critical.
Key Points:
Comparison with PGA Tour Averages: The traditional metric importances differ slightly from the PGA averages (PPGIR: 30.13%, Scrambling: 27.02%, GIR: 29.77%) with PPGIR and Scrambling showing higher importance. The data suggests an increased emphasis on short game and recovery for the analysed tournaments.
Key Points:
Comparison with PGA Tour Averages: The importance levels align closely with PGA averages (Par-3: 17.32%, Par-4: 67.12%, Par-5: 15.56%), confirming that Par-4 performance is critical, while Par-5 offers scoring opportunities.
The table below shows the top-5 ranked players and their average estimated scores from the different Random Forest models above.
Rank | Surname | Firstname | Average Predicted Score |
---|---|---|---|
1 | Shipley | Neal | 68.93 |
2 | Noh | Seung-Yul | 69.08 |
3 | Smotherman | Austin | 69.09 |
4 | Vegas | Jhonattan | 69.12 |
5 | Hoey | Rico | 69.18 |
Estimated scores for all players can be found here.