College Basketball Analytics, Insights, and Trends
Note: The steady decline in win percentage early in the season is typical. Non-conference games often involve high-major teams playing "buy games" (or "body bag games") against small conference opponents at home. These mismatches are easier to predict, inflating early-season accuracy.
| Model | Games | Win Accuracy ℹ️ | Score MAE ℹ️ | Spread MAE ℹ️ | Total MAE ℹ️ | RMSE ℹ️ |
|---|
Our general philosophy is to start as simple as possible and meticulously and methodically investigate features, engineered features, and feature importances one-by-one.
The Progression:
Pace * Efficiency term. This validates that a learned coefficient on the fundamental scoring equation performs at least as well as the simple multiplication.Phase 2: We have accomplished our Phase 1 goal: a framework and pipeline for researching, validating, and deploying ML models. We are now deep into Phase 2, involving data-driven feature engineering, new objective functions, and more advanced model architectures and ensembling.