2025 / UC Berkeley (IEOR 242A)

NBA Career Trajectory Prediction

Predict career length, survival, and awards from early-career data

Overview

Built a reproducible pipeline to forecast NBA career outcomes using rookie + sophomore season data, draft context, and advanced stats, delivered as a report, codebase, and Streamlit app.

Problem

Front offices must evaluate long-term player trajectories using limited early-career signals. The goal was to predict career length, survival probability, and award likelihood with transparent, reproducible models.

Role

Team project (IEOR 242A final project)

Timeline

Fall 2025

Tools

Python / pandas / scikit-learn / Streamlit / joblib / LaTeX

Data

- Historical player tables (draft, combine, per-game, advanced stats)
- Coverage through 2025/26 labels per report; processed into a modeling table
- Targets: career length, survival threshold, All-Star, MVP

Approach

- Built a unified modeling table from raw CSVs and standardized features
- Trained L1-regularized regression for career length and L1-logistic classifiers
- Calibrated sparse award probabilities and blended predictions with nearest analogs