The Components of Offense: Turning the Lurk into a Feature

One of the challenges in data science within the NBA is diagnosis. Diagnosis is the process of collecting and dissecting data in an effort to reveal and understand the components of such data such that we can extrapolate intelligence. Within many NBA analytics circles, this is done heuristically: discussion of a question, hypothesizing new variables,…

Introduction to Pandas Using Play-By-Play

By popular demand, attached you will find basic course materials that I developed for a sports analytics course taught at UW-Madison. The goal is simple: introduce Pandas and show how column manipulations, groupings, and report building could be accomplished. This was a working document at the time; and has not been updated since the course.…

Stochastic Tracking

In the era of tracking data, a need for a new style of analysis has emerged. Long gone are the regularized regression models and the simple counting techniques. Instead, we require leveraging shot-noise distributed systems such as Dan Cervone’s competing risks model, or Matthias Kempe’s self-organizing maps, or Peter Carr’s Imitation Learning. The list is…