Client: PlaylistPro
Executive Summary: PlaylistPro, a subscription-based music streaming service, faced a critical business challenge: nearly 50% of subscribers were churning, directly threatening recurring revenue and long-term profitability. This project delivered a comprehensive analytics solution combining descriptive analytics, predictive modeling (XGBoost, Random Forest), and prescriptive optimization (Mixed-Integer Linear Programming) to identify at-risk customers, quantify churn risk, and optimize retention resource allocationâachieving a 2,319% ROI on targeted interventions.
Technical Skills Demonstrated: Python ⢠R ⢠Machine Learning (XGBoost, Random Forest, Logistic Regression) ⢠Optimization (Gurobi MILP) ⢠Statistical Analysis ⢠Data Visualization ⢠Business Strategy
What happened?
Analyzed 125K customers across 19 variables. Discovered behavioral patterns (not demographics) drive churn: low listening hours, high skip rates, subscription pauses.
Who will churn?
Built 4 ML models. XGBoost achieved 94% AUC, ranking customers by churn risk. Top predictors: subscription type, service inquiries, listening hours.
How to act optimally?
MILP optimization under budget constraints. Achieved 2,319% ROI by targeting 75 high-value customers, balanced across tiers with fairness constraints.
The numbers told a stark story: 51.3% of PlaylistPro's 125,000 customers had churned. This wasn't a small segment or niche issueâit was a systemic retention crisis threatening the company's survival. With churn and retention nearly equal, every customer decision mattered.
The business question was urgent: How can we identify customers at highest risk of churning before they leave, and deploy retention resources optimally to protect revenue?
The company's reactive approach wasn't working. Marketing dollars were wasted on loyal customers who would have stayed anyway, while truly at-risk users received insufficient attention. PlaylistPro needed a data-driven system to predict churn risk and guide intervention strategies.
This analysis directly addresses the needs of multiple organizational stakeholders:
Analyzing 125,000 customer records across 19 behavioral and demographic variables, we discovered that no single factor alone explained churn. Instead, churn emerged from patterns of behavior that, when combined, signaled customer disengagement. The business needed a sophisticated model to detect these patterns early.
Correlation analysis reveals behavioral factors cluster together, driving churn patterns
We built and tested four machine learning models to predict which customers would churn: Logistic Regression (baseline), Stepwise Logistic Regression (feature selection), Random Forest (ensemble), and XGBoost (gradient boosting).
XGBoost emerged as the winner with 84.79% accuracy and an AUC of 0.94âmeaning it could reliably rank customers by churn risk. But accuracy alone wasn't the goal. We needed to understand why the model made its predictions so we could guide business strategy.
Across all models, three factors consistently ranked as the most powerful churn predictors:
This cross-model consistency gave leadership confidence: these weren't statistical artifactsâthey were real, actionable business levers.
Prediction alone doesn't drive business valueâoptimization determines how to act on those predictions. I developed a Mixed-Integer Linear Programming (MILP) model using Gurobi to optimize retention spending under PlaylistPro's budget and operational constraints.
This wasn't a simple ranking problem. The optimization had to balance competing objectives:
The MILP model optimizes binary decision variables (contact/no contact for each customer) subject to linear constraints. The objective function maximizes: ÎŁ (churn_probability Ă CLV Ă retention_effectiveness - intervention_cost) for selected customers.
Key constraints implemented:
At $150 Weekly Budget:
Sensitivity Analysis: Testing budget scenarios from $50 to $1,000 per week revealed that marginal returns diminish after $400-$500, defining the optimal spending range. This allows leadership to make data-driven budget allocation decisions with clear ROI expectations.
The optimization doesn't just maximize ROIâit ensures equitable treatment across customer segments while intelligently targeting based on churn risk and customer lifetime value.
Technical depth: Formulated a constrained optimization problem, implemented it in Gurobi (industry-standard solver), and validated the solution with sensitivity testing across multiple parameter ranges.
Business acumen: Translated abstract model outputs into actionable metrics (ROI, budget efficiency curves, fairness guarantees) that non-technical stakeholders can use to make strategic decisions. The model doesn't just find optimal solutionsâit explains why those solutions matter financially.
The optimization framework guides a real-time, risk-based intervention system that personalizes retention offers:
This tiered approach optimizes every dollar spent on retention. Low-risk customers don't get unnecessary discounts. High-risk customers receive compelling offers calibrated to their likelihood of churning.
1. Launch Engagement Campaigns Immediately
Weekly listening hours is the dominant predictor. Marketing should deploy personalized email nudges featuring new releases in users' favorite genres, push notifications for followed artists, and weekly listening summaries that gamify usage. Monitor engagement as a leading indicator and trigger retention offers when usage drops.
2. Redesign Free and Student Tier Strategy
These tiers exhibit dramatically higher churn. Leadership must decide: Are they effective acquisition funnels worth the churn cost, or should resources focus on converting and retaining paying customers? Consider tier-specific campaigns emphasizing upgrade paths and transition discounts.
3. Proactively Address Service Friction
High inquiry frequency predicts churn because it signals unresolved issues. Flag customers who contact support multiple times within a short window for retention follow-up. Analyze inquiry themes (billing issues, technical bugs, feature confusion) to identify and fix systemic product problems.
4. Improve Content Recommendation Quality
Higher skip rates correlate with churn. Invest in recommendation algorithms that dynamically adjust based on skip behavior. Introduce genre exploration features and test whether giving users more control over recommendations reduces skip rates and improves retention.
5. Segment Strategies by Age
The U-shaped age-churn relationship requires targeted approaches. For younger users, emphasize social features and gamification. For older users, simplify the interface and provide curated playlists. Middle-aged users benefit from family plans and loyalty rewards.
Explore the complete analytical methodology, model diagnostics, optimization implementation, and statistical validation: