Is Uber as Substitute or Complement for Public Transit：A Machine Learning Approach

Code Author: Shuhan Yang

Project Overview

This project investigates whether Uber services act as a substitute or complement to public transit systems across different metropolitan areas. Using advanced machine learning techniques and panel data analysis, we estimate the causal effect of Uber's presence on public transit ridership.

Key Finding: Uber demonstrates a statistically significant 4.28% increase in public transit ridership, suggesting a complementary rather than substitutive relationship.

Data

Dataset Size: 76,000+ panel observations
Coverage: Multiple Metropolitan Statistical Areas (MSAs)
Time Period: Multi-year panel data
Key Variables: Transit ridership, Uber market presence, demographic controls, economic indicators

Methodology

Core Approach

Double Machine Learning (DML): Applied to estimate causal effects while controlling for high-dimensional confounders
Panel Data Analysis: Exploited temporal and cross-sectional variation
Robustness Check: Multiple model specifications and sample restrictions

Machine Learning Models Implemented

Lasso Regression: For feature selection and regularization
Random Forest: For non-linear pattern detection
Cross-Validation: For model selection and hyperparameter tuning

Key Features

Advanced Econometric Methods: Double machine learning implementation for causal inference
Comprehensive Robustness Checks: Multiple model specifications and sample restrictions
Heterogeneity Analysis: Treatment effect variation by agency and MSA characteristics
Scalable Data Processing: Efficient handling of 76,000+ observations
Statistical Validation: Rigorous testing of model assumptions and results

Core Analysis Pipeline

Data Preprocessing: Panel data cleaning and variable construction
Feature Engineering: Creation of interaction terms and control variables
Model Training: Implementation of ML algorithms with cross-validation
Causal Estimation: Double machine learning for treatment effect estimation
Robustness Testing: Multiple specifications and sample restrictions
Heterogeneity Analysis: Subgroup analysis by MSA characteristics

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ECON 434 Final Project.ipynb		ECON 434 Final Project.ipynb
README.md		README.md
uber_dataset.csv		uber_dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Is Uber as Substitute or Complement for Public Transit：A Machine Learning Approach

Project Overview

Data

Methodology

Core Approach

Machine Learning Models Implemented

Key Features

Core Analysis Pipeline

About

Uh oh!

Releases

Packages

Languages

dndn15/Is-Uber-as-Substitute-or-Complement-for-Public-Transit-A-Machine-Learning-Approach

Folders and files

Latest commit

History

Repository files navigation

Is Uber as Substitute or Complement for Public Transit：A Machine Learning Approach

Project Overview

Data

Methodology

Core Approach

Machine Learning Models Implemented

Key Features

Core Analysis Pipeline

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages