BikeSaferPA
A project in which I build BikeSaferPA, a gradient boosted decision tree classifier designed to predict severity of bicycle crashes in PA based on crash input data.
- BikeSaferPA is trained on a PENNDOT dataset of over 26,000 bicycle crashes in PA from 2002-2021.
- The project involved data procurement and cleaning, visualizations, a feature engineering pipeline, and a rigorous model selection process culminating in the BikeSaferPA model.
- I investigated the importance of various features in explaining the model's predictions via SHAP value analysis, and used these results to make policy recommendations for improving safety outcomes for cyclists.
- I used Streamlit to design and built an easy-to-use BikeSaferPA web app, a suite of tools which enable the user to visualize the data and experiment with the BikeSaferPA model.
Try out the BikeSaferPA web app
See the GitHub repository, or view the Jupyter notebooks in HTML format:




