Project 1: Integration and Evaluation of Unintentional Substance Release Incidents
Unintentional substance release (USR) incidents occur far more frequently than commonly perceived and pose significant environmental threats. These include not only high-profile incidents like the Deepwater Horizon spill but also numerous smaller incidents that collectively result in substantial cumulative impacts. However, historical hazardous incidents are recorded in various formats across different datasets, creating challenges for the comprehensive evaluation of USR impacts.
To address this, I developed a series of data tools for acquisition and integration, including:
A web crawler to automatically collect incident reports.
A Natural Language Processing (NLP)-based model to identify actual leakage events from incident reports.
A data pipeline to integrate USR incidents from multiple sources.
Using these tools, I built a consolidated database comprising over 300,000 USR incidents, including oil spills, natural gas leaks, and chemical releases. I evaluated USR risks by analyzing the temporal, spatial, and statistical distribution of these incidents.