Over the past year, I’ve worked on a few different projects/areas of research:
- Ride or Drive?: A Visual Comparison of Trends in Transportation Between the MTA and Uber utilizing variables such as time, weather, and socioeconomic conditions.
- Predicting Patient Outcomes Using EHR Data and NLP Techniques: For my final presentation in my NLP class, I used mock EHR data of a cohort of cardiology patients and applied various NLP techniques to classify different outcomes. For example, patients who took aspirin vs those who do not, using features extracted from the text data in their EHR. I created training and testing datasets and ran a few different classifiers on my data (Logistic Regression, Naive Bayes and Random Forest) and compared the results of each model.
- Crisis Mapping: Using Geographic Information System Programs to Help Create a Hazard History and Consequence Tool for the NYC Office of Emergency Management.
- Measuring Impact through NLP: Using Twitter and Social Media Engagement Data to Measure the Impact of the Panama Papers on NGO Campaigns Against Tax Evasion and Avoidance.
- Quantifying Historical Variables: Understanding How Colonialism May Factor into a Country’s World Happiness Score as determined by the United Nation’s Sustainable Development Solutions Network.
- Data Dive on Extractive Industries: Exploratory Data Analysis: How do corporations/industries leverage the economic/tax policies (or lack thereof) of a nation in order to participate in the business of extracting natural resources from that nation?
- Centers for Medicare & Medicaid Services Online Chartbook: During my time at the Yale Center for Outcomes Research Evaluation, I contributed to the research and design of the CMS Online Chartbook, which allows stakeholders to explore hospital quality and outcome measurement through data visualizations.
Check out my GitHub page for more info/ some of the code for my projects.