Portfolio

A selection of past projects/case studies completed* 

California: The State of Incarceration Data Hub

This data hub was created in collaboration with my colleagues at the Vera Institute, and brings together various governmental data sources on incarceration and spending at the state and county level to paint a picture of the current state of incarceration in California. This project was last updated in August of 2024 using the most recently available full-year data, calendar year 2022. 

Skills Highlighted: Person-centered writing, data analysis, data visualization, technical writing, criminal-justice writing, literary review, R, Python, budget analysis

California: The State of Incarceration, Explained

An executive summary I produced for the California: The State of Incarceration Data Hub, with summary statistics, talking points and key findings for the data hub I helped to produce. 

Skills Highlighted: Person-centered writing, technical writing, summarization, communication

Care First L.A.: Tracking Jail Decarceration

A dashboard that updates daily, using Los Angeles County Sheriff's Department data. This dashboard cleans, analyzes and visualizes data on incarcation, demographics and deaths in the custody of L.A. county's sheriff's department, including tracking the county's progress towards it's committment to close the deadly Men's Central Jail. I maintain the web scraper that feeds this dashboard, as well as led the development of the in-custody deaths tracker. 

Skills Highlighted: Web-scraping, R, data analysis, Datawrapper, Tableau, person-centered writing, literary review, translation, data visualization, CRON, criminal justice data, data cleaning

Case Study 1: How casual and annual members use a bike sharing service differently

In this case study, we are working as a junior data analyst for a bike sharing company, Cyclistic, located in Chicago, IL.  We are tasked to investigate how casual riders and annual riders use the service differently, if at all. Using twelve months of first party data**, I investigate differences in time, ride counts, duration and weekly trends. I used Microsoft Excel and R programming for this case study, with a medium sized dataset (~4.7 million observations).

Skills Highlighted: Spreadsheets, Pivot Tables, Data Cleaning, ETL, Time Variables, RegEx, ETL, R Programming, SMART methodology, Data Visualization

*Click on image for link to HTML version of RMD file (available upon request) 

* *Names have been changed or redacted for privacy