Overview: As part of my 'Blueprint for Modernity' coursework, I conducted an original research project to contribute to work on the history of engineering.
βTHE RESEARCH QUESTION
What can we learn about the world of the early engineering faculty at Cornell?
π THE PROCESS:
Identified historical source data from The Cornellian, Cornell's yearbook, from 1875-1915
Converted documents to usable text data
Cleaned the data for usability and relevancy using R/RStudio
Visualized the data in 5 multi-panel visualizations using ggplotly
Analyzed the data with historical context
Presented the findings in a 30-page final research report
π THE DATA
I found my primary source data on the online HathiTrust digital library. Its original format was scanned document pages, which I downloaded and converted to text files after isolating the lists of faculty.
Original data, 1895
Using R, I cleaned these text files and merged them into a usable, formatted database.
A snapshot of some of the cleaning code
Part of the cleaned consolidated database
π THE VISUALIZATION
Using my newly cleaned database, I was able to create 5 multi-panel interactive visualizations providing insight into the world of the early engineering faculty at Cornell, including student-to-faculty ratio, turnover, distribution of specializations, alumni professors, and more. A few examples are shown below:
The final report included an appendix containing these visualizations, a discussion of the insights from each, and full historical context to enrich the understanding of the visualizations.