EXPERIENCE
Last Updated: November 2024
Summary
Sijia Li (Nancy)
Aspiring data scientist with a strong foundation in machine learning, software engineering, statistical analysis, and data visualization, as well as experience in technology, pharmaceuticals, and private capital research. Excels in transforming complex datasets into actionable insights and innovative solutions. Passionate about solving real-world problems, storytelling through data, and driving key decisions.
- Cambridge, MA
- [email protected]
Education
M.S. in Data Science
08/2022 - 05/2024
Harvard University, Cambridge, MA
- Relevant Coursework: Machine Learning Operations (MLOps), Natural Language Processing (NLP), Computer Vision (CV), Systems Development, Visualization, Causal Inference
- Teaching Fellow Appointments: Visualization, Critical Thinking in Data Science
BASc. in Industrial Engineering, Minors in Engineering Business & AI Engineering
09/2017 - 06/2022
University of Toronto, Toronto, Canada
- Relevant Coursework: Probability, Statistics, Data Modelling, Operations Research, Object Oriented Programming, Artificial Intelligence, Algorithms & Numerical Methods, Optimization in Machine Learning
- Extracurriculars: University of Toronto Consulting Association (Development Co-Director, Consulting Group Associate)
Professional Experience
Research Associate
07/2024 - Present
Harvard Business School, Boston, MA
- Enhance code efficiency and resolve critical bugs in the data transformation and consolidation processes for PCRI database, improving data accuracy across multiple vendors.
- Design and conduct a GPT-4o-mini experiment to assess biases in VC investment decisions, analyzing 12 profiles with racially identifiable names and pictures across fund types and team experiences.
Data Engineer Intern
05/2023 - 08/2023
Merck KGaA, EMD Digital, Cambridge, MA
- Optimized ETL pipelines with PySpark in Palantir Foundry, automating and parallelizing data ingestion for over 35M+ records that reduced manual effort and errors by 90%.
- Integrated different data sources into a centralized warehouse and built interactive dashboards to deliver real-time insights.
- Leveraged Agile methodologies and Scrum practices with Jira to drive efficient collaboration and on-time project delivery.
Business Analyst Intern
08/2020 - 05/2021
Sanofi Pasteur, Toronto, Canada
- Analyzed project financial data ($200M+ CapEx and $60M+ OpEx) to identify root causes of budget to actual variances.
- Supported metrics creation in the Project Portfolio Prioritization and Optimization initiative to evaluate, score, and rank projects from various functional areas, as well as discovering effective project selection and prioritization strategies.