EXPERIENCE

Last Updated: November 2024

Summary

Sijia Li (Nancy)

Aspiring data scientist with a strong foundation in machine learning, software engineering, statistical analysis, and data visualization, as well as experience in technology, pharmaceuticals, and private capital research. Excels in transforming complex datasets into actionable insights and innovative solutions. Passionate about solving real-world problems, storytelling through data, and driving key decisions.

Education

M.S. in Data Science

08/2022 - 05/2024

Harvard University, Cambridge, MA

  • Relevant Coursework: Machine Learning Operations (MLOps), Natural Language Processing (NLP), Computer Vision (CV), Systems Development, Visualization, Causal Inference
  • Teaching Fellow Appointments: Visualization, Critical Thinking in Data Science

BASc. in Industrial Engineering, Minors in Engineering Business & AI Engineering

09/2017 - 06/2022

University of Toronto, Toronto, Canada

  • Relevant Coursework: Probability, Statistics, Data Modelling, Operations Research, Object Oriented Programming, Artificial Intelligence, Algorithms & Numerical Methods, Optimization in Machine Learning
  • Extracurriculars: University of Toronto Consulting Association (Development Co-Director, Consulting Group Associate)

Professional Experience

Research Associate

07/2024 - Present

Harvard Business School, Boston, MA

  • Enhance code efficiency and resolve critical bugs in the data transformation and consolidation processes for PCRI database, improving data accuracy across multiple vendors.
  • Design and conduct a GPT-4o-mini experiment to assess biases in VC investment decisions, analyzing 12 profiles with racially identifiable names and pictures across fund types and team experiences.

Data Engineer Intern

05/2023 - 08/2023

Merck KGaA, EMD Digital, Cambridge, MA

  • Optimized ETL pipelines with PySpark in Palantir Foundry, automating and parallelizing data ingestion for over 35M+ records that reduced manual effort and errors by 90%.
  • Integrated different data sources into a centralized warehouse and built interactive dashboards to deliver real-time insights.
  • Leveraged Agile methodologies and Scrum practices with Jira to drive efficient collaboration and on-time project delivery.

Business Analyst Intern

08/2020 - 05/2021

Sanofi Pasteur, Toronto, Canada

  • Analyzed project financial data ($200M+ CapEx and $60M+ OpEx) to identify root causes of budget to actual variances.
  • Supported metrics creation in the Project Portfolio Prioritization and Optimization initiative to evaluate, score, and rank projects from various functional areas, as well as discovering effective project selection and prioritization strategies.