Hi, welcome to my portfolio!

I am a data science and engineering professional with prior experience in analytics, project management and team leadership. I hold a master's degree in Information and Data Science at UC Berkeley (MIDS), and have a passion for applying data science and analytical skills in solving complex business problems.

Most of the following projects were originated from the courses that I took during MIDS. My goals when building these projects were deriving insights from the data, and level up essential skills in programing, statistics and machine learning to be successful in Tech.

I hope you find them interesting, and any recommendations are very welcome.

Project Overview


Skills Tools Topics
Full Stack Data Science (NLP, MLOps, DE) Pytorch
Transformers
PySpark
HyperOpt
Kubernetes
Docker
FastAPI
WebScraping
DD.ai
A Platform for Drug-Drug Interaction (DDI) Prediction
Natural Language Processing (NLP) Pytorch
Transformers
Text Detoxification in Online Communications
Machine Learning Operations (MLOps) Kubernetes
Docker
FastAPI
K6
AKS
Full End-to-End Machine Learning API
Machine Learning with Big Data Spark
Python
Databricks
US Domestic Flight Delays Prediction at Scale
Deep Learning, Computer Vision Pytorch
Python
Docker
American Sign Language Detection On Edge Device
Machine Learning Scikit-Learn
Python
Annual Compensation Prediction for Data Scientist
Statistics: Causal Inference R US Tourism and COVID-19 Vaccinations
Statistics: Hypothesis Testing R 2020 US Election: Voters’ Demographics & Perspective
Assemble Data Pipeline Docker
Kafka
Flask
Spark
Python
Apache Bench
Presto
Hadoop
Understanding User Behavior: Game API Streaming
ETL Data Pipeline Docker
Kafka
Spark
Python
Hadoop
Tracking User Activity: Exams Taking
Data Analysis SQL
BigQuery
Python
San Francisco Bike Share: Business Recommendations
Data Analysis Python Real Estate and Academic Performance in Los Angeles
Object Oriented Programing Python Game Design: Kwaii Lottery


Recent posts