Skip to content

Latest commit

 

History

History
56 lines (31 loc) · 4.16 KB

README.md

File metadata and controls

56 lines (31 loc) · 4.16 KB

Data-Science-Portfolio

Portfolio including my data science projects for academic, self-learning, and hobby.

More information about me: LinkedIn

Experience

Graduate Thesis Research, Beck's Research Lab, University of Washington, Seattle, WA

Developing a Visualization Tool for Unsupervised Machine Learning Analysis on Genomics Data

  • Applied K-means clustering and PCA analysis on RNA-seq data using Scikit-Learn.
  • Created a SQL Server database and wrote queries for data loading and extraction.
  • Built an interactive web application and visualized analysis results by Tableau and Plot.ly - Dash.

Keywords: K-Means Clustering / Principal Component Analysis / Data Visualization / Genomics / RNA-seq

Working Repository: DashOmics

Learning Repository: Learning-Tableau ; Learning-Dash

------------------------------------------------------------------------------------------------------------------------------------------------

DIRECT Data Science Trainee, Clean Energy Institute, University of Washington, Seattle, WA

  • Analysis and Optimization of Lignin PyrolysisKinetic Model (Capstone Project) : Developed an open source package in Scipy to analyze chemical kinetic model of lignin pyrolysis (involving 93 species and 406 reactions) to predict the temporal evolution of molecules and functional groups during chemical reaction
  • Electricity Analysis and Suggestion System(Coursework Project): Implemented random forest and statistical analysis for electricity generation suggestion model, and built GUI based on Tkinter package, with prioritized resources and revenue plots presented by matplotlib

Keywords: Data Mining / Python (pandas, matplotlib) / Random Forest / P-Value / GUI

Projects