• Designed and implemented an information retrieval and classification system for sentiment analysis on Twitter
• Crawled tweets on user’s timeline from Twitter API, and extract JSON responses using Requests module
• Cleaned, parsed and segmented tweets content; counted most frequent words and hashtags associated with each emotion
Seattle Home Value Data Analysis & Visualization (R)
Machine Learning & Data Analysis
• Delivered statistical analysis on over 100,000 tuples of data using R packages dplyr, tidyr, reshape2, splines and lubridate
• Performed time series analysis with machine learning to prove patterns like seasonality trends using zoo, xts packages with SVM and ARIMA forecasting models; Performed multiple linear regression on two datasets to find correlations
• Plotted choropleth map, facet plots and quantile plots to perform interactive visualizations using ggplot2 and lattice packages
Business Intelligence Systems Implementation Project
ETL & Data Warehousing
• Built a SQL data warehouse that contains staging tables, dimensional model and SQL views that support data visualizations, and utilized functions like grouping, filtering, specialized calculations
• Developed SSIS (SQL Server Integration Services) packages to perform ETL tasks using SQL multiple joins, identity column insertion, CASE, SUBSTRING, CAST and ISNULL function
• Utilized SQL subqueries and analytical functions to create computed columns, add encryptions and create views
• Used Tableau to build interactive visualizations which help to answer business questions
Data Management System Full Stack Development
Database Design & Development
• Developed a working prototype of multi-layer data management system that allows users to search product, place and cancel
orders on a travel booking platform
• Completed the database application using Spring framework, including Java, JavaScript and HTML
Education
MSc in Information Management | Data Science Specialization
University of WashingtonSeattle, US
Relevant Courses: INFX574 Machine Learning, CSE414 Introduction to Database Systems, CSE373 Data Structures and Algorithm, IMT 577 Business Intelligence Systems, INFX543 Relational Database Management Systems, INFX598 Programming for Data Science
BS in Information Systems and E-Commerce
Wuhan UniversityWuhan, CHINA
Programming Skills
Data Analytics & Software Engineering
Python, SQL, R
Java, Git, NoSQL
HTML5/CSS & Javascript
Work Experience
Summer Technical Intern
Tencent Inc., Wuhan, China
• Participated in the e-business website maintenance, development and update using front-end technologies
• Reported, analyzed and visualized website’s PV, UV and relevant user data using R, SQL Server and Excel
• Worked closely with developers, product managers and UI designers and provided analytics support
• Optimized the targeted push notification content and brought in 30% increase of revenue with higher conversion rate
Owner of Taobao Online Shop
Alibaba.com, Beijing, China
• Co-founded “MEKO” online shop, sells women’s dress with a turnover of $75000/month
• Responsible for Taobao webpage maintenance, customer data analysis and sales trend prediction