Hello

I am JasonA Data Scientist

I create short-term values and long-term strategic initiatives for companies
I work as a data scientist and marketing analyst to solve business problems
I lead data science and business analytics projects from beginning to end
I communicate my findings to deeply technical and non-technical audiences
I use Python, SQL, and Tableau to analyze big data and visualize my results

About Me

My Skills

Hi! My name is Jason Fang, a data scientist who graduated from the University of Maryland, College Park. With a technical (Master of Science in Marketing Analytics) and non-technical (Bachelor of Science in Marketing) academic background, I always focus on creating values rather than creating models.

Here are several tools that I usually use to solve business problems

Python: Create machine learning, deep learning and statistical models

SQL: Find key metrics, analyze business problems, join tables

Tableau: Visualize findings, build dashboard, create calculated field

Excel: Analyze small dataset, create pivot table, generate reports

PowerPoint: Present my results to technical and non-technical audiences

PySpark: Analyze big dataset, utilize Spark ML Library and Spark Streaming

AWS: Use S3, Redshift, QuickSight, EC2, IAM, Route 53, CloudFront

Python
100%
SQL
90%
Tableau
95%
Excel
85%
PowerPoint
90%
PySpark
80%
AWS
75%





Data Science Models

I use many models to create predictive models. Deep learning sounds more advanced, but sometimes machine learning models, statistical models, and ensemble learning are better. It always depends on the cases that I need to analyze.

Here are some models that I usually use to solve business problems

Regression: Simple, Multiple, Polynomial Linear Regression, Logistic Regression

Classification: KNN, SVM, Naive Bayes, Decision Tree Classification

Clustering: K-means, Hierarchical Clustering, Self-Organizing Maps

Ensemble Learning: Random Forest,Gradient Boosting, XGBoost

Dimensionality Reduction: PCA, EFA, SOM, Multidimensional Scaling

Deep Learning: ANN, RNN, CNN

Time Series: ARIMA, RNN, Facebook Prophet

Statistical Analysis: T-test, Chi-squared Test, ANOVA

A/B Testing: Thompson Sampling, Upper Confidence Bound




Regression
100%
Classification
95%
Clustering
90%
Ensemble Learning
85%
Dimensionality Reduction
70%
Deep Learning
95%
Time Series
90%
Statistical Analysis
85%
A/B Testing
95%


My Works

  • All
  • Machine Learning
  • Deep Learning
  • Statistical Analysis
  • Visualization
  • SQL
  • NLP
  • AWS

Tableau Showcase

Machine Learning, Visualization

Website SQL Analysis

SQL

Fintech App

Machine Learning, Deep Learning

Time Series ARIMA Models

Statistical Analysis

Spark Spam Filter

Machine Learning, NLP

A/B Testing

Statistical Analysis, Visualization

Self-Organizing Maps

Deep Learning

Fraud Detection

Machine Learning, Deep Learning

Yammer Case

SQL, AWS

My Projects

Let's work together on your next project

Contact Me