This morning, I came across with Wakana’s post, where she shared her tracking record of studying plan & progress on data science. The project inspired me a lot, so I decided to do the same thing - to keep records of what I’ve been learning :)
My plan for 2021 is to continue learning mostly from Coursera and free online materials that I posted previously. And no more DataCamp!
Following are the courses and hands-on projects I’ve done so far:
1. Learn Foundations
| Course | Source | Status | Date of completion |
|---|---|---|---|
| Python Syntax | Codecademy | Completed |
04.Oct.2017 |
| Introduction to Data Science in Python | DataCamp | Completed |
06.Feb.2020 |
| Intermediate Python | DataCamp | Completed |
13.Feb.2020 |
| Python Data Science Toolbox (Part 1) | DataCamp | Completed |
19.Feb.2020 |
| Introduction to SQL | DataCamp | Completed |
26.Feb.2020 |
| Joining Data in SQL | DataCamp | Completed |
10.Mar.2020 |
| Introduction to Git | DataCamp | Completed |
17.Mar.2020 |
| Introduction to Shell | DataCamp | Completed |
24.Mar.2020 |
| Python Data Science Toolbox (Part 2) | DataCamp | Completed |
22.Apr.2020 |
| pandas Foundations | DataCamp | Completed |
25.Apr.2020 |
| Introduction to Relational Databases in SQL | DataCamp | Completed |
06.May.2020 |
| Data Manipulation with pandas | DataCamp | Completed |
11.May.2020 |
| Merging DataFrames with pandas | DataCamp | Completed |
16.May.2020 |
| Introduction to Data Visualization with Matplotlib | DataCamp | Completed |
17.May.2020 |
| Introduction to Data Visualization with Seaborn | DataCamp | Completed |
18.May.2020 |
| Exploratory Data Analysis in Python | DataCamp | Completed |
03.Jun.2020 |
| Introduction to Importing Data in Python | DataCamp | Completed |
23.Jun.2020 |
| Intermediate Importing Data in Python | DataCamp | Completed |
25.Jun.2020 |
| Cleaning Data in Python | DataCamp | Completed |
06.Jul.2020 |
| Analyzing Police Activity with pandas | DataCamp | Completed |
07.Jul.2020 |
| Streamlined Data Ingestion with pandas | DataCamp | Completed |
08.Jul.2020 |
| Introduction to Databases in Python | DataCamp | Completed |
08.Jul.2020 |
| 26-week (550h) Data Analytics Bootcamp | CodeOp | Completed |
12.Sep.2020 |
| Introduction to Data Science in Python by University of Michigan | Coursera | Completed |
03.Oct.2020 |
| Time Series Analysis in Python | DataCamp | Completed |
11.Oct.2020 |
| Working with Geospatial Data in Python | DataCamp | Completed |
03.Nov.2020 |
| Basic Statistics by University of Amsterdam | Coursera | 50% |
|
| Applied Machine Learning in Python by University of Michingan | Coursera | 68% |
|
| TensorFlow, Keras and deep learning, without a PhD | Link | 30% |
|
| Case Studies in Neural Data Analysis | Link | 20% |
|
| Hands-On Programming with R | Link | 60% |
|
| R for Data Science | Link | ‘Ready to start’ | |
| A Gentle Introduction to Tidy Statistics in R | Link | ‘Ready to start’ |
2. Practice - Hands-on projects
| Name | Notebook | Description | Status | Date |
|---|---|---|---|---|
| Research Trend Analysis | GitHub | NLP: text data mining and cleaning | Completed |
Aug.2020 |
| Fraud Detection Paysim in Python | GitHub | Predictive analysis, Supervised Learning, Classification | Completed |
Aug.2020 |
| In-app chat message analysis | Private | SQL: Create tables in a database and query data | Completed |
Oct.2020 |
| EDA on e-commerce data | Private | Product (delivery app) analytics in Python | Completed |
Oct.2020 |
| Daily Sales Forecast | Private | Time Series: Predictive modeling using Profet | Completed |
Oct.2020 |
| NYC taxi pickups with demographic and socio-economic data by census block group | GitHub | Docker, Geospatial data analysis: ETL, EDA, baseline modeling | Completed |
Nov.2020 |
| Scientific Data Analysis | GitHub | Numeric data processing in pandas, matplotlib, Scipy, Numpy | Completed |
Nov.2020 |
| In-app chat message analysis 2 | Private | SQL | Completed |
Dec.2020 |
| EDA on e-commerce data 2 | Private | Product (marketplace app) analytics in Python | Completed |
Dec.2020 |
| WiDS Datathon 2021 | Github | Datathon | Completed |
Feb.2021 |
| Fraud Detection Paysim in R | GitHub | Translating a Python project into R for hands-on practice | Ongoing |
Aug.2020 |
| BCN Traffic Forecast | Github | Geospatial data analysis: ETL, EDA, Multiclass classification | Ongoing |
[GitHub] |
| Flu Shot Learning: Predict H1N1 and Seasonal Flu Vaccines | A competition hosted by Driven Data | Up Next |