Data Science Fundamentals: Project Summaries

This site gives a simple overview of each project folder in this repository, based on the notebooks and README files.

At a Glance

Project Main Analysis Used Technologies / Methods
Profiling Academic Trajectories Student-performance profiling, clustering, classification, interpretability pandas, numpy, scikit-learn, scipy, plotly, matplotlib, shap, ucimlrepo
Life Expectancy Analysis WHO data cleaning, missingness checks, clustering, PCA, map-based visualization pandas, numpy, scikit-learn, plotly, missingno, pycountry, matplotlib
Data Analysis and k-NN Supervised classification on Palmer penguins pandas, numpy, scikit-learn, matplotlib, plotly, scipy
Data Analysis and PCA Breast-cancer data preprocessing, scaling, PCA, k-NN baseline pandas, numpy, scikit-learn, seaborn, matplotlib, plotly
Gradient Descent 1D and higher-dimensional gradient-descent experiments numpy, plotly, IPython
Data Ethics Paper Written ethics review of three readings Comparative analysis, policy/ethics critique
Lecture Notes Course demonstrations and experiments across core topics pandas, numpy, scikit-learn, matplotlib, plotly, missingno

Project Pages

Use the navigation menu to open a page for each project.