← Back to Projects

Machine Learning Coursework

SMU MSDS β€” Applied Machine Learning

Overview

Four-part coursework progressing from Python data structure mastery through applied deep learning. The final assignment explored AI-assisted learning via Cursor’s Model Context Protocol (MCP) β€” turning the ML assignments themselves into an object of study for understanding code with AI.

Assignments

HW1 β€” Python Data Structures

Comprehensive review of Python built-in data structures: lists (append, extend, index, insert, remove, pop, sort, reverse), dictionaries, sets, and their associated operations. Demonstrated through annotated examples covering time complexity tradeoffs.

HW2 β€” Hyperparameter Optimization

Designed a multi-classifier evaluation framework comparing Logistic Regression, Random Forest, and SVC across three different hyperparameter configurations each. Used KFold cross-validation for reliable performance estimates and GridSearchCV for systematic search. Extended to Optuna (Bayesian optimization) for neural network hyperparameter tuning β€” significantly faster than grid search for continuous parameter spaces.

Key results: Evaluated accuracy, precision, recall, and F1 across all classifiers. Generated matplotlib comparison plots for model selection visualization.

HW3 β€” Transfer Learning with PyTorch

Built a SimpleCNN architecture from scratch:

This achieved faster convergence and better accuracy than training from scratch, demonstrating the practical value of transfer learning for limited-data scenarios.

HW4 β€” AI-Assisted Learning with Cursor + MCP

Explored how AI tools (Cursor with MCP) transform the learning experience for ML coursework. Loaded HW2 into a Cursor workspace and used the AI to explain each code block, visualize what each algorithm is doing conceptually, and answer targeted questions without needing to search documentation.

Key insight: MCP allows the AI to have direct context about the running codebase, making it qualitatively different from pasting code into a chat interface. The debugging and explanation loop is orders of magnitude faster.

Technology Stack

Category Tools
Deep Learning PyTorch, torchvision, Conv2d, DataLoader
ML Framework scikit-learn, GridSearchCV, KFold
Optimization Optuna (TPE sampler)
Data numpy, pandas, matplotlib
Computer Vision OpenCV, MNIST dataset
Dev Tools Cursor IDE, MCP

Key Takeaways


GitHub Repository ← Back to Projects