Cheese Image Classification with Machine Learning

Weeks 1–3 · Foundations

Started by setting up the environment with VS Code, Python, Git, and Markdown, then reviewed core concepts in machine learning (regression vs. classification) and major frameworks like TensorFlow, Keras, and PyTorch. MNIST served as the “hello world” dataset for image classification, using fully connected networks and CNNs to understand training dynamics.

Weeks 4–5 · Cheese Datasets & Cleaning

Shifted to cheese-specific datasets from Kaggle, learning what defines a high-quality dataset in practice: no mislabeled samples, consistent image sizes, minimal blur, and no duplicates. Applied a mix of manual review and automated checks to bring the datasets closer to that standard and prepared unified train/validation/test splits.

Week 5 · Automated Quality Checks

Experimented with tools such as CleanVision for detecting low-quality and potentially mislabeled images, and used Google Gemini examples to flag images that did not actually contain cheese. After cleaning, all images were resized to a consistent resolution suitable for model training.

Weeks 6–11 · Model Design & Training

Studied state-of-the-art image classification approaches, beginning with CNNs and conceptually exploring transformer-based models. Helped design a PyTorch model sized to fit the university HPC GPU server, and participated in training and refinement cycles, including hyperparameter adjustments and monitoring performance across epochs.

Weeks 12–14 · Evaluation & Documentation

Supported efforts to evaluate the trained model using a small collected cheese dataset and contributed to documentation of the full pipeline: from tool onboarding and dataset cleaning through model design, training, and evaluation. This documentation is intended to help future students quickly understand and extend the project.

Cheese Image Classification with Machine Learning

14 Weeks

PyTorch

My Responsibilities

Weeks 1–3 · Foundations

Weeks 4–5 · Cheese Datasets & Cleaning

Week 5 · Automated Quality Checks

Weeks 6–11 · Model Design & Training

Weeks 12–14 · Evaluation & Documentation

Languages

Frameworks

Libraries

AI APIs

Platforms

Version Control

Data Sources

Documentation

What Was Accomplished

Key Learnings

Project Wrap-Up & Lab Directions