Red Light, Green Light — Teaching AI Traffic Signs

This project trains a convolutional neural network to classify real-world traffic signs under noisy conditions, demonstrating applied computer vision fundamentals and disciplined model evaluation.

Project Links

Executive Summary (Web)

Neural Net Development & Evaluation

GitHub Repository

As a capstone to Harvard’s CS50AI, this project revisited a classic computer vision task using a more challenging dataset than MNIST's handwritten numbers. It marks a shift from tinkering to building models thoughtfully, learning from mistakes, and understanding how they handle real-world challenges. And it was a fitting note on which to bid a fond farewell to a series of pedagogically perfect courses that helped build my computer science foundation.

Project Overview

This project explores convolutional neural networks (CNNs) and applied computer vision under realistic, noisy conditions. I built a complete pipeline: preprocessing images, visualizing class distributions and errors, and designing CNN architectures with convolutional/pooling layers, dropout, and batch normalization. Training curves, confusion matrices, and error analyses were automated and exported for web presentation. This was a controlled technical exercise rather than a deployed system, focused on learning model behavior rather than operational deployment.

I experimented with class weighting, additional convolutional layers, batch normalization, and dense layers. The best results came from a streamlined two-layer CNN with batch normalization, which outperformed both the baseline and more complex, over-dense models. In fact, extra dense layers reduced accuracy, underscoring the value of simplicity and regularization.

The final model achieved over 99% accuracy on the test set, with most misclassifications in ambiguous speed limit signs. This project highlights my strengths in deep learning, systematic model experimentation, and data storytelling, taking a classic AI challenge from concept to polished, portfolio-ready finish.

Gallery

Convolutional Neural Network and Biological Vision Diagram

CNN architecture: mimics the human eye by combining simple features into complex object recognition.¹

Class Distribution Plot

Class imbalance: Label distribution is highly skewed across classes.

GTSRB Sample Images

Dataset diversity: representative image for each label.

Training Curves Comparison

Validation and training curves: best model vs. baseline.

Confusion Matrix Comparison

Confusion matrices: best model noticeably reduces off-diagonal errors.

Confidence Histogram Comparison

Misclassification confidence histogram: overkill model’s errors are less confident and more frequent.

References

Dataset provided by the German Traffic Sign Recognition Benchmark (GTSRB).

¹ Inspired by Deep learning for wireless capsule endoscopy: a systematic review and meta-analysis, Soffer, Shelly, et al, Gastrointestinal Endoscopy, © 2020 American Society for Gastrointestinal Endoscopy. Image generated with AI and edited by Bryan Johns. Used for educational purposes only. If this use does not qualify as fair use, I am happy to remove the image upon request.