AI Image Recognition System
Overview
Developed a model forecasting sales for 6 restaurants (10k+ records) with a 98.41% accuracy R2 and outperforming an already high naive baseline of 97.7%. Used GridSearch for hyperparameters and testing algorithms (RNNs, classical algorithms, Neural Networks), and XGBoost was chosen based on RMSE, R2, and naive baseline comparison.
Key Features
- Custom CNN architecture with 5 convolutional layers optimized for multi-class classification
- Advanced data augmentation pipeline (rotation, flipping, scaling) to improve model generalization
- Real-time image classification with sub-second inference times (avg. 0.3s per image)
- Interactive web interface built with Flask for easy testing and demonstration
- Model versioning and experiment tracking with TensorBoard
Technical Stack
Challenges & Solutions
The main challenge was dealing with class imbalance in the dataset, where some categories had 3x more samples than others. I addressed this by implementing weighted loss functions and applying SMOTE (Synthetic Minority Over-sampling Technique) to balance the training data. This improved minority class accuracy by 18%.
Results & Impact
The final model achieved 94% overall accuracy with F1-scores above 0.90 for all classes. The system processes images 5x faster than the baseline approach and has been deployed as a demonstration project showcasing practical computer vision applications.