🎲 Rubik's Cube Recognition with TensorFlow RetinaNet

This is a deep learning model that can recognize and analyze Rubik's cubes in images. The model is built using TensorFlow and RetinaNet architecture for object detection.

🚀 Features

Real-time Detection: Upload images of Rubik's cubes for instant analysis
Multi-class Recognition: Detect cube faces and 6 different color tiles
Interactive Interface: Simple and intuitive Gradio web interface
Advanced AI: Powered by RetinaNet with SpineNet-49 backbone

🎯 How to Use

Upload: Click "Upload Rubik's Cube Image" and select your image
Analyze: Click "🔍 Analyze Image" or wait for automatic processing
Results: View detection results and visualization with bounding boxes

🔬 Technical Details

Framework: TensorFlow 2.15+ with Gradio interface
Architecture: RetinaNet with SpineNet-49 backbone
Input Size: 640×640 pixels
Classes: 7 total (1 face + 6 color tiles)
Colors Detected: Red, White, Blue, Orange, Green, Yellow

🌟 Model Architecture

RetinaNet-SpineNet-49

Base Model: RetinaNet for object detection
Backbone: SpineNet-49 for feature extraction
Input Resolution: 640×640×3
Output: Bounding boxes with class predictions and confidence scores

Detection Classes

face - Rubik's cube face
red_tile - Red color tile
white_tile - White color tile
blue_tile - Blue color tile
orange_tile - Orange color tile
green_tile - Green color tile
yellow_tile - Yellow color tile

📊 Performance Metrics

Metric	Target	Status
[email protected]	>0.85	In Training
Inference Speed	<100ms	Optimized
Accuracy	>90%	Evaluating

🛠️ Local Development

# Clone the repository
git clone https://huggingface.co/spaces/itsyuimorii/rubiks-cube-recognition
cd rubiks-cube-recognition

# Install dependencies
pip install -r requirements.txt

# Run the application
python app.py

📁 Project Structure

rubiks-cube-recognition/
├── app.py                    # Main Gradio application
├── requirements.txt          # Python dependencies
├── README.md                # This file
├── configs/                 # Model configurations
│   └── retinanet_config.py
├── src/                     # Source code
│   ├── data/               # Data processing utilities
│   └── model/              # Model training and inference
└── images/                 # Training and test datasets
    ├── train/              # Training images and annotations
    ├── test/               # Test images and annotations
    └── valid/              # Validation images and annotations

🎮 Demo Status

⚠️ Note: This is a demo version. The complete trained model is currently being developed. The interface will show a preview of the detection capabilities.

📝 Dataset Information

Format: COCO annotation format
Image Size: 640×640 pixels
Training Images: 50+ annotated cube images
Classes: 7 object classes (face + 6 colors)
Annotation Tool: LabelMe

🔧 Training Pipeline

# Training command
python src/model/trainer.py --config configs/retinanet_config.py

# Inference command  
python src/model/visualize.py --image path/to/cube_image.jpg

🤝 Contributing

Contributions are welcome! Areas for improvement:

Additional training data
Model optimization
UI/UX enhancements
Performance improvements

📄 License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

🙏 Acknowledgements

TensorFlow Model Garden - RetinaNet implementation
SpineNet - Backbone architecture
Gradio - Web interface framework
Hugging Face - Model hosting and deployment

📧 Contact

GitHub: @itsyuimorii
Hugging Face: @itsyuimorii

🔗 References

🎲 Ready to solve your Rubik's cube detection challenges!

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support