Troubleshooting

🚑 Quick Fixes for Common Issues

Tip

💬 Still stuck? Open a GitHub Issue with your error message!

🔧 Installation Issues

🖼️ Processing Issues

🎁 Quick Help Commands

# Check if everything is working
python scripts/health_check.py

# Test with a single image
python scripts/processing/analysis.py -i path/to/images_dir -o test_output/

# Get help for any script
python scripts/processing/detection.py --help

📞 Still Need Help?

💬 GitHub Issues: Report bugs or ask questions
📖 Documentation: Check the full User Guide for detailed instructions
🤝 Contributing: See Contributing if you want to help improve the project
🔧 API Reference: Check API Reference for technical details

Low Detection Accuracy

Symptoms: Few or no labels detected in images

Solutions: - Check image quality (resolution, lighting, focus) - Lower confidence threshold: --confidence 0.5 - Verify image format (JPEG, PNG supported) - Ensure labels are clearly visible in images

OCR Returns Gibberish

Symptoms: Extracted text is unreadable or incorrect

Solutions: - Try Google Vision API instead of Tesseract - Check image rotation and orientation - Verify text language settings - Improve image preprocessing (contrast, denoising)

Memory Errors

Error: MemoryError or CUDA out of memory

Solutions: - Reduce batch size in processing scripts - Close other applications to free memory - Process images sequentially instead of in batches - Use CPU processing if GPU memory is insufficient

Slow Processing

Symptoms: Very long processing times

Solutions: - Enable GPU acceleration if available - Reduce image resolution if extremely high - Process in smaller batches - Use Docker for optimized resource usage - Monitor CPU and memory usage

Specific Error Messages

ImportError: No module named ‘cv2’

Solution: - Activate the conda environment: conda activate ELIE - Reinstall OpenCV: pip install opencv-python

FileNotFoundError: [Errno 2] No such file or directory

Solution: - Check file paths are correct and absolute - Ensure input directories exist and contain images - Verify output directories are writable

ModuleNotFoundError: No module named ‘tensorflow’

Solution: - Check TensorFlow installation: pip list | grep tensorflow - Reinstall if needed: pip install tensorflow>=2.16.0 - For Apple Silicon Macs: pip install tensorflow-macos

CUDA Error: device not found

Solution: - Install NVIDIA drivers and CUDA toolkit - Verify GPU compatibility - Use CPU-only versions if no compatible GPU

Quality Issues

Poor Classification Results

Symptoms: Labels misclassified (empty vs handwritten vs printed)

Solutions: - Review training data quality - Adjust classification thresholds - Manual review of borderline cases - Retrain models with domain-specific data

Inconsistent OCR Results

Symptoms: Same text extracted differently across runs

Solutions: - Use deterministic processing settings - Ensure consistent image preprocessing - Document processing parameters used - Consider ensemble methods for critical text

Missing Labels in Detection

Symptoms: Some labels not detected in multi-label images

Solutions: - Lower detection confidence threshold - Check for label overlap or occlusion - Verify image resolution is sufficient - Manual annotation of missed cases

Performance Optimization

Speed Improvements

Use SSD storage for image directories
Increase available RAM
Enable GPU processing for compatible operations
Process images in parallel where possible
Optimize image sizes for processing pipeline

Resource Management

Monitor system resources during processing
Set memory limits for Docker containers
Use batch processing for large datasets
Clean up temporary files regularly

Quality vs Speed Trade-offs

Higher confidence thresholds = faster but less complete
Lower resolution = faster but less accurate
Tesseract = faster, Google Vision = more accurate
Batch size affects memory usage and speed

Development Issues

Code Changes Not Reflected

Solution: - Reinstall package in development mode: pip install -e . - Restart Python interpreter - Clear Python cache: find . -name "*.pyc" -delete

Testing Failures

Solutions: - Ensure test data is available - Check all dependencies are installed - Run individual test modules to isolate issues - Update test expectations if API changed

Documentation Build Fails

Solutions: - Install documentation dependencies: pip install -e .[docs] - Clear Sphinx build cache: make clean in docs directory - Check for syntax errors in RST files - Verify all module imports work correctly

Getting More Help

Log File Analysis

Important log locations: - Application logs: logs/ directory - System logs: /var/log/ (Linux), Console app (macOS) - Docker logs: docker logs <container_name>

Diagnostic Information

When reporting issues, include: - Operating system and version - Python version and environment details - Complete error messages and stack traces - Input data characteristics (size, format, content) - Processing parameters used - System resource availability (RAM, GPU)

Community Resources

GitHub Issues: Report bugs and feature requests
Documentation: Check latest version online
Stack Overflow: Search for similar problems
Research Papers: Understanding algorithm limitations

Professional Support

For enterprise users: - Contact maintainers for dedicated support - Consider consulting services for custom implementations - Training workshops available for teams - Priority bug fixes and feature development

Prevention Tips

Regular Maintenance

Keep dependencies updated
Monitor disk space usage
Regular backup of configuration files
Test with sample data before large processing runs

Best Practices

Document processing workflows and parameters
Version control for custom configurations
Regular validation of output quality
Maintain separate environments for development and production

Monitoring

Set up automated health checks
Monitor processing success rates
Track performance metrics over time
Alert systems for critical failures