label_processing.utils
Utility functions for the entomological label processing pipeline.
Provides image validation, filename generation, JSON/CSV I/O, NURI format checking, and model integrity verification helpers used across all pipeline variants.
Functions
|
Checks if the directory contains valid jpg files with integrity validation. |
|
Check NURI's format in OCR transcription "text". |
|
Gets the path to a file or directory as an input and returns it with an appendix added to the end. |
|
Loads the CSV file using Pandas. |
|
Loads the jpg files using the OpenCV module. |
|
Load JSON data from a file and deserialize it. |
|
Read a CSV file containing vocabulary and convert it to a dictionary. |
|
Correct NURI format in OCR transcription JSON output. |
|
Saves a json file with human-readable format. |
|
Validate image file integrity with strict memory safety limits. |
|
SECURITY: Mandatory model file integrity verification using SHA256 checksums. |