label_evaluation.redundancy
Functions
|
Preprocess the dataset by converting text to lowercase, removing punctuation and whitespace, and excluding entries containing 'http'. |
|
Calculate the percentage of transcription redundancy in a dataset. |
|
Identify duplicate entries in a preprocessed dataset. |