[FEATURE REQUEST] Functionality for analyzing the differences between two Annotation objects. #147

Open
opened 2025-10-14 16:27:20 -06:00 by navan · 0 comments
Owner

Originally created by @AndriyMulyar on 12/28/2018

What problem does your feature solve?
A method to do analysis of annotations (namely for the application of looking at differences between gold and predicted annotations).

Describe the solution you'd like
The Annotation class should be given some static methods like Annotation.diff(ann_object_1, ann_object_2) will output the difference between to annotation objects. Maybe some parameter for leniency to deal with fuzzy annotation matching.

Interface sklearn to compute various evaluation metrics between two annotation files (assuming one is gold and one is predicted).

Additional context
This would be very useful for result analysis and guiding the building of pipelines.

*Originally created by @AndriyMulyar on 12/28/2018* **What problem does your feature solve?** A method to do analysis of annotations (namely for the application of looking at differences between gold and predicted annotations). **Describe the solution you'd like** The Annotation class should be given some static methods like `Annotation.diff(ann_object_1, ann_object_2)` will output the difference between to annotation objects. Maybe some parameter for leniency to deal with fuzzy annotation matching. Interface sklearn to compute various evaluation metrics between two annotation files (assuming one is gold and one is predicted). **Additional context** This would be very useful for result analysis and guiding the building of pipelines.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: github/medaCy#147
No description provided.