Macro F1 Score calculation in your code.

Hi @ChristophAlt , 

Thank you for the wonderful codebase. 

I see that for reporting macro averaged F1 scores, you first calculate macro averaged precision, then calculate macro averaged recall, and then use `f1_measure = 2.0 * ((precision * recall) / (precision + recall + 1e-13)) `to calculate macro averaged F1 score ([here](https://github.com/DFKI-NLP/RelEx/blob/master/relex/metrics/f1_measure.py#L146)). 

In standard implementations of macro F-score (like in [scikit-learn](https://scikit-learn.org/stable/modules/generated/sklearn.metrics.f1_score.html)), macro averaged F1 score is calculated by first calculating F1 score for each class and then taking the average of these F1 scores.

You might want to mention this somewhere in your documentation, as it can cause some confusion. 

Let me know what you think. 

Thanks,
Ashim


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Macro F1 Score calculation in your code. #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Macro F1 Score calculation in your code. #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions