diff --git a/README.md b/README.md
index 7812ed8..272596d 100644
--- a/README.md
+++ b/README.md
@@ -1,9 +1,25 @@
 # SEFLAG
+
 **S**ystematic **E**valuation **F**ramework for NLP models and datasets in **L**atin and **A**ncient **G**reek
 
 ## Evaluation Results
+
 ### Lemmatization on UD test data
-#### Ancient Greek: greCy 
+
+#### Ancient Greek: greCy
+
 {'accuracy': 0.8942049121548943}
+
 #### Latin: LatinCy
+
 {'accuracy': 0.8843245653143111}
+
+### NER with average metrics (weighted -- micro -- macro)
+
+|               |                    |                |
+|---------------|--------------------|----------------|
+| **Model**     | flair_grc_bert_ner | LatinCy        |
+| **Dataset**   | Yousef et al.      | Herodotos      |
+| **Precision** | 96 -- 94 -- 66     | 96 -- 96 -- 59 |
+| **Recall**    | 94 -- 94 -- 76     | 96 -- 96 -- 60 |
+| **F1**        | 95 -- 94 -- 64     | 96 -- 96 -- 58 |