diff --git a/evals/evaluation/HELMET/README.md b/evals/evaluation/HELMET/README.md
index 245cf0b2..4cb23e49 100644
--- a/evals/evaluation/HELMET/README.md
+++ b/evals/evaluation/HELMET/README.md
@@ -1,6 +1,5 @@
# HELMET: How to Evaluate Long-context Language Models Effectively and Thoroughly
-
[[Paper](https://arxiv.org/abs/2410.02694)]
HELMET (How to Evaluate Long-context Models Effectively and Thoroughly) is a comprehensive benchmark for long-context language models covering seven diverse categories of tasks.