Skip to content

update helmet formatting #2817

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Apr 16, 2025
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
40 changes: 24 additions & 16 deletions helmet.md
Original file line number Diff line number Diff line change
@@ -1,31 +1,39 @@
---
title: "Introducing HELMET: Holistically Evaluating Long-context Language Models"
title: "Introducing HELMET: Holistically Evaluating Long-context Language Models"
thumbnail: /blog/assets/helmet/thumbnail.png
authors:
- user: hyen
guest: true
org: princeton-nlp
- user: gaotianyu1350
guest: true
org: princeton-nlp
- user: houminmin
guest: true
org: Intel
- user: kding1
guest: true
org: Intel
- user: danf
guest: true
org: Intel
- user: moshew
guest: true
org: Intel
- user: cdq10131
guest: true
org: princeton-nlp

---

# Introducing <img src="./assets/helmet/logo.jpeg" alt="logo" width="30"/><span style="color: #E77500"><b>HELMET</b></span>: Holistically Evaluating Long-context Language Models

By Howard Yen<sup><span style="color: #E77500">&spades;</span></sup>,
Tianyu Gao<sup><span style="color: #E77500">&spades;</span></sup>,
Minmin Hou<sup><span style="color: #00C7FD">&clubs;</span></sup>,
Ke Ding<sup><span style="color: #00C7FD">&clubs;</span></sup>,
Daniel Fleischer<sup><span style="color: #00C7FD">&clubs;</span></sup>,
Peter Izsak<sup><span style="color: #00C7FD">&clubs;</span></sup>,
Moshe Wasserblat<sup><span style="color: #00C7FD">&clubs;</span></sup>,
and Danqi Chen<sup><span style="color: #E77500">&spades;</span></sup>\
<sup><span style="color: #E77500">&spades;</span></sup>Princeton Language and Intelligence (PLI), Princeton University\
<sup><span style="color: #00C7FD">&clubs;</span></sup>Intel\
2025-02-29
<h1 class="subtitle is-3 publication-subtitle">
<span>Introducing <span style="color: #E77500"><b>HELMET</b></span>: Holistically Evaluating Long-context Language Models</span>
</h1>

Contact: [email protected] \
Paper: https://arxiv.org/abs/2410.02694 \
Code & Data: https://github.com/princeton-nlp/HELMET \
Website: https://princeton-nlp.github.io/HELMET
Website: https://princeton-nlp.github.io/HELMET \
Code & Data: https://github.com/princeton-nlp/HELMET

Since we first released HELMET last October, there has been more development on long-context language models than ever before, and we are thrilled to see the adoption of HELMET by the community, such as [Microsoft's Phi-4](https://arxiv.org/abs/2412.08905) and [AI21's Jamba 1.6](https://www.ai21.com/blog/introducing-jamba-1-6/).
After the initial release, we have added more models to our evaluation suite and conducted additional analyses. We are excited to share our new results and present HELMET at ICLR 2025!
Expand Down