|
1 | 1 | ---
|
2 |
| -title: "Introducing HELMET: Holistically Evaluating Long-context Language Models" |
| 2 | +title: "Introducing HELMET: Holistically Evaluating Long-context Language Models" |
3 | 3 | thumbnail: /blog/assets/helmet/thumbnail.png
|
4 | 4 | authors:
|
5 | 5 | - user: hyen
|
6 | 6 | guest: true
|
7 | 7 | org: princeton-nlp
|
| 8 | +- user: gaotianyu1350 |
| 9 | + guest: true |
| 10 | + org: princeton-nlp |
| 11 | +- user: houminmin |
| 12 | + guest: true |
| 13 | + org: Intel |
| 14 | +- user: kding1 |
| 15 | + guest: true |
| 16 | + org: Intel |
| 17 | +- user: danf |
| 18 | + guest: true |
| 19 | + org: Intel |
| 20 | +- user: moshew |
| 21 | + guest: true |
| 22 | + org: Intel |
| 23 | +- user: cdq10131 |
| 24 | + guest: true |
| 25 | + org: princeton-nlp |
8 | 26 |
|
9 | 27 | ---
|
10 | 28 |
|
11 |
| -# Introducing <img src="./assets/helmet/logo.jpeg" alt="logo" width="30"/><span style="color: #E77500"><b>HELMET</b></span>: Holistically Evaluating Long-context Language Models |
12 |
| - |
13 |
| -By Howard Yen<sup><span style="color: #E77500">♠</span></sup>, |
14 |
| -Tianyu Gao<sup><span style="color: #E77500">♠</span></sup>, |
15 |
| -Minmin Hou<sup><span style="color: #00C7FD">♣</span></sup>, |
16 |
| -Ke Ding<sup><span style="color: #00C7FD">♣</span></sup>, |
17 |
| -Daniel Fleischer<sup><span style="color: #00C7FD">♣</span></sup>, |
18 |
| -Peter Izsak<sup><span style="color: #00C7FD">♣</span></sup>, |
19 |
| -Moshe Wasserblat<sup><span style="color: #00C7FD">♣</span></sup>, |
20 |
| -and Danqi Chen<sup><span style="color: #E77500">♠</span></sup>\ |
21 |
| -<sup><span style="color: #E77500">♠</span></sup>Princeton Language and Intelligence (PLI), Princeton University\ |
22 |
| -<sup><span style="color: #00C7FD">♣</span></sup>Intel\ |
23 |
| -2025-02-29 |
| 29 | +<h1 class="subtitle is-3 publication-subtitle"> |
| 30 | + <span>Introducing <span style="color: #E77500"><b>HELMET</b></span>: Holistically Evaluating Long-context Language Models</span> |
| 31 | +</h1> |
24 | 32 |
|
25 | 33 |
|
26 | 34 | Paper: https://arxiv.org/abs/2410.02694 \
|
27 |
| -Code & Data: https://github.com/princeton-nlp/HELMET \ |
28 |
| -Website: https://princeton-nlp.github.io/HELMET |
| 35 | +Website: https://princeton-nlp.github.io/HELMET \ |
| 36 | +Code & Data: https://github.com/princeton-nlp/HELMET |
29 | 37 |
|
30 | 38 | Since we first released HELMET last October, there has been more development on long-context language models than ever before, and we are thrilled to see the adoption of HELMET by the community, such as [Microsoft's Phi-4](https://arxiv.org/abs/2412.08905) and [AI21's Jamba 1.6](https://www.ai21.com/blog/introducing-jamba-1-6/).
|
31 | 39 | After the initial release, we have added more models to our evaluation suite and conducted additional analyses. We are excited to share our new results and present HELMET at ICLR 2025!
|
|
0 commit comments