You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was trying to run ITAS algorithm for GSM8K benchmark to get a task specific ARCHON architecture. Unfortunately, I'm a bit stuck with unsupported benchmark issues.
I can see that provided scripts under benchmarks/ and benchmarks/gsm8k repos can generate and evaluate answers.
Unfortunately, it seems like itas_algorithm script in current released version supports only "mt_bench" and "arena_hard_auto":
as well as add some logic to compare generated answer against a correct one. Is my intuition correct? Do you plan to update the code with this logic by any chance?
Thanks in advance!
The text was updated successfully, but these errors were encountered:
You're correct; at this point in time, Archon only supports arena hard auto and mt_bench for sampling. The brunt of the work is getting Power Ranker to support new benchmarks since ITAS relies on Power Ranker to decide what configurations work best. We don't have a current timeline in mind for supporting other benchmarks, but it is something that's on the agenda. If you do implement this for your own use case and put up a PR that would definitely help us get the integration working sooner
[Note: edited for clarification]
Dear authors,
I was trying to run ITAS algorithm for
GSM8K
benchmark to get a task specific ARCHON architecture. Unfortunately, I'm a bit stuck withunsupported benchmark
issues.I can see that provided scripts under
benchmarks/
andbenchmarks/gsm8k
repos can generate and evaluate answers.Unfortunately, it seems like
itas_algorithm
script in current released version supports only "mt_bench" and "arena_hard_auto":Archon/src/archon/itas_algorithms/itas_algorithm.py
Line 150 in d45892c
Please, let me know if I'm wrong and what steps are necessary to get a task specific ARCHON architecture.
My intuition leads me to the fact that I need to add question map to use in
power_ranker
:Archon/src/archon/itas_algorithms/power_ranker.py
Lines 24 to 27 in d45892c
as well as add some logic to compare generated answer against a correct one. Is my intuition correct? Do you plan to update the code with this logic by any chance?
Thanks in advance!
The text was updated successfully, but these errors were encountered: