This change adds support for Intel Gaudi HPUs. #7275

emascarenhas · 2025-03-12T17:29:06Z

Several configuration files are provided in the examples directory for use with Gaudi.

LLaMA-Factory features and optimizations including inferencing, training (sft, dpo, etc.), LoRA fine-tuning, distributed training with DeepSpeed and DDP are working. Please see README for details. Co-authored-by: Yaser Afshar [email protected] Co-authored-by: Edward Mascarenhas [email protected] Co-authored-by: Jianhong-Zhang [email protected] Co-authored-by: Wenbin Chen [email protected]
Co-authored-by: Voas, Tanner [email protected]

Before submitting

[y] Did you read the contributor guideline?
[y] Did you write any new necessary tests?

Several configuration files are provided in the examples directory for use with Gaudi. LLaMA-Factory features and optimizations including inferencing, training (sft, dpo, etc.), LoRA fine-tuning, distributed training with DeepSpeed and DDP are working. Please see README for details. Co-authored-by: Yaser Afshar [email protected] Co-authored-by: Edward Mascarenhas [email protected] Co-authored-by: Jianhong-Zhang [email protected] Co-authored-by: Wenbin Chen [email protected] Co-authored-by: Voas, Tanner [email protected]

hiyouga

Thanks for your contribution, please view the comments

hiyouga · 2025-03-12T18:00:27Z

examples/train_lora/qwen2vl_lora_sft_gaudi.yaml

Could we avoid add too many examples? I think one for fine-tuning and another one for inference is sufficient

Thanks for your comment. Yes we can. I will remove following files from examples.
deleted: examples/inference/llama3_gaudi.yaml
deleted: examples/train_full/llama3_full_sft_ds3_gaudi.yaml
deleted: examples/train_full/qwen2_full_sft_ds1_gaudi.yaml
deleted: examples/train_full/qwen2_full_sft_ds3_gaudi.yaml
deleted: examples/train_lora/llama3_lora_dpo_gaudi.yaml
deleted: examples/train_lora/llama3_lora_eval_gaudi.yaml
deleted: examples/train_lora/llama3_lora_ppo_gaudi.yaml
deleted: examples/train_lora/llama3_lora_pretrain_gaudi.yaml
deleted: examples/train_lora/llama3_lora_reward_gaudi.yaml

Only remaining will be 1 file in inference, and 3 in train_lora for llama3 with deepspeed and without deepspeed, and qwen2vl.

hiyouga · 2025-03-12T18:03:10Z

requirements.txt

@@ -1,7 +1,6 @@
-transformers>=4.41.2,<=4.49.0,!=4.46.*,!=4.47.*,!=4.48.*;python_version<'3.10'


Transformers 4.49 is required. We should not downgrade the version

Ok. I reverted to existing requirements.txt file and fixed the requirements-gaudi.txt file. We expect in about a months time to support transformers 4.49.2 with Gaudi as well at which point we may not need requirements-gaudi.txt. a0f1661 and a16e3d4

…or gaudi in separate file.

hiyouga requested changes Mar 12, 2025

View reviewed changes

hiyouga added the pending This problem is yet to be addressed label Mar 12, 2025

emascarenhas added 3 commits March 12, 2025 15:28

Delete examples of Gaudi yaml files

3074690

Revert to original requirements.txt and capture transformer version f…

a0f1661

…or gaudi in separate file.

Revert transformers and other versioning

a16e3d4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This change adds support for Intel Gaudi HPUs. #7275

This change adds support for Intel Gaudi HPUs. #7275

emascarenhas commented Mar 12, 2025

hiyouga left a comment

hiyouga Mar 12, 2025

emascarenhas Mar 12, 2025

emascarenhas Mar 12, 2025

hiyouga Mar 12, 2025

emascarenhas Mar 12, 2025 •

edited

Loading

		@@ -1,7 +1,6 @@
		transformers>=4.41.2,<=4.49.0,!=4.46.,!=4.47.,!=4.48.*;python_version<'3.10'

This change adds support for Intel Gaudi HPUs. #7275

Are you sure you want to change the base?

This change adds support for Intel Gaudi HPUs. #7275

Conversation

emascarenhas commented Mar 12, 2025

Before submitting

hiyouga left a comment

Choose a reason for hiding this comment

hiyouga Mar 12, 2025

Choose a reason for hiding this comment

emascarenhas Mar 12, 2025

Choose a reason for hiding this comment

emascarenhas Mar 12, 2025

Choose a reason for hiding this comment

hiyouga Mar 12, 2025

Choose a reason for hiding this comment

emascarenhas Mar 12, 2025 • edited Loading

Choose a reason for hiding this comment

emascarenhas Mar 12, 2025 •

edited

Loading