Llama3.1 with torchtune #1123

Gasoonjia · 2024-09-09T20:29:37Z

This PR aims to add torchtune llama3.1 support while keep the original torchchat llama3.1 for reference.
To play with it:

The command for original model should be the same:

python3 torchchat.py generate llama3.1 --prompt "write me a story about a boy and his bear"

If you want to play with torchtune 3.1 model, consider using:

python3 torchchat.py generate llama3.1-tune --prompt "write me a story about a boy and his bear"

…ponent

pytorch-bot · 2024-09-09T20:29:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1123

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1599c2b with merge base 964d437 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This PR makes torchchat support multi-modality model definition and constructions. To show our power in multi-modality area, we integrate flamingo component into our system. Note that this is only for bare-minimum support for model definition. Please check openai_api_multimodal branch for e2e, and #1123 (comment) for better structure and llama3.1 support

Jack-Khuu

PR itself looks find to me with some minor nits.

Jack-Khuu · 2024-09-11T07:29:58Z

torchchat/cli/builder.py

@@ -35,6 +35,14 @@
 from torchchat.utils.measure_time import measure_time
 from torchchat.utils.quantize import quantize_model

+# bypass the import issue before torchao is ready on macos
+try:
+    from torchtune.training import set_default_dtype


Unused import?

Jack-Khuu · 2024-09-11T07:49:53Z

torchchat/model.py

@@ -11,6 +11,7 @@
 from enum import Enum
 from pathlib import Path
 from typing import Callable, Dict, Optional, Union
+from abc import ABC, abstractmethod


ABC is unused?

should be one of the Model's parents. Fixed it.

Jack-Khuu · 2024-09-11T07:53:39Z

torchchat/model.py

+            if isinstance(self.config.transformer_args[name], dict):
+                modules[name] = module_class(**self.config.transformer_args[name])
+            else:
+                modules[name] = module_class(self.config.transformer_args[name])


Suggested change

if isinstance(self.config.transformer_args[name], dict):

modules[name] = module_class(**self.config.transformer_args[name])

else:

modules[name] = module_class(self.config.transformer_args[name])

if isinstance(config_args := self.config.transformer_args[name], dict):

modules[name] = module_class(**config_args)

else:

modules[name] = module_class(config_args)

Jack-Khuu · 2024-09-11T07:57:34Z

torchchat/model.py

+
+
+class FlamingoModel(Model):
+    def forward(self, tokens: Tensor, encoder_input: Optional[Dict[str, Tensor]] = None, encoder_mask: Optional[Tensor] = None) -> Tensor:


lint long line

Gasoonjia and others added 18 commits August 27, 2024 16:37

added model source and type for torchtune flamingo support

0dae9ef

added model source and type for torchtune flamingo support

87397e3

grab missing enum

0f61614

fix ModelArgs init

d7f3a88

create init func for ModelArgs for BC

994b148

update pipeline for ModleSource and ModelType

d184e68

Merge branch 'main' of github.com:pytorch/torchchat into flamingo_com…

6bb2485

…ponent

revert lintrunner update on ET

0d8e368

introduce flamingo modules form torchtune

6c78850

back up to move to linux

2691bae

mitigate building issue

ba960f0

pass local test

8b3a684

merge solved

880dfe2

structual model builder

e7fa7b4

update torchtune address

c179bcb

update install requirement

5ead73b

support new torchtune flamingo component

882c336

specific version for vision and ao

952b8bd

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 9, 2024

Gasoonjia added 6 commits September 9, 2024 13:30

unify text-only model generation pipeline

e764111

convert installation back and bypass torchtune

9679a5b

Merge branch 'main' into flamingo_component

56006ea

Merge branch 'flamingo_component' into llama3.1_with_torchtune

2ec217d

restructual model definition

a3f08ea

update exportation variable name

59337a6

Gasoonjia mentioned this pull request Sep 9, 2024

multi-modality model construction support #1068

Merged

Gasoonjia added 3 commits September 9, 2024 16:55

Merge branch 'flamingo_component' into llama3.1_with_torchtune

33da35b

remove redunctant function

68e29bb

1/n torchtune 3.1 8b

8ea29e7

Gasoonjia added 18 commits September 10, 2024 00:17

installation update

4a6f703

torchtune 3.1 8b / 30b

5ec0811

merge main

3043433

bring torchchat llama3.1 back

f83154a

bring tok vali back to torchchat model + revert install_requirements.sh

f891fb1

solve bugs related to tt model support

6c97eb7

bypass torchtune import issue

11217a4

solve merge confilct

d0e2974

solve Jack's wonderful comments

758af10

remveo extra dot

80b5481

merge into flamingo_component

2b8c939

add type.Callable

1cc7909

fix torchchat typos

95684d9

merge with flamingo_component

c750c08

solve bug when args.model is None

6dc2aab

support builder_args.params_table is None

08a05b7

remove all .DS_Store

257b1ce

bring gguf back

324d338

Gasoonjia added 7 commits September 10, 2024 20:13

merge main

5082fb2

remove reduntant updates

192841d

bring checkpoint back

a5556f4

debug

d395f7f

debug

8130901

debug

dc40152

new factory func to produce Model from modelargs

6cf7db7

Jack-Khuu approved these changes Sep 11, 2024

View reviewed changes

solve comments

1599c2b

Gasoonjia merged commit e2049f4 into main Sep 11, 2024
51 checks passed

Jack-Khuu mentioned this pull request Dec 19, 2024

[KNOWN BUG] Broken Support for TextOnly Models from torchtune #1430

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama3.1 with torchtune #1123

Llama3.1 with torchtune #1123

Gasoonjia commented Sep 9, 2024 •

edited

Loading

pytorch-bot bot commented Sep 9, 2024 •

edited

Loading

Jack-Khuu left a comment

Jack-Khuu Sep 11, 2024

Jack-Khuu Sep 11, 2024

Gasoonjia Sep 11, 2024

Jack-Khuu Sep 11, 2024

Jack-Khuu Sep 11, 2024



		class FlamingoModel(Model):
		def forward(self, tokens: Tensor, encoder_input: Optional[Dict[str, Tensor]] = None, encoder_mask: Optional[Tensor] = None) -> Tensor:

Llama3.1 with torchtune #1123

Llama3.1 with torchtune #1123

Conversation

Gasoonjia commented Sep 9, 2024 • edited Loading

pytorch-bot bot commented Sep 9, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1123

✅ No Failures

Jack-Khuu left a comment

Choose a reason for hiding this comment

Jack-Khuu Sep 11, 2024

Choose a reason for hiding this comment

Jack-Khuu Sep 11, 2024

Choose a reason for hiding this comment

Gasoonjia Sep 11, 2024

Choose a reason for hiding this comment

Jack-Khuu Sep 11, 2024

Choose a reason for hiding this comment

Jack-Khuu Sep 11, 2024

Choose a reason for hiding this comment

Gasoonjia commented Sep 9, 2024 •

edited

Loading

pytorch-bot bot commented Sep 9, 2024 •

edited

Loading