Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Skipping verses and sound noise and volume. #73

Open
AshD opened this issue Feb 20, 2025 · 2 comments
Open

Skipping verses and sound noise and volume. #73

AshD opened this issue Feb 20, 2025 · 2 comments

Comments

@AshD
Copy link

AshD commented Feb 20, 2025

When the number of verses increase to 10+, the model starts skipping verses. No Errors. Each verse is around 130 to 180 characters.

Also, some times the song has a lot of noise only for certain words and the volume is much higher. Data corruption?

I am running with --new tokens at 1500 or 2000 to make sure the model does not run out of context size. Is there a way to increase the context?

I am running the YuE-s1-7B-anneal-en-icl model with audio prompt on a H200 NVL (144GB VRAM) with python 3.10 and torch 2.6.0+cu124

Thanks,
Ash

@a43992899
Copy link
Collaborator

a43992899 commented Feb 22, 2025

May I look at your txts? It Could be too many verses.

@AshD
Copy link
Author

AshD commented Feb 25, 2025

Sorry, I missed this notification earlier.

In the sample below it misses the entire Outro section and the lines (in italics below)
Please, take the flight AND I'll hold it within
See attached zip file for music generated.

Here is the cmd line

python infer.py \
    --cuda_idx 0 \
    --stage1_model m-a-p/YuE-s1-7B-anneal-en-icl \
    --stage2_model m-a-p/YuE-s2-1B-general \
    --genre_txt ../prompt_egs/genre.txt \
    --lyrics_txt ../lyrics2.txt \
    --run_n_segments 7 \
    --stage2_batch_size 4 \
    --output_dir ../output \
    --max_new_tokens 2500 \
    --repetition_penalty 1.1 \
    --use_audio_prompt \
    --audio_prompt_path ../prompt_egs/pop.00001.mp3 \
    --prompt_start_time 0 \
    --prompt_end_time 30 

genre.txt
inspiring female uplifting pop airy vocal electronic bright vocal vocal

inspiring-female-uplifting-pop-airy-vocal-electronic-bright-vocal-vocal_tp0@93_T1@0_rp1@1_maxtk2500_6cdfcd63-463f-4779-abab-9d05b0c3fcf6_mixed.zip

lyrics2.txt

[verse]
I've got to hold you, anyhow
Not later, this hour

[verse]
I know it's dark, whoa I can't mark
So come on and take my hand
Please, take my hand
Now, don't start fretting, trying to understand

[chorus]
The stars are shining, let's not miss this prime
I know it's night, whoa I feel the hype
So come on and take the flight
_Please, take the flight_

[verse]
I know it's hard making this request
There's no other path I can take to be your quest

[chorus]
If only your kin would consent
Situations like this we could prevent
No, no
I won't reveal to anyone
_I'll hold it within_

[chorus]
I know it's dark, whoa I can't mark
So come on and take my heart
Please, take my heart


[outro]
_'Cause I desire you, truly I do
Oh, stay by my side
Yes, oh, darling I require you
I want you, I want you to light my way_  

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants