Fix LTX-2 Inference when num_videos_per_prompt > 1 and CFG is Enabled#13121
Fix LTX-2 Inference when num_videos_per_prompt > 1 and CFG is Enabled#13121
num_videos_per_prompt > 1 and CFG is Enabled#13121Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
sayakpaul
left a comment
There was a problem hiding this comment.
Thanks! Should we use a guidance scale of > 1 instead in the tests then?
I think it would be reasonable to test with both |
|
Merging as the CI failures should be unrelated. |
|
@dg845 would you like to open a PR with those modifications? |
What does this PR do?
This PR fixes LTX-2 inference when
num_videos_per_prompt > 1and CFG is enabled by duplicating the video and audio position ids for CFG. Usingnum_videos_per_prompt > 1worked before this PR when CFG was not used (guidance_scale=1.0), but now it should work when CFG is used as well. An example script that should work after this PR isWho can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.
@sayakpaul
@yiyixuxu