社区盼开源图生视频模型如久旱盼甘霖 | We really need an image2video CogVideoX! #182

StarCycle · 2024-08-27T13:31:21Z

Feature request / 功能建议

希望能看到CogVideoX的图生视频版本！！！

Motivation / 动机

社区盼开源图生视频模型如久旱盼甘霖，只生成6s短视频的文生视频模型在生产中用处比较有限，用6s短视频表达清楚创作者的意思是困难的。

只有具备图生视频能力，才能在拼接多段短视频时保持人物和场景的一致性，才能创作出长视频。从文生视频到图生视频，在技术上只是小小一步，但对创作者而言是能用和不能用的区别。

至于训练生成更长短视频的模型（比如8秒，10秒）相对而言不是那么重要，一般一个镜头时长不会超过6秒。

Your contribution / 您的贡献

您发布初始版本以后，我可以试用或者提PR改进网络结构（如果您这边愿意提供微调数据）。

比如OpenSoraPlan为了实现根据前后帧inpaint中间帧，采用了如下的方式训练I2V模型：

bendanzzc · 2024-08-27T13:42:32Z

Your work is very helpful. The earth and the sky will praise your generosity, and countless algorithm engineers will praise you, the selfless devotee, the great architect.

trouble-maker007 · 2024-08-27T17:18:48Z

I am also trying to port the train inpaint solution from the Open Sora Plan to CogVideoX. This solution is similar to EasyAnimate, but I hope the official team can provide a fine-tuning code for Diffusers, as modifying the code based on SAT is not very intuitive. The forward process involves different modules, which are all in different .py files.

Nurburgring-Zhang · 2024-08-27T17:29:29Z

期待I2V，首尾帧，视频延长，我觉得我能做出一部大片。。。。。

Maikauer · 2024-08-28T01:13:47Z

2B版本不是已经可以图生视频了吗，我没有本地部署，但是我有使用官方的智谱APP试用了一下，由于效果不是太理想就没再用了。难道说开源的版本不能使用图生视频功能吗，只有API才可以用吗？我不是太清楚。另外发行介绍说3060就可以用5B版本，用试过的朋友了吗，效果如何

StarCycle · 2024-08-28T01:36:49Z

@Maikauer

Yes, the open-source version does not support image2video at this moment. If there is an open-source strong I2V model, there will be a community finetuning it (like Stable Diffusion & LLaMA).

I also tried OpenSoraPlan 1.2 and OpenSora 1.2 for image2video. Limited by resources, their models are not as strong as CogVideoX, though they are very good developers.

zRzRzRzRzRzRzR · 2024-08-28T04:26:23Z

我们收到了这个意见，我们会继续调研一下，感谢你们的支持

codingcn · 2024-08-28T06:50:10Z

2B版本不是已经可以图生视频了吗，我没有本地部署，但是我有使用官方的智谱APP试用了一下，由于效果不是太理想就没再用了。难道说开源的版本不能使用图生视频功能吗，只有API才可以用吗？我不是太清楚。另外发行介绍说3060就可以用5B版本，用试过的朋友了吗，效果如何

图生视频并没有开源，目前只是开源了t2v，但大部分人都是需要i2v保持视频的统一性和可控性，所以文生视频显得非常鸡肋，用处不大。
目前开源社区只看到了阿里的EasyAnimate做的比较好，官方还对ComfyUI进行了支持。

非常期待清影能开源i2v。

zRzRzRzRzRzRzR · 2024-08-28T12:48:19Z

comfyui我们很快会支持

zRzRzRzRzRzRzR self-assigned this Aug 27, 2024

zRzRzRzRzRzRzR mentioned this issue Aug 28, 2024

API支持图像和文本一起输入，CogVideoX-5B什么时候可以支持呢？ #191

Open

zRzRzRzRzRzRzR pinned this issue Aug 28, 2024

zRzRzRzRzRzRzR unpinned this issue Aug 28, 2024

zRzRzRzRzRzRzR added enhancement New feature or request help wanted Extra attention is needed labels Aug 28, 2024

zRzRzRzRzRzRzR mentioned this issue Aug 28, 2024

Work plan and enhancement / 工作计划和用户诉求 #194

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

社区盼开源图生视频模型如久旱盼甘霖 | We really need an image2video CogVideoX! #182

社区盼开源图生视频模型如久旱盼甘霖 | We really need an image2video CogVideoX! #182

StarCycle commented Aug 27, 2024

bendanzzc commented Aug 27, 2024

trouble-maker007 commented Aug 27, 2024

Nurburgring-Zhang commented Aug 27, 2024

Maikauer commented Aug 28, 2024

StarCycle commented Aug 28, 2024

zRzRzRzRzRzRzR commented Aug 28, 2024

codingcn commented Aug 28, 2024

zRzRzRzRzRzRzR commented Aug 28, 2024

社区盼开源图生视频模型如久旱盼甘霖 | We really need an image2video CogVideoX! #182

社区盼开源图生视频模型如久旱盼甘霖 | We really need an image2video CogVideoX! #182

Comments

StarCycle commented Aug 27, 2024

Feature request / 功能建议

Motivation / 动机

Your contribution / 您的贡献

bendanzzc commented Aug 27, 2024

trouble-maker007 commented Aug 27, 2024

Nurburgring-Zhang commented Aug 27, 2024

Maikauer commented Aug 28, 2024

StarCycle commented Aug 28, 2024

zRzRzRzRzRzRzR commented Aug 28, 2024

codingcn commented Aug 28, 2024

zRzRzRzRzRzRzR commented Aug 28, 2024