《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases machine-learningtutorialreinforcement-learningdeep-learningcnntransformerganrnnpruningtransfer-learningbertdiffusionself-attentionnetwork-compressionchatgptleedl-tutorial ...
(robodiff)[diffusion_policy]$ cat data/pusht_eval_output/eval_log.json{"test/mean_score": 0.9150393806777066,"test/sim_max_reward_4300000": 1.0,"test/sim_max_reward_4300001": 0.9872969750774386,..."train/sim_video_1": "data/pusht_eval_output//media/2fo4btlf.mp4"} ...
2024-08-25: Updated new related works before 25/08/2024 in this GitHub. 📌 About new works. If you want to incorporate your studies (e.g., the link of paper or project) on diffusion model-based image processing in this repository. Welcome to raise an issue or email us. We will in...
"train/sim_video_1": "data/pusht_eval_output//media/2fo4btlf.mp4" }🦾 Demo, Training and Eval on a Real RobotMake sure your UR5 robot is running and accepting command from its network interface (emergency stop button within reach at all time), your RealSense cameras plugged in to yo...
[Paper] [Github] 17 June 2022 A Flexible Diffusion Model Weitao Du, Tao Yang, He Zhang, Yuanqi Du arXiv 2022. [Paper] 17 Jun 2022 Lossy Compression with Gaussian Diffusion Lucas Theis, Tim Salimans, Matthew D. Hoffman, Fabian Mentzer arXiv 2022. [Paper] 17 Jun 2022 ...
Code URL:https://github.com/lucidrains/magvit2-pytorch TL;DR 2023 年 google 和 CMU 的文章,介绍了视频 tokenizer MAGVIT-v2,旨在使用共同的词汇表为视频和图像生成简洁而富有表现力的 token。如论文标题提到的,该研究首次在 ImageNet 数据集上展示了基于 LLMs 的生成效果超过扩散模型。
视频生成自然不用多说了,年初OpenAI放出的Sora,势必会带来一个视频生成的风口,带动一系列这方面的研究。另一方面,根据个人的调研结果,目前视频生成的工作还不多,仍有较大挖掘空间。关于视频生成的顶会文章,可以参考我GitHub repo收录的paper list: Video Generation Paper List(https://github.com/AlonzoLeeeooo/aw...
#Clone this repository.git clone https://github.com/alibaba/Tora.gitcdTora#Install Pytorch (we use Pytorch 2.4.0) and torchvision following the official instructions: https://pytorch.org/get-started/previous-versions/. For example:conda create -n tora python==3.10 conda activate tora conda inst...
git clone https://github.com/ali-vilab/VGen.git cd VGen Getting Started with VGen (1) Train your text-to-video model Executing the following command to enable distributed training is as easy as that. python train_net.py --cfg configs/t2v_train.yaml In the t2v_train.yaml configuratio...
notebooks/stable-video-diffusion/stable-video-diffusion.ipynb @@ -9,7 +9,7 @@ "# Image to Video Generation with Stable Video Diffusion\n", Collaborator nikita-savelyevv May 28, 2024 • edited How about "That's why we use weight compression" -> "That's why we use only weight ...