github-pages 1 Branch0 Tags Code Folders and files Latest commit Cannot retrieve latest commit at this time. History42 Commits images Rewrite template Jul 11, 2024 .gitattributes Add video Jun 19, 2024 .gitignore Update Jun 19, 2024 DriveVLM.pdf Upload DriveVLM.pdf Jun 25, 2024 index.html...
论文《DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models》论文链接:https://arxiv.org/abs/2402.12289 项目连接:https://tsinghua-mars-lab.github.io/DriveVLM/ DriveVLM 的整体流程如图 1 所示:将连续帧视觉图像进行编码,通过特征对齐模块,与 LMM 进行交互;从场景描述开始...
@@ -168,7 +179,7 @@ DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Mo <!-- --> <!-- --> <iframe width="900" height="506" src="https://www.youtube.com/embed/mt-SdHTTZzA?si=ZnbL5B_FNtdumFlE" title="YouTube video player" frameborder="0" ...
GitHub Sponsors Fund open source developers The ReadME Project GitHub community articles Repositories Topics Trending Collections Enterprise Enterprise platform AI-powered developer platform Available add-ons Advanced Security Enterprise-grade security features GitHub Copilot Enterprise-grade AI features...
DriveVLM以视觉语言大模型为基础,并与端到端模型实现双系统,在复杂和驾驶场景中表现出色,是首个部署上车的自动驾驶大模型。该成果论文近日收录于CoRL 2024。 论文链接: https://arxiv.org/abs/2402.12289 项目链接: https://tsinghua-m...
项目主页:https://tsinghua-mars-lab.github.io/DriveVLM/ 作者:自动驾驶专栏 | 原文出处:公众号【自动驾驶专栏】 摘要 本文介绍了DriveVLM:自动驾驶与大型视觉语言模型的融合。城市环境中自动驾驶的一个主要障碍是理解复杂且长尾的场景,例如具有挑战性的路况和微妙的人类行为。为此,本文引入了DriveVLM,这是一种利用...
Contribute to Tsinghua-MARS-Lab/DriveVLM development by creating an account on GitHub.
Showing 1 changed file with 0 additions and 0 deletions. Whitespace Ignore whitespace Split Unified Binary file removed BIN -8.6 MB DriveVLM.pdf Binary file not shown. 0 comments on commit e394cda Please sign in to comment. Footer © 2024 GitHub, Inc. Footer navigation Terms ...
项目连接:https://tsinghua-mars-lab.github.io/DriveVLM/ DriveVLM 的整体流程如图 1 所示: 将连续帧视觉图像进行编码,通过特征对齐模块,与 LMM 进行交互; 从场景描述开始引导 VLM 模型的思考,先引导时间、场景、车道环境等静态场景,再引导影响驾驶决策关键障碍物; ...
Expand Up@@ -137,7 +137,11 @@ DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Mo <!-- <!-- class="icon brands style2 fa-github">Github--> <ahref="https://arxiv.org/abs/2402.12289"class="icon style2 fa-file-pdf" <!-- <!-- target="_blank"> <...