Title: Loong: Generating Minute-level Long Videos with Autoregressive Language Models
Presenter: WANG Yuqing
Presentation Date: November 21, 2024 (Thursday)
Research Areas: Computer vision, video generation
DOI: https://doi.org/10.48550/arXiv.2410.02757
Title: Visual Instruction Tuning
Presenter: ZHU Chenming
Research Areas: Computer vision, multi-modal LLM
DOI: https://doi.org/10.48550/arXiv.2304.08485
Title: LLaVA-OneVision: Easy Visual Task Transfer
DOI: https://doi.org/10.48550/arXiv.2408.03326
Title: LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
DOI: https://doi.org/10.48550/arXiv.2409.18125