Skip to content

Nov 21, 2024

Research Works

Article 1

Title: Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Presenter: WANG Yuqing

Presentation Date: November 21, 2024 (Thursday)

Research Areas: Computer vision, video generation

DOI: https://doi.org/10.48550/arXiv.2410.02757

Article 2

Title: Visual Instruction Tuning

Presenter: ZHU Chenming

Presentation Date: November 21, 2024 (Thursday)

Research Areas: Computer vision, multi-modal LLM

DOI: https://doi.org/10.48550/arXiv.2304.08485

Article 3

Title: LLaVA-OneVision: Easy Visual Task Transfer

Presenter: ZHU Chenming

Presentation Date: November 21, 2024 (Thursday)

Research Areas: Computer vision, multi-modal LLM

DOI: https://doi.org/10.48550/arXiv.2408.03326

Article 4

Title: LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Presenter: ZHU Chenming

Presentation Date: November 21, 2024 (Thursday)

Research Areas: Computer vision, multi-modal LLM

DOI: https://doi.org/10.48550/arXiv.2409.18125

Photo Album