LMMS Tutorial - Search News

3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer

3D-LLaVA (CVPR 2025) is 3D Large Multimodal Model that takes point clouds and text instruction as input to perform VQA, Dense Captioning and 3D Referring Segmentation. At the core of 3D-LLaVA is a new ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer

Trending now