Skip to content
View BradyFU's full-sized avatar
👋
👋

Organizations

@VITA-MLLM

Block or report BradyFU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Awesome-Multimodal-Large-Language-Models Awesome-Multimodal-Large-Language-Models Public

    ✨✨Latest Advances on Multimodal Large Language Models

    14.2k 918

  2. Video-MME Video-MME Public

    ✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    480 20

  3. VITA-MLLM/VITA VITA-MLLM/VITA Public

    ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    Python 2.2k 164

  4. VITA-MLLM/Long-VITA VITA-MLLM/Long-VITA Public

    ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

    Python 251 28

  5. Woodpecker Woodpecker Public

    ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

    Python 632 31

  6. shenyunhang/APE shenyunhang/APE Public

    [CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

    Python 552 42