Skip to content
@360CVGroup

360 AI Research

360人工智能研究院

👋 Who We Are

This is the 360 AI Research, our mission is to lead in tech innovations and deliver real-world values.
We focus on "multimodal + cross-modal learning" and "large model + zero/few shot learning",
conducting research in

  • 🔎 multi-modal comprehension

    • FG-CLIP: ICML2025, new generation of CLIP with strong fine grained discrimination capability
    • LMM-Det: ICCV2025, make large multimodal models excel in object detection
    • IAA: AAAI2025, LMM with plugin mechanism solving catastrophic forgetting
    • 360VL: Large multimodal model, 2nd-gen
    • SEEChat: Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM
    • OVD: KDD2023, open-world object detection, we also co-hosted open vocabulary detection contest 2023 with CSIG(中国图象图形学学会)
    • Zero: ACM MM2023, large scale open-sourced Chinese cross-modal data and benchmark
  • 🎨 multi-modal generation

    • PlanGen: ICCV2025, unified layout planning and image generation
    • Qihoo-T2X: ICLR2025, efficient DiT architecture for text2any tasks
    • BDM: AAAI2025, Chinese-native image generation while compatible with SD eco-system, 1st-gen
    • HiCo: NeurIPS2024, layout controlled image generation
    • FancyVideo: Video generation from text&image, 1st-gen

🛒 Business & API

Check research.360.cn for contact and API portal

🔥 Hiring

Internship: we're hiring research interns in fileds of AIGC, LMM, and inference optimization, check 👉 JD here

Pinned Loading

  1. FG-CLIP FG-CLIP Public

    New generation of CLIP with fine grained discrimination capability, ICML2025

    Python 288 14

  2. Qihoo-T2X Qihoo-T2X Public

    Efficient DiT architecture for text2any tasks, ICLR2025

    450 22

  3. LMM-Det LMM-Det Public

    Make Large Multimodal Models excel in object detection, ICCV 2025

    Python 41 2

  4. PlanGen PlanGen Public

    Unified layout planning and image generation, ICCV2025

    Python 30 1

  5. Inner-Adaptor-Architecture Inner-Adaptor-Architecture Public

    LMM solved catastrophic forgetting, AAAI2025

    Python 44 4

  6. HiCo_T2I HiCo_T2I Public

    Layout Conditioned Image Generation, NeurIPS2024

    Python 61 3

Repositories

Showing 10 of 14 repositories
  • HiCo_T2I Public

    Layout Conditioned Image Generation, NeurIPS2024

    360CVGroup/HiCo_T2I’s past year of commit activity
    Python 61 3 12 0 Updated Sep 3, 2025
  • .github Public

    Introduction to 360 AI Research

    360CVGroup/.github’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Aug 4, 2025
  • LMM-Det Public

    Make Large Multimodal Models excel in object detection, ICCV 2025

    360CVGroup/LMM-Det’s past year of commit activity
    Python 41 Apache-2.0 2 1 0 Updated Aug 1, 2025
  • FG-CLIP Public

    New generation of CLIP with fine grained discrimination capability, ICML2025

    360CVGroup/FG-CLIP’s past year of commit activity
    Python 288 Apache-2.0 14 26 0 Updated Jul 29, 2025
  • WISA Public

    World Simulator Assistant for Physics-Aware Text-to-Video Generation

    360CVGroup/WISA’s past year of commit activity
    Python 240 Apache-2.0 42 6 0 Updated May 22, 2025
  • FancyVideo Public

    Video generation from text&image, 1st-gen

    360CVGroup/FancyVideo’s past year of commit activity
    Python 922 53 17 0 Updated May 10, 2025
  • Qihoo-T2X Public

    Efficient DiT architecture for text2any tasks, ICLR2025

    360CVGroup/Qihoo-T2X’s past year of commit activity
    450 22 2 0 Updated May 10, 2025
  • RelaCtrl Public

    Efficient controlnet for DiTs

    360CVGroup/RelaCtrl’s past year of commit activity
    Python 380 34 22 0 Updated May 10, 2025
  • Inner-Adaptor-Architecture Public

    LMM solved catastrophic forgetting, AAAI2025

    360CVGroup/Inner-Adaptor-Architecture’s past year of commit activity
    Python 44 Apache-2.0 4 1 0 Updated Apr 15, 2025
  • PlanGen Public

    Unified layout planning and image generation, ICCV2025

    360CVGroup/PlanGen’s past year of commit activity
    Python 30 Apache-2.0 1 4 0 Updated Apr 14, 2025

Top languages

Loading…

Most used topics

Loading…