Xiongkun Linghu

I am now a research engineer at Beijing Institute for General Artificial Intelligence(BIGAI). I obtained my M.S. from Tsinghua University in July 2023. Previously, I received my B.S. from Beijing Institute of Technology in July 2020.

I am generally interested in multimodal foundation models and embodied AI. My long-term goal is building powerful, reliable and safe embodied agents in the digital and physical world.

Publications

  1. scenecot.png
    SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes
    ICLR 2026
  2. beacon3d.png
    Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
    CVPR 2025
  3. msr3d.png
    Multi-modal Situated Reasoning in 3D Scenes
    NeurIPS, Datasets and Benchmarks Track, 2024
  4. LEO.png
    An Embodied Generalist Agent in 3D World
    ICML, 2024
  5. SFSC.png
    Switchable representation learning framework with self-compatibility
    CVPR, 2023

Preprint

  1. bel.jpg
    Bayesian Evidential Learning for Few-Shot Classification
    arxiv, 2022

Service

    • Reviewers: I serve as the reviewer for NeurIPS, ICML, ICLR, CVPR, and ECCV

Experience

    • 2021.12 - 2022.7, Intern, Huawei, Few-shot Learning and uncertainty modeling
    • 2021.5 - 2021.11, Intern, Megvii, Few-shot learning