Xiongkun Linghu

I was a senior research engineer at Beijing Institute for General Artificial Intelligence(BIGAI) from July 2023 to March 2026. In BIGAI, I was advised by Dr. Siyuan Huang and Dr. Baoxiong Jia. I obtained my M.S. from Tsinghua University in July 2023. Previously, I received my B.S. from Beijing Institute of Technology in July 2020.

I am generally interested in multimodal foundation models and embodied AI. My long-term goal is building powerful, reliable and safe embodied agents in the digital and physical world.

News

  • May 2026: 3D-RFT is accepted by ICML 2026.
  • January 2026: SceneCOT is accepted by ICLR 2026.

Publications

  1. 3D-RFT.png
    3D-RFT: Reinforcement Fine-Tuning for Video-based 3D Scene Understanding
    Xiongkun Linghu*, Jiangyong Huang*, Baoxiong Jia, and Siyuan Huang
    ICML 2026
  2. scenecot.png
    SceneCOT: Eliciting Grounded Chain-of-Thought Reasoning in 3D Scenes
    ICLR 2026
  3. beacon3d.png
    Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
    CVPR 2025
  4. msr3d.png
    Multi-modal Situated Reasoning in 3D Scenes
    NeurIPS, Datasets and Benchmarks Track, 2024
  5. LEO.png
    An Embodied Generalist Agent in 3D World
    ICML, 2024
  6. SFSC.png
    Switchable representation learning framework with self-compatibility
    CVPR, 2023

Preprint

  1. bel.jpg
    Bayesian Evidential Learning for Few-Shot Classification
    arxiv, 2022

Service

    • Reviewers: I serve as the reviewer for NeurIPS, ICML, ICLR, CVPR, and ECCV

Experience

    • 2023.7 - 2026.3, Research Engineer, BIGAI, Multimodal LLM, Embodied AI, 3D Vision
    • 2021.12 - 2022.7, Intern, Huawei, Few-shot Learning and Uncertainty Modeling
    • 2021.5 - 2021.11, Intern, Megvii, Few-shot Learning

Education

    • 2020.8 - 2023.6, M.S., Department of Electronic Engineering, Tsinghua University
    • 2016.8 - 2020.6, B.Eng, School of Information and Electronics, Beijing Institute of Technology