Longxin Kou - 寇龙馨 | AI Researcher

About Me

I am currently a third-year master student at the Deep Reinforcement Learning (DRL) Lab, Tianjin University, advised by Prof. Jianye Hao. I am also co-advised by Prof. Yan Zheng. I received my Bachelor's degree in Internet of Things Engineering from the School of Information Science and Engineering at Hunan University in June 2023.

Research Interests

Embodied AI Video Understanding Multimodal Large Language Models Video Large Language Models Robot Manipulation

Educational Experience

Master - Tianjin University (2023.9 ~ Present)

Department: School of Intelligent and Computing

Major: Software Engineering

Lab: Deep Reinforcement Learning (DRL) Lab

Bachelor - Hunan University (2019.9 ~ 2023.7)

Department: School of Information Science and Engineering

Major: Internet of Things Engineering

Scholarships: National Scholarship (2022.10), Aiwan Medical Scholarship (2023.04)

Honors: University "Three Good Student" Honor, University Outstanding Graduate

Latest News

June 26, 2025

🎉 Our paper "RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration" has been accepted at ICCV 2025!

September 26, 2024

🎉 Our paper "PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation" has been accepted at NeurIPS 2024!

May 2, 2024

🎉 Our paper "KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations" has been accepted at ICML 2024!

February 27, 2024

🎉 Our paper "Generate Subgoal Images before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts" has been accepted at CVPR 2024!

Publications

RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration

Longxin Kou, Fei Ni, YAN ZHENG, Peilong Han, Jinyi Liu, Haiqin Cui, Rui Liu, Jianye HAO

International Conference on Computer Vision (ICCV), 2025. CCF-A

PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation

Fei Ni, Jianye Hao, Shiguang Wu, Longxin Kou, Yifu Yuan, Zibin Dong, Jinyi Liu, Mingzhi Li, Yuzheng Zhuang, Yan Zheng

The Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS), 2024. CCF-A

KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations

Longxin Kou, Fei Ni, Yan Zheng, Jinyi Liu, Yifu Yuan, Zibin Dong, Jianye Hao

The Forty-first International Conference on Machine Learning (ICML), 2024. CCF-A

Generate Subgoal Images before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts

Fei Ni, Jianye Hao, Shiguang Wu, Longxin Kou, Jiashun Liu, Yan Zheng, Bin Wang, Yuzheng Zhuang

The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024. CCF-A