Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile
Kun Shao@ICLR2025

@shaokun1991

Project Manager of London AI Agent. Leading the AI Agent team. Principal Research Scientist at Noah’s Ark Lab, Huawei. AI Agents, LLMs, RL, MAS, Robot Learning.

ID: 4714109953

calendar_today05-01-2016 14:53:31

54 Tweet

244 Followers

490 Following

Haitham Bou Ammar (@hbouammar) 's Twitter Profile Photo

As you know, we are organising a #Neurips2023 competition on robotics, where we are putting all those #MachineLearning claims to the test! The real-test people! I wrote a blog on the competition and the results so far we have on the leaderboard! Check it out:

As you know, we are organising a #Neurips2023 competition on robotics, where we are putting all those #MachineLearning claims to the test! The real-test people! I wrote a blog on the competition and the results so far we have on the leaderboard! Check it out:
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

#ICCV2023 Traj-MAE: Masked Autoencoders for Trajectory Prediction. Joined work with my co-supervised PhD students at CUHK. jiazewang.com/projects/trajm…

#ICCV2023 Traj-MAE: Masked Autoencoders for Trajectory Prediction. Joined work with my co-supervised PhD students at CUHK. 
jiazewang.com/projects/trajm…
Xidong Feng (@xidong_feng) 's Twitter Profile Photo

#NeurIPS2023 Happy to share our new NeurIPS 2023 paper: ChessGPT: Bridging Policy Learning and Language Modeling. We present how we conduct policy learning and language modeling in GPT model simultaneously in Chess! Paper: openreview.net/forum?id=pvdm4… Code: github.com/waterhorse1/Ch…

#NeurIPS2023 Happy to share our new NeurIPS 2023 paper: ChessGPT: Bridging Policy Learning and Language Modeling. We present how we conduct policy learning and language modeling in GPT model simultaneously in Chess!

Paper: openreview.net/forum?id=pvdm4…
Code: github.com/waterhorse1/Ch…
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

[1/3] Happy to share our new GUI Agent paper: SPA-Bench: A Comprehensive Benchmark for Smartphone Agent Evaluation. Key contributions: (1) A diverse set of tasks; (2) A plug-and-play Agent framework; (3) A novel evaluation pipeline! Project page: ai-agents-2030.github.io/SPA-Bench/

[1/3] Happy to share our new GUI Agent paper: SPA-Bench: A Comprehensive Benchmark for Smartphone Agent Evaluation. Key contributions: (1) A diverse set of tasks; (2) A plug-and-play Agent framework; (3) A novel evaluation pipeline! 
Project page: ai-agents-2030.github.io/SPA-Bench/
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

[2/3] Happy to share our new GUI Agent paper: Lightweight Neural App Control. We propose LiMAC, an architecture that balances efficiency and natural language understanding by combining a lightweight transformer with a fine-tuned VLM. Project page: arxiv.org/abs/2410.17883

[2/3] Happy to share our new GUI Agent paper: Lightweight Neural App Control. We propose LiMAC, an architecture that balances efficiency and natural language understanding by combining a lightweight transformer with a fine-tuned VLM. 
Project page: arxiv.org/abs/2410.17883
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

We are #hiring for AI Agent research scientist, research engineer, research intern at Huawei Noah’s Ark Lab. Message me if you're interested in joining our team. We are attending Conference on NeurIPS 2024 if you would like to meet! #NeurIPSConference

We are #hiring for AI Agent research scientist, research engineer, research intern at Huawei Noah’s Ark Lab. Message me if you're interested in joining our team. We are attending Conference on NeurIPS 2024 if you would like to meet! #NeurIPSConference
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

#ICLR2025 Kicking off with GUI Agents' real-world needs, we've explored action model design, reinforcement fine-tuning, and benchmarking. If you're into GUI Agents, let's collaborate! Full-time and internship positions are open. #GUIAgents #AIAgent

#ICLR2025 Kicking off with GUI Agents' real-world needs, we've explored action model design, reinforcement fine-tuning, and benchmarking. If you're into GUI Agents, let's collaborate! Full-time and internship positions are open. #GUIAgents #AIAgent
Filippos Christianos (@f_christianos) 's Twitter Profile Photo

Excited to share that our paper "Lightweight Neural App Control (LiMAC)" is accepted to #ICLR2025! 🎉 LiMAC enables app control on mobile devices using small architectures (<2B params), offering a lightweight alternative to large frameworks like OpenAI's Operator. 🧵👇 (1/3)

Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

I'll attend ICLR 2025 in Singapore! Three papers (2 spotlight) at the main conference. If you want to chat about RL/Agent/LLM, pls DM me! #ICLR2025

I'll attend ICLR 2025 in Singapore!
Three papers (2 spotlight) at the main conference.
If you want to chat about RL/Agent/LLM, pls DM me! #ICLR2025
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

🚀 ViMo – The First Visual World Model for App Agents 📍 Project page: ai-agents-2030.github.io/ViMo/ 🎯 ViMo fills a critical gap by bringing vision-language understanding into mobile interaction, enabling agents to simulate, reason, and plan in mobile environments.

🚀 ViMo – The First Visual World Model for App Agents
📍 Project page: ai-agents-2030.github.io/ViMo/
🎯 ViMo fills a critical gap by bringing vision-language understanding into mobile interaction, enabling agents to simulate, reason, and plan in mobile environments.
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

🚀 VSC-RL, New VLM agents 🤖 for resolving mobile device 📱 and web 🖥️ control tasks. 🔗Website: ai-agents-2030.github.io/VSC-RL/ VSC-RL autonomously decomposes complex goals into feasible subgoals, boosting learning efficiency and performance.

🚀 VSC-RL, New VLM agents 🤖 for resolving mobile device 📱 and web 🖥️ control tasks.
🔗Website: ai-agents-2030.github.io/VSC-RL/
VSC-RL autonomously decomposes complex goals into feasible subgoals, boosting learning efficiency and performance.
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

Delighted to announce that three of our papers have been accepted at #NeurIPS2025 ! This was a true team effort, and I want to thank all of our collaborators for their invaluable contributions. Onward and upward in the world of AI Agents and Agentic RL!

Delighted to announce that three of our papers have been accepted at #NeurIPS2025 ! 
This was a true team effort, and I want to thank all of our collaborators for their invaluable contributions.
Onward and upward in the world of AI Agents and Agentic RL!
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

Happy to share that we've had 3 papers accepted at #AAAI2026! A huge thank you to all our collaborators—this success wouldn't be possible without your incredible contributions!

Happy to share that we've had 3 papers accepted at #AAAI2026! 
A huge thank you to all our collaborators—this success wouldn't be possible without your incredible contributions!
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

Darwin Mobile Agent: the 1st FULLY OPEN-SOURCE, end-to-end pipeline for large-scale online RL and inference of mobile GUI agents. ai-agents-2030.github.io/Darwin-Mobile-… #ReinforcementLearning #GUIAgents #DarwinAgent

Darwin Mobile Agent: the 1st FULLY OPEN-SOURCE, end-to-end pipeline for large-scale online RL and inference of mobile GUI agents.
ai-agents-2030.github.io/Darwin-Mobile-… 
#ReinforcementLearning #GUIAgents #DarwinAgent
Kun Shao@ICLR2025 (@shaokun1991) 's Twitter Profile Photo

#ICLR+3 🎉 Thrilled to announce that ViMo has been accepted! As the first multimodal world model in the GUI Agent domain, it further validates the emerging paradigm for future personal AI agents: GUI Agent + GenUI.

#ICLR+3 🎉 Thrilled to announce that ViMo has been accepted! As the first multimodal world model in the GUI Agent domain, it further validates the emerging paradigm for future personal AI agents: GUI Agent + GenUI.