Selected Robotics Papers from arXiv – November 1, 2025

Selected robotics papers from arXiv! 👏👏👏

#Robotics

RoboOS-NeXT: A Unified Memory-Based Framework for Lifelong, Scalable, and Robust Multi-Robot Collaboration

Date: October 30, 2025

Authors: Huajie Tan, Cheng Chi, Xiansheng Chen, Yuheng Ji, Zhongxia Zhao, Xiaoshuai Hao, Yaoxu Lyu, Mingyu Cao, Junkai Zhao, Huaihai Lyu, Enshen Zhou, Ning Chen, Yankai Fu, Cheng Peng, Wei Guo, Dong Liang, Zhuo Chen, Mengsi Lyu, Chenrui He, Yulong Ao, Yonghua Lin, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang

Link: http://arxiv.org/abs/2510.26536v1The proliferation of collaborative robots across various tasks and forms presents a core challenge: achieving lifelong adaptability, scalable coordination, and robust scheduling in multi-agent systems. Existing approaches, from vision-language-action (VLA) models to hierarchical frameworks, fall short due to reliance on limited or single-agent memory. This fundamentally restricts their ability to learn over the long term, scale to heterogeneous teams, or recover from failures, highlighting the need for a unified memory representation. To address these limitations, we introduce RoboOS-NeXT, a unified memory-based framework for lifelong, scalable, and robust multi-robot collaboration. The core of RoboOS-NeXT is a novel Space-Time-Form (STEM) memory that integrates spatial scene geometry, temporal event history, and morphological profiles into a shared representation. This memory-centric design is integrated into a brain-cerebellum framework, where high-level brain models perform global planning by retrieving and updating STEM, while low-level controllers execute local actions. This closed-loop of cognition, memory, and execution enables dynamic task allocation, fault-tolerant collaboration, and consistent state synchronization. We conducted extensive experiments on complex coordination tasks in restaurants, supermarkets, and homes. Our results demonstrate that RoboOS-NeXT achieves superior performance across heterogeneous forms, validating its effectiveness in realizing lifelong, scalable, and robust multi-robot collaboration. Project website: https://flagopen.github.io/RoboOS/.

Hybrid Consistency Policy: Decoupling Multi-Modal Diversity and Real-Time Efficiency in Robotic Operations

Date: October 30, 2025

Authors: Qianyou Zhao, Yuliang Shen, Xuanran Zhai, Ce Hao, Duidi Wu, Jin Qi, Jie Hu, Qiaojun Yu

Link: http://arxiv.org/abs/2510.26670v1In visual motion policy learning, diffusion-based imitation learning has gained widespread application due to its ability to capture diverse behaviors. However, schemes based on ordinary and stochastic denoising processes struggle to achieve rapid sampling and robust multi-modality simultaneously. To address these challenges, we propose the Hybrid Consistency Policy (HCP). HCP runs a short random prefix until an adaptive switching time, then applies a one-step consistency jump to generate the final action. To maintain consistency with this jump generation, HCP performs temporal consistency distillation, which combines trajectory consistency objectives to maintain coherence among adjacent predictions and denoising matching objectives to enhance local fidelity. On both simulated and real robots, HCP achieves near 80-step DDPM teacher accuracy and mode coverage using 25 SDE steps plus one jump, while significantly reducing latency. These results indicate that multi-modality does not require slow inference, and switching time decouples mode retention from speed, providing practical accuracy and efficiency trade-offs for robotic policies.

Running Variable Length Arrays at Real-Time Speeds

Date: October 30, 2025

Authors: Yunchao Ma, Yizhuang Zhou, Yunhuan Yang, Tiancai Wang, Haoqiang Fan

Link: http://arxiv.org/abs/2510.26742v1In this paper, we demonstrate how to run pi0-level multi-view VLA at 30Hz frame rate and up to 480Hz trajectory frequency using a single consumer-grade GPU. This enables dynamic and real-time tasks that were previously thought unachievable with large VLA models. To achieve this, we introduce a series of strategies to eliminate overhead in model inference. Practical experiments show that the pi0 policy using our strategies achieves a 100% success rate in the task of grasping a dropped pen. Based on these results, we further propose a fully streaming inference framework for real-time robotic control of VLA. The code can be found at https://github.com/Dexmal/realtime-vla.

Thor: Human-Level Whole-Body Response Technology for High-Intensity Contact Environments

Date: October 30, 2025

Authors: Gangyang Li, Qing Shi, Youhao Hu, Jincheng Hu, Zhongyuan Wang, Xinlong Wang, Shaqi Luo

Link: http://arxiv.org/abs/2510.26280v1Humanoid robots have tremendous potential in service, industrial, and rescue applications, where they must maintain whole-body stability during dense, contact-rich interactions with the environment. However, generating human-like, adaptive responses in humanoid robots under such conditions remains a major challenge. To address this issue, we propose Thor, a humanoid robot framework for achieving human-level whole-body responses in contact-rich environments. Based on force analysis of the robot, we design a Force-Adaptive Trunk Tilt (FAT2) reward function to encourage humanoid robots to exhibit human-like responses in force interaction tasks. To alleviate the high-dimensional challenges in humanoid control, Thor introduces a reinforcement learning architecture that decouples the upper body, waist, and lower body. Each component shares global observations of the whole body and jointly updates its parameters. Finally, we deploy Thor on the Unitree G1, which significantly outperforms the baseline in force interaction tasks. Specifically, when moving backward, the robot achieved a maximum pulling force of 167.7 N (approximately 48% of G1’s weight), and 145.5 N when moving forward, representing improvements of 68.9% and 74.7% over the best-performing baseline, respectively. Additionally, Thor is capable of pulling a shelf loaded with heavy objects (130 N) and opening a fire door with one hand (60 N). These results highlight Thor’s effectiveness in enhancing the force interaction capabilities of humanoid robots.

Heuristically Adjusting Potentially Mis-Specified Domain Supports for Likelihood-Free Inference in Stochastic Dynamical Systems

Date: October 30, 2025

Authors: Georgios Kamaras, Craig Innes, Subramanian Ramamoorthy

Link: http://arxiv.org/abs/2510.26656v1In robotics, likelihood-free inference (LFI) can provide parameterized deployment conditional domain distributions adapted for learning agents. LFI assumes an arbitrary sampling support that remains unchanged while iteratively refining an initial generic prior to obtain a more descriptive posterior. However, a potentially mis-specified support can lead to suboptimal but falsely confident posteriors. To address this issue, we propose three heuristic LFI variants: EDGE, MODE, and CENTRE. Each variant interprets posterior mode shifts in its own way and adjusts the support alongside posterior inference when integrated into the LFI step. We first reveal the support mis-specification problem and evaluate our heuristic methods using stochastic dynamics benchmarks. We then assess the impact of heuristic support adaptation on parameter inference and policy learning in dynamic deformable linear object (DLO) manipulation tasks. The inference results provide finer classifications of length and stiffness for a set of parameterized DLOs. When the obtained posterior is used as the domain distribution for simulation-based policy learning, they lead to more robust object-centric agent performance.

REALMS2 – Resilient Exploration and Lunar Mapping System 2 – A Comprehensive Approach

Date: October 30, 2025

Authors: Dave van der Meer, Loïck P. Chovet, Gabriel M. Garcia, Abhishek Bera, Miguel A. Olivares-Mendez

Link: http://arxiv.org/abs/2510.26638v1The European Space Agency (ESA) and the European Space Resources Innovation Centre (ESRIC) jointly launched the Space Resources Challenge, inviting researchers and companies to propose innovative solutions for multi-robot systems (MRS) in space exploration. This paper presents a framework called Resilient Exploration And Lunar Mapping System 2 (REALMS2) for planetary exploration and mapping. The system is based on Robot Operating System version 2 (ROS 2) and enhances visual simultaneous localization and mapping (vSLAM) capabilities to generate maps. REALMS2 employs a mesh network to achieve robust ad-hoc networking. A single graphical user interface (GUI) controls all rovers, providing a simple overview of robotic tasks. The system is designed for heterogeneous multi-robot exploration tasks, addressing the challenges posed by extraterrestrial environments. REALMS2 was utilized in the second round of field tests for the ESA-ESRIC challenge, allowing three homogeneous rovers to map approximately 60% of the area while handling communication delays and interruptions.

Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments

Date: October 30, 2025

Authors: Xiaoyi He, Danggui Chen, Zhenshuo Zhang, Zimeng Bai

Link: http://arxiv.org/abs/2510.26646v1This paper presents a hierarchical path planning and control framework that combines a low-level deep Q-network (DQN) for discrete sub-goal selection and a low-level twin delayed deep deterministic policy gradient (TD3) controller for continuous actions. The high-level module selects behaviors and sub-goals; the low-level module executes smooth velocity commands. We design a practical reward shaping scheme (direction, distance, obstacle avoidance, action smoothness, collision penalties, time penalties, and progress) and a laser-based safety gate to prevent unsafe actions. The system is implemented on ROS + Gazebo (TurtleBot3) and evaluated using PathBench metrics, including success rate, collision rate, path efficiency, and replanning efficiency in dynamic and partially observable environments. Experiments show improved success rates and sample efficiency compared to single algorithm baselines (DQN or TD3 alone) and rule-based planners, with better generalization to unseen obstacle configurations and smoother control variations. Code and evaluation scripts can be found in the project repository.

Sliding Window Filter for Online Continuous-Time State Estimation of Continuum Robots

Date: October 30, 2025

Authors: Spencer Teetaert, Sven Lilge, Jessica Burgner-Kahrs, Timothy D. Barfoot

Link: http://arxiv.org/abs/2510.26623v1Stochastic state estimation methods for continuum robots (CRs) often struggle to balance accuracy and computational efficiency. While recent studies have explored sliding window formulations for CRs, these methods are limited to simplified discrete-time approximations and do not provide stochastic representations. In contrast, current stochastic filtering methods must run at measurement speeds, limiting their full potential. Recent research on continuous-time estimation techniques for CRs has shown a principled approach to addressing this runtime constraint, but is currently limited to offline operation. In this work, we propose a Sliding Window Filter (SWF) for continuous-time state estimation of CRs that improves filter accuracy while enabling continuous-time methods to run online and at speeds faster than real-time. This represents the first stochastic SWF designed for CRs, providing a promising direction for future research in the field.

FLYINGTRUST: A Quadrotor Navigation Benchmark Across Scenes and Aircraft

Date: October 30, 2025

Authors: Gang Li, Chunlei Zhai, Teng Wang, Shaun Li, Shangsong Jiang, Xiangwei Zhu

Link: http://arxiv.org/abs/2510.26588v1Visual navigation algorithms often exhibit significant variations in performance on quadrotor drones when transferred to different platforms and scene geometries, increasing the cost and risk of field deployment. To support systematic early evaluation, we introduce FLYINGTRUST, a high-fidelity, configurable benchmarking framework for measuring how platform dynamics and scene structure jointly affect navigation robustness. FLYINGTRUST uses two compact, physically interpretable metrics to simulate vehicle capabilities: maximum thrust-to-weight ratio and maximum axial angular acceleration. The benchmark matches a diverse scene library with a set of heterogeneous entities and virtual platforms, specifying standardized evaluation protocols and a comprehensive scoring method that balances scene importance, platform importance, and performance stability. We used FLYINGTRUST to compare mainstream navigation methods based on optimization and learning under the same conditions, conducting repeated trials for each platform-scene combination and reporting uncertainty-aware metrics. Results indicate systematic patterns: navigation success can reasonably depend on platform capabilities and scene geometries, with different algorithms exhibiting varying preferences and failure modes under evaluation conditions. These observations highlight the practical necessity of incorporating platform capabilities and scene structures into algorithm design, evaluation, and selection, motivating future research on methods that maintain robustness across different platforms and scenes.

Adaptive Inverse Kinematics Framework for Learning Variable-Length Tool Manipulation in Robots

Date: October 30, 2025

Authors: Prathamesh Kothavale, Sravani Boddepalli

Link: http://arxiv.org/abs/2510.26551v1Traditional robots have limited understanding of their kinematics and are constrained to pre-programmed tasks, hindering their ability to effectively utilize tools. Driven by the necessary components of tool use—grasping expected outcomes, selecting the most suitable tool, determining optimal tool orientation, and executing precise operations—we introduce a groundbreaking framework. Our novel approach extends the capabilities of robotic inverse kinematics solvers to acquire a range of continuous actions using tools of varying lengths. By integrating simulated learned action trajectories with tools, we demonstrate the practicality of transferring acquired skills from simulation to real-world scenarios through comprehensive experiments. Impressively, our extended inverse kinematics solver shows an error rate of less than 1 cm. Notably, our training strategy achieves an average error of 8 cm in simulation. Remarkably, when using two different lengths of tools, our model achieves nearly indistinguishable performance. This research provides potential advancements in exploring the four fundamental aspects of tool use, enabling robots to master the complex art of manipulating tools across multiple tasks.

Leave a Comment