Paper Reading: Robot learning 2 • Xiaohei's Blog

Rating Criteria

Mediocre paper

Regular paper

Widely recognized or eye-catching paper

Foundational paper or one I really love

Undoubtedly the best paper

RL-BASED DRONE PAPER READING

Planar Pose Graph#

Arxiv ID 2209.08543

幻觉翻译 2209.08543

A stability framework for planar pose optimization

A stability framework for planar pose optimization. By decoupling the problem into two subproblems and analyzing them, it converts the original matrix-form problem into a linear one (simpler than the matrix formulation), and finally optimizes the target by solving the linear problem.

推荐指数：

Whole-Body Scale Optimization#

Arxiv ID 2208.06331

幻觉翻译 2208.06331

Accurate whole-body collision trajectory prediction with linear scaling

show It proposes an accurate whole-body collision formulation with linear scaling, and derives analytic gradients for trajectory optimization.

推荐指数：

UAV Payload Transportation#

Arxiv ID 2310.15050

幻觉翻译 2310.15050

Planning and control for complex structures

show1 Control of the UAV attitude and the payload attitude during heavy-load transportation, enabling automatic swing suppression.

推荐指数：

Robotic Relative Localization#

Arxiv ID 2210.08265

幻觉翻译 2210.08265

Swarm localization

show2 Relative localization for a robot swarm with mutual observations. It addresses recovering relative poses for partially mutually observed robot groups and proposes a robust, scalable algorithm.

推荐指数：

Path Planning#

Arxiv ID 2209.13159

幻觉翻译 2209.13159

Path planning

show3 Topic: improve view-path planning efficiency for autonomous implicit reconstruction and enhance reconstruction quality of target images. Methods: (1) approximate information-gain fields (2) hybrid representations (3) a new path-planning strategy.

推荐指数：

GS-Planner#

Arxiv ID 2405.10142

幻觉翻译 2405.10142

Gaussian-Splatting-based Planning Framework

show4 3D Gaussian reconstruction (mapping + path planning, with emphasis on mapping): it proposes complete and mapping-quality evaluation metrics for 3D Gaussian mapping, and designs a sampling-based active view-planning algorithm to guide reconstruction of unobserved regions and improve mapping quality.

推荐指数：

Back to Newton’s Laws#

Arxiv ID 2407.10648

幻觉翻译 2407.10648

Vision-based optimal control

show5 This is not a paper in the reinforcement learning scope. As its title suggests, it goes back to Newton’s laws and uses physical approaches to achieve high-speed UAV flight, still based on vision algorithms.

推荐指数：

ARiADNE#

Arxiv ID 2301.11575

幻觉翻译 2301.11575

Attention-based reinforcement learning method

show6 ARiADNE is an attention-based RL method for autonomous exploration in unknown robot environments, addressing short-sighted planning. Key contribution: use an attention network to learn multi-scale spatial dependencies in maps, combined with SAC to achieve non-myopic path decisions. It outperforms frontier-based methods, sampling-based methods, and CNN-DRL baselines in exploration efficiency, and is validated for practicality in ROS simulation.

推荐指数：

CTSAC#

Arxiv ID 2503.14254

幻觉翻译 2503.14254

Transformer + SAC with periodic-review curriculum learning and LiDAR clustering optimization

show7 To address weak environment reasoning, slow convergence, and hard sim-to-real transfer in goal-directed autonomous robot exploration, it proposes CTSAC: it embeds a Transformer into the SAC perception network to use history information and improve foresight, designs a periodic-review curriculum learning scheme to mitigate catastrophic forgetting and accelerate training, and optimizes LiDAR clustering to narrow the sim-to-real gap. ROS-Gazebo simulation and real-robot experiments show higher exploration success rate and efficiency than traditional non-learning methods and mainstream learning-based algorithms.

推荐指数：

DARE#

Arxiv ID 2410.16687

幻觉翻译 2410.16687

Diffusion policy + attention map encoder to generate exploration paths from optimal expert demonstrations

show8 It proposes DARE, a generative method for autonomous robot exploration. It uses an attention encoder to extract environment-map features, and a diffusion-policy network to learn exploration patterns from optimal expert demonstrations. It can infer the structure of unknown regions based on local environment beliefs and generate explicit long-horizon planned paths. Simulation and real-world deployment show exploration efficiency comparable to mainstream traditional and learning-based planners, with strong generalization and sim-to-real transfer.

推荐指数：