基于ARM-LSTM-SAC算法的机械臂柔性轴孔装配策略研究

霍跃钦; 李汝彬; 龚文宇; 何博; 王文学; 刘永奎

doi:10.13551/j.cnki.zxjxqk.2026.01.001

基于ARM-LSTM-SAC算法的机械臂柔性轴孔装配策略研究

Research on flexible peg-in-hole assembly strategy for robotic arms based on ARM-LSTM-SAC algorithm

摘要

摘要: 针对工业装配任务，尤其是不规则轴孔工件装配中，基于学习的前期样本质量低、训练过程不稳定等问题，提出一种融合引斥力模型(Attraction-Repulsion Model, ARM)引导机制和长短期记忆网络(Long Short Term Memory, LSTM)的柔性演员-评论家(Soft Actor-Critic, SAC)算法。首先，为解决训练初期探索效率低的问题，提出一种基于引斥力模型的策略引导机制，通过目标位置信息引导机械臂运动，加速收敛过程；其次，基于长短期记忆网络对算法的策略网络和价值网络进行改进，有效利用历史信息，增强策略学习能力，提高算法的收敛速度和稳定性。仿真结果表明，所提出的算法在行星减速器中心轴装配任务中取得显著的效果，装配成功率高达99.4%,与普通SAC算法相比，平均最大接触力和力矩分别降低了68.8%和79.2%。在物理环境中装配成功率达95%以上，最大接触力和力矩分别小于10 N和1.5 N·m,验证了算法的有效性。

Abstract: This paper presents a Soft Actor-Critic(SAC) algorithm enhanced with an Attraction-Repulsion Model(ARM) guidance mechanism and a Long Short-Term Memory(LSTM) network to address the challenges of low-quality initial samples and unstable training in learning-based industrial assembly, particularly for irregular peg-in-hole tasks. First, to improve early-stage exploration efficiency, an ARM-based guidance strategy is introduced, using target pose information to steer the robotic arm and accelerate convergence. Second, the policy and value networks of SAC are augmented with LSTM layers, enabling the effective use of sequential interaction history to enhance policy learning and improve training stability. Simulations show that the proposed method achieves a 99.4% success rate in assembling a planetary reducer center shaft, reducing the average maximum contact force and torque by 68.8% and 79.2%, respectively, compared to the standard SAC. Physical experiments further yield a success rate exceeding 95%, with maximum contact force and torque remaining below 10 N and 1.5 N·m, demonstrating the algorithm's effectiveness and robustness.

HTML全文

参考文献(20)

施引文献

资源附件(0)

基于ARM-LSTM-SAC算法的机械臂柔性轴孔装配策略研究

Research on flexible peg-in-hole assembly strategy for robotic arms based on ARM-LSTM-SAC algorithm

热门链接

友情链接

联系我们