基于增强学习的下肢助力外骨骼虚阻抗控制

Rui HUANG; Hong CHENG; Hong-liang GUO

doi:10.3969/j.issn.1001-0548.2018.03.001

基于增强学习的下肢助力外骨骼虚阻抗控制

doi: 10.3969/j.issn.1001-0548.2018.03.001

电子科技大学自动化工程学院成都 611731

基金项目:

National Natural Science Foundation of China 61503060

详细信息

作者简介:
黄瑞(1989-), 男, 博士生, 主要从事外骨骼机器人、增强学习及深度增强学习等方面的研究

中图分类号: TP273

Learning Virtual Impedance for Control of a Human-Coupled Lower Exoskeleton

School of Automation Engineering, University of Electronic Science and Technology of China Chengdu 611731

Funds:

National Natural Science Foundation of China 61503060

More Information

Author Bio:
HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

摘要: 提出了一种基于增强学习的变虚阻抗控制算法，其控制器设计为一个结合人机交互模型的虚阻抗控制器。为了适应不同穿戴者所产生的交互力，采用了PI²增强学习算法对控制器中的参数进行在线学习。该控制策略在一自由度外骨骼平台和HUALEX下肢助力外骨骼上进行了实验验证，证明了所提出控制算法的有效性。
- human-exoskeleton interaction /
- human-powered augmentation lower exoskeleton /
- reinforcement learning /
- virtual impedance control
Abstract: This paper presents a novel variable virtual impedance control (VVIC) strategy which can adapt HEI to different pilots with a virtual impedance controller. The controller is model-based with a virtual impedance which models HEI between the pilot and the exoskeleton. To adapt different pilots with different HEI, a reinforcement learning method based on policy improvement and path integrals (PI²) is employed to adjust and optimize parameters of virtual impedance. We demonstrate the efficiency of the proposed VVIC strategy on a single degree-of-freedom (DOF) exoskeleton platform as well as a human-powered augmentation lower exoskeleton (HUALEX) system. Experimental results indicate that the proposed VVIC strategy is able to adapt HEI to different pilots and outperforms traditional model-based control strategies in terms of interaction forces.
- human-exoskeleton interaction /
- human-powered augmentation lower exoskeleton /
- reinforcement learning /
- virtual impedance control

Figure 1. A general control system diagram with the model-based controller for the single DOF case

下载: 全尺寸图片幻灯片

Figure 2. Model of single DOF exoskeleton coupling with the pilot in knee joint

下载: 全尺寸图片幻灯片

Figure 3. Learning curves of reinforcement learning process for simulated different pilots

下载: 全尺寸图片幻灯片

Figure 4. Control performances of the proposed VVIC strategy and SAC algorithm

下载: 全尺寸图片幻灯片

Figure 5. HUALEX with the pilot

下载: 全尺寸图片幻灯片

Figure 6. Learning curves of VVICs in the HUALEX system with different pilots at joint

下载: 全尺寸图片幻灯片

Table 1. Comparison of VVIC strategy and SAC algorithm for three different simulated pilots in single DOF case

NMSE/rad	Simulated pilot A	Simulated pilot B	Simulated pilot C
VVIC	0.024	0.032	0.038
SAC	0.069	0.094	0.124

下载: 导出CSV

Table 2. Comparison of SAC and VVIC strategy in HUALEX with different pilots in total 100 gait cycles

NMSE/rad VVIC\|SAC	Pilot A		Pilot B		Pilot C
Left hip	0.026	0.078	0.028	0.085	0.025	0.086
Left knee	0.036	0.086	0.038	0.092	0.032	0.09
Right hip	0.024	0.065	0.023	0.068	0.026	0.075
Right knee	0.028	0.087	0.031	0.079	0.032	0.094

下载: 导出CSV

[1]	KAZEROONI H, CHU A, STEGER R. That which does not stabilize, will only make us stronger[J]. International Journal of Robotics Research, 2007, 26(1):5-89. doi: 10.1177/0278364907073779
[2]	SANKAI Y. HAL:Hybrid assistive limb based on cybernics[J]. Robotics Research, 2010:25-34. doi: 10.1007/978-3-642-14743-2_3
[3]	WALSH C J, PALUSKA D, PASCH K, et al. Development of a lightweight, under-actuated exoskeleton for load- carrying augmentation[C]//IEEE International Conference on Robotics and Automation (ICRA). Florida, USA: IEEE, 2006: 3485-3491.
[4]	STAUSSE K A, KAZEROONI H. The development and testing of a human machine interface for a mobile medical exoskeleton[C]//IEEE International Conference on Intelligent Robots and Systems (IROS). California, USA: IEEE, 2011: 4911-4916.
[5]	ESQUENAZI A, TALATY M, PACKEL A, et al. The rewalk powered exoskeleton to restore ambulatory function to individuals with theracic-level motor-complete spinal cord injury[J]. American Journal of Physical Medicine and Rehabilitation, 2012, 91(11):911-921. doi: 10.1097/PHM.0b013e318269d9a3
[6]	HUANG R, CHENG H, CHEN Q, et al. Interative learning for sensitivity factors of a human-powered augmentation lower exoskeleton[C]//IEEE International Conference on Intelligent Robots and Systems (IROS). Hamburg, Germany: IEEE, 2015: 6409-6415.
[7]	WALSH C J, PASCH K, HERR H. An autonomous, under-actuated exoskeleton for load-carrying augmentation[C]//IEEE International Conference on Intelligent Robots and Systems (IROS). Hamburg, Germany: IEEE, 2006: 1410-1415.
[8]	ZOSS A, KAZEROONI H, CHU A. On the mechanical design of the berkeley lower extremity exoskeleton (BLEEX)[C]//IEEE International Conference on Intelligent Robots and Systems (IROS). Edmonton, Canada: IEEE, 2005: 3132-3139.
[9]	TRAN H T, CHENG H, LIN X, et al. The relationship between physical human-exoskeleton interaction and dynamic factors:using a learning approach for control applications[J]. Science China Information Science, 2014, 57(12):1-13.
[10]	KAZEROONI H, STEGER R, HUANG L, et al. Hybrid control of the berkeley lower extremity exoskeleton (BLEEX)[J]. International Journal of Robotics Research, 2006, 25(6):561-573.
[11]	KAWAMOTO H, SANKAI Y. Power assist method based on phase sequence and muscle force condition for HAL[J]. Advance Robotics, 2005, 19(7):717-734. doi: 10.1163/1568553054455103
[12]	LEE S, SANKAI Y. Power Assist control for walking aid with hal-3 based on EMG and impedance adjustment around knee joint[C]//International Conference on Intelligent Robots and Systems (IROS). Lausanne, Switzerland: [s. n. ], 2002: 1499-1504.
[13]	HAYASHI T, KAWAMOTO H, SANKAI Y. Control method of robot suit HAL working as operator's muscle using biologic and dynamical information[C]//IEEE International Conference on Intelligent Robots and Systems (IROS). Edmonton, Canada: IEEE, 2005: 3063-3068.
[14]	GABRIEL A O, COLGATE J E, PESHKIN M A, et al. Active-impedance control of a lower-limb assistive exoskeleton[C]//IEEE International Conference on Rehabilitation Robotics. Noordwijk, Netherlands: IEEE, 2007: 188-195.
[15]	TRAN H T, CHENG H, DUONG M K, et al. Fuuzy-based impedance regulation for control of the coupled human-exoskeleton system[C]//IEEE International Conference on Robotics and Biomimetics. Bali, Indonesia: IEEE, 2014: 986-992.
[16]	KAZEROONI H, RACINE J L, HUANG L, et al. On the control of the berkeley lower extremity exoskeleton (BLEEX)[C]//International Conference of Robotics and Automation (ICRA). Barcelona, Spain: [s. n. ], 2005: 4353-4360.
[17]	GHAN J, STEGER R, KAZEROONI H. Control and system identification for the berkeley lower extremity exoskeleton[J]. Advanced Robotics, 2006, 20(9):989-1014. doi: 10.1163/156855306778394012
[18]	RACINE J L. Control of a lower extremity exoskeleton for human performance amplification[D]. California, USA: University of California, Berkeley, 2003.
[19]	GHAN J, KAZEROONI H. System identification for the berkeley lower extremity exoskeleton (BLEEX)[C]//International Conference of Robotics and Automation (ICRA). Florida, USA: [s. n. ], 2006: 3477-3484.
[20]	THEODOROU E A, BUCHILI J, SCHAAL S. A generalized path integral control aproach to reinforcement learning[J]. Journal of Machine Learning Research, 2010, 11:3137-3181.
[21]	BUCHLI J, STULP F, THEODOROU E A, et al. Learning variable impedance control[J]. International Journal of Robotics Research, 2011, 30(7):820-833. doi: 10.1177/0278364911402527

[1]	翟社平, 李航, 亢鑫年, 杨锐. 融合强化学习的实体关系联合抽取模型 . 电子科技大学学报, 2024, 53(2): 243-251. doi: 10.12178/1001-0548.2023107
[2]	YI Tao, YANG Yu, XU Yilin, LI Zihao, WANG Chuanke, LI Tingshuai. Analysis of Electromagnetic Radiation of Secondary Ion Radiation Induced by Laser Interaction with Solid Target . 电子科技大学学报, 2023, 52(4): 555-561. doi: 10.12178/1001-0548.2021323
[3]	李新民, 尹宝林, 魏李莉, 张晓强. 强化学习无人机通信系统中的信息年龄优化 . 电子科技大学学报, 2022, 51(2): 213-218. doi: 10.12178/1001-0548.2021128
[4]	罗欣, 陈艳阳, 耿昊天, 许文波, 张民. 基于深度强化学习的文本实体关系抽取方法 . 电子科技大学学报, 2022, 51(1): 91-99. doi: 10.12178/1001-0548.2021162
[5]	高远翔, 罗龙, 孙罡. 基于强化学习的多阶段网络分组路由方法 . 电子科技大学学报, 2022, 51(2): 200-206. doi: 10.12178/1001-0548.2021260
[6]	林粤伟. 基于强化学习的LTE与WiFi异构网络共存机制 . 电子科技大学学报, 2021, 50(3): 375-381. doi: 10.12178/1001-0548.2019303
[7]	鲁华祥, 尹世远, 龚国良, 刘毅, 陈刚. 基于深度确定性策略梯度的粒子群算法 . 电子科技大学学报, 2021, 50(2): 199-206. doi: 10.12178/1001-0548.2020420
[8]	叶进, 肖庆宇, 陈梓晗, 陈贵豪, 李陶深. 以用户QoE预测值为奖励的视频自适应比特率算法 . 电子科技大学学报, 2021, 50(2): 236-242. doi: 10.12178/1001-0548.2020325
[9]	FAN Cong-min, ZHANG Ying-jun, YUAN Xiao-jun, LI Si-xian. Machine Learning for Heterogeneous Ultra-Dense Networks with Graphical Representations . 电子科技大学学报, 2020, 49(6): 826-836. doi: 10.12178/1001-0548.2020356
[10]	黄庆东, 石斌宇, 郭民鹏, 袁润芝, 陈晨. 基于Q-learning的分布式自适应拓扑稳定性算法 . 电子科技大学学报, 2020, 49(2): 262-268. doi: 10.12178/1001-0548.2019076
[11]	Jie-fu CHEN, Hua CHEN, Xing XU, Yan-li JI, Li-jiang CHEN. Learning to Write Multi-Stylized Chinese Characters by Generative Adversarial Networks . 电子科技大学学报, 2019, 48(5): 674-678. doi: 10.3969/j.issn.1001-0548.2019.05.003
[12]	史豪斌, 徐梦. 基于强化学习的旋翼无人机智能追踪方法 . 电子科技大学学报, 2019, 48(4): 553-559. doi: 10.3969/j.issn.1001-0548.2019.04.012
[13]	邱静, 陈启明, 卢军, 程洪, 黄瑞. 下肢助力外骨骼机器人自适应阻抗控制研究 . 电子科技大学学报, 2016, 45(4): 689-695. doi: 10.3969/j.issn.1001-0548.2016.04.022
[14]	向中凡. Q学习角色值法在机器人足球比赛中的应用 . 电子科技大学学报, 2007, 36(4): 809-812.

点击查看大图

图(6) / 表(2)

计量

文章访问数: 4327
HTML全文浏览量: 1316
PDF下载量: 213
被引次数: 0

全文HTML

Many lower exoskeletons have been developed for strength augmentation and walking assistance scenarios over the past few decades^[1-5]. For strength augmentation related applications, lower exoskeletons are designed to track the pilot's motion with little interaction force between the exoskeleton and the pilot^[6-8]. The controller of these lower exoskeletons can be roughly categorized into two categories, namely, sensor-based controller and model-based controller.

For sensor-based controllers, extra sensors are always employed to measure the pilot's information and/or the interaction force between the pilot and the exoskeleton^[9-11]. With the measured sensory information, many variations of control strategies can be employed to control the lower exoskeleton, i.e., impedance control strategies. For example, the Hybrid assistive limb (HAL) exoskeleton system is an impedance control strategy proposed by Y. Sankai based on measuring eletro-myo-graphical (EMG) signals of the pilot^[12]. In the impedance control strategy of HAL system, EMG signals are utilized to calculate reference patterns of the pilot which aims at estimating the human-exoskeleton interaction (HEI) between the pilot and the exoskeleton^[13]. Furthermore, based on measuring the pilot's motion with acceleration sensors, the active-impedance control strategy^[14] and the fuzzy-based impedance control strategy^[15] are proposed to adapt to the changing interaction dynamics among different pilots. However, sensor-based controllers heavily rely on complex sensory systems, which are unstable sometimes and is mostly costly. Hence, sensor-based controllers are, to some extent, limited in most strength augmentation scenarios.

On the other hand, model-based controllers are designed to simplify the sensory system of the exoskeleton, which is only based on the information from exoskeleton itself. Sensitivity amplification control (SAC) is one of the model-based controllers proposed by the berkeley lower extremity exoskeleton (BLEEX)^[16]. With a sensitivity factor in the model- based controller, SAC can estimate the output joint torques based on current states (joint angle, angular velocity and angular acceleration) of the lower exoskeleton. The SAC strategy is able to reduce the interaction force between the pilot and the exoskeleton without measuring it directly, which also reduces the complexity of the exoskeleton sensory system. However, the SAC strategy requires accurate dynamic models of the lower exoskeleton (sensitive to model imperfections and different pilots) which makes the system identification process quite complicated^[17].

In this paper, we propose a novel variable virtual impedance control (VVIC) strategy which inherits both advantages of sensor-based controllers and model- based controllers. On the one hand, it is a model-based control strategy, which reduces the complexity of the exoskeleton sensory system. On the other hand, we apply a reinforcement learning method based on policy improvement and path integrals (PI²) to learn parameters of the virtual impedance model, which circumvents the complicated system identification process. The main contributions of this paper can be summarized as follows:

1) A novel VVIC strategy with a model-based controller named virtual impedance controller is proposed, which reduces the exoskeleton sensory system requirement;

2) To reduce the complicated system identification process, a reinforcement learning method is utilized to learn/optimize parameters of the virtual impedance controller of VVIC strategy;

3) The proposed VVIC strategy is verified on both a single DOF platform and HUALEX system.

The proposed VVIC strategy is firstly validated on a single DOF exoskeleton platform, and then tested on a HUALEX system. Experimental results show that the proposed VVIC strategy is able to adapt different HEI to different pilots when compared with canonical model-based control strategies.

2. Virtual Impedance Adaptation through Reinforcement Learning

For the implementation of traditional model-based controllers, i.e. SAC in BLEEX system, the system identification process is often employed to obtain system dynamics and human-related parameters of the designed controller (sensitivity factors in SAC)^[19]. However, the lower exoskeleton is a kind of human- coupled system for different pilots, which requires that the controller needs to recalibrate for different pilots.

In this paper, a model-free reinforcement learning method is employed to learn the optimal virtual impedance parameters of VIC, which aims at adapting with different HEI for different pilots. Combining the learning process and the model-based controller, which is named as the VVIC strategy, we can reduce the system sensor requirement as well as the system identification process. In the reinforcement learning process, a model-free reinforcement learning method named policy improvement and path integral (PI²) algorithm^[20-21] is employed to learn the parameters ${K_h}$ and ${D_h}$ of VIC.

The parameterized policy of PI² is defined as:

$${\mathit{\boldsymbol{p}}_t} = \mathit{\boldsymbol{W}}_t^T(\mathit{\boldsymbol{ \boldsymbol{\varTheta} }} + {\epsilon }_{t})$$

(13)

where $\mathit{\boldsymbol{ \boldsymbol{\varTheta} }} $ is a vector of virtual impedance parameters ${[{K_h}, {D_h}]^T}$and ${{\epsilon }_{t}}$ indicate the exploration noise. $\mathit{\boldsymbol{W}}_t^T$ is the basis function with Gaussian kernels $\omega $ :

$${[{\mathit{\boldsymbol{W}}_t}]_j} = \frac{{{\omega _j}}}{{\sum\limits_i^n {{\omega _j}} }}$$

(14)

Eq. (14) calculates the j^th average weight, where n is the number of parameters which is to be learned (n=2 in our case).

In the implementation of VVIC strategy, we define the immediate cost based on the measured sensory information of the pilot. For single DOF case, the immediate cost function is defined as follows:

$${r_t} = {\alpha _1}{[{q_h}(t) - {q_e}(t)]^2} + {\alpha _2}{[{\dot q_h}(t) - {\dot q_e}(t)]^2}$$

(15)

where ${\alpha _1}$ and${\alpha _2}$ are positive scale factors. In order to obtain the pilot motion information during the learning process, inclinometers are utilized to measure the pilot's joint angle position ${q_h}$ and angular velocity ${\dot q_h}$.

With the defined policy and cost function, the learning process of virtual impedance parameters based on PI² for single DOF algorithm is described in as:

1) Initialize the virtual impedance parameter vector ${\mathit{\boldsymbol{ \boldsymbol{\varTheta} }}} $.

2) Initialize basis function ${\mathit{\boldsymbol{W}}_{{t_i}}}$ according Eq. (14).

3) Repeat.

4) Run K gait cycles of the exoskeleton using stochastic parameters ${\mathit{\boldsymbol{ \boldsymbol{\varTheta} }}} +{{\epsilon }_{t}}$ at every time step.

5) For all gait cycles $k \in [1, K]$:

6) Compute the projection matrix M through Eq. (16).

7) Compute the stochastic cost S through Eq. (17).

8) Compute the probability P through Eq. (18).

9) For all time steps $i \in [1, T]$:

10) Compute $\Delta {\mathit{\boldsymbol{ \boldsymbol{\varTheta} }} _{{t_i}}}$ for each time step through Eq. (19).

11) Normalize $\Delta \mathit{\boldsymbol{ \boldsymbol{\varTheta} }} $ according Eq. (20).

12) Update $\mathit{\boldsymbol{ \boldsymbol{\varTheta} }} \leftarrow \mathit{\boldsymbol{ \boldsymbol{\varTheta} }} + \Delta \mathit{\boldsymbol{ \boldsymbol{\varTheta} }} $.

13) Run one noiseless gait cycle to compute the trajectory cost R through Eq. (21).

14) Until Trajectory cost R is converged.

As shown in Tab. 1, virtual impedance parameters of VIC will be updated every K+1 gait cycles. The updating rule is described in Tab. 1 with Eq. (16) to Eq. (20).

$${\mathit{\boldsymbol{M}}_{{t_i}, k}} = \frac{{{H^{ - 1}}{\mathit{\boldsymbol{W}}_{{t_i}, k}}\mathit{\boldsymbol{W}}_{{t_i}, k}^T}}{{\mathit{\boldsymbol{W}}_{{t_i}, k}^T{H^{ - 1}}{\mathit{\boldsymbol{W}}_{{t_i}, k}}}}$$

(16)

$${\mathit{\boldsymbol{S}}_{i, k}} = \sum\limits_{j = i}^{N - 1} {{r_{{t_j}, k}}} + \frac{1}{2}\sum\limits_{j = i + 1}^{N - 1} ( \mathit{\boldsymbol{ \boldsymbol{\varTheta} }} + {\mathit{\boldsymbol{M}}_{{t_j}, k}}{\epsilon _{{t_j}, k}}{)^{\rm{T}}}H(\mathit{\boldsymbol{ \boldsymbol{\varTheta} }} + {\mathit{\boldsymbol{M}}_{{t_j}, k}}{\epsilon _{{t_j}, k}})$$

(17)

$${\mathit{\boldsymbol{P}}_{i, k}} = \frac{{{{\rm{e}}^{ - \frac{1}{\lambda }{\mathit{\boldsymbol{S}}_{i, k}}}}}}{{\sum\limits_{k = 1}^K {\left[ {{{\rm{e}}^{ - \frac{1}{\lambda }{\mathit{\boldsymbol{S}}_{i, k}}}}} \right]} }} $$

(18)

$$\Delta {\mathit{\boldsymbol{ \boldsymbol{\varTheta} }} _{{t_i}}} = \sum\limits_{k = 1}^K {[{\mathit{\boldsymbol{P}}_{i, k}}{\mathit{\boldsymbol{M}}_{{t_i}, k}}{\epsilon _{{t_i}, k}}]} $$

(19)

$${[\Delta \mathit{\boldsymbol{ \boldsymbol{\varTheta} }} ]_j} = \frac{{\sum\limits_{i = 1}^T ( T - i){\omega _{j, {t_i}}}{{[\Delta {\mathit{\boldsymbol{ \boldsymbol{\varTheta} }} _{{t_i}}}]}_j}}}{{\sum\limits_{i = 1}^T {{\omega _{j, {t_i}}}} (T - i)}}$$

(20)

Table 1. Comparison of VVIC strategy and SAC algorithm for three different simulated pilots in single DOF case

NMSE/rad	Simulated pilot A	Simulated pilot B	Simulated pilot C
VVIC	0.024	0.032	0.038
SAC	0.069	0.094	0.124

The matrix H in Eq. (16) and Eq. (17) is a positive semi-definite weight matrix. The scale factor λ in Eq. (18) is set within (0, 1]. With updated parameter vector $\mathit{\boldsymbol{ \boldsymbol{\varTheta} }} $, a noiseless gait cycle (without exploration noise ${{\epsilon }_{t}}$) is taken to determine whether the learning process should be terminated through calculating the trajectory cost R:

$$R = \rho \cdot \sum\limits_{i = 1}^T {{r_{{t_i}}}} $$

(21)

where $\rho = 1/{t_d}$ (${t_d}$ indicates time duration of the gait cycle) is a normalization factor, since the duration of each gait cycle are always different in real-time applications.

In the implementation of VVIC strategy, the reinforcement learning process needs to be taken in the case of different pilots, which learns optimal virtual parameters to adapt different HEI to different pilots. Afterwards, with the learned optimal model-based controller, the lower exoskeleton is able to track the pilot's motion as soon as possible based only on joint information of lower exoskeleton.

4. Conclusions and Future Work

This paper has proposed a novel VVIC strategy to control of a HUALEX system, which aims at adapting different HEI to different pilots. The proposed VVIC strategy is based on a novel VIC, which is a model-based controller with a virtual impedance model. In order to adapt different HEI to different pilots, the PI² reinforcement learning algorithm is employed to obtain optimal parameters in virtual impedance of VIC. Control performances of the proposed VVIC strategy are validated on a single DOF exoskeleton simulation environment as well as the HUALEX system. Experimental results indicate that the proposed VVIC has better performances compared with the traditional SAC algorithm, and can deal with variation HEI from different pilots.

In the future, we will investigate the methods which can learn/update the parameters of VVIC online. In this case, the HUALEX will be able to 'get used to' the pilot during the operation process. Moreover, the estimation to the accurate dynamic models of HUALEX is also important, the accurate dynamic models always achieve better performances for model-based controller in strength augmentation lower exoskeletons.

参考文献 (21)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于增强学习的下肢助力外骨骼虚阻抗控制

doi: 10.3969/j.issn.1001-0548.2018.03.001

作者简介:
黄瑞(1989-), 男, 博士生, 主要从事外骨骼机器人、增强学习及深度增强学习等方面的研究

Learning Virtual Impedance for Control of a Human-Coupled Lower Exoskeleton

Author Bio:
HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

计量

Learning Virtual Impedance for Control of a Human-Coupled Lower Exoskeleton

doi: 10.3969/j.issn.1001-0548.2018.03.001

电子科技大学自动化工程学院成都 611731

作者简介:
HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

English Abstract

Learning Virtual Impedance for Control of a Human-Coupled Lower Exoskeleton

School of Automation Engineering, University of Electronic Science and Technology of China Chengdu 611731

Author Bio:
HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

全文HTML

1.1. Virtual Impedance Controller

1.2. Stability Analysis

3.1. Single DOF Case in Simulation Environment

3.1.1. Introduction to the Single DOF Exoskeleton Platform

3.1.2. Experiments of Simulated Single DOF Exoskeleton

3.2. Experiments on the HUALEX System

3.2.1. Introduction to the HUALEX System

3.2.2. Experimental Setup

3.2.3. Results and Discussions

目录

期刊在线

编辑办公

友情链接

留言板

基于增强学习的下肢助力外骨骼虚阻抗控制

doi: 10.3969/j.issn.1001-0548.2018.03.001

作者简介: 黄瑞(1989-), 男, 博士生, 主要从事外骨骼机器人、增强学习及深度增强学习等方面的研究

Learning Virtual Impedance for Control of a Human-Coupled Lower Exoskeleton

Author Bio: HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

计量

出版历程

Learning Virtual Impedance for Control of a Human-Coupled Lower Exoskeleton

doi: 10.3969/j.issn.1001-0548.2018.03.001

电子科技大学自动化工程学院 成都 611731

作者简介: HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

English Abstract

Learning Virtual Impedance for Control of a Human-Coupled Lower Exoskeleton

School of Automation Engineering, University of Electronic Science and Technology of China Chengdu 611731

Author Bio: HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

全文HTML

1.1. Virtual Impedance Controller

1.2. Stability Analysis

3.1. Single DOF Case in Simulation Environment

3.1.1. Introduction to the Single DOF Exoskeleton Platform

3.1.2. Experiments of Simulated Single DOF Exoskeleton

3.2. Experiments on the HUALEX System

3.2.1. Introduction to the HUALEX System

3.2.2. Experimental Setup

3.2.3. Results and Discussions

目录

期刊在线

编辑办公

友情链接

作者简介:
黄瑞(1989-), 男, 博士生, 主要从事外骨骼机器人、增强学习及深度增强学习等方面的研究

Author Bio:
HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

电子科技大学自动化工程学院成都 611731

作者简介:
HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning

Author Bio:
HUANG Rui was born in 1989, and his research interests include exoskeletons, reinforcement learning and deep reinforcement learning