針對單一和多智能體人形機器人之創新雙演員近端策略優化算法

dc.contributor包傑奇zh_TW
dc.contributor薩義德zh_TW
dc.contributorJacky Baltesen_US
dc.contributorSaeed Saeedvanden_US
dc.contributor.author李安民zh_TW
dc.contributor.authorAkbar, Ilhamen_US
dc.date.accessioned2024-12-17T03:22:25Z
dc.date.available2024-07-16
dc.date.issued2024
dc.description.abstractnonezh_TW
dc.description.abstractSingle-agent and multi-agent systems are integral to the dynamic environmental processes of reinforcement learning in advanced humanoid robotic applications. This thesis introduces the Dual Proximal Policy Optimization (DA-PPO) algorithm and its extension, Independent Dual Actor Proximal Policy Optimization (IDA-PPO),designed for robotic navigation and cooperative tasks using the ROBOTIS-OP3 humanoid robot. The study validates the effectiveness of DA-PPO and IDA-PPO cross various scenarios, demonstrating significant improvements in both single-agent and multi-agent environments. DA-PPO excels in robotic navigation and movement tasks, outperforming established reinforcement learning methods in complex environments and basic walking tasks. This success is attributed to its innovative architecture, efficient utilization of hardware resources like the NVIDIA GeForce RTX 3050, and an effective reward function strategy. IDA-PPO, with its decentralized training and dual actor policy network, achieves higher mean rewards and faster learning compared to IPPO and MAPPO. IDA-PPO is 5.49 times faster than MAPPO and 8.22 times faster than IPPO, highlighting its superior efficiency and adaptability in multi-agent tasks. These findings underscore the importance of algorithmic innovation and hardware capabilities in advancing robotic performance, positioning DA-PPO and IDA-PPO as significant advancements in robotic learningen_US
dc.description.sponsorship電機工程學系zh_TW
dc.identifier61175063H-45428
dc.identifier.urihttps://etds.lib.ntnu.edu.tw/thesis/detail/606e3490d24d6e8c7c34ed518ab0df64/
dc.identifier.urihttp://rportal.lib.ntnu.edu.tw/handle/20.500.12235/122945
dc.language英文
dc.subjectNonezh_TW
dc.subjectDA-PPOen_US
dc.subjectIDA-PPOen_US
dc.subjectSingle Agenten_US
dc.subjectMulti Agenten_US
dc.subjectreinforcement learningen_US
dc.subjectcooperative tasksen_US
dc.subjecthumanoid robotsen_US
dc.subjectrobotic navigationen_US
dc.title針對單一和多智能體人形機器人之創新雙演員近端策略優化算法zh_TW
dc.titleA Novel Dual-Actor Proximal Policy Optimization Algorithm for Single and Multi-Agent Humanoid Roboten_US
dc.type學術論文

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
202400045428-107754.pdf
Size:
12.81 MB
Format:
Adobe Portable Document Format
Description:
學術論文

Collections