According the real-time demand of manipulator control,the Sarsa(λ)algorithm,which was combined with K-means clustering algorithm,has been selected for its on-policy feature and efficiency.
采用基于K-均值聚类的强化学习方法作为基本的控制策略,给出系统算法的具体实施过程。
According the real-time demand of manipulator control,the Sarsa(λ)algorithm,which was combined with K-means clustering algorithm,has been selected for its on-policy feature and efficiency.
采用基于K-均值聚类的强化学习方法作为基本的控制策略,给出系统算法的具体实施过程。
声明:以上例句、词性分类均由互联网资源自动生成,部分未经过人工审核,其达内容亦本软件的观点;若发现问题,欢迎向我们指正。