多特征融合的目标物体导航方法

毕盛; 林华伟; 董敏

doi:10.12178/1001-0548.2024122

多特征融合的目标物体导航方法

Target object navigation method based on multi-feature fusion

摘要

摘要: 目标物体导航是在未知的环境中根据视觉观察到达预期的目标物体。其中，如何从视觉观察中找到目标物体的方向是至关重要的。针对这一问题，提出一种基于多特征融合的目标物体导航方法。该方法通过特征融合模块融合包含导航环境整体信息、局部信息的视觉特征和指代目标物体语义的文本特征，得到表征导航方向的方向特征和导航环境的环境特征，将视觉表示与导航方向相关联，从而指导导航动作的生成，约束代理朝目标物体方向导航，提高模型的导航成功率和效率。AI2-Thor 数据集上的实验表明，和基准模型对比，导航成功率SR提升11.7%、导航成功路径长度加权比率SPL提升0.093；和目前先进的方法对比，SR提升2.1%、SPL提升0.008。实验结果证明了该方法的准确性和高效性。

Abstract: Target object navigation is the process of reaching the expected target object based on visual observation in an unknown environment. In this process, it is crucial to find the direction of the target object from visual observation. A target object navigation method based on multi-feature fusion is proposed to address this issue. Using a feature fusion module to fuse visual features that contain the overall and local information of the navigation environment and the text features that refer to the semantics of the target object, the method obtains the directional features that represent the navigation direction and environmental features of the navigation environment. And then this method associates the visual representation with the navigation direction, thereby guiding the generation of navigation actions, constraining the agent to navigate towards the direction of the target object, and improving the success rate and efficiency of the model's navigation. Experiments on the AI2 Thor dataset show that compared to the benchmark model, the navigation success rate SR has increased by 11.7 percentage points, and the navigation success path length weighted ratio SPL has increased by 0.093; Compared with current advanced methods, SR has increased by 2.1 percentage points and SPL has increased by 0.008. The experimental results have demonstrated the accuracy and efficiency of this method.

HTML全文

参考文献(31)

施引文献

资源附件(0)

多特征融合的目标物体导航方法

Target object navigation method based on multi-feature fusion

期刊在线

编辑办公

友情链接