多特征融合的目标物体导航方法

Target object navigation method based on multi feature fusion

  • 摘要: 目标物体导航是在未知的环境中根据视觉观察到达预期的目标物体。其中,如何从视觉观察中找到目标物体的方向是至关重要的。针对这一问题,提出一种基于多特征融合的目标物体导航方法。该方法通过特征融合模块融合包含导航环境整体信息、局部信息的视觉特征和指代目标物体语义的文本特征,得到表征导航方向的方向特征和导航环境的环境特征,将视觉表示与导航方向相关联,从而指导导航动作的生成,约束代理朝目标物体方向导航,提高模型的导航成功率和效率。AI2-Thor 数据集上的实验表明,和基准模型对比,导航成功率SR提升11.7%、导航成功路径长度加权比率SPL提升0.093;和目前先进的方法对比,SR提升2.1%、SPL提升0.008。实验结果证明了该方法的准确性和高效性。

     

    Abstract: Target object navigation is the process of reaching the expected target object based on visual observation in an unknown environment. Among them, it is crucial to find the direction of the target object from visual observation. A target object navigation method based on multi feature fusion is proposed to address this issue. This method uses a feature fusion module to fuse visual features that contain overall and local information of the navigation environment, as well as text features that refer to the semantics of the target object, to obtain directional features that represent the navigation direction and environmental features of the navigation environment. The visual representation is associated with the navigation direction to guide the generation of navigation actions, constrain the agent to navigate towards the direction of the target object, and improve the success rate and efficiency of the model's navigation. Experiments on the AI2 Thor dataset show that compared to the benchmark model, the navigation success rate SR has increased by 11.7 percentage points, and the navigation success path length weighted ratio SPL has increased by 0.093; Compared with current advanced methods, SR has increased by 2.1 percentage points and SPL has increased by 0.008. The experimental results have demonstrated the accuracy and efficiency of this method.

     

/

返回文章
返回