非对称深度在线哈希

Asymmetric deep online Hashing

  • 摘要: 海量图像以流数据的形式实时涌入网络,使得在线图像检索需求越来越迫切。为了保证在线图像检索性能,研究人员利用在线哈希算法实时更新哈希函数,并重新学习新、旧数据集的哈希码。然而,随着旧数据集的日益积累,在线更新旧数据集的哈希码会严重影响在线检索效率。为此,提出非对称深度在线哈希(asymmetric deep online Hashing, ADOH),以非对称的方式深度学习在线哈希网络,并且仅生成新数据集的哈希码,无须更新旧数据集的哈希码,能够有效地提升在线检索效率。ADOH算法通过最小化哈希码内积与相似度矩阵之间的差异,保持样本对之间的语义相似性关系。另外,ADOH算法建立分类损失项和标签嵌入模块学习样本的语义信息,使生成的哈希码更具备语义鉴别性。在3个广泛使用的数据集cifar-10、mnist和Places205上设置在线近邻检索对比实验,结果表明ADOH算法的在线近邻检索性能优于目前8种较先进的在线哈希算法。

     

    Abstract: Massive images are flooding into the internet in real-time in the form of streaming data, which makes the need of online image retrieval more and more urgent. To guarantee the online image retrieval performance, online Hashing algorithms are utilized to re-learn the Hash functions and re-generate the Hash codes of the new and old samples in real time. As time went by, the amount of old dataset is very large, and the time complexity of re-generating Hash codes become unacceptable. To avoid the above problems, we propose a novel asymmetric deep online Hashing (ADOH) which trains a deep online Hashing network in an asymmetric manner. To improve the online retrieval efficiency, ADOH only generates the Hash code of the new samples and do not update the old samples’ Hash codes. During the training process, ADOH minimizes the difference between the Hash codes’ inner product and the similarity matrix, which can preserve the pair-wise semantic similarity relationship. Moreover, ADOH establishes the classification loss and the label embedding model to learn the samples’ semantic information, which makes the generated Hash codes more semantically discriminative. We conduct the approximate nearest neighbor retrieval comparative experiments on three widely used datasets including the cifar-10 dataset, the mnist dataset and the Places205 dataset. The results show that the online approximate nearest neighbor retrieval performance of ADOH outperforms the other 8 existing online Hashing methods.

     

/

返回文章
返回