广西科学

引用本文：

何沛,王萌,王卓,卢光云.基于特征增强的对抗哈希跨模态检索[J].广西科学,2022,29(4):691-699. [点击复制]
HE Pei,WANG Meng,WANG Zhuo,LU Guangyun.Feature Boosting Adversarial Hashing for Cross-Modal Retrieval[J].Guangxi Sciences,2022,29(4):691-699. [点击复制]

【打印本页】【在线阅读全文】【下载PDF全文】【查看/发表评论】【下载PDF阅读器】【关闭】

本文已被：浏览 363次下载 293次	码上扫一扫！
基于特征增强的对抗哈希跨模态检索
何沛¹, 王萌², 王卓¹, 卢光云³
0 字体:加大+\|默认\|缩小-
(1.广西科技大学理学院, 广西柳州 545000;2.广西科技大学数字启迪学院, 广西柳州 545000;3.柳州工学院, 广西柳州 545616)

摘要:

在跨模态检索任务中，哈希方法由于其检索效率高效、储存成本低廉而被广泛应用。但是，这些方法很少关注如何去弥补主体网络将高维特征转换为哈希码的过程中所丢失的特征信息。为解决这些问题，本文提出了一种特征增强对抗跨模态哈希（Feature Boosting Adversarial Hashing for Cross-Modal，FBAH）方法。FBAH方法将子空间学习与对抗学习相结合，来减少不同模态数据的差异性。另外，构造一种类残差模块，它可以将筛选出具有区别性的特征绕过主体网络直接输入到哈希空间进行特征增强。这样，生成的哈希码能够具有更多的原始特征信息。最后，通过带有分支网络的线性分类器在标签空间进行两种方式的预测，并最小化与真实标签的差距来保证语义的不变性。本文选择两个跨模态检索任务中常用的大型数据集进行大量实验，结果表明FBAH方法的性能优于目前7种较为先进的跨模态哈希方法。

关键词: 特征增强跨模态检索稀疏矩阵哈希子空间学习对抗学习

DOI：10.13656/j.cnki.gxkx.20220919.009

投稿时间：2022-04-04

基金项目:广西中青年教师基础能力提升项目“基于语义的跨媒体检索方法研究（2019KY1095）”资助。

Feature Boosting Adversarial Hashing for Cross-Modal Retrieval

HE Pei¹, WANG Meng², WANG Zhuo¹, LU Guangyun³

(1.College of Science, Guangxi University of Science and Technology, Liuzhou, Guangxi, 545000, China;2.Tus College of Digit, Guangxi University of Science and Technology, Liuzhou, Guangxi, 545000, China;3.Liuzhou Institute of Technology, Liuzhou, Guangxi, 545616, China)

Abstract:

In cross-modal retrieval tasks, hashing method is widely used because of its high retrieval efficiency and low storage cost.However, these methods pay little attention on how to compensate for the loss of feature information in the process of transforming high-dimensional features into hash codes.To solve these problems, this article proposes a Feature Boosting Adversarial Hashing for Cross-Modal (FBAH).FBAH combines subspace learning with adversarial learning to reduce the difference of different modal data.In addition, a module similar to residual structure is constructed, which can bypass the main network and directly input the selected distinguishing features into the hash space for feature boosting.In this way, the generated hash code can have more original feature information.Finally, the linear classifier with branch network is used to make two kinds of prediction in label space, and the gap with the real label is minimized to ensure the invariance of semantics.In this article, two large datasets commonly used in cross-modal retrieval tasks are selected for a large number of experiments.The results show that the performance of FBAH is superior to seven existing advanced cross-modal hashing methods.

Key words: feature boosting cross-modal retrieval sparse matrix hashing subspace learning adversarial learning

用微信扫一扫