Cross-modal information retrieval refers to the process of linking and querying data across distinct modalities, such as images, text, audio, and video. This field addresses the inherent semantic gap ...
Beijing Zhongke Journal Publising Co. Ltd. With the popularization of social networks, different modalities of data such as images, text, and audio aregrowing rapidly on the Internet. Subsequently, ...