首页 > 最新文献

IEEE Transactions on Multimedia最新文献

英文 中文
Improving Pre-trained Model-based Speech Emotion Recognition from a Low-level Speech Feature Perspective 从低级语音特征角度改进基于预训练模型的语音情感识别
IF 7.3 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-05 DOI: 10.1109/tmm.2024.3410133
Ke Liu, Jiwei Wei, Jie Zou, Peng Wang, Yang Yang, Heng Tao Shen
{"title":"Improving Pre-trained Model-based Speech Emotion Recognition from a Low-level Speech Feature Perspective","authors":"Ke Liu, Jiwei Wei, Jie Zou, Peng Wang, Yang Yang, Heng Tao Shen","doi":"10.1109/tmm.2024.3410133","DOIUrl":"https://doi.org/10.1109/tmm.2024.3410133","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"12 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning Semantic Polymorphic Mapping for Text-Based Person Retrieval 学习语义多态映射,实现基于文本的人员检索
IF 7.3 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-05 DOI: 10.1109/tmm.2024.3410129
Jiayi Li, Min Jiang, Jun Kong, Xuefeng Tao, Xi Luo
{"title":"Learning Semantic Polymorphic Mapping for Text-Based Person Retrieval","authors":"Jiayi Li, Min Jiang, Jun Kong, Xuefeng Tao, Xi Luo","doi":"10.1109/tmm.2024.3410129","DOIUrl":"https://doi.org/10.1109/tmm.2024.3410129","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"29 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Triple Consistency for Transparent Cheating Problem in Light Field Depth Estimation 光场深度估计中透明作弊问题的三重一致性
IF 7.3 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-05 DOI: 10.1109/tmm.2024.3410139
Zhenglong Cui, Da Yang, Hao Sheng, Sizhe Wang, Rongshan Chen, Ruixuan Cong, Wei Ke
{"title":"Triple Consistency for Transparent Cheating Problem in Light Field Depth Estimation","authors":"Zhenglong Cui, Da Yang, Hao Sheng, Sizhe Wang, Rongshan Chen, Ruixuan Cong, Wei Ke","doi":"10.1109/tmm.2024.3410139","DOIUrl":"https://doi.org/10.1109/tmm.2024.3410139","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"13 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Width-Adaptive CNN: Fast CU Partition Prediction for VVC Screen Content Coding 宽度自适应 CNN:针对 VVC 屏幕内容编码的快速 CU 分区预测
IF 8.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-05 DOI: 10.1109/TMM.2024.3410116
Chao Jiao;Huanqiang Zeng;Jing Chen;Chih-Hsien Hsia;Tianlei Wang;Kai-Kuang Ma
Screen content coding (SCC) in Versatile Video Coding (VVC) improves the coding efficiency of screen content videos (SCVs) significantly but results in high computational complexity due to the quad-tree plus multi-type tree (QTMT) structure of the coding unit (CU) partitioning. Therefore, we make the first attempt to reduce the encoding complexity from the perspective of CU partitioning for SCC in VVC. To this end, a fast CU partition prediction method is technically developed for VVC-SCC. First, to solve the problem of lacking sufficient SCC training data, SCVs are collected to establish a database containing CUs of various sizes and corresponding partition labels. Second, to determine the partition decision in advance, a novel WA-CNN model is proposed, which is capable of predicting two large CUs for VVC-SCC by adjusting the feature channels based on the size of input CU blocks. Finally, considering the imbalanced proportion of diverse partition decisions, a loss function with the weight that equalizes the contribution of imbalanced data is formulated to train the proposed WA-CNN model. Experimental results show that the proposed model reduces the SCC intra-encoding time by 35.65%${sim }$38.31% with an average of 1.84%${sim }$2.42% BDBR increase.
多功能视频编码(VVC)中的屏幕内容编码(SCC)大大提高了屏幕内容视频(SCV)的编码效率,但由于编码单元(CU)分区的四叉树加多类型树(QTMT)结构,导致了较高的计算复杂度。因此,我们首次尝试从 CU 分区的角度降低 VVC 中 SCC 的编码复杂度。为此,我们从技术上为 VVC-SCC 开发了一种快速 CU 分区预测方法。首先,为了解决缺乏足够 SCC 训练数据的问题,收集了 SCV,建立了一个包含不同大小 CU 和相应分区标签的数据库。其次,为了提前确定分区决策,提出了一种新颖的 WA-CNN 模型,该模型能够根据输入 CU 块的大小调整特征通道,从而预测 VVC-SCC 的两个大 CU。最后,考虑到不同分区决策的不平衡比例,制定了一个具有均衡不平衡数据贡献权重的损失函数来训练所提出的 WA-CNN 模型。实验结果表明,所提出的模型减少了 35.65%${sim }$38.31% 的 SCC 内部编码时间,平均增加了 1.84%${sim }$2.42% 的 BDBR。
{"title":"Width-Adaptive CNN: Fast CU Partition Prediction for VVC Screen Content Coding","authors":"Chao Jiao;Huanqiang Zeng;Jing Chen;Chih-Hsien Hsia;Tianlei Wang;Kai-Kuang Ma","doi":"10.1109/TMM.2024.3410116","DOIUrl":"10.1109/TMM.2024.3410116","url":null,"abstract":"Screen content coding (SCC) in Versatile Video Coding (VVC) improves the coding efficiency of screen content videos (SCVs) significantly but results in high computational complexity due to the quad-tree plus multi-type tree (QTMT) structure of the coding unit (CU) partitioning. Therefore, we make the first attempt to reduce the encoding complexity from the perspective of CU partitioning for SCC in VVC. To this end, a fast CU partition prediction method is technically developed for VVC-SCC. First, to solve the problem of lacking sufficient SCC training data, SCVs are collected to establish a database containing CUs of various sizes and corresponding partition labels. Second, to determine the partition decision in advance, a novel WA-CNN model is proposed, which is capable of predicting two large CUs for VVC-SCC by adjusting the feature channels based on the size of input CU blocks. Finally, considering the imbalanced proportion of diverse partition decisions, a loss function with the weight that equalizes the contribution of imbalanced data is formulated to train the proposed WA-CNN model. Experimental results show that the proposed model reduces the SCC intra-encoding time by 35.65%\u0000<inline-formula><tex-math>${sim }$</tex-math></inline-formula>\u000038.31% with an average of 1.84%\u0000<inline-formula><tex-math>${sim }$</tex-math></inline-formula>\u00002.42% BDBR increase.","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"26 ","pages":"9372-9382"},"PeriodicalIF":8.4,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CDKM: Common and Distinct Knowledge Mining Network with Content Interaction for Dense Captioning CDKM:用于密集字幕的具有内容交互性的共性和特性知识挖掘网络
IF 7.3 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-04 DOI: 10.1109/tmm.2024.3407695
Hongyu Deng, Yushan Xie, Qi Wang, Jianjun Wang, Weijian Ruan, Wu Liu, Yong-Jin Liu
{"title":"CDKM: Common and Distinct Knowledge Mining Network with Content Interaction for Dense Captioning","authors":"Hongyu Deng, Yushan Xie, Qi Wang, Jianjun Wang, Weijian Ruan, Wu Liu, Yong-Jin Liu","doi":"10.1109/tmm.2024.3407695","DOIUrl":"https://doi.org/10.1109/tmm.2024.3407695","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"91 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimating the Semantics via Sector Embedding for Image-Text Retrieval 通过扇形嵌入估计语义,实现图像文本检索
IF 8.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-04 DOI: 10.1109/TMM.2024.3407664
Zheng Wang;Zhenwei Gao;Mengqun Han;Yang Yang;Heng Tao Shen
Based on deterministic single-point embedding, most extant image-text retrieval methods only focus on the match of ground truth while suffering from one-to-many correspondence, where besides annotated positives, many similar instances of another modality should be retrieved by a given query. Recent solutions of probabilistic embedding and rectangle mapping still encounter some drawbacks, albeit their promising effectiveness at multiple matches. Meanwhile, the exploration of one-to-many correspondence is still insufficient. Therefore, this paper proposes a novel geometric representation to Estimate the Semantics of heterogeneous data via Sector Embedding (dubbed ESSE). Specifically, a given image/text can be projected as a sector, where its symmetric axis represents mean semantics and the aperture estimates uncertainty. Further, a sector matching loss is introduced to better handle the multiplicity by considering the sine of included angles as distance calculation, which encourages candidates to be contained by the apertures of a query sector. The experimental results on three widely used benchmarks CUB, Flickr30 K and MS-COCO reveal that sector embedding can achieve competitive performance on multiple matches and also improve the traditional ground-truth matching of the baselines. Additionally, we also verify the generalization to video-text retrieval on two extensively used datasets of MSRVTT and MSVD, and to text-based person retrieval on CUHK-PEDES. This superiority and effectiveness can also demonstrate that the bounded property of the aperture can better estimate semantic uncertainty when compared to prior remedies.
基于确定性单点嵌入,大多数现有的图像-文本检索方法只关注地面实况的匹配,而存在一对多的对应问题。最近的概率嵌入和矩形映射解决方案虽然在多匹配方面效果显著,但仍存在一些缺陷。同时,对一对多对应关系的探索仍然不够。因此,本文提出了一种新颖的几何表示法,通过扇形嵌入来估算异构数据的语义(称为ESSE)。具体来说,给定的图像/文本可以投影为一个扇形,其中对称轴代表平均语义,光圈则估计不确定性。此外,还引入了扇形匹配损失,通过考虑包含角的正弦作为距离计算来更好地处理多重性,从而鼓励候选者被查询扇形的孔径所包含。在 CUB、Flickr30 K 和 MS-COCO 这三个广泛使用的基准上的实验结果表明,扇形嵌入可以在多重匹配上实现有竞争力的性能,同时还能改善基准的传统地面实况匹配。此外,我们还在两个广泛使用的数据集 MSRVTT 和 MSVD 上验证了视频文本检索的通用性,并在 CUHK-PEDES 上验证了基于文本的人物检索的通用性。这种优越性和有效性还表明,与之前的补救方法相比,孔径的有界属性能更好地估计语义的不确定性。
{"title":"Estimating the Semantics via Sector Embedding for Image-Text Retrieval","authors":"Zheng Wang;Zhenwei Gao;Mengqun Han;Yang Yang;Heng Tao Shen","doi":"10.1109/TMM.2024.3407664","DOIUrl":"10.1109/TMM.2024.3407664","url":null,"abstract":"Based on deterministic single-point embedding, most extant image-text retrieval methods only focus on the match of ground truth while suffering from one-to-many correspondence, where besides annotated positives, many similar instances of another modality should be retrieved by a given query. Recent solutions of probabilistic embedding and rectangle mapping still encounter some drawbacks, albeit their promising effectiveness at multiple matches. Meanwhile, the exploration of one-to-many correspondence is still insufficient. Therefore, this paper proposes a novel geometric representation to \u0000<underline>E</u>\u0000stimate the \u0000<underline>S</u>\u0000emantics of heterogeneous data via \u0000<underline>S</u>\u0000ector \u0000<underline>E</u>\u0000mbedding (dubbed \u0000<bold>ESSE</b>\u0000). Specifically, a given image/text can be projected as a sector, where its symmetric axis represents mean semantics and the aperture estimates uncertainty. Further, a sector matching loss is introduced to better handle the multiplicity by considering the sine of included angles as distance calculation, which encourages candidates to be contained by the apertures of a query sector. The experimental results on three widely used benchmarks CUB, Flickr30 K and MS-COCO reveal that sector embedding can achieve competitive performance on multiple matches and also improve the traditional ground-truth matching of the baselines. Additionally, we also verify the generalization to video-text retrieval on two extensively used datasets of MSRVTT and MSVD, and to text-based person retrieval on CUHK-PEDES. This superiority and effectiveness can also demonstrate that the bounded property of the aperture can better estimate semantic uncertainty when compared to prior remedies.","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"26 ","pages":"10342-10353"},"PeriodicalIF":8.4,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Logit Variated Product Quantization Based on Parts Interaction and Metric Learning With Knowledge Distillation for Fine-Grained Image Retrieval 基于部件交互和度量学习的 Logit 变量产品量化与知识提炼,用于细粒度图像检索
IF 8.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-03 DOI: 10.1109/TMM.2024.3407661
Lei Ma;Xin Luo;Hanyu Hong;Fanman Meng;Qingbo Wu
Image retrieval with fine-grained categories is an extremely challenging task due to the high intraclass variance and low interclass variance. Most previous works have focused on localizing discriminative image regions in isolation, but have rarely exploited correlations across the different discriminative regions to alleviate intraclass differences. In addition, the intraclass compactness of embedding features is ensured by extra regularization terms that only exist during the training phase, which appear to generalize less well in the inference phase. Finally, the information granularity of the distance measure should distinguish subtle visual differences and the correlation between the embedding features and the quantized features should be maximized sufficiently. To address the above issues, we propose a logit variated product quantization method based on part interaction and metric learning with knowledge distillation for fine-grained image retrieval. Specifically, we introduce a causal context module into the deep navigator to generate discriminative regions and utilize a channelwise cross-part fusion transformer to model the part correlations while alleviating intraclass differences. Subsequently, we design a logit variation module based on a weighted sum scheme to further reduce the intraclass variance of the embedding features directly and enhance the learning power of the quantization model. Finally, we propose a novel product quantization loss based on metric learning and knowledge distillation to enhance the correlation between the embedding features and the quantized features and allow the quantization features to learn more knowledge from the embedding features. The experimental results on several fine-grained datasets demonstrate that the proposed method is superior to state-of-the-art fine-grained image retrieval methods.
由于类内差异大而类间差异小,细粒度类别的图像检索是一项极具挑战性的任务。以往的大多数研究都侧重于孤立地定位图像的分辨区域,但很少利用不同分辨区域之间的相关性来减轻类内差异。此外,嵌入特征的类内紧凑性是通过额外的正则化项来保证的,而这些正则化项只存在于训练阶段,在推理阶段的泛化效果似乎较差。最后,距离度量的信息粒度应能区分细微的视觉差异,嵌入特征与量化特征之间的相关性应充分最大化。针对上述问题,我们提出了一种基于部分交互和度量学习的 logit 变积量化方法,并将其用于细粒度图像检索的知识提炼。具体来说,我们在深度导航器中引入了一个因果上下文模块,以生成具有区分性的区域,并利用通道式跨部件融合转换器来建立部件相关性模型,同时减轻类内差异。随后,我们设计了一个基于加权和方案的 logit 变异模块,进一步直接降低嵌入特征的类内方差,增强量化模型的学习能力。最后,我们提出了一种基于度量学习和知识提炼的新型乘积量化损失,以增强嵌入特征与量化特征之间的相关性,让量化特征从嵌入特征中学习更多知识。在多个细粒度数据集上的实验结果表明,所提出的方法优于最先进的细粒度图像检索方法。
{"title":"Logit Variated Product Quantization Based on Parts Interaction and Metric Learning With Knowledge Distillation for Fine-Grained Image Retrieval","authors":"Lei Ma;Xin Luo;Hanyu Hong;Fanman Meng;Qingbo Wu","doi":"10.1109/TMM.2024.3407661","DOIUrl":"10.1109/TMM.2024.3407661","url":null,"abstract":"Image retrieval with fine-grained categories is an extremely challenging task due to the high intraclass variance and low interclass variance. Most previous works have focused on localizing discriminative image regions in isolation, but have rarely exploited correlations across the different discriminative regions to alleviate intraclass differences. In addition, the intraclass compactness of embedding features is ensured by extra regularization terms that only exist during the training phase, which appear to generalize less well in the inference phase. Finally, the information granularity of the distance measure should distinguish subtle visual differences and the correlation between the embedding features and the quantized features should be maximized sufficiently. To address the above issues, we propose a logit variated product quantization method based on part interaction and metric learning with knowledge distillation for fine-grained image retrieval. Specifically, we introduce a causal context module into the deep navigator to generate discriminative regions and utilize a channelwise cross-part fusion transformer to model the part correlations while alleviating intraclass differences. Subsequently, we design a logit variation module based on a weighted sum scheme to further reduce the intraclass variance of the embedding features directly and enhance the learning power of the quantization model. Finally, we propose a novel product quantization loss based on metric learning and knowledge distillation to enhance the correlation between the embedding features and the quantized features and allow the quantization features to learn more knowledge from the embedding features. The experimental results on several fine-grained datasets demonstrate that the proposed method is superior to state-of-the-art fine-grained image retrieval methods.","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"26 ","pages":"10406-10419"},"PeriodicalIF":8.4,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PhotoStyle60: A Photographic Style Dataset for Photo Authorship Attribution and Photographic Style Transfer 照片风格 60:用于照片作者归属和摄影风格转换的摄影风格数据集
IF 7.3 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-06-03 DOI: 10.1109/tmm.2024.3408683
Marco Cotogni, Marco Arazzi, Claudio Cusano
{"title":"PhotoStyle60: A Photographic Style Dataset for Photo Authorship Attribution and Photographic Style Transfer","authors":"Marco Cotogni, Marco Arazzi, Claudio Cusano","doi":"10.1109/tmm.2024.3408683","DOIUrl":"https://doi.org/10.1109/tmm.2024.3408683","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"91 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141968865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Symbolic Music Generation from Graph-Learning-based Preference Modeling and Textual Queries 通过基于图学习的偏好建模和文本查询生成符号音乐
IF 7.3 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-05-31 DOI: 10.1109/tmm.2024.3408060
Xichu Ma, Yuchen Wang, Ye Wang
{"title":"Symbolic Music Generation from Graph-Learning-based Preference Modeling and Textual Queries","authors":"Xichu Ma, Yuchen Wang, Ye Wang","doi":"10.1109/tmm.2024.3408060","DOIUrl":"https://doi.org/10.1109/tmm.2024.3408060","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"116 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141193073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DEER: Distribution Divergence-based Graph Contrast for Partial Label Learning on Graphs DEER:基于分布发散的图形对比,用于图形上的部分标签学习
IF 7.3 1区 计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2024-05-31 DOI: 10.1109/tmm.2024.3408038
Yiyang Gu, Zihao Chen, Yifang Qin, Zhengyang Mao, Zhiping Xiao, Wei Ju, Chong Chen, Xian-Sheng Hua, Yifan Wang, Xiao Luo, Ming Zhang
{"title":"DEER: Distribution Divergence-based Graph Contrast for Partial Label Learning on Graphs","authors":"Yiyang Gu, Zihao Chen, Yifang Qin, Zhengyang Mao, Zhiping Xiao, Wei Ju, Chong Chen, Xian-Sheng Hua, Yifan Wang, Xiao Luo, Ming Zhang","doi":"10.1109/tmm.2024.3408038","DOIUrl":"https://doi.org/10.1109/tmm.2024.3408038","url":null,"abstract":"","PeriodicalId":13273,"journal":{"name":"IEEE Transactions on Multimedia","volume":"126 1","pages":""},"PeriodicalIF":7.3,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141193254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Multimedia
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1