基于逐像素密度分布模型的半监督计数

IF 18.6 IEEE transactions on pattern analysis and machine intelligence Pub Date : 2025-01-21 DOI:10.1109/TPAMI.2025.3532512

Hui Lin;Zhiheng Ma;Rongrong Ji;Yaowei Wang;Zhou Su;Xiaopeng Hong;Deyu Meng

{"title":"基于逐像素密度分布模型的半监督计数","authors":"Hui Lin;Zhiheng Ma;Rongrong Ji;Yaowei Wang;Zhou Su;Xiaopeng Hong;Deyu Meng","doi":"10.1109/TPAMI.2025.3532512","DOIUrl":null,"url":null,"abstract":"This paper focuses on semi-supervised crowd counting, where only a small portion of the training data are labeled. We formulate the pixel-wise density value to regress as a probability distribution, instead of a single deterministic value. On this basis, we propose a semi-supervised crowd counting model. First, we design a pixel-wise distribution matching loss to measure the differences in the pixel-wise density distributions between the prediction and the ground-truth; Second, we enhance the transformer decoder by using <underline>density tokens</u> to specialize the forwards of decoders w.r.t. different density intervals; Third, we design the <underline>interleaving consistency</u> self-supervised learning mechanism to learn from unlabeled data efficiently. Extensive experiments on four datasets are performed to show that our method clearly outperforms the competitors by a large margin under various labeled ratio settings.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 5","pages":"3625-3638"},"PeriodicalIF":18.6000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Semi-Supervised Counting via Pixel-by-Pixel Density Distribution Modeling\",\"authors\":\"Hui Lin;Zhiheng Ma;Rongrong Ji;Yaowei Wang;Zhou Su;Xiaopeng Hong;Deyu Meng\",\"doi\":\"10.1109/TPAMI.2025.3532512\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper focuses on semi-supervised crowd counting, where only a small portion of the training data are labeled. We formulate the pixel-wise density value to regress as a probability distribution, instead of a single deterministic value. On this basis, we propose a semi-supervised crowd counting model. First, we design a pixel-wise distribution matching loss to measure the differences in the pixel-wise density distributions between the prediction and the ground-truth; Second, we enhance the transformer decoder by using <underline>density tokens</u> to specialize the forwards of decoders w.r.t. different density intervals; Third, we design the <underline>interleaving consistency</u> self-supervised learning mechanism to learn from unlabeled data efficiently. Extensive experiments on four datasets are performed to show that our method clearly outperforms the competitors by a large margin under various labeled ratio settings.\",\"PeriodicalId\":94034,\"journal\":{\"name\":\"IEEE transactions on pattern analysis and machine intelligence\",\"volume\":\"47 5\",\"pages\":\"3625-3638\"},\"PeriodicalIF\":18.6000,\"publicationDate\":\"2025-01-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on pattern analysis and machine intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10848320/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10848320/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文关注的是半监督人群计数，其中只有一小部分训练数据被标记。我们将逐像素的密度值表述为一个概率分布，而不是一个单一的确定性值。在此基础上，提出了一个半监督的人群计数模型。首先，我们设计了一个逐像素分布匹配损失来衡量预测与真实之间逐像素密度分布的差异；其次，利用密度令牌对不同密度区间的译码器的转发进行专门化，增强了变压器译码器；第三，我们设计了交错一致性自监督学习机制，有效地从未标记数据中学习。在四个数据集上进行的大量实验表明，在各种标记比率设置下，我们的方法明显优于竞争对手。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Semi-Supervised Counting via Pixel-by-Pixel Density Distribution Modeling

This paper focuses on semi-supervised crowd counting, where only a small portion of the training data are labeled. We formulate the pixel-wise density value to regress as a probability distribution, instead of a single deterministic value. On this basis, we propose a semi-supervised crowd counting model. First, we design a pixel-wise distribution matching loss to measure the differences in the pixel-wise density distributions between the prediction and the ground-truth; Second, we enhance the transformer decoder by using density tokens to specialize the forwards of decoders w.r.t. different density intervals; Third, we design the interleaving consistency self-supervised learning mechanism to learn from unlabeled data efficiently. Extensive experiments on four datasets are performed to show that our method clearly outperforms the competitors by a large margin under various labeled ratio settings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量