Real-time dual-eye collaborative eyeblink detection with contrastive learning

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pattern Recognition Pub Date : 2025-06-01 Epub Date: 2025-02-13 DOI:10.1016/j.patcog.2025.111440
Hanli Zhao , Yu Wang , Wanglong Lu , Zili Yi , Jun Liu , Minglun Gong
{"title":"Real-time dual-eye collaborative eyeblink detection with contrastive learning","authors":"Hanli Zhao ,&nbsp;Yu Wang ,&nbsp;Wanglong Lu ,&nbsp;Zili Yi ,&nbsp;Jun Liu ,&nbsp;Minglun Gong","doi":"10.1016/j.patcog.2025.111440","DOIUrl":null,"url":null,"abstract":"<div><div>Real-time detection of eyeblinks in uncontrolled settings is crucial for applications such as driver fatigue monitoring, face spoofing prevention, and emotion analysis. This task, however, is significantly challenged by variations in facial poses, motion blur, and inconsistent lighting conditions, which frequently lead traditional facial landmark analysis tools to perform poorly, especially in low-light and dynamic environments. often lead to imprecise localization of key regions of interest, undermining the effectiveness of subsequent blink detection. To address these issues, we have developed a novel real-time dual-eye collaborative eyeblink detection method that incorporates contrastive learning. Our approach includes a consistent eye feature embedding technique that minimizes the impact of adverse lighting and extraneous noise during feature extraction. Through contrastive learning, we align feature embeddings of coarsely captured, low-light eye patches with those from finely detailed, well-lit patches. Furthermore, to enhance eyeblink detection and reduce false identifications of eye regions, we exploit the natural synchrony in blink patterns between the left and right eyes. We introduce a dual-eye collaborative spatio-temporal attention mechanism that captures both the inter-eye correlations and the temporal dynamics across sequences. Our collaborative learning approach maximizes the inherent synchrony and cooperation between the two eyes, significantly improving detection accuracy. Extensive experiments on three datasets and their low-light variants demonstrate that our method operates in real-time, adjusts effectively to varying lighting conditions, and performs robustly in untrimmed video scenarios.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"162 ","pages":"Article 111440"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325001001","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/13 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Real-time detection of eyeblinks in uncontrolled settings is crucial for applications such as driver fatigue monitoring, face spoofing prevention, and emotion analysis. This task, however, is significantly challenged by variations in facial poses, motion blur, and inconsistent lighting conditions, which frequently lead traditional facial landmark analysis tools to perform poorly, especially in low-light and dynamic environments. often lead to imprecise localization of key regions of interest, undermining the effectiveness of subsequent blink detection. To address these issues, we have developed a novel real-time dual-eye collaborative eyeblink detection method that incorporates contrastive learning. Our approach includes a consistent eye feature embedding technique that minimizes the impact of adverse lighting and extraneous noise during feature extraction. Through contrastive learning, we align feature embeddings of coarsely captured, low-light eye patches with those from finely detailed, well-lit patches. Furthermore, to enhance eyeblink detection and reduce false identifications of eye regions, we exploit the natural synchrony in blink patterns between the left and right eyes. We introduce a dual-eye collaborative spatio-temporal attention mechanism that captures both the inter-eye correlations and the temporal dynamics across sequences. Our collaborative learning approach maximizes the inherent synchrony and cooperation between the two eyes, significantly improving detection accuracy. Extensive experiments on three datasets and their low-light variants demonstrate that our method operates in real-time, adjusts effectively to varying lighting conditions, and performs robustly in untrimmed video scenarios.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于对比学习的实时双眼协同眨眼检测
在不受控制的环境中实时检测眨眼对于驾驶员疲劳监测、面部欺骗预防和情绪分析等应用至关重要。然而,这项任务受到面部姿势变化,运动模糊和不一致的照明条件的显著挑战,这些条件经常导致传统的面部地标分析工具表现不佳,特别是在低光和动态环境中。通常会导致关键感兴趣区域的定位不精确,从而破坏后续眨眼检测的有效性。为了解决这些问题,我们开发了一种新的实时双眼协同眨眼检测方法,该方法结合了对比学习。我们的方法包括一种一致的眼睛特征嵌入技术,该技术可以最大限度地减少特征提取过程中不利光照和外来噪声的影响。通过对比学习,我们将粗糙捕获的低光照眼斑的特征嵌入与精细、光照良好的眼斑的特征嵌入对齐。此外,为了增强眨眼检测和减少眼睛区域的错误识别,我们利用了左眼和右眼之间眨眼模式的自然同步。我们引入了一个双眼协同的时空注意机制,该机制既能捕捉到眼间的相关性,也能捕捉到序列间的时间动态。我们的协同学习方法最大限度地发挥了两只眼睛之间固有的同步性和合作性,显著提高了检测精度。在三个数据集及其低光照变量上进行的大量实验表明,我们的方法可以实时运行,有效地适应不同的光照条件,并在未修剪的视频场景中表现出色。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
期刊最新文献
Multiple similarity and multiple kernel fusion based on graph inference network for circRNA-disease association prediction FMaMIL: Synergistic spatial-frequency Mamba multi-instance learning for weakly supervised pathology lesion segmentation Flexible multi-view feature selection with semi-supervised label semantic alignment Unsupervised feature selection based on dual-graph clustering learning and adaptive weighting Model-based clustering of music pieces
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1