Pub Date : 2022-04-05DOI: 10.2200/s01169ed1v01y202202cov020
G. Csurka, Timothy M. Hospedales, M. Salzmann, T. Tommasi
{"title":"Visual Domain Adaptation in the Deep Learning Era","authors":"G. Csurka, Timothy M. Hospedales, M. Salzmann, T. Tommasi","doi":"10.2200/s01169ed1v01y202202cov020","DOIUrl":"https://doi.org/10.2200/s01169ed1v01y202202cov020","url":null,"abstract":"","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121415489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-10-27DOI: 10.2200/s01127ed1v01y202109cov019
Michael Teutsch, A. Sappa, R. Hammoud
{"title":"Computer Vision in the Infrared Spectrum: Challenges and Approaches","authors":"Michael Teutsch, A. Sappa, R. Hammoud","doi":"10.2200/s01127ed1v01y202109cov019","DOIUrl":"https://doi.org/10.2200/s01127ed1v01y202109cov019","url":null,"abstract":"","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124630202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2021-09-30DOI: 10.2200/s01122ed1v01y202108cov018
R. Panda, A. Roy-Chowdhury
Abstract Person re-identification is the problem of associating observations of targets in different non-overlapping cameras. Most of the existing learning-based methods have resulted in improved p...
{"title":"Person Re-identification with Limited Supervision","authors":"R. Panda, A. Roy-Chowdhury","doi":"10.2200/s01122ed1v01y202108cov018","DOIUrl":"https://doi.org/10.2200/s01122ed1v01y202108cov018","url":null,"abstract":"Abstract Person re-identification is the problem of associating observations of targets in different non-overlapping cameras. Most of the existing learning-based methods have resulted in improved p...","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"200 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121890587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2020-07-27DOI: 10.2200/s01032ed1v01y202007cov017
Jun Wan, G. Guo, Sergio Escalera, H. Escalante, S. Li
Abstract For the last ten years, face biometric research has been intensively studied by the computer vision community. Face recognition systems have been used in mobile, banking, and surveillance ...
近十年来,人脸生物识别研究受到计算机视觉界的广泛关注。人脸识别系统已经应用于手机、银行和监控领域。
{"title":"Multi-Modal Face Presentation Attack Detection","authors":"Jun Wan, G. Guo, Sergio Escalera, H. Escalante, S. Li","doi":"10.2200/s01032ed1v01y202007cov017","DOIUrl":"https://doi.org/10.2200/s01032ed1v01y202007cov017","url":null,"abstract":"Abstract For the last ten years, face biometric research has been intensively studied by the computer vision community. Face recognition systems have been used in mobile, banking, and surveillance ...","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115062016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-09-13DOI: 10.2200/S00819ED1V01Y201712COV014
Kristin J. Dana
Abstract Visual pattern analysis is a fundamental tool in mining data for knowledge. Computational representations for patterns and texture allow us to summarize, store, compare, and label in order...
{"title":"Computational Texture and Patterns: From Textons to Deep Learning","authors":"Kristin J. Dana","doi":"10.2200/S00819ED1V01Y201712COV014","DOIUrl":"https://doi.org/10.2200/S00819ED1V01Y201712COV014","url":null,"abstract":"Abstract Visual pattern analysis is a fundamental tool in mining data for knowledge. Computational representations for patterns and texture allow us to summarize, store, compare, and label in order...","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116573808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-29DOI: 10.2200/S00851ED1V01Y201804COV016
M. Felsberg
Under the title "Probabilistic and Biologically Inspired Feature Representations," this text collects a substantial amount of work on the topic of channel representations. Channel representations a ...
在“概率和生物学启发的特征表示”的标题下,本文收集了大量关于通道表示主题的工作。通道表示a…
{"title":"Probabilistic and Biologically Inspired Feature Representations","authors":"M. Felsberg","doi":"10.2200/S00851ED1V01Y201804COV016","DOIUrl":"https://doi.org/10.2200/S00851ED1V01Y201804COV016","url":null,"abstract":"Under the title \"Probabilistic and Biologically Inspired Feature Representations,\" this text collects a substantial amount of work on the topic of channel representations. Channel representations a ...","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123421643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2016-04-21DOI: 10.2200/s00705ed1v01y201602cov007
Kobus Barnard
Abstract "This is clearly the most comprehensive and thoughtful compendium of knowledge on language/vision integration out there, and I'm sure it will be a valuable resources to many researchers and instructors." - Sven Dickinson, Series Editor (University of Toronto) Modeling data from visual and linguistic modalities together creates opportunities for better understanding of both, and supports many useful applications. Examples of dual visual-linguistic data includes images with keywords, video with narrative, and figures in documents. We consider two key task-driven themes: translating from one modality to another (e.g., inferring annotations for images) and understanding the data using all modalities, where one modality can help disambiguate information in another. The multiple modalities can either be essentially semantically redundant (e.g., keywords provided by a person looking at the image), or largely complementary (e.g., meta data such as the camera used). Redundancy and complementarity are two ...
“这显然是关于语言/视觉整合的最全面和最深思熟虑的知识纲要,我相信它将成为许多研究人员和教师的宝贵资源。”- Sven Dickinson,系列编辑(多伦多大学)来自视觉和语言模式的建模数据一起为更好地理解两者创造了机会,并支持许多有用的应用。双重视觉语言数据的例子包括带有关键词的图像、带有叙述的视频和文档中的数字。我们考虑了两个关键的任务驱动主题:从一种模态转换到另一种模态(例如,推断图像的注释)和使用所有模态理解数据,其中一种模态可以帮助消除另一种模态中的信息歧义。多模态可以本质上是语义冗余的(例如,由查看图像的人提供的关键字),或者很大程度上是互补的(例如,元数据,如使用的相机)。冗余和互补性是两个…
{"title":"Computational Methods for Integrating Vision and Language","authors":"Kobus Barnard","doi":"10.2200/s00705ed1v01y201602cov007","DOIUrl":"https://doi.org/10.2200/s00705ed1v01y201602cov007","url":null,"abstract":"Abstract \"This is clearly the most comprehensive and thoughtful compendium of knowledge on language/vision integration out there, and I'm sure it will be a valuable resources to many researchers and instructors.\" - Sven Dickinson, Series Editor (University of Toronto) Modeling data from visual and linguistic modalities together creates opportunities for better understanding of both, and supports many useful applications. Examples of dual visual-linguistic data includes images with keywords, video with narrative, and figures in documents. We consider two key task-driven themes: translating from one modality to another (e.g., inferring annotations for images) and understanding the data using all modalities, where one modality can help disambiguate information in another. The multiple modalities can either be essentially semantically redundant (e.g., keywords provided by a person looking at the image), or largely complementary (e.g., meta data such as the camera used). Redundancy and complementarity are two ...","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115273253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1900-01-01DOI: 10.1007/978-3-031-32906-7
Jun Wan, G. Guo, Sergio Escalera, H. Escalante, S. Li
{"title":"Advances in Face Presentation Attack Detection","authors":"Jun Wan, G. Guo, Sergio Escalera, H. Escalante, S. Li","doi":"10.1007/978-3-031-32906-7","DOIUrl":"https://doi.org/10.1007/978-3-031-32906-7","url":null,"abstract":"","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126774152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1900-01-01DOI: 10.1007/978-3-031-14595-7
Lei Huang
{"title":"Normalization Techniques in Deep Learning","authors":"Lei Huang","doi":"10.1007/978-3-031-14595-7","DOIUrl":"https://doi.org/10.1007/978-3-031-14595-7","url":null,"abstract":"","PeriodicalId":377202,"journal":{"name":"Synthesis Lectures on Computer Vision","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125694370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}