对齐视觉和词汇语义

Diversity, divergence, dialogue : 16th international conference, iConference 2021, Beijing, China, March 17-31, 2021 : proceedings. iConference (Conference) (16th : 2021 : Online) Pub Date : 2022-12-13 DOI:10.48550/arXiv.2212.06629

Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao

{"title":"对齐视觉和词汇语义","authors":"Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao","doi":"10.48550/arXiv.2212.06629","DOIUrl":null,"url":null,"abstract":"We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in turn, has a major impact on CV systems in the form of the Semantic Gap Problem (SGP). The paper, while extensively exemplifying the lack of coincidence as above, introduces a general, domain-agnostic methodology to enforce alignment between visual and lexical semantics.","PeriodicalId":93543,"journal":{"name":"Diversity, divergence, dialogue : 16th international conference, iConference 2021, Beijing, China, March 17-31, 2021 : proceedings. iConference (Conference) (16th : 2021 : Online)","volume":"172 1","pages":"294-302"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Aligning Visual and Lexical Semantics\",\"authors\":\"Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao\",\"doi\":\"10.48550/arXiv.2212.06629\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in turn, has a major impact on CV systems in the form of the Semantic Gap Problem (SGP). The paper, while extensively exemplifying the lack of coincidence as above, introduces a general, domain-agnostic methodology to enforce alignment between visual and lexical semantics.\",\"PeriodicalId\":93543,\"journal\":{\"name\":\"Diversity, divergence, dialogue : 16th international conference, iConference 2021, Beijing, China, March 17-31, 2021 : proceedings. iConference (Conference) (16th : 2021 : Online)\",\"volume\":\"172 1\",\"pages\":\"294-302\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Diversity, divergence, dialogue : 16th international conference, iConference 2021, Beijing, China, March 17-31, 2021 : proceedings. iConference (Conference) (16th : 2021 : Online)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2212.06629\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Diversity, divergence, dialogue : 16th international conference, iConference 2021, Beijing, China, March 17-31, 2021 : proceedings. iConference (Conference) (16th : 2021 : Online)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.06629","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

我们讨论了与计算机视觉(CV)系统相关的两种语义——视觉语义和词汇语义。视觉语义学关注的是人类在使用视觉感知目标现实时如何构建概念，而词汇语义学关注的是人类如何通过使用语言构建相同目标现实的概念。视觉语义和词汇语义之间缺乏一致性，反过来又以语义缺口问题(SGP)的形式对CV系统产生重大影响。本文虽然广泛地举例说明了上述巧合的缺乏，但引入了一种通用的、领域不可知论的方法来强制视觉语义和词汇语义之间的对齐。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Aligning Visual and Lexical Semantics

We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in turn, has a major impact on CV systems in the form of the Semantic Gap Problem (SGP). The paper, while extensively exemplifying the lack of coincidence as above, introduces a general, domain-agnostic methodology to enforce alignment between visual and lexical semantics.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Diversity, divergence, dialogue : 16th international conference, iConference 2021, Beijing, China, March 17-31, 2021 : proceedings. iConference (Conference) (16th : 2021 : Online)

自引率

0.00%

发文量