文献互助智能选刊最新文献

高级搜索发布求助登录注册

Beyond Text and Back Again

Companion Proceedings of the Web Conference 2021 Pub Date : 2021-04-19 DOI:10.1145/3442442.3451896

Desmond Elliott

引用次数: 0

Abstract

A talk with two parts covering three modalities. In the first part, I will talk about NLP Beyond Text, where we integrate visual context into a speech recognition model and find that the recovery of different types of masked speech inputs is improved by fine-grained visual grounding against detected objects [2]. In the second part, I will come Back Again, and talk about the benefits of textual supervision in cross-modal speech–vision retrieval models [1].

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

超越文本，再回来

讲座分为两部分，涵盖三种模式。在第一部分中，我将讨论超越文本的NLP，其中我们将视觉上下文集成到语音识别模型中，并发现通过针对检测对象的细粒度视觉基础可以改善不同类型屏蔽语音输入的恢复[2]。在第二部分中，我将再次回来，讨论文本监督在跨模态语音视觉检索模型中的好处[1]。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Companion Proceedings of the Web Conference 2021

Companion Proceedings of the Web Conference 2021

自引率

0.00%

发文量

0

期刊最新文献

Do I Trust this Stranger? Generalized Trust and the Governance of Online Communities Explainable Demand Forecasting: A Data Mining Goldmine Tracing the Factoids: the Anatomy of Information Re-organization in Wikipedia Articles AI Principles in Identifying Toxicity in Online Conversation: Keynote at the Third Workshop on Fairness, Accountability, Transparency, Ethics and Society on the Web Fairness beyond “equal”: The Diversity Searcher as a Tool to Detect and Enhance the Representation of Socio-political Actors in News Media

0

微信

客服QQ

Book学术公众号

扫码关注我们

反馈

Book学术官方微信

Book学术文献互助

Book学术文献互助群
群号：604180095

文献互助智能选刊最新文献互助须知联系我们：info@booksci.cn

Book学术提供免费学术资源搜索服务，方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。

Copyright © 2023 Book学术 All rights reserved.

京公网安备 11010802042870号京ICP备2023020795号-1