研究自动提取的多模态特征与讲座视频质量的相关性

Proceedings of the 1st International Workshop on Search as Learning with Multimedia Information Pub Date : 2019-10-15 DOI:10.1145/3347451.3356731

Jianwei Shi, Christian Otto, Anett Hoppe, Peter Holtz, R. Ewerth

{"title":"研究自动提取的多模态特征与讲座视频质量的相关性","authors":"Jianwei Shi, Christian Otto, Anett Hoppe, Peter Holtz, R. Ewerth","doi":"10.1145/3347451.3356731","DOIUrl":null,"url":null,"abstract":"Ranking and recommendation of multimedia content such as videos is usually realized with respect to the relevance to a user query. However, for lecture videos and MOOCs (Massive Open Online Courses) it is not only required to retrieve relevant videos, but particularly to find lecture videos of high quality that facilitate learning, for instance, independent of the video's or speaker's popularity. Thus, metadata about a lecture video's quality are crucial features for learning contexts, e.g., lecture video recommendation in search as learning scenarios. In this paper, we investigate whether automatically extracted features are correlated to quality aspects of a video. A set of scholarly videos from a Mass Open Online Course (MOOC) is analyzed regarding audio, linguistic, and visual features. Furthermore, a set of cross-modal features is proposed which are derived by combining transcripts, audio, video, and slide content. A user study is conducted to investigate the correlations between the automatically collected features and human ratings of quality aspects of a lecture video. Finally, the impact of our features on the knowledge gain of the participants is discussed.","PeriodicalId":347114,"journal":{"name":"Proceedings of the 1st International Workshop on Search as Learning with Multimedia Information","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"Investigating Correlations of Automatically Extracted Multimodal Features and Lecture Video Quality\",\"authors\":\"Jianwei Shi, Christian Otto, Anett Hoppe, Peter Holtz, R. Ewerth\",\"doi\":\"10.1145/3347451.3356731\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ranking and recommendation of multimedia content such as videos is usually realized with respect to the relevance to a user query. However, for lecture videos and MOOCs (Massive Open Online Courses) it is not only required to retrieve relevant videos, but particularly to find lecture videos of high quality that facilitate learning, for instance, independent of the video's or speaker's popularity. Thus, metadata about a lecture video's quality are crucial features for learning contexts, e.g., lecture video recommendation in search as learning scenarios. In this paper, we investigate whether automatically extracted features are correlated to quality aspects of a video. A set of scholarly videos from a Mass Open Online Course (MOOC) is analyzed regarding audio, linguistic, and visual features. Furthermore, a set of cross-modal features is proposed which are derived by combining transcripts, audio, video, and slide content. A user study is conducted to investigate the correlations between the automatically collected features and human ratings of quality aspects of a lecture video. Finally, the impact of our features on the knowledge gain of the participants is discussed.\",\"PeriodicalId\":347114,\"journal\":{\"name\":\"Proceedings of the 1st International Workshop on Search as Learning with Multimedia Information\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 1st International Workshop on Search as Learning with Multimedia Information\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3347451.3356731\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 1st International Workshop on Search as Learning with Multimedia Information","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3347451.3356731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 13

摘要

诸如视频之类的多媒体内容的排序和推荐通常是根据与用户查询的相关性来实现的。然而，对于讲座视频和mooc(大规模在线开放课程)来说，不仅需要检索相关的视频，而且需要找到高质量的、有利于学习的讲座视频，例如，与视频或演讲者的知名度无关。因此，关于讲座视频质量的元数据是学习情境的关键特征，例如，作为学习场景的搜索中的讲座视频推荐。在本文中，我们研究了自动提取的特征是否与视频的质量相关。本文分析了一组来自大规模在线开放课程(MOOC)的学术视频的音频、语言和视觉特征。此外，本文还提出了一套由文本、音频、视频和幻灯片内容组合而成的跨模态特征。进行用户研究，以调查自动收集的特征和讲座视频质量方面的人类评级之间的相关性。最后，讨论了我们的特征对参与者知识获取的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Investigating Correlations of Automatically Extracted Multimodal Features and Lecture Video Quality

Ranking and recommendation of multimedia content such as videos is usually realized with respect to the relevance to a user query. However, for lecture videos and MOOCs (Massive Open Online Courses) it is not only required to retrieve relevant videos, but particularly to find lecture videos of high quality that facilitate learning, for instance, independent of the video's or speaker's popularity. Thus, metadata about a lecture video's quality are crucial features for learning contexts, e.g., lecture video recommendation in search as learning scenarios. In this paper, we investigate whether automatically extracted features are correlated to quality aspects of a video. A set of scholarly videos from a Mass Open Online Course (MOOC) is analyzed regarding audio, linguistic, and visual features. Furthermore, a set of cross-modal features is proposed which are derived by combining transcripts, audio, video, and slide content. A user study is conducted to investigate the correlations between the automatically collected features and human ratings of quality aspects of a lecture video. Finally, the impact of our features on the knowledge gain of the participants is discussed.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 1st International Workshop on Search as Learning with Multimedia Information

自引率

0.00%

发文量