首页 > 最新文献

Universal Access in the Information Society最新文献

英文 中文
Accessibility first: detecting frustration in web browsing for visually impaired and sighted smartphone users 无障碍第一:为视障和视力正常的智能手机用户检测网页浏览的挫败感
4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-24 DOI: 10.1007/s10209-023-01053-3
Yasmin Abdelaal, Dena Al-Thani
Abstract Smartphone adoption has become increasingly prevalent in modern society, reflecting the widespread integration of these devices into various aspects of daily life, yet accessibility and usability problems persist. Smartphones pose numerous challenges to individuals with visual impairments (VI). This is due to smaller screen sizes, lack of physical buttons, and prevalent problems. These challenges frustrate users as they are unable to accomplish their goals. In this paper, the physiological response of VI and sighted individuals are compared and examined. The participant's frustration is measured through physiological signals. The GSR is the signal captured to detect the user's frustration caused by the superimposed accessibility and usability web-related issues. GSR, being less obtrusive, is an effective way to measure frustration. There were 13 VI and 16 sighted participants. Using a within-subject and between-subject design, participants completed four tasks, each was completed under frustrating and non-frustrating conditions while wearing an Empatica E4 wristband to collect GSR data. Challenges were both group-specific and mutual. A slowed internet connection and constant page refreshes are frustrating tasks for sighted participants. VI participants faced an unsearchable drop-down list and an inaccessible menu. Pop-up ads and session timeout are common tasks. Among VI participants, there was no significant difference between frustrating and non-frustrating tasks. When VI participants experienced a session timeout, they significantly showed a higher level of arousal. When comparing the physiological responses of sighted and VI participants in the page refresh and session timeout tasks, the results show that the arousal level is significantly higher in VI participants, primarily due to accessibility issues.
智能手机的采用在现代社会越来越普遍,反映了这些设备广泛融入日常生活的各个方面,但可访问性和可用性问题仍然存在。智能手机给视力受损的人带来了许多挑战。这是由于屏幕尺寸较小,缺乏物理按钮以及普遍存在的问题。这些挑战会让用户感到沮丧,因为他们无法完成自己的目标。本文比较和考察了视残者和正常人的生理反应。参与者的挫败感是通过生理信号来衡量的。GSR是捕获的信号,用于检测由叠加的可访问性和可用性网络相关问题引起的用户挫折感。GSR不那么突兀,是衡量挫败感的有效方法。有13名视力障碍者和16名视力障碍者。使用受试者内部和受试者之间的设计,参与者完成了四个任务,每个任务在令人沮丧和非令人沮丧的条件下完成,同时佩戴Empatica E4腕带收集GSR数据。挑战既有群体的,也有相互的。对于视力正常的参与者来说,缓慢的互联网连接和不断的页面刷新是令人沮丧的任务。VI参与者面对一个无法搜索的下拉列表和一个无法访问的菜单。弹出广告和会话超时是常见的任务。在VI参与者中,令人沮丧和非令人沮丧的任务之间没有显著差异。当VI参与者经历会话暂停时,他们显着表现出更高水平的唤醒。在页面刷新和会话超时任务的生理反应对比中,结果表明,VI参与者的唤醒水平明显更高,这主要是由于可访问性问题。
{"title":"Accessibility first: detecting frustration in web browsing for visually impaired and sighted smartphone users","authors":"Yasmin Abdelaal, Dena Al-Thani","doi":"10.1007/s10209-023-01053-3","DOIUrl":"https://doi.org/10.1007/s10209-023-01053-3","url":null,"abstract":"Abstract Smartphone adoption has become increasingly prevalent in modern society, reflecting the widespread integration of these devices into various aspects of daily life, yet accessibility and usability problems persist. Smartphones pose numerous challenges to individuals with visual impairments (VI). This is due to smaller screen sizes, lack of physical buttons, and prevalent problems. These challenges frustrate users as they are unable to accomplish their goals. In this paper, the physiological response of VI and sighted individuals are compared and examined. The participant's frustration is measured through physiological signals. The GSR is the signal captured to detect the user's frustration caused by the superimposed accessibility and usability web-related issues. GSR, being less obtrusive, is an effective way to measure frustration. There were 13 VI and 16 sighted participants. Using a within-subject and between-subject design, participants completed four tasks, each was completed under frustrating and non-frustrating conditions while wearing an Empatica E4 wristband to collect GSR data. Challenges were both group-specific and mutual. A slowed internet connection and constant page refreshes are frustrating tasks for sighted participants. VI participants faced an unsearchable drop-down list and an inaccessible menu. Pop-up ads and session timeout are common tasks. Among VI participants, there was no significant difference between frustrating and non-frustrating tasks. When VI participants experienced a session timeout, they significantly showed a higher level of arousal. When comparing the physiological responses of sighted and VI participants in the page refresh and session timeout tasks, the results show that the arousal level is significantly higher in VI participants, primarily due to accessibility issues.","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135266484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An audio MOOC framework for the digital inclusion of low literate people in the distance education process 一个音频MOOC框架,用于在远程教育过程中对低文化人群进行数字包容
4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-24 DOI: 10.1007/s10209-023-01051-5
Raj Kishen Moloo, Kavi Kumar Khedo, Tadinada Venkata Prabhakar
{"title":"An audio MOOC framework for the digital inclusion of low literate people in the distance education process","authors":"Raj Kishen Moloo, Kavi Kumar Khedo, Tadinada Venkata Prabhakar","doi":"10.1007/s10209-023-01051-5","DOIUrl":"https://doi.org/10.1007/s10209-023-01051-5","url":null,"abstract":"","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135268370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Needs analysis for the design of a digital platform to train professionals in online family intervention through live supervision of real cases 需求分析,设计数字平台,通过对真实案例的现场监督,培训在线家庭干预专业人员
4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-20 DOI: 10.1007/s10209-023-01038-2
Sonia Torras, Anna Vilaregut, Xavier Canaleta, Eduard Martí
Abstract Purpose Mental health professionals undergo continuous training throughout their careers. Their training consists in part of the supervision of cases by an entire healthcare team, a practice that allows them to consolidate their understanding of behaviour, emotions and to enhance their relationships with patients and their families. The COVID-19 pandemic has had a great impact on this training methodology, leading to a significant increase in the use of digital platforms, but such digital tools are not well adapted to this context, especially when it comes to the supervision of real online cases. The goals of this study are: (1) to analyse what professionals need in order to carry out online interventions and training through the live supervision of real online cases and (2) to create a prototype of a specific digital platform intended to help meet the detected needs. Methods 28 semi-structured interviews were conducted with supervisors ( N = 14) and professionals in training ( N = 14). Results The results have allowed us to gain a deeper understanding of the difficulties and benefits that professionals are encountering when doing online live supervision using the existing video conference platforms. Conclusion This analysis points to a need to create a platform that can overcome the difficulties and enhance the benefits of the digitalization of family intervention training through the live supervision of real cases. These specific needs have yet to be addressed by existing digital platforms.
目的心理健康专业人员在其职业生涯中接受持续的培训。他们的培训包括整个医疗团队对病例的部分监督,这种做法使他们能够巩固对行为和情绪的理解,并加强与患者及其家属的关系。COVID-19大流行对这种培训方法产生了很大影响,导致数字平台的使用大幅增加,但这些数字工具并不适应这种情况,特别是在对真实在线病例的监管方面。本研究的目标是:(1)通过对真实在线案例的现场监督,分析专业人员开展在线干预和培训需要什么;(2)创建一个特定数字平台的原型,旨在帮助满足检测到的需求。方法采用半结构式访谈28例,分别对管理人员(N = 14)和在职专业人员(N = 14)进行访谈。结果研究结果使我们对专业人员使用现有视频会议平台进行在线直播监管时遇到的困难和好处有了更深入的了解。本分析指出,需要通过对真实案例的现场监督,创建一个平台,克服家庭干预培训数字化的困难,提升其效益。现有的数字平台尚未满足这些具体需求。
{"title":"Needs analysis for the design of a digital platform to train professionals in online family intervention through live supervision of real cases","authors":"Sonia Torras, Anna Vilaregut, Xavier Canaleta, Eduard Martí","doi":"10.1007/s10209-023-01038-2","DOIUrl":"https://doi.org/10.1007/s10209-023-01038-2","url":null,"abstract":"Abstract Purpose Mental health professionals undergo continuous training throughout their careers. Their training consists in part of the supervision of cases by an entire healthcare team, a practice that allows them to consolidate their understanding of behaviour, emotions and to enhance their relationships with patients and their families. The COVID-19 pandemic has had a great impact on this training methodology, leading to a significant increase in the use of digital platforms, but such digital tools are not well adapted to this context, especially when it comes to the supervision of real online cases. The goals of this study are: (1) to analyse what professionals need in order to carry out online interventions and training through the live supervision of real online cases and (2) to create a prototype of a specific digital platform intended to help meet the detected needs. Methods 28 semi-structured interviews were conducted with supervisors ( N = 14) and professionals in training ( N = 14). Results The results have allowed us to gain a deeper understanding of the difficulties and benefits that professionals are encountering when doing online live supervision using the existing video conference platforms. Conclusion This analysis points to a need to create a platform that can overcome the difficulties and enhance the benefits of the digitalization of family intervention training through the live supervision of real cases. These specific needs have yet to be addressed by existing digital platforms.","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135616647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic captions on video calls: a must for the older adults 视频通话的自动字幕:老年人的必备功能
4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-13 DOI: 10.1007/s10209-023-01048-0
Eduardo Nacimiento-García, Carina S. González-González, Francisco L. Gutiérrez-Vela
Abstract In recent years, the use of video call or video conference tools has not stopped increasing, and especially due to the COVID-19 pandemic, the use of video calls increased in the educational and work spheres, but also in the family sphere, due to the risks of contagion in face-to-face meetings. Throughout the world, many older people are affected by hearing loss. Auditory functional diversity can make it difficult to enjoy video calls. Using automatic captions might help these people, but not all video calling tools offer this functionality, and some offer it in some languages. We developed an automatic conversation captioning tool using Automatic Speech Recognition and Speech to Text, using the free software tool Coqui STT. This automatic captioning tool is independent of the video call platform used and allows older adults or anyone with auditory functional diversity to enjoy video calls in a simple way. A transparent user interface was designed for our tool that overlays the video call window, and the tool allows us to easily change the text size, color, and background settings. It is also important to remember that many older people have visual functional diversity, so they could have problems reading the texts, thus it is important that each person can adapt the text to their needs. An analysis has been carried out that includes older people to analyze the benefits of the interface, as well as some configuration preferences, and a proposal to improve the way the text is displayed on the screen. Spanish and English were tested during the investigation, but the tool allows us to easily install dozens of new languages based on models trained for Coqui STT.
近年来,视频通话或视频会议工具的使用一直在不断增加,特别是由于新冠肺炎疫情,由于面对面会议的传染风险,视频通话在教育和工作领域的使用有所增加,在家庭领域也有所增加。在世界各地,许多老年人都受到听力损失的影响。听觉功能的多样性会使人们难以享受视频通话。使用自动字幕可能会对这些人有所帮助,但并不是所有的视频通话工具都提供这种功能,有些工具只提供某些语言的功能。我们利用免费的Coqui STT软件工具,利用自动语音识别和语音转文本开发了一个自动对话字幕工具。这种自动字幕工具独立于所使用的视频通话平台,允许老年人或任何有听觉功能多样性的人以简单的方式享受视频通话。我们为我们的工具设计了一个透明的用户界面,覆盖视频通话窗口,该工具允许我们轻松更改文本大小,颜色和背景设置。同样重要的是要记住,许多老年人具有视觉功能多样性,因此他们在阅读文本时可能有问题,因此每个人都能根据自己的需要调整文本是很重要的。已经进行了一项分析,包括老年人来分析界面的好处,以及一些配置偏好,并提出了改进文本在屏幕上显示方式的建议。在调查期间测试了西班牙语和英语,但该工具允许我们轻松地安装基于Coqui STT训练的模型的数十种新语言。
{"title":"Automatic captions on video calls: a must for the older adults","authors":"Eduardo Nacimiento-García, Carina S. González-González, Francisco L. Gutiérrez-Vela","doi":"10.1007/s10209-023-01048-0","DOIUrl":"https://doi.org/10.1007/s10209-023-01048-0","url":null,"abstract":"Abstract In recent years, the use of video call or video conference tools has not stopped increasing, and especially due to the COVID-19 pandemic, the use of video calls increased in the educational and work spheres, but also in the family sphere, due to the risks of contagion in face-to-face meetings. Throughout the world, many older people are affected by hearing loss. Auditory functional diversity can make it difficult to enjoy video calls. Using automatic captions might help these people, but not all video calling tools offer this functionality, and some offer it in some languages. We developed an automatic conversation captioning tool using Automatic Speech Recognition and Speech to Text, using the free software tool Coqui STT. This automatic captioning tool is independent of the video call platform used and allows older adults or anyone with auditory functional diversity to enjoy video calls in a simple way. A transparent user interface was designed for our tool that overlays the video call window, and the tool allows us to easily change the text size, color, and background settings. It is also important to remember that many older people have visual functional diversity, so they could have problems reading the texts, thus it is important that each person can adapt the text to their needs. An analysis has been carried out that includes older people to analyze the benefits of the interface, as well as some configuration preferences, and a proposal to improve the way the text is displayed on the screen. Spanish and English were tested during the investigation, but the tool allows us to easily install dozens of new languages based on models trained for Coqui STT.","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135858516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward digital inclusion of older adults in e-health: a case study on support for physical activity 将老年人数字化纳入电子保健:关于支持身体活动的案例研究
4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-12 DOI: 10.1007/s10209-023-01049-z
Åsa Revenäs, Lars Ström, Antonio Cicchetti, Maria Ehn
Abstract Older adults are a heterogeneous population for which many e-health innovations are inaccessible. Involving older adults in user-centered design (UCD) with a specific focus on inclusive design is important to make e-health more accessible to this user group. This case study aimed to explore the feasibility of a new UCD approach aiming to minimize bias in the design phase of a digital support for older adults’ physical activity (PA). The study used mixed methods and applied UCD principles in a four-iteration design phase followed by an evaluation phase where 11 and 15 older adults participated, respectively. The users’ gender, PA level and technology experience (TE) were considered in recruitment, data analysis and prioritization of improvement efforts. In the design phase, users with different gender, PA level and TE participated and contributed with feedback, which was prioritized in the development. The adaptation included improving readability, simplifying layout and features, clarifying structure, and making the digital content more inclusive and relevant. The evaluation showed that the users had a positive experience of the prototype and could use it with some help. The study demonstrated that adopting e-health to assure digital inclusion among older adults must address several aspects. The UCD approach was feasible for amending user bias and for confirming that users of both genders and with varied PA- and TE level shaped the design. However, evaluation of the method with larger samples is needed. Moreover, further research on methods to involve digitally excluded populations in UCD is needed.
老年人是一个异质人群,许多电子卫生创新是无法获得的。让老年人参与以用户为中心的设计(UCD),并特别注重包容性设计,这对于使这一用户群体更容易获得电子保健非常重要。本案例研究旨在探索一种新的UCD方法的可行性,旨在最大限度地减少老年人体育活动(PA)数字支持设计阶段的偏差。该研究使用混合方法,并将UCD原则应用于四次迭代设计阶段,随后是分别有11名和15名老年人参与的评估阶段。在招聘、数据分析和改进工作的优先次序方面考虑了用户的性别、PA水平和技术经验(TE)。在设计阶段,不同性别、PA级别和TE的用户参与并反馈,这在开发中是优先考虑的。调整包括提高可读性,简化布局和功能,澄清结构,使数字内容更具包容性和相关性。评估结果表明,用户对原型有良好的体验,可以在一些帮助下使用它。该研究表明,采用电子保健以确保老年人的数字包容必须解决几个方面的问题。UCD方法对于修正用户偏见和确认不同性别的用户以及不同PA和TE水平的用户塑造设计是可行的。然而,需要用更大的样本对该方法进行评估。此外,还需要进一步研究将数字排斥人群纳入UCD的方法。
{"title":"Toward digital inclusion of older adults in e-health: a case study on support for physical activity","authors":"Åsa Revenäs, Lars Ström, Antonio Cicchetti, Maria Ehn","doi":"10.1007/s10209-023-01049-z","DOIUrl":"https://doi.org/10.1007/s10209-023-01049-z","url":null,"abstract":"Abstract Older adults are a heterogeneous population for which many e-health innovations are inaccessible. Involving older adults in user-centered design (UCD) with a specific focus on inclusive design is important to make e-health more accessible to this user group. This case study aimed to explore the feasibility of a new UCD approach aiming to minimize bias in the design phase of a digital support for older adults’ physical activity (PA). The study used mixed methods and applied UCD principles in a four-iteration design phase followed by an evaluation phase where 11 and 15 older adults participated, respectively. The users’ gender, PA level and technology experience (TE) were considered in recruitment, data analysis and prioritization of improvement efforts. In the design phase, users with different gender, PA level and TE participated and contributed with feedback, which was prioritized in the development. The adaptation included improving readability, simplifying layout and features, clarifying structure, and making the digital content more inclusive and relevant. The evaluation showed that the users had a positive experience of the prototype and could use it with some help. The study demonstrated that adopting e-health to assure digital inclusion among older adults must address several aspects. The UCD approach was feasible for amending user bias and for confirming that users of both genders and with varied PA- and TE level shaped the design. However, evaluation of the method with larger samples is needed. Moreover, further research on methods to involve digitally excluded populations in UCD is needed.","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135969172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Omissions and inferential meaning-making in audio description, and implications for automating video content description 音频描述中的省略和推理意义生成,以及对视频内容描述自动化的启示
4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-08 DOI: 10.1007/s10209-023-01045-3
Kim Starr, Sabine Braun
Abstract There is broad consensus that audio description (AD) is a modality of intersemiotic translation, but there are different views in relation to how AD can be more precisely conceptualised. While Benecke (Audiodeskription als partielle Translation. Modell und Methode, LIT, Berlin, 2014) characterises AD as ‘partial translation’, Braun (T 28: 302–313, 2016) hypothesises that what audio describers appear to ‘omit’ from their descriptions can normally be inferred by the audience, drawing on narrative cues from dialogue, mise-en-scène, kinesis, music or sound effects. The study reported in this paper tested this hypothesis using a corpus of material created during the H2020 MeMAD project. The MeMAD project aimed to improve access to audiovisual (AV) content through a combination of human and computer-based methods of description. One of the MeMAD workstreams addressed human approaches to describing visually salient cues. This included an analysis of the potential impact of omissions in AD, which is the focus of this paper. Using a corpus of approximately 500 audio described film extracts we identified the visual elements that can be considered essential for the construction of the filmic narrative and then performed a qualitative analysis of the corresponding audio descriptions to determine how these elements are verbally represented and whether any omitted elements could be inferred from other cues that are accessible to visually impaired audiences. We then identified the most likely source of these inferences and the conditions upon which retrieval could be predicated, preparing the ground for future reception studies to test our hypotheses with target audiences. In this paper, we discuss the methodology used to determine where omissions occur in the analysed audio descriptions, consider worked examples from the MeMAD500 film corpus, and outline the findings of our study namely that various strategies are relevant to inferring omitted information, including the use of proximal and distal contextual cues, and reliance on the application of common knowledge and iconic scenarios. To conclude, consideration is given to overcoming significant omissions in human-generated AD, such as using extended AD formats, and mitigating similar gaps in machine-generated descriptions, where incorporating dialogue analysis and other supplementary data into the computer model could resolve many omissions.
摘要音频描述是一种符码间翻译的形态,这一观点已经得到了广泛的认同,但对于如何更精确地对音频描述进行概念化却存在不同的观点。而Benecke (Audiodeskription)则是粒子翻译。模型与方法,文学,柏林,2014)将AD描述为“部分翻译”,Braun (T 28: 302-313, 2016)假设音频描述者似乎从他们的描述中“省略”的内容通常可以由观众推断出来,从对话,场景,动作,音乐或声音效果中提取叙事线索。本文报道的研究使用H2020 MeMAD项目期间创建的材料语料库验证了这一假设。MeMAD项目旨在通过结合基于人和计算机的描述方法来改善对视听(AV)内容的访问。其中一个MeMAD工作流解决了人类描述视觉显著线索的方法。这包括对AD中遗漏的潜在影响的分析,这是本文的重点。使用大约500个音频描述的电影摘录的语料库,我们确定了可以被认为对电影叙事构建至关重要的视觉元素,然后对相应的音频描述进行定性分析,以确定这些元素是如何口头表达的,以及是否可以从视障观众可以访问的其他线索中推断出任何遗漏的元素。然后,我们确定了这些推论的最有可能的来源,以及可以预测检索的条件,为未来的接受研究奠定基础,以便在目标受众中测试我们的假设。在本文中,我们讨论了用于确定所分析的音频描述中遗漏位置的方法,考虑了MeMAD500电影语料库中的工作示例,并概述了我们的研究结果,即与推断遗漏信息相关的各种策略,包括使用近端和远端上下文线索,以及依赖于常识和标志性场景的应用。综上所述,考虑克服人类生成的AD中的重大遗漏,例如使用扩展的AD格式,并减轻机器生成描述中的类似差距,其中将对话分析和其他补充数据纳入计算机模型可以解决许多遗漏。
{"title":"Omissions and inferential meaning-making in audio description, and implications for automating video content description","authors":"Kim Starr, Sabine Braun","doi":"10.1007/s10209-023-01045-3","DOIUrl":"https://doi.org/10.1007/s10209-023-01045-3","url":null,"abstract":"Abstract There is broad consensus that audio description (AD) is a modality of intersemiotic translation, but there are different views in relation to how AD can be more precisely conceptualised. While Benecke (Audiodeskription als partielle Translation. Modell und Methode, LIT, Berlin, 2014) characterises AD as ‘partial translation’, Braun (T 28: 302–313, 2016) hypothesises that what audio describers appear to ‘omit’ from their descriptions can normally be inferred by the audience, drawing on narrative cues from dialogue, mise-en-scène, kinesis, music or sound effects. The study reported in this paper tested this hypothesis using a corpus of material created during the H2020 MeMAD project. The MeMAD project aimed to improve access to audiovisual (AV) content through a combination of human and computer-based methods of description. One of the MeMAD workstreams addressed human approaches to describing visually salient cues. This included an analysis of the potential impact of omissions in AD, which is the focus of this paper. Using a corpus of approximately 500 audio described film extracts we identified the visual elements that can be considered essential for the construction of the filmic narrative and then performed a qualitative analysis of the corresponding audio descriptions to determine how these elements are verbally represented and whether any omitted elements could be inferred from other cues that are accessible to visually impaired audiences. We then identified the most likely source of these inferences and the conditions upon which retrieval could be predicated, preparing the ground for future reception studies to test our hypotheses with target audiences. In this paper, we discuss the methodology used to determine where omissions occur in the analysed audio descriptions, consider worked examples from the MeMAD500 film corpus, and outline the findings of our study namely that various strategies are relevant to inferring omitted information, including the use of proximal and distal contextual cues, and reliance on the application of common knowledge and iconic scenarios. To conclude, consideration is given to overcoming significant omissions in human-generated AD, such as using extended AD formats, and mitigating similar gaps in machine-generated descriptions, where incorporating dialogue analysis and other supplementary data into the computer model could resolve many omissions.","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135198849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An AI-based English education platform during the COVID-19 pandemic 新冠疫情期间基于人工智能的英语教育平台
4区 计算机科学 Q1 Computer Science Pub Date : 2023-10-06 DOI: 10.1007/s10209-023-01046-2
Hansuk Um, Hisam Kim, Dain Choi, Hyungna Oh
Abstract This study examines whether the use of AI-Pengtalk, an AI-based Conversational English programme, provided by a broadcasting company (EBS) that specializes in public education can significantly improve conversational English skills and bridge the English language proficiency gap associated with parental socioeconomic status. Over the course of four weeks from April 27 to May 22 in 2020, 108 fourth-grade classes in 54 elementary schools voluntarily participated in this experiment. Two classes in each school were designated as a treatment group and a control group. For the treatment group, a tablet installed with a pilot version of AI-Pengtalk was provided and students were encouraged to make use of the programme. Two sets of surveys and English tests were placed pre and post hoc. After 4 weeks, test scores, log files, and survey responses of participants were analysed. A series of DID analyses demonstrate that the use of AI-Pengtalk improves the treatment group’s self-evaluation of their English abilities, confidence in using English, preference on English itself, and amount of time spent on studying English during the pilot experimental period compared to the control group. When other variables were controlled, the use of AI-Pengtalk also helped the treatment group achieve higher test scores. This study implicates that the use of smart English education like AI-Pengtalk may especially be able to better compensate for academic setbacks caused by low parental SES or, in the case of English learning, the reluctance to converse in English with other students.
摘要本研究考察了一家专门从事公共教育的广播公司(EBS)提供的基于人工智能的英语会话节目AI-Pengtalk的使用是否能显著提高英语会话技能,并弥合与父母社会经济地位相关的英语能力差距。在2020年4月27日至5月22日的四周时间里,54所小学的108个四年级班级自愿参加了这项实验。每所学校的两个班级被指定为治疗组和对照组。治疗组获得了安装AI-Pengtalk试用版的平板电脑,并鼓励学生使用该程序。课前和课后分别进行了两组调查和英语测试。4周后,对参与者的测试成绩、日志文件和调查反应进行分析。一系列DID分析表明,与对照组相比,AI-Pengtalk的使用提高了试验组对英语能力的自我评价、使用英语的信心、对英语本身的偏好以及在试验期间学习英语的时间。当其他变量得到控制时,AI-Pengtalk的使用也帮助治疗组取得了更高的考试成绩。这项研究表明,使用像AI-Pengtalk这样的智能英语教育可能尤其能够更好地弥补由于父母经济地位低而导致的学业挫折,或者在英语学习的情况下,不愿与其他学生用英语交谈。
{"title":"An AI-based English education platform during the COVID-19 pandemic","authors":"Hansuk Um, Hisam Kim, Dain Choi, Hyungna Oh","doi":"10.1007/s10209-023-01046-2","DOIUrl":"https://doi.org/10.1007/s10209-023-01046-2","url":null,"abstract":"Abstract This study examines whether the use of AI-Pengtalk, an AI-based Conversational English programme, provided by a broadcasting company (EBS) that specializes in public education can significantly improve conversational English skills and bridge the English language proficiency gap associated with parental socioeconomic status. Over the course of four weeks from April 27 to May 22 in 2020, 108 fourth-grade classes in 54 elementary schools voluntarily participated in this experiment. Two classes in each school were designated as a treatment group and a control group. For the treatment group, a tablet installed with a pilot version of AI-Pengtalk was provided and students were encouraged to make use of the programme. Two sets of surveys and English tests were placed pre and post hoc. After 4 weeks, test scores, log files, and survey responses of participants were analysed. A series of DID analyses demonstrate that the use of AI-Pengtalk improves the treatment group’s self-evaluation of their English abilities, confidence in using English, preference on English itself, and amount of time spent on studying English during the pilot experimental period compared to the control group. When other variables were controlled, the use of AI-Pengtalk also helped the treatment group achieve higher test scores. This study implicates that the use of smart English education like AI-Pengtalk may especially be able to better compensate for academic setbacks caused by low parental SES or, in the case of English learning, the reluctance to converse in English with other students.","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135350358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring automatic text-to-sign translation in a healthcare setting 探索医疗保健设置中的自动文本到签名转换
4区 计算机科学 Q1 Computer Science Pub Date : 2023-09-29 DOI: 10.1007/s10209-023-01042-6
Lyke Esselink, Floris Roelofsen, Jakub Dotlačil, Shani Mende-Gillings, Maartje de Meulder, Nienke Sijm, Anika Smeijers
Abstract Communication between healthcare professionals and deaf patients has been particularly challenging during the COVID-19 pandemic. We have explored the possibility to automatically translate phrases that are frequently used in the diagnosis and treatment of hospital patients, in particular phrases related to COVID-19, from Dutch or English to Dutch Sign Language (NGT). The prototype system we developed displays translations either by means of pre-recorded videos featuring a deaf human signer (for a limited number of sentences) or by means of animations featuring a computer-generated signing avatar (for a larger, though still restricted number of sentences). We evaluated the comprehensibility of the signing avatar, as compared to the human signer. We found that, while individual signs are recognized correctly when signed by the avatar almost as frequently as when signed by a human, sentence comprehension rates and clarity scores for the avatar are substantially lower than for the human signer. We identify a number of concrete limitations of the JASigning avatar engine that underlies our system. Namely, the engine currently does not offer sufficient control over mouth shapes, the relative speed and intensity of signs in a sentence (prosody), and transitions between signs. These limitations need to be overcome in future work for the engine to become usable in practice.
在COVID-19大流行期间,医护人员与失聪患者之间的沟通尤其具有挑战性。我们探索了将医院患者诊断和治疗中经常使用的短语,特别是与COVID-19相关的短语,从荷兰语或英语自动翻译为荷兰语手语(NGT)的可能性。我们开发的原型系统可以通过预先录制的视频来显示翻译,视频中有一个聋哑人的签名(对于有限数量的句子),或者通过计算机生成的签名化身的动画来显示翻译(对于更大的,尽管仍然有限数量的句子)。与人类签名者相比,我们评估了签名头像的可理解性。我们发现,虽然由虚拟角色签名的个体符号被正确识别的频率几乎与人类签名时一样高,但虚拟角色的句子理解率和清晰度得分明显低于人类签名者。我们确定了jasign头像引擎的一些具体限制,这些限制是我们系统的基础。也就是说,该引擎目前还不能提供足够的控制口型、句子中符号的相对速度和强度(韵律)以及符号之间的转换。这些限制需要在未来的工作中克服,使发动机在实践中可用。
{"title":"Exploring automatic text-to-sign translation in a healthcare setting","authors":"Lyke Esselink, Floris Roelofsen, Jakub Dotlačil, Shani Mende-Gillings, Maartje de Meulder, Nienke Sijm, Anika Smeijers","doi":"10.1007/s10209-023-01042-6","DOIUrl":"https://doi.org/10.1007/s10209-023-01042-6","url":null,"abstract":"Abstract Communication between healthcare professionals and deaf patients has been particularly challenging during the COVID-19 pandemic. We have explored the possibility to automatically translate phrases that are frequently used in the diagnosis and treatment of hospital patients, in particular phrases related to COVID-19, from Dutch or English to Dutch Sign Language (NGT). The prototype system we developed displays translations either by means of pre-recorded videos featuring a deaf human signer (for a limited number of sentences) or by means of animations featuring a computer-generated signing avatar (for a larger, though still restricted number of sentences). We evaluated the comprehensibility of the signing avatar, as compared to the human signer. We found that, while individual signs are recognized correctly when signed by the avatar almost as frequently as when signed by a human, sentence comprehension rates and clarity scores for the avatar are substantially lower than for the human signer. We identify a number of concrete limitations of the JASigning avatar engine that underlies our system. Namely, the engine currently does not offer sufficient control over mouth shapes, the relative speed and intensity of signs in a sentence (prosody), and transitions between signs. These limitations need to be overcome in future work for the engine to become usable in practice.","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135200116","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic detection of accessibility smells 无障碍气味的动态检测
4区 计算机科学 Q1 Computer Science Pub Date : 2023-09-22 DOI: 10.1007/s10209-023-01043-5
Fernando Durgam, Julián Grigera, Alejandra Garrido
{"title":"Dynamic detection of accessibility smells","authors":"Fernando Durgam, Julián Grigera, Alejandra Garrido","doi":"10.1007/s10209-023-01043-5","DOIUrl":"https://doi.org/10.1007/s10209-023-01043-5","url":null,"abstract":"","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136061696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Schools as living labs for the new European bauhaus 学校是新欧洲包豪斯的生活实验室
4区 计算机科学 Q1 Computer Science Pub Date : 2023-09-21 DOI: 10.1007/s10209-023-01044-4
Loukas Katikas, Sofoklis Sotiriou
{"title":"Schools as living labs for the new European bauhaus","authors":"Loukas Katikas, Sofoklis Sotiriou","doi":"10.1007/s10209-023-01044-4","DOIUrl":"https://doi.org/10.1007/s10209-023-01044-4","url":null,"abstract":"","PeriodicalId":49115,"journal":{"name":"Universal Access in the Information Society","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136235233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Universal Access in the Information Society
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1