首页 > 最新文献

IPO Annual Progress Report最新文献

英文 中文
Speech and the user interface of consumer products : the VODIS system 语音与消费产品的用户界面:VODIS系统
Pub Date : 1900-01-01 DOI: 10.1037/e496092004-001
Xhg Xavier Pouteau
Building interfaces where the user-system communication relies on speech is often motivated by the added value expected from speech: a more natural, efficient communication that also frees the hands (and the eyes) of the user. However, when developing such an interface, one has to remember that just like other systems, spoken interfaces require a proper design, implying an adequate analysis of the user's needs throughout the dialogue. The VODIS project has led to the design and development of a spoken interface for the control of car equipment. Due to the workload caused by the task of driving the vehicle, spoken communication provides a potentially safe and efficient mode of controlling the car equipment. Here we report the main characteristics of the central module of the system, the Dialogue Manager, designed and implemented by IPO, which comprises the necessary components aimed at achieving the goal of a robust and efficient dialogue system. We mainly concentrate on two important activities carried out in the project: the integration of the various modules into a task model taking into account the characteristics of a spoken command dialogue, and the necessity of customizing parts of the system to spoken communication.
构建用户-系统通信依赖于语音的界面通常是出于对语音的附加价值的期望:一种更自然、更有效的通信,同时也解放了用户的双手(和眼睛)。然而,在开发这样的界面时,必须记住,就像其他系统一样,语音界面需要适当的设计,这意味着在整个对话过程中对用户需求进行充分的分析。VODIS项目已经导致了用于控制汽车设备的语音接口的设计和开发。由于驾驶车辆的任务带来的工作量,语音通信提供了一种潜在的安全高效的控制汽车设备的方式。在这里,我们报告了由IPO设计和实施的系统中心模块“对话管理器”的主要特征,它包含了旨在实现强大和高效对话系统目标的必要组件。我们主要集中在项目中进行的两项重要活动:考虑到口头命令对话的特点,将各个模块集成到任务模型中,以及定制系统部分以进行口头通信的必要性。
{"title":"Speech and the user interface of consumer products : the VODIS system","authors":"Xhg Xavier Pouteau","doi":"10.1037/e496092004-001","DOIUrl":"https://doi.org/10.1037/e496092004-001","url":null,"abstract":"Building interfaces where the user-system communication relies on speech is often motivated by the added value expected from speech: a more natural, efficient communication that also frees the hands (and the eyes) of the user. However, when developing such an interface, one has to remember that just like other systems, spoken interfaces require a proper design, implying an adequate analysis of the user's needs throughout the dialogue. The VODIS project has led to the design and development of a spoken interface for the control of car equipment. Due to the workload caused by the task of driving the vehicle, spoken communication provides a potentially safe and efficient mode of controlling the car equipment. Here we report the main characteristics of the central module of the system, the Dialogue Manager, designed and implemented by IPO, which comprises the necessary components aimed at achieving the goal of a robust and efficient dialogue system. We mainly concentrate on two important activities carried out in the project: the integration of the various modules into a task model taking into account the characteristics of a spoken command dialogue, and the necessity of customizing parts of the system to spoken communication.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127880381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GUI access for blind users : a sound initiative 为盲人用户提供GUI访问:一个很好的倡议
Pub Date : 1900-01-01 DOI: 10.1037/e492272004-001
L.H.D. Poll, J. H. Eggen
In this paper a description is given of an experiment in which the feasibility of a non-visual GUI device was explored. Both blind and blindfolded subjects participated in the experiment. Because experienced blind GUI us.ers are scarce, a playing card metaphor was used as the basis for the screens presented to the subjects. During the experiment, subjects were asked to locate and select/drag a specific object with the help of the non-visual interaction device. The results of the experiment show that the interaction device is suited for use in a non-visual GUI access system. However, the results also indicated that the addition of an auditory/tactile localization aid is desirable.
本文描述了一个实验,在这个实验中探索了非可视化GUI设备的可行性。盲眼和蒙眼的受试者都参加了实验。因为经历过盲目的GUI我们。er是稀缺的,打牌的比喻被用作展示给受试者的屏幕的基础。在实验过程中,受试者被要求在非视觉交互装置的帮助下定位和选择/拖动特定的物体。实验结果表明,该交互装置适用于非可视化GUI访问系统。然而,结果也表明,增加听觉/触觉定位辅助是可取的。
{"title":"GUI access for blind users : a sound initiative","authors":"L.H.D. Poll, J. H. Eggen","doi":"10.1037/e492272004-001","DOIUrl":"https://doi.org/10.1037/e492272004-001","url":null,"abstract":"In this paper a description is given of an experiment in which the feasibility of a non-visual GUI device was explored. Both blind and blindfolded subjects participated in the experiment. Because experienced blind GUI us.ers are scarce, a playing card metaphor was used as the basis for the screens presented to the subjects. During the experiment, subjects were asked to locate and select/drag a specific object with the help of the non-visual interaction device. The results of the experiment show that the interaction device is suited for use in a non-visual GUI access system. However, the results also indicated that the addition of an auditory/tactile localization aid is desirable.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117184298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effect of image disparity, convergence distance and focal length on perceived quality in stereoscopic displays 立体显示器中图像视差、会聚距离和焦距对感知质量的影响
Pub Date : 1900-01-01 DOI: 10.1037/e492392004-001
W. IJsselsteijn, de H Huib Ridder, R. Hamberg
Within the area of broadcasting and entertainment, stereoscopic displays are used to heighten the viewer's sense of excitement and qual ity. To evaluate these subjective experiences, an appreciation-oriented approach may be appropriate. Within this context, the current study investigates the influence of image disparity, convergence distance and focal length on the subjective assessment of depth, naturalness of depth and quality of depth. Twelve observers viewed a set of stereoscopic still images varying in image disparity, convergence distance and focal length. Each observer was asked to rate his/her impression of depth, naturalness of depth and quality of depth, in separate counterbalanced sessions. Results indicate that observers prefer a stereoscopic presentation of images over a monoscopic presentation. A clear optimum was found at 4 em image disparity for both subjective judgments of naturalness and of quality. A focal length effect was only found for extreme image disparities. Although there was a strong linear relationship between naturalness and quality (a correlation of r=O.96), a small but systematic deviation could be observed. This quality-naturalness shift is discussed in relation to similar, yet more pronounced findings in the domain of colour perception.
在广播和娱乐领域,立体显示器被用来提高观众的兴奋感和质量。为了评估这些主观体验,一种以欣赏为导向的方法可能是合适的。在此背景下,本研究探讨了像差、收敛距离和焦距对深度主观评价、深度自然度和深度质量的影响。12名观察者观看了一组视差、会聚距离和焦距不同的立体静止图像。每个观察者被要求在单独的平衡环节中对深度、深度的自然度和深度的质量进行评价。结果表明,观察者更喜欢立体呈现的图像,而不是单镜呈现。对于自然性和质量的主观判断,在4 em图像差异下发现了一个明显的最佳值。焦距效应只在极端的图像差异中被发现。虽然自然度和质量之间存在很强的线性关系(相关r= 0.96),但可以观察到一个小但系统的偏差。这种质量-自然性的转变与色彩感知领域的类似但更明显的发现有关。
{"title":"The effect of image disparity, convergence distance and focal length on perceived quality in stereoscopic displays","authors":"W. IJsselsteijn, de H Huib Ridder, R. Hamberg","doi":"10.1037/e492392004-001","DOIUrl":"https://doi.org/10.1037/e492392004-001","url":null,"abstract":"Within the area of broadcasting and entertainment, stereoscopic displays are used to heighten the viewer's sense of excitement and qual ity. To evaluate these subjective experiences, an appreciation-oriented approach may be appropriate. Within this context, the current study investigates the influence of image disparity, convergence distance and focal length on the subjective assessment of depth, naturalness of depth and quality of depth. Twelve observers viewed a set of stereoscopic still images varying in image disparity, convergence distance and focal length. Each observer was asked to rate his/her impression of depth, naturalness of depth and quality of depth, in separate counterbalanced sessions. Results indicate that observers prefer a stereoscopic presentation of images over a monoscopic presentation. A clear optimum was found at 4 em image disparity for both subjective judgments of naturalness and of quality. A focal length effect was only found for extreme image disparities. Although there was a strong linear relationship between naturalness and quality (a correlation of r=O.96), a small but systematic deviation could be observed. This quality-naturalness shift is discussed in relation to similar, yet more pronounced findings in the domain of colour perception.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124554227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Pragmatic interpretation and dialogue management in spoken-language systems 口语系统中的语用解释与对话管理
Pub Date : 1900-01-01 DOI: 10.1037/E492692004-001
G. V. V. Zanten
This paper describes the design of pragmatic-interpretation and dialogue-management modules in an automatic enquiry system that can be consulted through spoken natural language over the telephone. The system is designed around a central multi-level data structure representing the discourse that has unfolded during the dialogue. At the highest level of this discourse representation the information exchange is represented as a series of information-state changes or updates. Several conditions in the information state itself give rise to actions of the dialogue manager. The dialogue manager is designed to achieve the user's goal in a manner that is understandable to the user, efficient and correct. This is not a trivial problem because natural language and, in particular, speech understanding lead to many uncertainties. To deal with uncertain information, we have designed feedback and verification mechanisms and means for contextual understanding, underspecification and pragmatic inferencing.
本文介绍了一个可以通过电话语音进行自然语言查询的自动查询系统的语用解释和对话管理模块的设计。该系统是围绕一个中心的多层次数据结构设计的,该数据结构表示在对话期间展开的话语。在这种话语表示的最高层次上,信息交换被表示为一系列信息状态的变化或更新。信息状态本身的几个条件导致对话管理器的操作。对话管理器旨在以用户可以理解、高效和正确的方式实现用户的目标。这不是一个微不足道的问题,因为自然语言,特别是语音理解会导致许多不确定性。为了处理不确定信息,我们设计了反馈和验证机制,以及上下文理解、欠规范和语用推理的方法。
{"title":"Pragmatic interpretation and dialogue management in spoken-language systems","authors":"G. V. V. Zanten","doi":"10.1037/E492692004-001","DOIUrl":"https://doi.org/10.1037/E492692004-001","url":null,"abstract":"This paper describes the design of pragmatic-interpretation and dialogue-management modules in an automatic enquiry system that can be consulted through spoken natural language over the telephone. The system is designed around a central multi-level data structure representing the discourse that has unfolded during the dialogue. At the highest level of this discourse representation the information exchange is represented as a series of information-state changes or updates. Several conditions in the information state itself give rise to actions of the dialogue manager. The dialogue manager is designed to achieve the user's goal in a manner that is understandable to the user, efficient and correct. This is not a trivial problem because natural language and, in particular, speech understanding lead to many uncertainties. To deal with uncertain information, we have designed feedback and verification mechanisms and means for contextual understanding, underspecification and pragmatic inferencing.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133400312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
The role of stress and accent in the perception of speech rhythm 重音和重音在语音节奏感知中的作用
Pub Date : 1900-01-01 DOI: 10.1037/e495112004-001
Cn Grover, J. Terken
Modelling rhythmic characteristics of speech is expected to contribute to the acceptability of synthetic speech. However, before rules for the control of speech rhythm in synthetic speech can be developed, we need to know which properties of speech give rise to the perception of speech rhythm. An experiment is described which investigates how the distributions of stressed syllables and pitch accents contribute to the perceived rhythmicity of speech. The outcomes show that the perception of rhythm is related to the distribution of locally prominent syllables: primarily to accents, but also to stressed syllables in long stretches of speech without accented syllables. Furthermore, it appears that, once a rhythmic pattern has been established by the initial part of an utterance, listeners are quite tolerant of local deviations from this pattern later on in the utterance.
模拟语音的节奏特征有助于合成语音的可接受性。然而,在制定合成语音中控制语音节奏的规则之前,我们需要知道语音的哪些特性会引起语音节奏的感知。本文描述了一项实验,该实验研究了重读音节和音高口音的分布如何影响语音的感知节奏。研究结果表明,节奏感知与局部突出音节的分布有关:主要与口音有关,但也与长段无重音音节的语音中的重音音节有关。此外,一旦一段话语的开头部分形成了一种节奏模式,听者就会对随后话语中与这种模式的局部偏差相当宽容。
{"title":"The role of stress and accent in the perception of speech rhythm","authors":"Cn Grover, J. Terken","doi":"10.1037/e495112004-001","DOIUrl":"https://doi.org/10.1037/e495112004-001","url":null,"abstract":"Modelling rhythmic characteristics of speech is expected to contribute to the acceptability of synthetic speech. However, before rules for the control of speech rhythm in synthetic speech can be developed, we need to know which properties of speech give rise to the perception of speech rhythm. An experiment is described which investigates how the distributions of stressed syllables and pitch accents contribute to the perceived rhythmicity of speech. The outcomes show that the perception of rhythm is related to the distribution of locally prominent syllables: primarily to accents, but also to stressed syllables in long stretches of speech without accented syllables. Furthermore, it appears that, once a rhythmic pattern has been established by the initial part of an utterance, listeners are quite tolerant of local deviations from this pattern later on in the utterance.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121595603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Quantitative assessment of communication efficiency and users' preference: experiments worth improving 通讯效率与用户偏好的定量评估:值得改进的实验
Pub Date : 1900-01-01 DOI: 10.1037/e492452004-001
F. L. Engel, R. Haakma, J. V. D. Vijver
With the expanding possibilities of computer-controlled consumer products and the accordingly increasing complexity of manipulation, ease of use becomes an attribute of growing importance. Since it is a rather vague concept covering a variety of aspects, this paper describes a method which permits quantitative assessment of ease of use and its deciding factors. In particular, the preference subjects show for using specific entry devices as a function of their communication efficiency has been investigated. Users' preference for performing a given data-entry task by means of one of the available input devices could be manipulated by experimental variation of the relative speed and accuracy of the interaction. Our preliminary results show the proposed method to be a promising means of quantitative assessment of the determinants of ease of use. Accordingly, suggestions are given for further improvement of the experiments.
随着计算机控制的消费产品的可能性越来越大,操作的复杂性也相应地增加,易用性变得越来越重要。由于它是一个相当模糊的概念,涵盖了各个方面,本文描述了一种可以定量评估易用性及其决定因素的方法。特别是,研究对象对使用特定输入设备的偏好表现为其通信效率的函数。用户对通过一种可用的输入设备执行给定数据输入任务的偏好可以通过交互的相对速度和准确性的实验变化来操纵。我们的初步结果表明,所提出的方法是一种有前途的手段,定量评估的决定因素的易用性。在此基础上,提出了进一步改进实验的建议。
{"title":"Quantitative assessment of communication efficiency and users' preference: experiments worth improving","authors":"F. L. Engel, R. Haakma, J. V. D. Vijver","doi":"10.1037/e492452004-001","DOIUrl":"https://doi.org/10.1037/e492452004-001","url":null,"abstract":"With the expanding possibilities of computer-controlled consumer products and the accordingly increasing complexity of manipulation, ease of use becomes an attribute of growing importance. Since it is a rather vague concept covering a variety of aspects, this paper describes a method which permits quantitative assessment of ease of use and its deciding factors. In particular, the preference subjects show for using specific entry devices as a function of their communication efficiency has been investigated. Users' preference for performing a given data-entry task by means of one of the available input devices could be manipulated by experimental variation of the relative speed and accuracy of the interaction. Our preliminary results show the proposed method to be a promising means of quantitative assessment of the determinants of ease of use. Accordingly, suggestions are given for further improvement of the experiments.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129562314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonspeech audio in user interfaces for TV 电视用户界面中的非语音音频
Pub Date : 1900-01-01 DOI: 10.1037/e491952004-001
Bm Richard van de Sluis, Jh Berry Eggen, J. Rypkema
This study explores the end-user benefits of using nonspeech audio in television user interfaces. A prototype of an Electronic Programme Guide (EPG) served as a carrier for the research. One of the features of this EPG is the possibility to search for TV programmes in a category-based way. The EPG prototype was 'sonically-enhanced' with so-called category sounds. These category sounds were also used as auditory reminders indicating that a TV programme from a given category is about to start. Furthermore, certain characteristics of the category sound were manipulated to represent the urgency of a reminder. Two experiments are described. In the first experiment, the usability of category sounds was evaluated. In the second experiment, it was tested whether 'listener-source distance' is an appropriate metaphor to inform users about the urgency of an auditory reminder. The results showed that people can easily learn to match the category sounds to the corresponding TV programme categories, that the use of category sounds is effective, and that the category sounds were appreciated by a large part of the subjects. In the second experiment, it was found that the distance of a sound source is a useful metaphor to use in an auditory reminder to indicate the distance in time before a programme is going to start.
本研究探讨了在电视用户界面中使用非语音音频对终端用户的好处。电子节目指南(EPG)的原型作为这项研究的载体。该EPG的一个特点是可以按类别搜索电视节目。EPG原型机通过所谓的分类音进行了“声学增强”。这些类别的声音也被用作听觉提醒,表明某一特定类别的电视节目即将开始。此外,某些类别声音的特征被操纵来表示提醒的紧迫性。描述了两个实验。在第一个实验中,我们评估了类别音的可用性。在第二个实验中,测试了“听者-源距离”是否是一个恰当的隐喻,以告知用户听觉提醒的紧迫性。结果表明,人们可以很容易地学会将类别音与相应的电视节目类别相匹配,类别音的使用是有效的,并且大多数被试都喜欢类别音。在第二个实验中,研究人员发现声源的距离是一个有用的隐喻,可以用来在节目开始前的听觉提醒中指出距离。
{"title":"Nonspeech audio in user interfaces for TV","authors":"Bm Richard van de Sluis, Jh Berry Eggen, J. Rypkema","doi":"10.1037/e491952004-001","DOIUrl":"https://doi.org/10.1037/e491952004-001","url":null,"abstract":"This study explores the end-user benefits of using nonspeech audio in television user interfaces. A prototype of an Electronic Programme Guide (EPG) served as a carrier for the research. One of the features of this EPG is the possibility to search for TV programmes in a category-based way. The EPG prototype was 'sonically-enhanced' with so-called category sounds. These category sounds were also used as auditory reminders indicating that a TV programme from a given category is about to start. Furthermore, certain characteristics of the category sound were manipulated to represent the urgency of a reminder. Two experiments are described. In the first experiment, the usability of category sounds was evaluated. In the second experiment, it was tested whether 'listener-source distance' is an appropriate metaphor to inform users about the urgency of an auditory reminder. The results showed that people can easily learn to match the category sounds to the corresponding TV programme categories, that the use of category sounds is effective, and that the category sounds were appreciated by a large part of the subjects. In the second experiment, it was found that the distance of a sound source is a useful metaphor to use in an auditory reminder to indicate the distance in time before a programme is going to start.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"180 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122279477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
User navigation and guidance 用户导航和指导
Pub Date : 1900-01-01 DOI: 10.1037/e490972004-001
J. Masthoff
A new approach for user navigation and guidance is described in which initiative and topic selection by an interactive instruction system can be combined with initiative and topic selection by a student. The topic selection of the system is based on foreknowledge, goals and the capabilities of the individual student. In an experiment, we found that topic selection by the system had an advantage for students who were unable to monitor their own learning process.
提出了一种新的用户导航和引导方法,该方法将交互式教学系统的主动性和选题与学生的主动性和选题相结合。系统的选题是基于学生个人的预知性、目标和能力。在实验中,我们发现系统的选题对于无法监控自己学习过程的学生有优势。
{"title":"User navigation and guidance","authors":"J. Masthoff","doi":"10.1037/e490972004-001","DOIUrl":"https://doi.org/10.1037/e490972004-001","url":null,"abstract":"A new approach for user navigation and guidance is described in which initiative and topic selection by an interactive instruction system can be combined with initiative and topic selection by a student. The topic selection of the system is based on foreknowledge, goals and the capabilities of the individual student. In an experiment, we found that topic selection by the system had an advantage for students who were unable to monitor their own learning process.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"319 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132793693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic speech recognition in the medical environment 医疗环境中的自动语音识别
Pub Date : 1900-01-01 DOI: 10.1037/e491832004-001
E. Verheijen
Medical specialists are interested in applying Automatic Speech Recognition to save time and money spent on medical reporting. A number of issues need to be resolved in order to apply this technology successfully. This paper concentrates on the issue of feedback. An experiment is described in which the most appropriate feedback modality for presenting the recognition result in the situation of the pathologist is investigated. Although at this moment visual feedback seems the safest solution, error-detection performance is still poor. A totally different approach to medical reporting is presented which may prove to be the best solution.
医学专家对应用自动语音识别来节省花费在医疗报告上的时间和金钱很感兴趣。为了成功地应用这项技术,需要解决许多问题。本文主要讨论反馈问题。一个实验被描述,其中最适当的反馈模式,以提出在病理学家的情况下的识别结果进行了调查。虽然目前视觉反馈似乎是最安全的解决方案,但错误检测性能仍然很差。提出了一种完全不同的医疗报告方法,这可能被证明是最好的解决办法。
{"title":"Automatic speech recognition in the medical environment","authors":"E. Verheijen","doi":"10.1037/e491832004-001","DOIUrl":"https://doi.org/10.1037/e491832004-001","url":null,"abstract":"Medical specialists are interested in applying Automatic Speech Recognition to save time and money spent on medical reporting. A number of issues need to be resolved in order to apply this technology successfully. This paper concentrates on the issue of feedback. An experiment is described in which the most appropriate feedback modality for presenting the recognition result in the situation of the pathologist is investigated. Although at this moment visual feedback seems the safest solution, error-detection performance is still poor. A totally different approach to medical reporting is presented which may prove to be the best solution.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134038834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Explorative strategies while compiling music 编曲时的探索策略
Pub Date : 1900-01-01 DOI: 10.1037/e491082004-001
S. Pauws, J. H. Eggen, D. Bouwhuis
This paper describes the results of an experiment designed to understand task-directed human explorative behaviour in a large music collection. The subject's task was to compile a music programme preferred in a specific context-of-use, e.g., romantic evening, party. Experimental conditions were defined in which subjects were provided with no music recommendations, randomly drawn recommendations, or algorithmically determined recommendations while carrying out the task. The provision of recommendations meant to improve performance in the compilation task. When recommendations were provided, subjects systematically selected, played back, and compiled fewer items by themselves, but instead made use of the recommendations. This observation was not coupled with a reduction in the amount of time spent on the compilation task. But when asked for their preference, subjects chose the provision of algorithmically determined recommendations above the provision of randomly drawn recommendations or no recommendations.
本文描述了一项实验的结果,该实验旨在了解大型音乐收藏中任务导向的人类探索行为。这个主题的任务是编写一个在特定使用环境中喜欢的音乐节目,例如,浪漫的夜晚,派对。在实验条件下,受试者在执行任务时不提供音乐推荐、随机抽取的推荐或算法确定的推荐。提供建议的目的是提高编译任务的性能。当提供推荐时,受试者自己系统地选择、回放和编译较少的项目,而是使用推荐。这一观察结果并没有减少在编译任务上花费的时间。但当被问及他们的偏好时,受试者选择提供算法确定的推荐,而不是提供随机抽取的推荐或不提供推荐。
{"title":"Explorative strategies while compiling music","authors":"S. Pauws, J. H. Eggen, D. Bouwhuis","doi":"10.1037/e491082004-001","DOIUrl":"https://doi.org/10.1037/e491082004-001","url":null,"abstract":"This paper describes the results of an experiment designed to understand task-directed human explorative behaviour in a large music collection. The subject's task was to compile a music programme preferred in a specific context-of-use, e.g., romantic evening, party. Experimental conditions were defined in which subjects were provided with no music recommendations, randomly drawn recommendations, or algorithmically determined recommendations while carrying out the task. The provision of recommendations meant to improve performance in the compilation task. When recommendations were provided, subjects systematically selected, played back, and compiled fewer items by themselves, but instead made use of the recommendations. This observation was not coupled with a reduction in the amount of time spent on the compilation task. But when asked for their preference, subjects chose the provision of algorithmically determined recommendations above the provision of randomly drawn recommendations or no recommendations.","PeriodicalId":369207,"journal":{"name":"IPO Annual Progress Report","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133283285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
IPO Annual Progress Report
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1