Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463410
Kathrin Borchert, Stanislav Lange, T. Zinner, Matthias Hirth
Modern enterprise applications are often designed as distributed architectures, e.g., thin client computing and thus degradations in network related Quality of Service (QoS) parameters may also negatively impact the user-perceived Quality of Experience (QoE) of the application. In this work, we create a model to predict the perceived application quality based on measurements of objective technical parameters. For this, we gathered a data set in a cooperating enterprise over a timespan of nearly three months. As the obtained data set is subject to bias that originates from seasonal effects as well as a limited and predefined set of technical parameters, we further evaluate how to identify segments of the data that lead to misclassification. Last, we quantify the trade-off between the gain in the QoE prediction accuracy and the amount of filtered data.
{"title":"Identification of Delay Thresholds Representing the Perceived Quality of Enterprise Applications","authors":"Kathrin Borchert, Stanislav Lange, T. Zinner, Matthias Hirth","doi":"10.1109/QoMEX.2018.8463410","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463410","url":null,"abstract":"Modern enterprise applications are often designed as distributed architectures, e.g., thin client computing and thus degradations in network related Quality of Service (QoS) parameters may also negatively impact the user-perceived Quality of Experience (QoE) of the application. In this work, we create a model to predict the perceived application quality based on measurements of objective technical parameters. For this, we gathered a data set in a cooperating enterprise over a timespan of nearly three months. As the obtained data set is subject to bias that originates from seasonal effects as well as a limited and predefined set of technical parameters, we further evaluate how to identify segments of the data that lead to misclassification. Last, we quantify the trade-off between the gain in the QoE prediction accuracy and the amount of filtered data.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"20 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73981234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463430
T. Hossfeld, Florian Metzger, D. Rossi
In 2012, Google introduced the Speed Index (SI) metric to quantify the speed of the Web page visual completeness for the actually displayed above-the-fold (ATF) portion of a Web page. In Web browsing a page might appear to the user to be already fully rendered, even though further content may still be retrieved, resulting in the Page Load Time (PLT). This happens due to the browser progressively rendering all objects, part of which can also be located below the browser window's current viewport. The SI metric (and variants) thereof have since established themselves as a de facto standard in Web page and browser testing. While SI is a step in the direction of including the user experience into Web metrics, the actual meaning of the metric and especially its relationship between Speed Index and Web QoE is however far from being clear. The contributions of this paper are thus to first develop an understanding of the SI based on a theoretical analysis and second, to analyze the interdependency between SI and MOS values from an existing public dataset. Specifically, our analysis is based on two well established models that map the user waiting time to a user ACR-rating of the QoE. The analysis show that ATF-based metrics are more appropriate than pure PLT as input to Web QoE models.
2012年,Google引入了速度指数(Speed Index, SI)指标,用于量化Web页面实际显示在页面上方(ATF)部分的Web页面视觉完整性的速度。在Web浏览中,一个页面对用户来说可能已经完全呈现,即使可能仍然检索到更多的内容,从而导致页面加载时间(page Load Time, PLT)。这是由于浏览器逐渐呈现所有对象,其中一部分也可以位于浏览器窗口当前视口的下方。SI度量(及其变体)已经成为Web页面和浏览器测试中的实际标准。虽然SI是将用户体验纳入Web指标的一个步骤,但是指标的实际含义,特别是速度指数和Web QoE之间的关系还远未明确。因此,本文的贡献在于首先在理论分析的基础上发展对SI的理解,其次,从现有的公共数据集中分析SI和MOS值之间的相互依赖性。具体来说,我们的分析是基于两个完善的模型,它们将用户等待时间映射到QoE的用户acr评级。分析表明,基于atf的度量比纯PLT更适合作为Web QoE模型的输入。
{"title":"Speed Index: Relating the Industrial Standard for User Perceived Web Performance to web QoE","authors":"T. Hossfeld, Florian Metzger, D. Rossi","doi":"10.1109/QoMEX.2018.8463430","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463430","url":null,"abstract":"In 2012, Google introduced the Speed Index (SI) metric to quantify the speed of the Web page visual completeness for the actually displayed above-the-fold (ATF) portion of a Web page. In Web browsing a page might appear to the user to be already fully rendered, even though further content may still be retrieved, resulting in the Page Load Time (PLT). This happens due to the browser progressively rendering all objects, part of which can also be located below the browser window's current viewport. The SI metric (and variants) thereof have since established themselves as a de facto standard in Web page and browser testing. While SI is a step in the direction of including the user experience into Web metrics, the actual meaning of the metric and especially its relationship between Speed Index and Web QoE is however far from being clear. The contributions of this paper are thus to first develop an understanding of the SI based on a theoretical analysis and second, to analyze the interdependency between SI and MOS values from an existing public dataset. Specifically, our analysis is based on two well established models that map the user waiting time to a user ACR-rating of the QoE. The analysis show that ATF-based metrics are more appropriate than pure PLT as input to Web QoE models.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"36 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81539166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463374
M. Fiedler, Sebastian Möller, P. Reichl, Min Xie
This short paper presents the recently published Dagstuhl Manifesto ‘QoE Vadis?‘. The Manifesto is the result of a set of three Dagstuhl Seminars and one Dagstuhl Perspectives Workshop, aimed at shaping understanding, development and application of the Quality of Experience (QoE) notion and concept. Its task is to convey the current status, promising developments and future projections for different stakeholders. The latter are summarised in a set of eleven recommendations to academia, industry and funding organisations.
{"title":"A Glance at the Dagstuhl Manifesto ‘QoE Vadis?’","authors":"M. Fiedler, Sebastian Möller, P. Reichl, Min Xie","doi":"10.1109/QoMEX.2018.8463374","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463374","url":null,"abstract":"This short paper presents the recently published Dagstuhl Manifesto ‘QoE Vadis?‘. The Manifesto is the result of a set of three Dagstuhl Seminars and one Dagstuhl Perspectives Workshop, aimed at shaping understanding, development and application of the Quality of Experience (QoE) notion and concept. Its task is to convey the current status, promising developments and future projections for different stakeholders. The latter are summarised in a set of eleven recommendations to academia, industry and funding organisations.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"1 1","pages":"1-3"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91047705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463381
S. Bosse, Milena T. Bagdasarian, W. Samek, G. Curio, T. Wiegand
Steady-state visual evoked potentials (SSVEP) are brain responses elicited by periodic visual stimuli. Recently it was shown that the use of SSVEP in quality studies allows for accurate psychophysiological assessment of perceived visual quality, but the influence of the stimulation frequency is still unclear. This paper studies experimentally the relation between the SNR of the neural signal and the stimulation frequency in an psychophysiological quality assessment setup. For various source images tested at different distortion magnitudes over the range of 6 different stimulation frequencies, we show physiologically plausible results that provide insights into the temporal dynamics of neural distortion processing. Our findings inform a rational choice of stimulation frequency in SSVEP-based image quality assessment studies. This potentially improves the experimental setup of future image quality assessment studies exploiting the SSVEP paradigm.
{"title":"On the Stimulation Frequency in SSVEP-based Image Quality Assessment","authors":"S. Bosse, Milena T. Bagdasarian, W. Samek, G. Curio, T. Wiegand","doi":"10.1109/QoMEX.2018.8463381","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463381","url":null,"abstract":"Steady-state visual evoked potentials (SSVEP) are brain responses elicited by periodic visual stimuli. Recently it was shown that the use of SSVEP in quality studies allows for accurate psychophysiological assessment of perceived visual quality, but the influence of the stimulation frequency is still unclear. This paper studies experimentally the relation between the SNR of the neural signal and the stimulation frequency in an psychophysiological quality assessment setup. For various source images tested at different distortion magnitudes over the range of 6 different stimulation frequencies, we show physiologically plausible results that provide insights into the temporal dynamics of neural distortion processing. Our findings inform a rational choice of stimulation frequency in SSVEP-based image quality assessment studies. This potentially improves the experimental setup of future image quality assessment studies exploiting the SSVEP paradigm.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"104 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91062537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463418
C. Ozcinar, A. Smolic
Understanding of visual attention is crucial for omnidirectional video (ODV) viewed for instance with a head-mounted display (HMD), where only a fraction of an ODV is rendered at a time. Transmission and rendering of ODV can be optimized by understanding how viewers consume a given ODV in virtual reality (VR) applications. In order to predict video regions that might draw the attention of viewers, saliency maps can be estimated by using computational visual attention models. As no such model currently exists for ODV, but given the importance for emerging ODV applications, we create a new visual attention user dataset for ODV, investigate behavior of viewers when consuming the content, and analyze the prediction performance of state-of-the-art visual attention models. Our developed test-bed and dataset will be publicly available with this paper, to stimulate and support research on ODV.
{"title":"Visual Attention in Omnidirectional Video for Virtual Reality Applications","authors":"C. Ozcinar, A. Smolic","doi":"10.1109/QoMEX.2018.8463418","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463418","url":null,"abstract":"Understanding of visual attention is crucial for omnidirectional video (ODV) viewed for instance with a head-mounted display (HMD), where only a fraction of an ODV is rendered at a time. Transmission and rendering of ODV can be optimized by understanding how viewers consume a given ODV in virtual reality (VR) applications. In order to predict video regions that might draw the attention of viewers, saliency maps can be estimated by using computational visual attention models. As no such model currently exists for ODV, but given the importance for emerging ODV applications, we create a new visual attention user dataset for ODV, investigate behavior of viewers when consuming the content, and analyze the prediction performance of state-of-the-art visual attention models. Our developed test-bed and dataset will be publicly available with this paper, to stimulate and support research on ODV.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"1 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88790837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463363
W. Robitza, Dhananjaya G. Kittur, A. Dethof, Steve Goering, B. Feiten, A. Raake
The available Internet bandwidth has a strong impact on the Quality of Experience of video services. In order to manage their network efficiently and prevent customer churn, Internet Service Providers need to constantly monitor the QoE of video services such as YouTube. However, they often only rely on simple measurement scenarios that consider only one video being loaded repeatedly. In this paper we compare this scenario against a new approach in which multiple videos are being loaded in a session, thereby simulating user behavior. Using a testbed, we study the impact of download speeds on Key Performance Indicators (KPIs such as initial loading time and stalling events) and user QoE as measured using the ITU-T P.1203 standard. We show that the monitoring paradigm has a significant impact on the obtained results. We further provide a prediction model for estimating the impact of download speed on KPIs and user QoE.
{"title":"Measuring YouTube QoE with ITU-T P.1203 Under Constrained Bandwidth Conditions","authors":"W. Robitza, Dhananjaya G. Kittur, A. Dethof, Steve Goering, B. Feiten, A. Raake","doi":"10.1109/QoMEX.2018.8463363","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463363","url":null,"abstract":"The available Internet bandwidth has a strong impact on the Quality of Experience of video services. In order to manage their network efficiently and prevent customer churn, Internet Service Providers need to constantly monitor the QoE of video services such as YouTube. However, they often only rely on simple measurement scenarios that consider only one video being loaded repeatedly. In this paper we compare this scenario against a new approach in which multiple videos are being loaded in a session, thereby simulating user behavior. Using a testbed, we study the impact of download speeds on Key Performance Indicators (KPIs such as initial loading time and stalling events) and user QoE as measured using the ITU-T P.1203 standard. We show that the monitoring paradigm has a significant impact on the obtained results. We further provide a prediction model for estimating the impact of download speed on KPIs and user QoE.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"79 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77538559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463378
J. Skowronek, A. Raake
This paper presents a study on the quality perception of multiparty audio- and audiovisual conferencing calls, with a focus on asymmetric conditions, that is, different connection properties and equipment of individual participants. The results show that some mutual influence of the individual links between participants in terms of their perceived quality exists, and that the overall quality of a conference call is not always a simple average over quality ratings associated with individual links. Further, the paper interprets the results in terms of relevant processes of audiovisual scene perception and quality formation. The results can help operators or manufacturers to optimally balance QoS settings for individual participants, shedding light on how the overall impression of a conference may be formed by users.
{"title":"On the quality perception of multiparty conferencing calls","authors":"J. Skowronek, A. Raake","doi":"10.1109/QoMEX.2018.8463378","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463378","url":null,"abstract":"This paper presents a study on the quality perception of multiparty audio- and audiovisual conferencing calls, with a focus on asymmetric conditions, that is, different connection properties and equipment of individual participants. The results show that some mutual influence of the individual links between participants in terms of their perceived quality exists, and that the overall quality of a conference call is not always a simple average over quality ratings associated with individual links. Further, the paper interprets the results in terms of relevant processes of audiovisual scene perception and quality formation. The results can help operators or manufacturers to optimally balance QoS settings for individual participants, shedding light on how the overall impression of a conference may be formed by users.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"10 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74435715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463417
Steven Schmidt, S. Möller, Saman Zadtootaghaj
Subjective tests to assess the Quality of Experience (QoE) of gaming services are necessary to enable service providers to ensure the satisfaction of their customers. Since gaming is an interactive activity, interactive tests are typically conducted to measure the full spectrum of the player experience. However, carrying out such tests is expensive and time-consuming. Furthermore, the results can be influenced by the behavior and abilities of participants. For this reason, it is of high interest whether such interactive tests can be partially replaced with passive viewing-and-listening tests. In this paper, we present a comparison of an interactive gaming test with passive tests using two different durations. To investigate the differences between the test paradigms, we assessed the overall quality, the video quality and the reactiveness of the game as well as other player experience aspect for different frame rates and bit rates. Results show that once certain requirements are fulfilled, passive tests offer indeed a valuable quality assessment method. However, if the duration of the presented video material is too short, the passive test overestimated the gaming and video quality. Furthermore, we show that the player performance has no impact on the video quality ratings.
{"title":"A Comparison of Interactive and Passive Quality Assessment for Gaming Research","authors":"Steven Schmidt, S. Möller, Saman Zadtootaghaj","doi":"10.1109/QoMEX.2018.8463417","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463417","url":null,"abstract":"Subjective tests to assess the Quality of Experience (QoE) of gaming services are necessary to enable service providers to ensure the satisfaction of their customers. Since gaming is an interactive activity, interactive tests are typically conducted to measure the full spectrum of the player experience. However, carrying out such tests is expensive and time-consuming. Furthermore, the results can be influenced by the behavior and abilities of participants. For this reason, it is of high interest whether such interactive tests can be partially replaced with passive viewing-and-listening tests. In this paper, we present a comparison of an interactive gaming test with passive tests using two different durations. To investigate the differences between the test paradigms, we assessed the overall quality, the video quality and the reactiveness of the game as well as other player experience aspect for different frame rates and bit rates. Results show that once certain requirements are fulfilled, passive tests offer indeed a valuable quality assessment method. However, if the duration of the presented video material is too short, the passive test overestimated the gaming and video quality. Furthermore, we show that the player performance has no impact on the video quality ratings.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"103 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85092123","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463371
Stefan Uhrig, S. Möller, Jan-Niklas Voigt-Antons
The present study utilized electroencephalography (EEG) to explore the neuro-electrical correlates of perceptual dimensions underlying speech quality. Specific focus lay on the P300 event-related brain potential (ERP) component to provide indication for internal processes related to attention and stimulus categorization. A high-quality (HQ) recording of a spoken word was impaired on each of three perceptual dimensions at a time, “discontinuity” (F), “noisiness” (N) and “coloration” (C), with F being realized through random erasure of distinct frames in the speech signal parts of the audio file. In an active three-stimulus oddball task, repeated presentations of the HQ stimulus led to the formation of a sensory/perceptual HQ reference, which was interrupted by infrequent occurrences of degraded “oddball” stimuli (F, N, C). Initial analysis of the obtained subjective and electrophysiological data suggested the following conclusions: 1) Participants perceived the three degraded stimuli as clearly impaired, but equal in terms of degradation intensity. Thus, variations in neural responses were assumed to reflect changes in the perceptual dimension along which the speech degradation had been induced. 2) Timing of the evoked P300 corresponded with temporal differences in the impairments, implying a later onset for “discretely” (F) compared to “continuously” (N, C) degraded stimuli after being categorized as task-irrelevant. Hence, P300 peak latency might prove useful to dissociate both classes of speech quality impairments on a neural level of analysis.
本研究利用脑电图(EEG)探讨语音质量感知维度的神经电相关性。具体的重点放在P300事件相关脑电位(ERP)组件,为注意和刺激分类相关的内部过程提供指示。一个高质量(HQ)的口语录音在三个感知维度上都受到了损害,“不连续”(F)、“噪声”(N)和“着色”(C),其中F是通过随机擦除音频文件的语音信号部分中的不同帧来实现的。在积极的三刺激古怪任务中,反复呈现HQ刺激会导致感觉/知觉HQ参考的形成,这一参考会被偶尔出现的退化“古怪”刺激打断(F, N, C)。对获得的主观和电生理数据的初步分析表明了以下结论:1)参与者认为三种退化刺激明显受损,但退化强度相同。因此,神经反应的变化被认为反映了感知维度的变化,而感知维度正是导致语言退化的原因。2)诱发P300的时间与损伤的时间差异相对应,这意味着在被归类为任务无关后,“离散性”(F)比“连续性”(N, C)退化刺激的发作时间晚。因此,P300峰值潜伏期可能被证明有助于在神经分析水平上分离两类语言质量障碍。
{"title":"Dissociating Perceptual Quality Dimensions of Transmitted Speech Using Electroencephalography","authors":"Stefan Uhrig, S. Möller, Jan-Niklas Voigt-Antons","doi":"10.1109/QoMEX.2018.8463371","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463371","url":null,"abstract":"The present study utilized electroencephalography (EEG) to explore the neuro-electrical correlates of perceptual dimensions underlying speech quality. Specific focus lay on the P300 event-related brain potential (ERP) component to provide indication for internal processes related to attention and stimulus categorization. A high-quality (HQ) recording of a spoken word was impaired on each of three perceptual dimensions at a time, “discontinuity” (F), “noisiness” (N) and “coloration” (C), with F being realized through random erasure of distinct frames in the speech signal parts of the audio file. In an active three-stimulus oddball task, repeated presentations of the HQ stimulus led to the formation of a sensory/perceptual HQ reference, which was interrupted by infrequent occurrences of degraded “oddball” stimuli (F, N, C). Initial analysis of the obtained subjective and electrophysiological data suggested the following conclusions: 1) Participants perceived the three degraded stimuli as clearly impaired, but equal in terms of degradation intensity. Thus, variations in neural responses were assumed to reflect changes in the perceptual dimension along which the speech degradation had been induced. 2) Timing of the evoked P300 corresponded with temporal differences in the impairments, implying a later onset for “discretely” (F) compared to “continuously” (N, C) degraded stimuli after being categorized as task-irrelevant. Hence, P300 peak latency might prove useful to dissociate both classes of speech quality impairments on a neural level of analysis.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"7 1","pages":"1-3"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87910668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2018-05-01DOI: 10.1109/QoMEX.2018.8463394
J. Korhonen
This In this paper, we study the problem of detecting packet loss distortion and estimating the perceived visibility of such distortion in decoded video. Our analysis is based on the features of the decoded video signal, and we assume that no information about actual packet losses is available from the underlying network or video decoder. First, we present a full-reference method for assessing packet loss visibility at the macroblock, frame and sequence levels. Second, we propose a no-reference method for detecting defected frames, based on spatiotemporal features and machine learning. Experimental results show that the proposed no-reference method achieves a high correlation with the full-reference method at both sequence and frame level. At sequence level, the no-reference method can also predict the subjective quality ratings at high accuracy.
{"title":"Learning-based Prediction of Packet Loss Artifact Visibility in Networked Video","authors":"J. Korhonen","doi":"10.1109/QoMEX.2018.8463394","DOIUrl":"https://doi.org/10.1109/QoMEX.2018.8463394","url":null,"abstract":"This In this paper, we study the problem of detecting packet loss distortion and estimating the perceived visibility of such distortion in decoded video. Our analysis is based on the features of the decoded video signal, and we assume that no information about actual packet losses is available from the underlying network or video decoder. First, we present a full-reference method for assessing packet loss visibility at the macroblock, frame and sequence levels. Second, we propose a no-reference method for detecting defected frames, based on spatiotemporal features and machine learning. Experimental results show that the proposed no-reference method achieves a high correlation with the full-reference method at both sequence and frame level. At sequence level, the no-reference method can also predict the subjective quality ratings at high accuracy.","PeriodicalId":6618,"journal":{"name":"2018 Tenth International Conference on Quality of Multimedia Experience (QoMEX)","volume":"99 1","pages":"1-6"},"PeriodicalIF":0.0,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78356181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}