2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)最新文献

英文中文

Comparison of read and spontaneous speech in case of automatic detection of depression 在抑郁自动检测的情况下，阅读与自发言语的比较

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268245

G. Kiss, K. Vicsi

In this paper, read and spontaneous speech have been compared in the light of automatic depression detection by speech processing. First, statistical analysis was carried out to select those acoustic features that differ significantly between healthy and depressed subjects in case of these two types of speech, separately for both gender. Secondly, statistical examination and classification experiments were prepared to compare the values of the selected features for the two types of speech. We were looking for the answer to which type of speech can be used to achieve better automatic depression detection results. As it was expected, the tempo related features, such as articulation rate, speech rate, and pause lengths are useful in case of spontaneous speech, while formants trajectories can be used only in case of read speech, because their values are mainly influenced by the linguistic content of the speech. Despite the significant differences of the features' values between read and spontaneous speech, there were no major differences in the detection accuracies. 83% detection accuracy was archived with read speech samples, and 86%detection accuracy was achieved with spontaneous speech samples.

本文从语音处理自动抑郁检测的角度，对阅读语音和自发语音进行了比较。首先，对健康受试者和抑郁受试者在这两种类型的言语情况下，分别进行统计分析，选择具有显著差异的声学特征。其次，进行统计检验和分类实验，比较两类语音所选择的特征值。我们正在寻找答案，哪种类型的语音可以达到更好的自动抑郁检测结果。正如预期的那样，与节奏相关的特征，如发音率、言语率和停顿长度，在自发语音的情况下是有用的，而共振子轨迹只能在阅读语音的情况下使用，因为它们的值主要受语音的语言内容的影响。尽管阅读语音和自发语音的特征值存在显著差异，但检测准确率没有显著差异。对读语音样本的检测准确率达到83%，对自发语音样本的检测准确率达到86%。

{"title":"Comparison of read and spontaneous speech in case of automatic detection of depression","authors":"G. Kiss, K. Vicsi","doi":"10.1109/COGINFOCOM.2017.8268245","DOIUrl":"https://doi.org/10.1109/COGINFOCOM.2017.8268245","url":null,"abstract":"In this paper, read and spontaneous speech have been compared in the light of automatic depression detection by speech processing. First, statistical analysis was carried out to select those acoustic features that differ significantly between healthy and depressed subjects in case of these two types of speech, separately for both gender. Secondly, statistical examination and classification experiments were prepared to compare the values of the selected features for the two types of speech. We were looking for the answer to which type of speech can be used to achieve better automatic depression detection results. As it was expected, the tempo related features, such as articulation rate, speech rate, and pause lengths are useful in case of spontaneous speech, while formants trajectories can be used only in case of read speech, because their values are mainly influenced by the linguistic content of the speech. Despite the significant differences of the features' values between read and spontaneous speech, there were no major differences in the detection accuracies. 83% detection accuracy was archived with read speech samples, and 86%detection accuracy was achieved with spontaneous speech samples.","PeriodicalId":212559,"journal":{"name":"2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115831660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 17

A prosody inspired RNN approach for punctuation of machine produced speech transcripts to improve human readability 一种韵律启发的RNN方法用于机器生成的语音文本的标点符号，以提高人类的可读性

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268246

A. Moro, György Szaszák

Speech communication human-machine interfaces exploit automatic speech recognition to implement speech-to-text conversion. Unfortunately, in the past, not much effort has been devoted to add punctuation marks to the recognized word chain after speech recognition. This affects human readability and makes interpretation hard. This paper presents an effort to restore punctuation marks by keeping low the latency resulting from this post-processing step. The approach exploits the prosodic structure and proposes a sequential modelling paradigm based on recurrent neural networks. Results show satisfying punctuation restoration abilities, especially taking into account that sentence boundaries are reliably detected. Even if the predicted punctuation sequence is not error free w.r.t. writing standards, human perception is expected to “repair” these errors more easily compared to the case when no punctuation is given at all and the reader is left in confusion regarding the basic segmentation of the word chain.

语音通信人机界面利用语音自动识别实现语音到文本的转换。遗憾的是，在过去，在语音识别后的识别词链中添加标点符号并没有付出太多的努力。这影响了人类的可读性，并使解释变得困难。本文提出了一种通过降低这一后处理步骤产生的延迟来恢复标点符号的方法。该方法利用韵律结构，提出了一种基于递归神经网络的序列建模范式。结果表明，该系统具有令人满意的标点恢复能力，特别是考虑到句子边界的可靠检测。即使预测的标点顺序不是没有错误的w.r.t.写作标准，与根本没有标点符号的情况相比，人类的感知也更容易“修复”这些错误，而读者则对单词链的基本分割感到困惑。

引用次数: 9

Numerical analysis of a network evolution model 网络演化模型的数值分析

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268236

I. Fazekas, Attila Perecsényi, B. Porvázsnyik

In this paper we introduce a new network evolution model. The basic feature of the model is the cooperation (interaction) of N nodes. In our model every step m new nodes are born, where m is a discrete random variable with values 0,1, 2,…, N − 1. Then the m new nodes interact with (N − m) old vertices, so that they form a complete graph on N vertices. The old nodes can be chosen either uniformly or by using the preferential attachment rule. We analyze certain properties of the above mentioned model by computer simulations. Power-law degree and weight distributions and clustering coefficients are studied.

本文提出了一种新的网络演化模型。该模型的基本特征是N个节点的协作(交互)。在我们的模型中，每一步产生m个新节点，其中m是一个离散随机变量，其值为0,1,2，…，N−1。然后，m个新节点与(N−m)个旧节点相互作用，形成N个顶点上的完整图。旧节点可以统一选择，也可以使用优先附加规则选择。通过计算机仿真分析了上述模型的某些特性。研究了幂律度分布、权分布和聚类系数。

引用次数: 0

Interaction-dependent e-health hub-software adaptation to cloud-based electronic health records 依赖交互的电子健康中心——适应基于云的电子健康记录的软件

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268267

A. Adamkó, Abel Garai, István Péntek

The recent decade brought a significant breakthrough in the healthcare interoperability. Among other things the patient-information has been thoroughly digitalized and shared between the involved organizational units. The healthcare data is stored in electronic health records (EHR). These records mostly remained in stand-alone servers. However, the cloud-technology penetrated also the healthcare industry. As people move and travel more frequently, they are more likely to receive treatment in foreign healthcare institutions. Therefore, they leave their electronic medical footprint in different countries. The industrial cloud-providers offer regional, international or global solutions. This trend eliminates the former technological barriers of cross-border data exchange. This article summarizes the results of the team's research, and focuses on the findings of the latest stage of the three-year exploratory program. From technical point of view, this research phase focuses on three objectives: capture of the bio-sensory raw data from the dedicated e-Health device, aggregation and evaluation of the data-flow by the hub-software, and collection into EHR. This research also simulates the proposed adaptive, event-based and interoperable healthcare ecosystem. According to the cloud architecture's elasticity, the findings of this pilot project can be later disseminated to international extent and the hub software's system dimensions can be scaled up to serve also complex healthcare ecosystems.

近十年来，医疗保健互操作性取得了重大突破。除其他事项外，患者信息已完全数字化，并在相关组织单位之间共享。医疗保健数据存储在电子健康记录(EHR)中。这些记录大多保存在独立的服务器中。然而，云技术也渗透到了医疗保健行业。随着人们搬家和旅行的频繁，他们更有可能在国外医疗机构接受治疗。因此，他们在不同的国家留下了他们的电子医疗足迹。工业云提供商提供区域性、国际性或全球性的解决方案。这一趋势消除了以前跨境数据交换的技术障碍。本文总结了团队的研究成果，重点介绍了三年探索计划最新阶段的研究成果。从技术角度来看，本研究阶段侧重于三个目标:从专用电子健康设备捕获生物感官原始数据，通过中心软件对数据流进行汇总和评估，并将其收集到电子健康档案中。本研究还模拟了提出的自适应、基于事件和可互操作的医疗保健生态系统。根据云架构的弹性，这个试点项目的发现可以在以后传播到国际范围，中心软件的系统维度可以扩大，以服务于复杂的医疗保健生态系统。

{"title":"Interaction-dependent e-health hub-software adaptation to cloud-based electronic health records","authors":"A. Adamkó, Abel Garai, István Péntek","doi":"10.1109/COGINFOCOM.2017.8268267","DOIUrl":"https://doi.org/10.1109/COGINFOCOM.2017.8268267","url":null,"abstract":"The recent decade brought a significant breakthrough in the healthcare interoperability. Among other things the patient-information has been thoroughly digitalized and shared between the involved organizational units. The healthcare data is stored in electronic health records (EHR). These records mostly remained in stand-alone servers. However, the cloud-technology penetrated also the healthcare industry. As people move and travel more frequently, they are more likely to receive treatment in foreign healthcare institutions. Therefore, they leave their electronic medical footprint in different countries. The industrial cloud-providers offer regional, international or global solutions. This trend eliminates the former technological barriers of cross-border data exchange. This article summarizes the results of the team's research, and focuses on the findings of the latest stage of the three-year exploratory program. From technical point of view, this research phase focuses on three objectives: capture of the bio-sensory raw data from the dedicated e-Health device, aggregation and evaluation of the data-flow by the hub-software, and collection into EHR. This research also simulates the proposed adaptive, event-based and interoperable healthcare ecosystem. According to the cloud architecture's elasticity, the findings of this pilot project can be later disseminated to international extent and the hub software's system dimensions can be scaled up to serve also complex healthcare ecosystems.","PeriodicalId":212559,"journal":{"name":"2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133903876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

The effects of virtual and augmented learning environments on the learning process in secondary school 虚拟和增强学习环境对中学学习过程的影响

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268273

Kinga Biró, G. Molnár, Dalma Pap, Zoltán Szűts

Pedagogy is in a dire need of shift since students are under motivated. Virtual and augmented reality offer great solutions given the fact that the majority of high school students have heard of those. Augmented reality based applications such as the Pokémon Go 3D has proven that smartphone applications are capable of exciting and moving people therefore these can be used effectively in education.

由于学生缺乏积极性，教育学急需转变。虚拟现实和增强现实提供了很好的解决方案，因为大多数高中生都听说过这些。基于增强现实的应用程序，如poksammon Go 3D已经证明，智能手机应用程序能够让人兴奋和感动，因此这些应用程序可以有效地用于教育。

引用次数: 29

Cognitive and spiritual revolution of the tenth century — Constantine porphyrogenitus and his hidden world: Part I. The Great Monarch's hidden world in the great medieval mystical writings 十世纪的认知和精神革命——君士坦丁和他的隐秘世界:第一部分:中世纪伟大神秘著作中伟大君主的隐秘世界

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268293

P. Várlaki, P. Baranyi

The paper deals, through a comparative hermeneutical macro and microanalysis, identical and very similar representational and meaning systems of such great medieval mystical writings as the Book Bahir, the Targum to Song of Songs and the Royal Mirror of St Stephen of Hungary. The basis of the comparison is the hidden (“highest”) presence of the spirit of the Great Monarch. The ‘identified’ mystical patterns are compared with the representational and meaning systems in the great medieval art-works related to Constantine Porphyrogenitus in Part II of the paper.

本文通过宏观和微观的比较解释学分析，探讨了中世纪伟大的神秘主义著作中相同或非常相似的表征和意义体系，如《巴希尔书》、《塔古姆之歌》和《匈牙利圣斯蒂芬的皇家镜子》。比较的基础是隐藏的(“最高”)存在的伟大君主的精神。在论文的第二部分中，将“已识别的”神秘模式与与君士坦丁·卟啉基尼图斯有关的中世纪伟大艺术作品中的表征和意义系统进行了比较。

引用次数: 4

Perception of delay tolerant network behavior with cognitive sonfication controller 认知超声控制器对延迟容忍网络行为的感知

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268241

Mohamed Amine Korteby, Zoltán Gál

Delay Tolerant Networking (DTN) allows communication in challenging and harsh environments where traditional networking fails and new routing and application protocols are required. For such networks, it is a constant task to keep track and a summary of their behavior because of the lack of network resources as they have less energy and memory to buffer transit messages. Therefore, it is important to exploit these resources efficiently. Several studies propose that accompanying visualization with sonification can ease some of the challenges of constant visual monitoring. In this paper, we simulate three different routing protocols and four movement models to analyze the energy consumption, the buffer occupancy and the interconnection time of the nodes under forty-eight different scenarios. Furthermore, we propose a cognitive sonification controller system and to enhance the network administrators in their network management task.

延迟容忍网络(DTN)允许在传统网络失效和需要新的路由和应用协议的具有挑战性和恶劣环境下进行通信。对于这样的网络，跟踪和总结它们的行为是一项持续的任务，因为缺乏网络资源，因为它们没有足够的能量和内存来缓冲传输消息。因此，有效地开发利用这些资源是非常重要的。几项研究表明，将可视化与超声相结合可以缓解持续视觉监测的一些挑战。本文模拟了三种不同的路由协议和四种移动模型，分析了48种不同场景下节点的能耗、缓冲占用和互联时间。此外，我们还提出了一种认知声控系统，以提高网络管理员的网络管理能力。

引用次数: 1

Pilot corpus of child-robot interaction in therapeutic settings 儿童机器人互动的试点语料库在治疗设置

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268252

M. Gnjatović, Jovica Tasevski, D. Mišković, S. Savic, B. Borovac, A. Mikov, R. Krasnik

This paper reports on a pilot corpus of child-robot interaction in therapeutic settings. The corpus comprises recordings of the interactions between twenty-one children and the conversational humanoid robot MARKO, in the kinesitherapeutic room at the Clinic of Paediatric Rehabilitation in Novi Sad, Serbia. The subject group included both healthy children and children with cerebral palsy and similar movement disorders. Approximately 156 minutes of session time was recorded. All dialogues were transcribed, and nonverbal acts were annotated. The initial evaluation of the corpus indicates that children positively respond to MARKO, engage in interaction with MARKO, perform verbal instructions given by MARKO, and experience increased motivation for therapy.

这篇论文报告了一个在治疗环境中儿童机器人交互的试点语料库。该语料库包括在塞尔维亚诺维萨德儿科康复诊所的运动治疗室中，21名儿童与对话型人形机器人MARKO之间互动的记录。研究对象包括健康儿童和患有脑瘫及类似运动障碍的儿童。大约记录了156分钟的会议时间。所有的对话都被记录下来，非语言行为被注释。对语料库的初步评估表明，儿童对MARKO有积极的反应，参与MARKO的互动，执行MARKO给出的口头指令，并且体验到治疗动机的增加。

引用次数: 10

Introduction of a multi-leveled E-leaming environment with community contribution 引入由社区参与的多层次电子学习环境

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268239

Dávid Sik

In our paper we introduce a new multi-leveled e-learning environment, called Sysbook. It is an open-access surface, available on the internet for any users. Its main topics cover the field of systems and control, with some mathematical and even philosophical aspects. The purpose of the Sysbook is to present systems and controls on different levels, addressing readers of different backgrounds and interests. These surfaces are extended with case studies for different fields and a student area where the users can also contribute.

在本文中，我们介绍了一个新的多层电子学习环境，称为Sysbook。这是一个开放的界面，任何用户都可以在互联网上使用。它的主要主题涵盖了系统和控制领域，以及一些数学甚至哲学方面的内容。系统手册的目的是介绍不同层次的系统和控制，针对不同背景和兴趣的读者。这些表面扩展了不同领域的案例研究和学生区域，用户也可以在这里做出贡献。

引用次数: 1

Á bilingual comparison of MaxEnt-and RNN-based punctuation restoration in speech transcripts Á基于maxent和rnn的语音文本标点恢复的双语比较

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Pub Date : 2017-09-01 DOI: 10.1109/COGINFOCOM.2017.8268227

Máté Ákos Tündik, Balázs Tarján, György Szaszák

Closed captioning is a common method to improve accessibility of TV programs for people who are hearing impaired or hard of hearing, while representing an application relevant for cognitive infocommunication. However, live captions provided by automatic speech recognition systems usually lack punctuation, making them hard to follow. In this paper, Maximum Entropy and Recurrent Neural Network based punctuation restoration models are compared on two closed captioning tasks in real-time and off-line setups. We present the first results in restoring punctuation for Hungarian broadcast speech, where the RNN significantly outperforms our MaxEnt baseline system. Our approach is also evaluated on TED talks within the IWSLT English dataset providing comparable results to the state-of-the-art systems.

隐式字幕是一种提高听障或重听人群电视节目的可及性的常用方法，同时也是一种与认知信息交流相关的应用。然而，自动语音识别系统提供的实时字幕通常缺乏标点符号，使其难以理解。本文比较了基于最大熵和循环神经网络的标点恢复模型在实时和离线两种情况下的封闭字幕任务。我们展示了匈牙利广播语音中恢复标点符号的第一个结果，其中RNN显着优于我们的MaxEnt基线系统。我们的方法也在IWSLT英语数据集中的TED演讲中进行了评估，提供了与最先进系统相当的结果。

引用次数: 8

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2017 8th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀