首页 > 最新文献

Int. J. Semantic Comput.最新文献

英文 中文
Graph Computing for Financial Crime and Fraud Detection: Trends, Challenges and Outlook 图形计算用于金融犯罪和欺诈检测:趋势、挑战和展望
Pub Date : 2020-12-01 DOI: 10.1142/S1793351X20300022
Eren Kurshan, Hongda Shen
The rise of digital payments has caused consequential changes in the financial crime landscape. As a result, traditional fraud detection approaches such as rule-based systems have largely become ineffective. Artificial intelligence (AI) and machine learning solutions using graph computing principles have gained significant interest in recent years. Graph-based techniques provide unique solution opportunities for financial crime detection. However, implementing such solutions at industrial-scale in real-time financial transaction processing systems has brought numerous application challenges to light. In this paper, we discuss the implementation difficulties current and next-generation graph solutions face. Furthermore, financial crime and digital payments trends indicate emerging challenges in the continued effectiveness of the detection techniques. We analyze the threat landscape and argue that it provides key insights for developing graph-based solutions.
数字支付的兴起给金融犯罪领域带来了重大变化。因此,传统的欺诈检测方法,如基于规则的系统,在很大程度上已经变得无效。人工智能(AI)和使用图计算原理的机器学习解决方案近年来获得了极大的兴趣。基于图形的技术为金融犯罪检测提供了独特的解决方案。然而,在实时金融交易处理系统中实现这种工业规模的解决方案带来了许多应用挑战。在本文中,我们讨论了当前和下一代图形解决方案所面临的实现困难。此外,金融犯罪和数字支付趋势表明,检测技术的持续有效性面临新的挑战。我们分析了威胁形势,并认为它为开发基于图形的解决方案提供了关键见解。
{"title":"Graph Computing for Financial Crime and Fraud Detection: Trends, Challenges and Outlook","authors":"Eren Kurshan, Hongda Shen","doi":"10.1142/S1793351X20300022","DOIUrl":"https://doi.org/10.1142/S1793351X20300022","url":null,"abstract":"The rise of digital payments has caused consequential changes in the financial crime landscape. As a result, traditional fraud detection approaches such as rule-based systems have largely become ineffective. Artificial intelligence (AI) and machine learning solutions using graph computing principles have gained significant interest in recent years. Graph-based techniques provide unique solution opportunities for financial crime detection. However, implementing such solutions at industrial-scale in real-time financial transaction processing systems has brought numerous application challenges to light. In this paper, we discuss the implementation difficulties current and next-generation graph solutions face. Furthermore, financial crime and digital payments trends indicate emerging challenges in the continued effectiveness of the detection techniques. We analyze the threat landscape and argue that it provides key insights for developing graph-based solutions.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"98 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133553737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A Case for 3D Integrated System Design for Neuromorphic Computing & AI Applications 神经形态计算与人工智能应用的三维集成系统设计案例
Pub Date : 2020-12-01 DOI: 10.1142/S1793351X20500063
Eren Kurshan, H. Li, Mingoo Seok, Yuan Xie
Over the last decade, artificial intelligence has found many applications areas in the society. As AI solutions have become more sophistication and the use cases grew, they highlighted the need to address performance and energy efficiency challenges faced during the implementation process. To address these challenges, there has been growing interest in neuromorphic chips. Neuromorphic computing relies on non von Neumann architectures as well as novel devices, circuits and manufacturing technologies to mimic the human brain. Among such technologies, 3D integration is an important enabler for AI hardware and the continuation of the scaling laws. In this paper, we overview the unique opportunities 3D integration provides in neuromorphic chip design, discuss the emerging opportunities in next generation neuromorphic architectures and review the obstacles. Neuromorphic architectures, which relied on the brain for inspiration and emulation purposes, face grand challenges due to the limited understanding of the functionality and the architecture of the human brain. Yet, high-levels of investments are dedicated to develop neuromorphic chips. We argue that 3D integration not only provides strategic advantages to the cost-effective and flexible design of neuromorphic chips, it may provide design flexibility in incorporating advanced capabilities to further benefits the designs in the future.
在过去的十年中,人工智能在社会中找到了许多应用领域。随着人工智能解决方案变得越来越复杂,用例也越来越多,他们强调需要解决实施过程中面临的性能和能效挑战。为了应对这些挑战,人们对神经形态芯片的兴趣日益浓厚。神经形态计算依赖于非冯·诺伊曼架构以及新颖的设备、电路和制造技术来模拟人类的大脑。在这些技术中,3D集成是人工智能硬件和缩放定律延续的重要推动者。在本文中,我们概述了3D集成在神经形态芯片设计中提供的独特机会,讨论了下一代神经形态架构中出现的机会,并回顾了障碍。由于对人类大脑的功能和结构的有限理解,依赖于大脑的灵感和仿真目的的神经形态架构面临着巨大的挑战。然而,高水平的投资致力于开发神经形态芯片。我们认为,3D集成不仅为神经形态芯片的成本效益和灵活设计提供了战略优势,它还可以为整合先进功能提供设计灵活性,从而进一步促进未来的设计。
{"title":"A Case for 3D Integrated System Design for Neuromorphic Computing & AI Applications","authors":"Eren Kurshan, H. Li, Mingoo Seok, Yuan Xie","doi":"10.1142/S1793351X20500063","DOIUrl":"https://doi.org/10.1142/S1793351X20500063","url":null,"abstract":"Over the last decade, artificial intelligence has found many applications areas in the society. As AI solutions have become more sophistication and the use cases grew, they highlighted the need to address performance and energy efficiency challenges faced during the implementation process. To address these challenges, there has been growing interest in neuromorphic chips. Neuromorphic computing relies on non von Neumann architectures as well as novel devices, circuits and manufacturing technologies to mimic the human brain. Among such technologies, 3D integration is an important enabler for AI hardware and the continuation of the scaling laws. In this paper, we overview the unique opportunities 3D integration provides in neuromorphic chip design, discuss the emerging opportunities in next generation neuromorphic architectures and review the obstacles. Neuromorphic architectures, which relied on the brain for inspiration and emulation purposes, face grand challenges due to the limited understanding of the functionality and the architecture of the human brain. Yet, high-levels of investments are dedicated to develop neuromorphic chips. We argue that 3D integration not only provides strategic advantages to the cost-effective and flexible design of neuromorphic chips, it may provide design flexibility in incorporating advanced capabilities to further benefits the designs in the future.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125118913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Review on Lexicon-Based and Machine Learning Political Sentiment Analysis Using Tweets 基于词典和机器学习的推文政治情绪分析综述
Pub Date : 2020-12-01 DOI: 10.1142/S1793351X20300010
Alexandros Britzolakis, H. Kondylakis, N. Papadakis
Sentiment analysis over social media platforms has been an active case of study for more than a decade. This occurs due to the constant rising of Internet users over these platforms, as well as to the increasing interest of companies for monitoring the opinion of customers over commercial products. Most of these platforms provide free, online services such as the creation of interactive web communities, multimedia content uploading, etc. This new way of communication has affected human societies as it shaped the way by which an opinion can be expressed, sparking the era of digital revolution. One of the most profound examples of social networking platforms for opinion mining is Twitter as it is a great source for extracting news and a platform which politicians tend to use frequently. In addition to that, the character limitation per posted tweet (maximum of 280 characters) makes it easier for automated tools to extract its underlying sentiment. In this review paper, we present a variety of lexicon-based tools as well as machine learning algorithms used for sentiment extraction. Furthermore, we present additional implementations used for political sentiment analysis over Twitter as well as additional open topics. We hope the review will help readers to understand this scientifically rich area, identify best options for their work and work on open topics.
十多年来,社交媒体平台上的情绪分析一直是一个活跃的研究案例。这是由于互联网用户在这些平台上的不断增加,以及公司越来越有兴趣监控客户对商业产品的意见。这些平台大多提供免费的在线服务,如创建交互式网络社区、上传多媒体内容等。这种新的沟通方式影响了人类社会,因为它塑造了表达意见的方式,引发了数字革命时代。在舆论挖掘方面,最具影响力的社交网络平台之一是Twitter,因为它是提取新闻的重要来源,也是政客们经常使用的平台。除此之外,每条推文的字符限制(最多280个字符)使自动化工具更容易提取其潜在的情感。在这篇综述文章中,我们提出了各种基于词典的工具以及用于情感提取的机器学习算法。此外,我们还提供了用于Twitter上的政治情绪分析以及其他开放主题的其他实现。我们希望这篇综述能帮助读者了解这个科学丰富的领域,为他们的工作和开放主题的工作确定最佳选择。
{"title":"A Review on Lexicon-Based and Machine Learning Political Sentiment Analysis Using Tweets","authors":"Alexandros Britzolakis, H. Kondylakis, N. Papadakis","doi":"10.1142/S1793351X20300010","DOIUrl":"https://doi.org/10.1142/S1793351X20300010","url":null,"abstract":"Sentiment analysis over social media platforms has been an active case of study for more than a decade. This occurs due to the constant rising of Internet users over these platforms, as well as to the increasing interest of companies for monitoring the opinion of customers over commercial products. Most of these platforms provide free, online services such as the creation of interactive web communities, multimedia content uploading, etc. This new way of communication has affected human societies as it shaped the way by which an opinion can be expressed, sparking the era of digital revolution. One of the most profound examples of social networking platforms for opinion mining is Twitter as it is a great source for extracting news and a platform which politicians tend to use frequently. In addition to that, the character limitation per posted tweet (maximum of 280 characters) makes it easier for automated tools to extract its underlying sentiment. In this review paper, we present a variety of lexicon-based tools as well as machine learning algorithms used for sentiment extraction. Furthermore, we present additional implementations used for political sentiment analysis over Twitter as well as additional open topics. We hope the review will help readers to understand this scientifically rich area, identify best options for their work and work on open topics.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"87 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120882124","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Vulnerability Rating of Source Code with Token Embedding and Combinatorial Algorithms 基于令牌嵌入和组合算法的源代码漏洞评级
Pub Date : 2020-12-01 DOI: 10.1142/S1793351X20500087
Joseph R. Barr, Peter Shaw, F. Abu-Khzam, Tyler Thatcher, Sheng Yu
We present an empirical analysis of the source code of the Fluoride Bluetooth module, which is a part of standard Android OS distribution, by exhibiting a novel approach for classifying and scoring source code and vulnerability rating. Our workflow combines deep learning, combinatorial optimization, heuristics and machine learning. A combination of heuristics and deep learning is used to embed function (and method) labels into a low-dimensional Euclidean space. Because the corpus of the Fluoride source code is rather limited (containing approximately 12,000 functions), a straightforward embedding (using, e.g. code2vec) is untenable. To overcome the challenge of dearth of data, it is necessary to go through an intermediate step of Byte-Pair Encoding. Subsequently, we embed the tokens from which we assemble an embedding of function/method labels. Long short-term memory network (LSTM) is used to embed tokens. The next step is to form a distance matrix consisting of the cosines between every pairs of vectors (function embedding) which in turn is interpreted as a (combinatorial) graph whose vertices represent functions, and edges correspond to entries whose value exceed some given threshold. Cluster-Editing is then applied to partition the vertex set of the graph into subsets representing “dense graphs,” that are nearly complete subgraphs. Finally, the vectors representing the components, plus additional heuristic-based features are used as features to model the components for vulnerability risk.
我们通过展示一种对源代码进行分类和评分以及漏洞评级的新方法,对标准Android操作系统分发版的一部分氟化物蓝牙模块的源代码进行了实证分析。我们的工作流程结合了深度学习、组合优化、启发式和机器学习。启发式和深度学习的结合用于将函数(和方法)标签嵌入到低维欧几里得空间中。由于氟化物源代码的语料库相当有限(包含大约12,000个函数),因此直接嵌入(例如使用code2vec)是站不住脚的。为了克服数据缺乏的挑战,有必要经过字节对编码的中间步骤。随后,我们嵌入令牌,从中组装函数/方法标签的嵌入。使用长短期记忆网络(LSTM)嵌入令牌。下一步是形成一个距离矩阵,由每对向量(函数嵌入)之间的余弦组成,这反过来被解释为一个(组合)图,其顶点表示函数,而边对应于值超过某个给定阈值的条目。然后应用聚类编辑将图的顶点集划分为代表“密集图”的子集,这些子集是几乎完全的子图。最后,使用表示组件的向量和附加的启发式特征作为特征来对组件进行漏洞风险建模。
{"title":"Vulnerability Rating of Source Code with Token Embedding and Combinatorial Algorithms","authors":"Joseph R. Barr, Peter Shaw, F. Abu-Khzam, Tyler Thatcher, Sheng Yu","doi":"10.1142/S1793351X20500087","DOIUrl":"https://doi.org/10.1142/S1793351X20500087","url":null,"abstract":"We present an empirical analysis of the source code of the Fluoride Bluetooth module, which is a part of standard Android OS distribution, by exhibiting a novel approach for classifying and scoring source code and vulnerability rating. Our workflow combines deep learning, combinatorial optimization, heuristics and machine learning. A combination of heuristics and deep learning is used to embed function (and method) labels into a low-dimensional Euclidean space. Because the corpus of the Fluoride source code is rather limited (containing approximately 12,000 functions), a straightforward embedding (using, e.g. code2vec) is untenable. To overcome the challenge of dearth of data, it is necessary to go through an intermediate step of Byte-Pair Encoding. Subsequently, we embed the tokens from which we assemble an embedding of function/method labels. Long short-term memory network (LSTM) is used to embed tokens. The next step is to form a distance matrix consisting of the cosines between every pairs of vectors (function embedding) which in turn is interpreted as a (combinatorial) graph whose vertices represent functions, and edges correspond to entries whose value exceed some given threshold. Cluster-Editing is then applied to partition the vertex set of the graph into subsets representing “dense graphs,” that are nearly complete subgraphs. Finally, the vectors representing the components, plus additional heuristic-based features are used as features to model the components for vulnerability risk.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"67 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124464191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Biofeedback Enhanced Adaptive Virtual Reality Environment for Managing Surgical Pain and Anxiety 生物反馈增强自适应虚拟现实环境管理手术疼痛和焦虑
Pub Date : 2020-09-01 DOI: 10.1142/s1793351x20400152
Vishnunarayan Girishan Prabhu, L. Stanley, Robert Morgan
Pain and anxiety are common accompaniments of surgery, and opioids have been the mainstay of pain management for decades, with about 80% of the surgical population leaving the hospital with an opioid prescription. Moreover, patients receiving an opioid prescription after short-stay surgeries have a 44% increased risk of long-term opioid use, and about one in 16 surgical patients becomes a long-term user. Current opioid abuse and addiction now place the US in an “opioid epidemic,” and calls for alternative pain management mechanisms. To mitigate the preoperative anxiety and postoperative pain, we developed a virtual reality (VR) experience based on Attention Restoration Theory (ART) and integrated the user’s heart rate variability (HRV) biofeedback to create an adaptive environment. A randomized control trial among 16 Total Knee Arthroplasty (TKA) patients undergoing surgery at Patewood Memorial Hospital, Greenville, SC demonstrated that patients experiencing the adaptive VR environment reported a significant decrease in preoperative anxiety ([Formula: see text]) and postoperative pain ([Formula: see text]) after the VR intervention. These results were also supported by the physiological measures where there was a significant increase in RR Interval (RRI) ([Formula: see text]) and a significant decrease in the low frequency (LF)/high frequency (HF) ratio ([Formula: see text]) and respiration rate (RR) ([Formula: see text]).
疼痛和焦虑是手术的常见伴随症状,几十年来,阿片类药物一直是疼痛治疗的主要手段,大约80%的手术患者出院时都有阿片类药物处方。此外,在短期手术后接受阿片类药物处方的患者长期使用阿片类药物的风险增加了44%,大约每16例手术患者中就有1例成为长期使用者。目前的阿片类药物滥用和成瘾使美国陷入了“阿片类药物流行病”,并呼吁建立替代的疼痛管理机制。为了减轻术前焦虑和术后疼痛,我们开发了一种基于注意力恢复理论(ART)的虚拟现实(VR)体验,并结合用户的心率变异性(HRV)生物反馈来创造一个自适应环境。在南卡罗来纳州Greenville的Patewood纪念医院进行的16例全膝关节置换术(TKA)患者的随机对照试验表明,经历适应性VR环境的患者在VR干预后,术前焦虑([公式:见文])和术后疼痛([公式:见文])显著减少。这些结果也得到了生理测量的支持,即RR间隔(RRI)([公式:见文])显著增加,低频(LF)/高频(HF)比([公式:见文])和呼吸速率(RR)([公式:见文])显著降低。
{"title":"A Biofeedback Enhanced Adaptive Virtual Reality Environment for Managing Surgical Pain and Anxiety","authors":"Vishnunarayan Girishan Prabhu, L. Stanley, Robert Morgan","doi":"10.1142/s1793351x20400152","DOIUrl":"https://doi.org/10.1142/s1793351x20400152","url":null,"abstract":"Pain and anxiety are common accompaniments of surgery, and opioids have been the mainstay of pain management for decades, with about 80% of the surgical population leaving the hospital with an opioid prescription. Moreover, patients receiving an opioid prescription after short-stay surgeries have a 44% increased risk of long-term opioid use, and about one in 16 surgical patients becomes a long-term user. Current opioid abuse and addiction now place the US in an “opioid epidemic,” and calls for alternative pain management mechanisms. To mitigate the preoperative anxiety and postoperative pain, we developed a virtual reality (VR) experience based on Attention Restoration Theory (ART) and integrated the user’s heart rate variability (HRV) biofeedback to create an adaptive environment. A randomized control trial among 16 Total Knee Arthroplasty (TKA) patients undergoing surgery at Patewood Memorial Hospital, Greenville, SC demonstrated that patients experiencing the adaptive VR environment reported a significant decrease in preoperative anxiety ([Formula: see text]) and postoperative pain ([Formula: see text]) after the VR intervention. These results were also supported by the physiological measures where there was a significant increase in RR Interval (RRI) ([Formula: see text]) and a significant decrease in the low frequency (LF)/high frequency (HF) ratio ([Formula: see text]) and respiration rate (RR) ([Formula: see text]).","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131772801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Semantic Analysis for Conversational Datasets: Improving Their Quality Using Semantic Relationships 会话数据集的语义分析:利用语义关系提高其质量
Pub Date : 2020-09-01 DOI: 10.1142/s1793351x2050004x
Maria Krommyda, Verena Kantere
As more and more datasets become available, their utilization in different applications increases in popularity. Their volume and production rate, however, means that their quality and content control is in most cases non-existing, resulting in many datasets that contain inaccurate information of low quality. Especially, in the field of conversational assistants, where the datasets come from many heterogeneous sources with no quality assurance, the problem is aggravated. We present here an integrated platform that creates task- and topic-specific conversational datasets to be used for training conversational agents. The platform explores available conversational datasets, extracts information based on semantic similarity and relatedness, and applies a weight-based score function to rank the information based on its value for the specific task and topic. The finalized dataset can then be used for the training of an automated conversational assistance over accurate data of high quality.
随着越来越多的数据集变得可用,它们在不同应用程序中的使用率也越来越高。然而,它们的数量和生产速度意味着它们的质量和内容控制在大多数情况下不存在,导致许多数据集包含低质量的不准确信息。特别是在会话助手领域,数据集来自许多异构来源,没有质量保证,问题更加严重。我们在这里提出了一个集成平台,它创建特定于任务和主题的会话数据集,用于训练会话代理。该平台探索可用的会话数据集,根据语义相似性和相关性提取信息,并应用基于权重的评分函数根据其对特定任务和主题的价值对信息进行排名。最终的数据集可以用于训练高质量的准确数据上的自动会话辅助。
{"title":"Semantic Analysis for Conversational Datasets: Improving Their Quality Using Semantic Relationships","authors":"Maria Krommyda, Verena Kantere","doi":"10.1142/s1793351x2050004x","DOIUrl":"https://doi.org/10.1142/s1793351x2050004x","url":null,"abstract":"As more and more datasets become available, their utilization in different applications increases in popularity. Their volume and production rate, however, means that their quality and content control is in most cases non-existing, resulting in many datasets that contain inaccurate information of low quality. Especially, in the field of conversational assistants, where the datasets come from many heterogeneous sources with no quality assurance, the problem is aggravated. We present here an integrated platform that creates task- and topic-specific conversational datasets to be used for training conversational agents. The platform explores available conversational datasets, extracts information based on semantic similarity and relatedness, and applies a weight-based score function to rank the information based on its value for the specific task and topic. The finalized dataset can then be used for the training of an automated conversational assistance over accurate data of high quality.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121062559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Multi-Platform Expansion of the Virtual Human Toolkit: Ubiquitous Conversational Agents 虚拟人类工具箱的多平台扩展:无处不在的会话代理
Pub Date : 2020-09-01 DOI: 10.1142/s1793351x20400127
Arno Hartholt, Edward Fast, Adam Reilly, W. Whitcup, Matt Liewer, S. Mozgai
We present an extension of the Virtual Human Toolkit to include a range of computing platforms, including mobile, web, Virtual Reality (VR) and Augmented Reality (AR). The Toolkit uses a mix of in-house and commodity technologies to support audio-visual sensing, speech recognition, natural language processing, nonverbal behavior generation and realization, text-to-speech generation and rendering. It has been extended to support computing platforms beyond Windows by leveraging microservices. The resulting framework maintains the modularity of the underlying architecture, allows re-use of both logic and content through cloud services, and is extensible by porting lightweight clients. We present the current state of the framework, discuss how we model and animate our characters, and offer lessons learned through several use cases, including expressive character animation in seated VR, shared space and navigation in room-scale VR, autonomous AI in mobile AR, and real-time user performance feedback leveraging mobile sensors in headset AR.
我们提出了一个扩展的虚拟人类工具包,包括一系列的计算平台,包括移动,网络,虚拟现实(VR)和增强现实(AR)。该工具包使用内部和商品技术的混合,以支持视听传感、语音识别、自然语言处理、非语言行为生成和实现、文本到语音的生成和渲染。通过利用微服务,它已经扩展到支持Windows以外的计算平台。生成的框架维护了底层体系结构的模块化,允许通过云服务重用逻辑和内容,并且可以通过移植轻量级客户端进行扩展。我们介绍了框架的当前状态,讨论了我们如何建模和动画我们的角色,并通过几个用例提供了经验教训,包括坐式VR中的富有表现力的角色动画,房间级VR中的共享空间和导航,移动AR中的自主AI,以及利用耳机AR中的移动传感器的实时用户性能反馈。
{"title":"Multi-Platform Expansion of the Virtual Human Toolkit: Ubiquitous Conversational Agents","authors":"Arno Hartholt, Edward Fast, Adam Reilly, W. Whitcup, Matt Liewer, S. Mozgai","doi":"10.1142/s1793351x20400127","DOIUrl":"https://doi.org/10.1142/s1793351x20400127","url":null,"abstract":"We present an extension of the Virtual Human Toolkit to include a range of computing platforms, including mobile, web, Virtual Reality (VR) and Augmented Reality (AR). The Toolkit uses a mix of in-house and commodity technologies to support audio-visual sensing, speech recognition, natural language processing, nonverbal behavior generation and realization, text-to-speech generation and rendering. It has been extended to support computing platforms beyond Windows by leveraging microservices. The resulting framework maintains the modularity of the underlying architecture, allows re-use of both logic and content through cloud services, and is extensible by porting lightweight clients. We present the current state of the framework, discuss how we model and animate our characters, and offer lessons learned through several use cases, including expressive character animation in seated VR, shared space and navigation in room-scale VR, autonomous AI in mobile AR, and real-time user performance feedback leveraging mobile sensors in headset AR.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133626965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video with Applications for Virtual Reality 圆柱全景视频深度和自我运动的无监督学习及其在虚拟现实中的应用
Pub Date : 2020-09-01 DOI: 10.1142/S1793351X20400139
Alisha Sharma, Ryan Nett, Jonathan Ventura
We introduce a convolutional neural network model for unsupervised learning of depth and ego-motion from cylindrical panoramic video. Panoramic depth estimation is an important technology for applications such as virtual reality, 3D modeling, and autonomous robotic navigation. In contrast to previous approaches for applying convolutional neural networks to panoramic imagery, we use the cylindrical panoramic projection which allows for the use of the traditional CNN layers such as convolutional filters and max pooling without modification. Our evaluation of synthetic and real data shows that unsupervised learning of depth and ego-motion on cylindrical panoramic images can produce high-quality depth maps and that an increased field-of-view improves ego-motion estimation accuracy. We create two new datasets to evaluate our approach: a synthetic dataset created using the CARLA simulator, and Headcam, a novel dataset of panoramic video collected from a helmet-mounted camera while biking in an urban setting. We also apply our network to the problem of converting monocular panoramas to stereo panoramas.
本文提出了一种卷积神经网络模型,用于对圆柱全景视频的深度和自我运动进行无监督学习。全景深度估计是虚拟现实、三维建模和自主机器人导航等应用中的一项重要技术。与之前将卷积神经网络应用于全景图像的方法相比,我们使用圆柱形全景投影,它允许使用传统的CNN层,如卷积滤波器和最大池化,而无需修改。我们对合成数据和真实数据的评估表明,在圆柱形全景图像上对深度和自我运动进行无监督学习可以生成高质量的深度图,并且增加的视场可以提高自我运动估计的准确性。我们创建了两个新的数据集来评估我们的方法:一个是使用CARLA模拟器创建的合成数据集,另一个是Headcam,这是一个在城市环境中骑自行车时从头盔摄像头收集的全景视频的新数据集。我们还将我们的网络应用于将单目全景图转换为立体全景图的问题。
{"title":"Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video with Applications for Virtual Reality","authors":"Alisha Sharma, Ryan Nett, Jonathan Ventura","doi":"10.1142/S1793351X20400139","DOIUrl":"https://doi.org/10.1142/S1793351X20400139","url":null,"abstract":"We introduce a convolutional neural network model for unsupervised learning of depth and ego-motion from cylindrical panoramic video. Panoramic depth estimation is an important technology for applications such as virtual reality, 3D modeling, and autonomous robotic navigation. In contrast to previous approaches for applying convolutional neural networks to panoramic imagery, we use the cylindrical panoramic projection which allows for the use of the traditional CNN layers such as convolutional filters and max pooling without modification. Our evaluation of synthetic and real data shows that unsupervised learning of depth and ego-motion on cylindrical panoramic images can produce high-quality depth maps and that an increased field-of-view improves ego-motion estimation accuracy. We create two new datasets to evaluate our approach: a synthetic dataset created using the CARLA simulator, and Headcam, a novel dataset of panoramic video collected from a helmet-mounted camera while biking in an urban setting. We also apply our network to the problem of converting monocular panoramas to stereo panoramas.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"516 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133132860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Emergent Languages from Pretrained Embeddings Characterize Latent Concepts in Dynamic Imagery 基于预训练嵌入的涌现语言表征动态图像中的潜在概念
Pub Date : 2020-09-01 DOI: 10.1142/s1793351x20400140
James R. Kubricht, A. Santamaría-Pang, Chinmaya Devaraj, Aritra Chowdhury, P. Tu
Recent unsupervised learning approaches have explored the feasibility of semantic analysis and interpretation of imagery using Emergent Language (EL) models. As EL requires some form of numerical embedding as input, it remains unclear which type is required in order for the EL to properly capture key semantic concepts associated with a given domain. In this paper, we compare unsupervised and supervised approaches for generating embeddings across two experiments. In Experiment 1, data are produced using a single-agent simulator. In each episode, a goal-driven agent attempts to accomplish a number of tasks in a synthetic cityscape environment which includes houses, banks, theaters and restaurants. In Experiment 2, a comparatively smaller dataset is produced where one or more objects demonstrate various types of physical motion in a 3D simulator environment. We investigate whether EL models generated from embeddings of raw pixel data produce expressions that capture key latent concepts (i.e. an agent’s motivations or physical motion types) in each environment. Our initial experiments show that the supervised learning approaches yield embeddings and EL descriptions that capture meaningful concepts from raw pixel inputs. Alternatively, embeddings from an unsupervised learning approach result in greater ambiguity with respect to latent concepts.
最近的无监督学习方法已经探索了使用紧急语言模型对图像进行语义分析和解释的可行性。由于EL需要某种形式的数字嵌入作为输入,因此尚不清楚EL需要哪种类型才能正确捕获与给定领域相关的关键语义概念。在本文中,我们比较了在两个实验中生成嵌入的无监督和有监督方法。在实验1中,数据是使用单代理模拟器生成的。在每一集中,一个目标驱动的代理人试图在一个合成的城市景观环境中完成一些任务,包括房屋、银行、剧院和餐馆。在实验2中,生成了一个相对较小的数据集,其中一个或多个对象在3D模拟器环境中展示了各种类型的物理运动。我们研究了从原始像素数据嵌入中生成的EL模型是否产生了在每个环境中捕获关键潜在概念(即代理的动机或物理运动类型)的表达式。我们的初步实验表明,监督学习方法产生了从原始像素输入中捕获有意义概念的嵌入和EL描述。另外,来自无监督学习方法的嵌入会导致相对于潜在概念的更大的模糊性。
{"title":"Emergent Languages from Pretrained Embeddings Characterize Latent Concepts in Dynamic Imagery","authors":"James R. Kubricht, A. Santamaría-Pang, Chinmaya Devaraj, Aritra Chowdhury, P. Tu","doi":"10.1142/s1793351x20400140","DOIUrl":"https://doi.org/10.1142/s1793351x20400140","url":null,"abstract":"Recent unsupervised learning approaches have explored the feasibility of semantic analysis and interpretation of imagery using Emergent Language (EL) models. As EL requires some form of numerical embedding as input, it remains unclear which type is required in order for the EL to properly capture key semantic concepts associated with a given domain. In this paper, we compare unsupervised and supervised approaches for generating embeddings across two experiments. In Experiment 1, data are produced using a single-agent simulator. In each episode, a goal-driven agent attempts to accomplish a number of tasks in a synthetic cityscape environment which includes houses, banks, theaters and restaurants. In Experiment 2, a comparatively smaller dataset is produced where one or more objects demonstrate various types of physical motion in a 3D simulator environment. We investigate whether EL models generated from embeddings of raw pixel data produce expressions that capture key latent concepts (i.e. an agent’s motivations or physical motion types) in each environment. Our initial experiments show that the supervised learning approaches yield embeddings and EL descriptions that capture meaningful concepts from raw pixel inputs. Alternatively, embeddings from an unsupervised learning approach result in greater ambiguity with respect to latent concepts.","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116135363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Accurate Lane Detection for Self-Driving Cars: An Approach Based on Color Filter Adjustment and K-Means Clustering Filter 自动驾驶汽车精确车道检测:基于颜色滤波调整和k均值聚类滤波的方法
Pub Date : 2020-06-09 DOI: 10.1142/s1793351x20500038
Dongfang Liu, Yaqin Wang, Tian Chen, E. Matson
Lane detection is a crucial factor for self-driving cars to achieve a fully autonomous mode. Due to its importance, lane detection has drawn wide attention in recent years for autonomous driving. O...
车道检测是自动驾驶汽车实现完全自动驾驶的关键因素。由于车道检测的重要性,近年来在自动驾驶领域受到了广泛关注。阿……
{"title":"Accurate Lane Detection for Self-Driving Cars: An Approach Based on Color Filter Adjustment and K-Means Clustering Filter","authors":"Dongfang Liu, Yaqin Wang, Tian Chen, E. Matson","doi":"10.1142/s1793351x20500038","DOIUrl":"https://doi.org/10.1142/s1793351x20500038","url":null,"abstract":"Lane detection is a crucial factor for self-driving cars to achieve a fully autonomous mode. Due to its importance, lane detection has drawn wide attention in recent years for autonomous driving. O...","PeriodicalId":217956,"journal":{"name":"Int. J. Semantic Comput.","volume":"295 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123239948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Int. J. Semantic Comput.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1