首页 > 最新文献

Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web最新文献

英文 中文
Proposal to Use of the Websocket Protocol for Web Device Control 使用Websocket协议进行Web设备控制的建议
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3126887
Adriano H. O. Maia, D. Silva
The Websocket protocol enables a full-duplex communication, besides it simplifies an exchange of data and reduces network overload. This paper proposes the use of the Websocket protocol in control and service devices through web within real-time requirements. Through tests made in a virtual environment and another one in embedded experiment, It is possible to validate an initial proposal of implementation the Websocket protocol. From the analysis of the results obtained, it can be seen the use of the proposal in question provides a considerable reduction in the quantity of requests and transferred data in relation to traditional approach to sending data in HTTP-based communications. Consequently it seems to be a very promising technique for this type of application.
Websocket协议支持全双工通信,简化了数据交换,减少了网络过载。本文提出在满足实时性要求的情况下,利用Websocket协议通过web对设备进行控制和服务。通过虚拟环境测试和嵌入式实验,验证了Websocket协议的初步实现方案。从所获得的结果的分析中可以看出,与在基于http的通信中发送数据的传统方法相比,使用所讨论的提议大大减少了请求和传输数据的数量。因此,对于这种类型的应用程序,它似乎是一种非常有前途的技术。
{"title":"Proposal to Use of the Websocket Protocol for Web Device Control","authors":"Adriano H. O. Maia, D. Silva","doi":"10.1145/3126858.3126887","DOIUrl":"https://doi.org/10.1145/3126858.3126887","url":null,"abstract":"The Websocket protocol enables a full-duplex communication, besides it simplifies an exchange of data and reduces network overload. This paper proposes the use of the Websocket protocol in control and service devices through web within real-time requirements. Through tests made in a virtual environment and another one in embedded experiment, It is possible to validate an initial proposal of implementation the Websocket protocol. From the analysis of the results obtained, it can be seen the use of the proposal in question provides a considerable reduction in the quantity of requests and transferred data in relation to traditional approach to sending data in HTTP-based communications. Consequently it seems to be a very promising technique for this type of application.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"2010 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133042936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Evaluating Ensemble Strategies for Recommender Systems under Metadata Reduction 元数据约简下推荐系统集成策略评价
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3126879
Lassion Laique Bomfim de Souza Santana, Alesson Bruno Santos Souza, Diego Lima Santana, Wendel Araújo Dourado, F. Durão
Recommender systems are information filtering tools that aspire to predict accurate ratings for users and items, with the ultimate goal of providing users with personalized and relevant recommendations. Recommender system that rely on the combination of quality metadata, i.e., all descriptive information about an item, are likely to be successful in the process of finding what is relevant or not for a target user. The problem arises when either data is sparse or important metadata is not available, making it hard for recommender systems to predict proper user-item ratings. In particular, this study investigates how our proposed collaborative-filtering recommender performs when important metadata is reduced from a dataset. To evaluate our approach use the HetRec 2011 2k dataset with five different movie metadata (genres, tags, directors, actors and countries). By applying our approach of metadata reduction, we provide a comprehensive analysis on how mean average precision is affected as important metadata become unavailable.
推荐系统是一种信息过滤工具,旨在预测用户和物品的准确评级,最终目标是为用户提供个性化和相关的推荐。依赖于高质量元数据组合的推荐系统,即关于一个项目的所有描述性信息,很可能在找到与目标用户相关或不相关的过程中成功。当数据稀疏或重要的元数据不可用时,问题就出现了,这使得推荐系统很难预测正确的用户-商品评级。特别地,本研究调查了我们提出的协同过滤推荐在从数据集中减少重要元数据时的表现。为了评估我们的方法,使用HetRec 2011 2k数据集与五个不同的电影元数据(类型,标签,导演,演员和国家)。通过应用我们的元数据约简方法,我们对重要元数据不可用时平均精度的影响进行了全面分析。
{"title":"Evaluating Ensemble Strategies for Recommender Systems under Metadata Reduction","authors":"Lassion Laique Bomfim de Souza Santana, Alesson Bruno Santos Souza, Diego Lima Santana, Wendel Araújo Dourado, F. Durão","doi":"10.1145/3126858.3126879","DOIUrl":"https://doi.org/10.1145/3126858.3126879","url":null,"abstract":"Recommender systems are information filtering tools that aspire to predict accurate ratings for users and items, with the ultimate goal of providing users with personalized and relevant recommendations. Recommender system that rely on the combination of quality metadata, i.e., all descriptive information about an item, are likely to be successful in the process of finding what is relevant or not for a target user. The problem arises when either data is sparse or important metadata is not available, making it hard for recommender systems to predict proper user-item ratings. In particular, this study investigates how our proposed collaborative-filtering recommender performs when important metadata is reduced from a dataset. To evaluate our approach use the HetRec 2011 2k dataset with five different movie metadata (genres, tags, directors, actors and countries). By applying our approach of metadata reduction, we provide a comprehensive analysis on how mean average precision is affected as important metadata become unavailable.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123475781","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An Adaptable Transmission Management Framework for Push-mode Hypermedia Content 推送模式超媒体内容的自适应传输管理框架
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3126869
M. Josué, M. Moreno, R. Costa
In the delivery of hypermedia content over communication networks, the specified intermedia synchronization must be assured, despite the inherent delay and jitter of most transmission media and networks. This kind of content typically provides users multiple interaction paths, with different sets of media objects each one. In spite of that, when the hypermedia content is transmitted in push mode, users receive all media objects, regardless of the chosen interaction path. Transmission strategies that take into account the occurrence of both deterministic and non-deterministic hypermedia presentation events can decrease the waste of storage resources in the receiver side, as well as the need for network bandwidth. This work proposes a framework for an adaptable management of push-mode hypermedia content transmission. Adaptability is achieved by supporting multiple transmission strategies that may employ multiple transmission channels, which are built upon a content analysis for the identification of deterministic and non-deterministic hypermedia presentation events. Methods for instantiating the framework in the context of Ginga-NCL application transmission are also discussed over multiple transmission scenarios, in comparison with the existing, unmanaged content transmission.
在通过通信网络传送超媒体内容时,必须保证指定的媒体间同步,尽管大多数传输媒体和网络存在固有的延迟和抖动。这类内容通常为用户提供多个交互路径,每个路径都有不同的媒体对象集。尽管如此,当超媒体内容以推送模式传输时,无论选择何种交互路径,用户都可以接收到所有媒体对象。考虑确定性和非确定性超媒体表示事件发生的传输策略可以减少接收端存储资源的浪费,以及对网络带宽的需求。本工作提出了一个自适应管理推送模式超媒体内容传输的框架。适应性是通过支持多种传输策略来实现的,这些传输策略可能采用多个传输通道,这些传输通道建立在用于识别确定性和非确定性超媒体表示事件的内容分析之上。与现有的非托管内容传输相比,还讨论了在Ginga-NCL应用程序传输上下文中实例化框架的方法。
{"title":"An Adaptable Transmission Management Framework for Push-mode Hypermedia Content","authors":"M. Josué, M. Moreno, R. Costa","doi":"10.1145/3126858.3126869","DOIUrl":"https://doi.org/10.1145/3126858.3126869","url":null,"abstract":"In the delivery of hypermedia content over communication networks, the specified intermedia synchronization must be assured, despite the inherent delay and jitter of most transmission media and networks. This kind of content typically provides users multiple interaction paths, with different sets of media objects each one. In spite of that, when the hypermedia content is transmitted in push mode, users receive all media objects, regardless of the chosen interaction path. Transmission strategies that take into account the occurrence of both deterministic and non-deterministic hypermedia presentation events can decrease the waste of storage resources in the receiver side, as well as the need for network bandwidth. This work proposes a framework for an adaptable management of push-mode hypermedia content transmission. Adaptability is achieved by supporting multiple transmission strategies that may employ multiple transmission channels, which are built upon a content analysis for the identification of deterministic and non-deterministic hypermedia presentation events. Methods for instantiating the framework in the context of Ginga-NCL application transmission are also discussed over multiple transmission scenarios, in comparison with the existing, unmanaged content transmission.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125112389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Profiling ISIS Supporters on Twitter 在推特上分析ISIS的支持者
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3131597
Bruno Guilherme Gomes, Pedro Holanda, Ana Paula Couto da Silva, Olga Goussevskaia
Cyber Terrorism is a real threat to the modern society. Many terrorist organizations spread their ideas and recruit new supporters over Online Social Networks. Among all terrorist organizations, ISIS can be considered as the biggest one, which is responsible for inspiring terrorist actions in more than 20 countries. As expected, ISIS uses Twitter for spreading its hatred, and an important issue is how to characterize these supporters in order to understand their motivation. Our work investigates and discusses the way ISIS organizes within Twitter. We base our analyses on two curated datasets. The first dataset, "How ISIS Uses Twitter?" (HIUT), is provided by the Fifth Tribe digital agency. The second dataset, "Syria and ISIS Mentioners?" (SIM), we collected ourselves and curated without participation of experts in the field. We made the SIM dataset publically available, helping new studies in the understanding of ISIS supporters' profiles on Twitter. The main contribution of this work is a characterization of both HIUT and SIM datasets.
网络恐怖主义是对现代社会的真正威胁。许多恐怖组织通过在线社交网络传播他们的思想并招募新的支持者。在所有恐怖组织中,ISIS可以被认为是最大的一个,它在20多个国家激发了恐怖行动。不出所料,ISIS利用Twitter传播仇恨,一个重要的问题是如何描述这些支持者的特征,以了解他们的动机。我们的工作是调查和讨论ISIS在Twitter上的组织方式。我们的分析基于两个经过整理的数据集。第一个数据集,“ISIS如何使用Twitter?”(HIUT),由第五部落数字机构提供。第二个数据集,“提到叙利亚和ISIS的人?”(SIM),我们在没有该领域专家参与的情况下自行收集和策划。我们公开了SIM数据集,帮助新的研究了解Twitter上ISIS支持者的资料。这项工作的主要贡献是HIUT和SIM数据集的特征。
{"title":"Profiling ISIS Supporters on Twitter","authors":"Bruno Guilherme Gomes, Pedro Holanda, Ana Paula Couto da Silva, Olga Goussevskaia","doi":"10.1145/3126858.3131597","DOIUrl":"https://doi.org/10.1145/3126858.3131597","url":null,"abstract":"Cyber Terrorism is a real threat to the modern society. Many terrorist organizations spread their ideas and recruit new supporters over Online Social Networks. Among all terrorist organizations, ISIS can be considered as the biggest one, which is responsible for inspiring terrorist actions in more than 20 countries. As expected, ISIS uses Twitter for spreading its hatred, and an important issue is how to characterize these supporters in order to understand their motivation. Our work investigates and discusses the way ISIS organizes within Twitter. We base our analyses on two curated datasets. The first dataset, \"How ISIS Uses Twitter?\" (HIUT), is provided by the Fifth Tribe digital agency. The second dataset, \"Syria and ISIS Mentioners?\" (SIM), we collected ourselves and curated without participation of experts in the field. We made the SIM dataset publically available, helping new studies in the understanding of ISIS supporters' profiles on Twitter. The main contribution of this work is a characterization of both HIUT and SIM datasets.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127923285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Dynamic Buffer Management for IPTV Video Players IPTV视频播放器的动态缓冲区管理
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3131580
Marcos Paulo Mendes, M. Moreno
Due to recent technological evolution, the provision of IPTV services has grown considerably. One of the services normally included in IPTV is Linear TV, where audiovisual contents are made available in the form of program schedules. Another service is Video on Demand, where the viewer can perform actions like pause, play and seek (trick mode). In this context, challenges emerge regarding the distribution of multimedia content over these services. One of these challenges is specifically the user's perception about the quality of the IPTV service, measured in terms of quality of experience (QoE). Thus, this article aims to analyze the problems that may occur when receiving Linear TV and VoD content and to propose solutions to these problems. Specifically, due to the congestion of transmission media or overload in the endpoints, a key issue is the statistical variation of time delay for multimedia content delivery (packet jitter). This paper's proposal comprises a novel dynamic management of buffers in IPTV terminal devices, taking into account the characteristics of both Linear TV and VoD services.
由于最近的技术发展,IPTV服务的提供已经大大增加。IPTV通常包括的服务之一是线性电视,其中视听内容以节目表的形式提供。另一项服务是视频点播,观众可以执行暂停、播放和寻找(恶作剧模式)等操作。在这种情况下,在这些服务上分发多媒体内容方面出现了挑战。其中一个挑战是用户对IPTV服务质量的看法,这种看法是用体验质量(QoE)来衡量的。因此,本文旨在分析在接收线性电视和视频点播内容时可能出现的问题,并提出解决这些问题的方法。具体来说,由于传输媒体的拥塞或端点的过载,一个关键问题是多媒体内容传输的时间延迟(数据包抖动)的统计变化。本文提出了一种新的IPTV终端设备缓冲区动态管理方法,同时考虑了线性电视和VoD业务的特点。
{"title":"Dynamic Buffer Management for IPTV Video Players","authors":"Marcos Paulo Mendes, M. Moreno","doi":"10.1145/3126858.3131580","DOIUrl":"https://doi.org/10.1145/3126858.3131580","url":null,"abstract":"Due to recent technological evolution, the provision of IPTV services has grown considerably. One of the services normally included in IPTV is Linear TV, where audiovisual contents are made available in the form of program schedules. Another service is Video on Demand, where the viewer can perform actions like pause, play and seek (trick mode). In this context, challenges emerge regarding the distribution of multimedia content over these services. One of these challenges is specifically the user's perception about the quality of the IPTV service, measured in terms of quality of experience (QoE). Thus, this article aims to analyze the problems that may occur when receiving Linear TV and VoD content and to propose solutions to these problems. Specifically, due to the congestion of transmission media or overload in the endpoints, a key issue is the statistical variation of time delay for multimedia content delivery (packet jitter). This paper's proposal comprises a novel dynamic management of buffers in IPTV terminal devices, taking into account the characteristics of both Linear TV and VoD services.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"126 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131692280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gap Filling of Missing Streaming Data in a Network of Intelligent Surveillance Cameras 智能监控摄像机网络中缺失流数据的缺口填充
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3131585
G. Lecomte, Vinícius Hipolito, B. Batista, B. Kuehne, Dionisio Machado Leite Filho, J. Martins, M. Peixoto
The growth of video surveillance devices increases the rate of streaming data. However, even working in the Fog Computing environment, these smart devices may fail collecting information, producing missing or invalid data. This issue can affect the user quality of experience, because the PTZ-controller may lose the target object tracking. Therefore, this paper presents the Singular Spectrum Analysis - (SSA), as the method to replace missing values in this complex environment of intelligent surveillance cameras. SSA is characterized within time series field by performing a non-parametric spectral estimation with spatial-temporal correlations. The values not correctly monitored, were estimated by SSA with accuracy, allowing the tracking of a suspect object.
视频监控设备的增长提高了流数据的速率。然而,即使在雾计算环境中工作,这些智能设备也可能无法收集信息,产生丢失或无效的数据。这个问题会影响用户的体验质量,因为ptz控制器可能会失去对目标物体的跟踪。因此,本文提出奇异谱分析(SSA)作为智能监控摄像机这种复杂环境下缺失值的替换方法。SSA在时间序列场中进行非参数谱估计,具有时空相关性。未正确监测的值,由SSA准确估计,允许跟踪可疑对象。
{"title":"Gap Filling of Missing Streaming Data in a Network of Intelligent Surveillance Cameras","authors":"G. Lecomte, Vinícius Hipolito, B. Batista, B. Kuehne, Dionisio Machado Leite Filho, J. Martins, M. Peixoto","doi":"10.1145/3126858.3131585","DOIUrl":"https://doi.org/10.1145/3126858.3131585","url":null,"abstract":"The growth of video surveillance devices increases the rate of streaming data. However, even working in the Fog Computing environment, these smart devices may fail collecting information, producing missing or invalid data. This issue can affect the user quality of experience, because the PTZ-controller may lose the target object tracking. Therefore, this paper presents the Singular Spectrum Analysis - (SSA), as the method to replace missing values in this complex environment of intelligent surveillance cameras. SSA is characterized within time series field by performing a non-parametric spectral estimation with spatial-temporal correlations. The values not correctly monitored, were estimated by SSA with accuracy, allowing the tracking of a suspect object.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126808807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Video Annotation by Cascading Microtasks: a Crowdsourcing Approach 级联微任务的视频注释:一种众包方法
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3126897
M. N. Amorim, R. M. C. Segundo, Celso A. S. Santos, O. L. Tavares
This paper presents a general approach to perform crowdsourcing video annotation without requiring trained workers nor experts. It consists of dividing complex annotation tasks into simple and small microtasks and cascading them to generate a final result. Moreover, this approach allows using simple annotation tools rather than complex and expensive annotation systems. Also, it tends to avoid activities that may be tedious and time-consuming for workers. The cascade microtasks strategy is included in a workflow of three steps: Preparation, Annotation, and Presentation. A crowdsourcing video annotation process in which four different microtasks were cascaded was developed to evaluate the proposed approach. In the process, extra content such as images, text, hyperlinks and other elements are applied in the video enrichment. To support the experiment was developed a toolkit that includes Web-based annotation tools and aggregation methods, besides a presentation system for the annotated videos. This toolkit is open source and can be downloaded and used to replicate this experiment, as so to construct different crowdsourcing video annotation systems.
本文提出了一种通用的方法来执行众包视频注释,而不需要训练有素的工人和专家。它包括将复杂的注释任务划分为简单和小的微任务,并将它们级联以生成最终结果。此外,这种方法允许使用简单的注释工具,而不是复杂和昂贵的注释系统。此外,它往往会避免那些对工人来说可能是乏味和耗时的活动。级联微任务策略包含在三个步骤的工作流中:准备、注释和表示。开发了一个将四个不同的微任务级联的众包视频注释过程来评估所提出的方法。在这个过程中,额外的内容,如图像、文本、超链接等元素被应用到视频丰富中。为了支持该实验,开发了一个工具包,其中包括基于web的注释工具和聚合方法,以及用于注释视频的表示系统。这个工具包是开源的,可以下载并用于复制这个实验,从而构建不同的众包视频注释系统。
{"title":"Video Annotation by Cascading Microtasks: a Crowdsourcing Approach","authors":"M. N. Amorim, R. M. C. Segundo, Celso A. S. Santos, O. L. Tavares","doi":"10.1145/3126858.3126897","DOIUrl":"https://doi.org/10.1145/3126858.3126897","url":null,"abstract":"This paper presents a general approach to perform crowdsourcing video annotation without requiring trained workers nor experts. It consists of dividing complex annotation tasks into simple and small microtasks and cascading them to generate a final result. Moreover, this approach allows using simple annotation tools rather than complex and expensive annotation systems. Also, it tends to avoid activities that may be tedious and time-consuming for workers. The cascade microtasks strategy is included in a workflow of three steps: Preparation, Annotation, and Presentation. A crowdsourcing video annotation process in which four different microtasks were cascaded was developed to evaluate the proposed approach. In the process, extra content such as images, text, hyperlinks and other elements are applied in the video enrichment. To support the experiment was developed a toolkit that includes Web-based annotation tools and aggregation methods, besides a presentation system for the annotated videos. This toolkit is open source and can be downloaded and used to replicate this experiment, as so to construct different crowdsourcing video annotation systems.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116925260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Automatic Text Recognition in Web Images 自动文本识别在网络图像
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3131570
Rodolfo Valiente, José C. Gutiérrez, M. T. Sadaike, G. Bressan
Web images play an important role in delivering multimedia content on the Web. The text embedded in web images carry semantic information related to layout and content of the pages. Statistics show that there is a significant need to detect and recognize text from web images. This paper presents an architecture that efficiently integrates localization, extraction and recognition algorithms applied to text recognition in web images. In the recognition step is proposed a procedure based on super-resolution and an iterative method for improving the performance. The approach is implemented and evaluated using Matlab and cloud computing, making the system flexible, scalable and robust in detecting texts from complex web images with different orientations, dimensions and colors. Competitive results are presented, both in precision and recognition rate, when compared with other systems in the existing literature.
Web图像在Web上传递多媒体内容方面起着重要的作用。嵌入在网络图像中的文本携带着与页面布局和内容相关的语义信息。统计数据表明,从网络图像中检测和识别文本是一个非常重要的需求。本文提出了一种集定位、提取和识别算法于一体的网络图像文本识别体系结构。在识别步骤中,提出了一种基于超分辨率的方法和一种改进性能的迭代方法。该方法使用Matlab和云计算实现和评估,使系统在检测具有不同方向,尺寸和颜色的复杂web图像中的文本方面具有灵活性,可扩展性和鲁棒性。与现有文献中的其他系统相比,在精度和识别率方面都具有竞争力。
{"title":"Automatic Text Recognition in Web Images","authors":"Rodolfo Valiente, José C. Gutiérrez, M. T. Sadaike, G. Bressan","doi":"10.1145/3126858.3131570","DOIUrl":"https://doi.org/10.1145/3126858.3131570","url":null,"abstract":"Web images play an important role in delivering multimedia content on the Web. The text embedded in web images carry semantic information related to layout and content of the pages. Statistics show that there is a significant need to detect and recognize text from web images. This paper presents an architecture that efficiently integrates localization, extraction and recognition algorithms applied to text recognition in web images. In the recognition step is proposed a procedure based on super-resolution and an iterative method for improving the performance. The approach is implemented and evaluated using Matlab and cloud computing, making the system flexible, scalable and robust in detecting texts from complex web images with different orientations, dimensions and colors. Competitive results are presented, both in precision and recognition rate, when compared with other systems in the existing literature.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114608853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An Approach for Automatic Segmentation of Scenes in Educational Videos through the use of Audio Transcription and Semantic Annotation 基于音频转录和语义标注的教育视频场景自动分割方法
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3126870
Eduardo R. Soares, E. Barrére
In recent years, educational videos are becoming more and more popular. Due to this increase in the amount of didactic content in the video format present on the web, it is interesting to make it possible for a search term to be related to a specific segment of the video. Better navigability allows the user to have quicker access to the topics that interest him, avoiding irrelevant content. This article proposes a method for automatic segmentation of scenes in educational videos through the use of automatic audio transcription and semantic annotation. With this targeting, you can improve content search on these videos by improving the user experience on e-learning platforms or educational video repositories.
近年来,教育视频越来越受欢迎。由于网络上视频格式的教学内容数量的增加,使搜索词与视频的特定部分相关成为可能是一件有趣的事情。更好的可导航性允许用户更快地访问他感兴趣的主题,避免不相关的内容。本文提出了一种利用自动音频转录和语义标注对教育视频中的场景进行自动分割的方法。有了这个目标,您可以通过改善电子学习平台或教育视频存储库的用户体验来改进这些视频的内容搜索。
{"title":"An Approach for Automatic Segmentation of Scenes in Educational Videos through the use of Audio Transcription and Semantic Annotation","authors":"Eduardo R. Soares, E. Barrére","doi":"10.1145/3126858.3126870","DOIUrl":"https://doi.org/10.1145/3126858.3126870","url":null,"abstract":"In recent years, educational videos are becoming more and more popular. Due to this increase in the amount of didactic content in the video format present on the web, it is interesting to make it possible for a search term to be related to a specific segment of the video. Better navigability allows the user to have quicker access to the topics that interest him, avoiding irrelevant content. This article proposes a method for automatic segmentation of scenes in educational videos through the use of automatic audio transcription and semantic annotation. With this targeting, you can improve content search on these videos by improving the user experience on e-learning platforms or educational video repositories.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123994454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Face Classification using a New Local Texture Descriptor 使用新的局部纹理描述符进行人脸分类
Pub Date : 2017-10-17 DOI: 10.1145/3126858.3131584
C. T. Ferraz, M. Manzato, A. Gonzaga
Face recognition has received significant attention during the past several years. It is a challenge task because faces can be affected by scale, noises, face expression, illumination, color or pose variations. The most robust methodologies related to these variations are based on "key points?" localization, followed by the application of a local descriptor to each surrounding region. Such descriptors are associated to clustering algorithms or histogram representation based on Bag of Features (BoF). In the BoF approach, the codebook can effectively describe objects by their appearance based on local texture. Based on texture descriptors proposed previously for image detection, we propose in this paper the application of such descriptors for face recognition. We evaluate the performance of our methodology using Feret, ORL and Yale databases, comparing our descriptor against SIFT and LIOP descriptors, and also other methodologies recently published in the literature.
在过去的几年里,人脸识别受到了极大的关注。这是一项具有挑战性的任务,因为人脸会受到尺度、噪音、面部表情、光照、颜色或姿势变化的影响。与这些变化相关的最健壮的方法是基于“关键点”定位,然后对每个周围区域应用局部描述符。这些描述符与聚类算法或基于特征包(BoF)的直方图表示相关联。在BoF方法中,码本可以基于局部纹理通过物体的外观有效地描述物体。本文在前人提出的纹理描述符用于图像检测的基础上,提出了纹理描述符在人脸识别中的应用。我们使用Feret, ORL和Yale数据库评估了我们方法的性能,将我们的描述符与SIFT和LIOP描述符以及最近发表在文献中的其他方法进行了比较。
{"title":"Face Classification using a New Local Texture Descriptor","authors":"C. T. Ferraz, M. Manzato, A. Gonzaga","doi":"10.1145/3126858.3131584","DOIUrl":"https://doi.org/10.1145/3126858.3131584","url":null,"abstract":"Face recognition has received significant attention during the past several years. It is a challenge task because faces can be affected by scale, noises, face expression, illumination, color or pose variations. The most robust methodologies related to these variations are based on \"key points?\" localization, followed by the application of a local descriptor to each surrounding region. Such descriptors are associated to clustering algorithms or histogram representation based on Bag of Features (BoF). In the BoF approach, the codebook can effectively describe objects by their appearance based on local texture. Based on texture descriptors proposed previously for image detection, we propose in this paper the application of such descriptors for face recognition. We evaluate the performance of our methodology using Feret, ORL and Yale databases, comparing our descriptor against SIFT and LIOP descriptors, and also other methodologies recently published in the literature.","PeriodicalId":338362,"journal":{"name":"Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116813292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1