Frontiers Digit. Humanit.最新文献_第5页

The Gutenberg English Poetry Corpus: Exemplary Quantitative Narrative Analyses 古腾堡英语诗歌语料库:典型的定量叙事分析

Frontiers Digit. Humanit.

Pub Date : 2018-04-05 DOI: 10.3389/fdigh.2018.00005

A. Jacobs

This paper describes a corpus of about 3000 English literary texts with about 250 million words extracted from the Gutenberg project that span a range of genres from both fiction and non-fiction written by more than 130 authors (e.g., Darwin, Dickens, Shakespeare). Quantitative Narrative Analysis (QNA) is used to explore a cleaned subcorpus, the Gutenberg English Poetry Corpus (GEPC) which comprises over 100 poetic texts with around 2 million words from about 50 authors (e.g., Keats, Joyce, Wordsworth). Some exemplary QNA studies show author similarities based on latent semantic analysis, significant topics for each author or various text-analytic metrics for George Eliot’s poem ‘How Lisa Loved the King’ and James Joyce’s ’Chamber Music’, concerning e.g. lexical diversity or sentiment analysis. The GEPC is particularly suited for research in Digital Humanities, Natural Language Processing or Neurocognitive Poetics, e.g. as training and test corpus, or for stimulus development and control.

本文描述了一个从古腾堡计划中提取的约3000个英语文学文本的语料库，约2.5亿字，涵盖了130多位作者(如达尔文、狄更斯、莎士比亚)的小说和非小说等一系列体裁。定量叙事分析(QNA)用于探索一个经过清理的子语料库，即古腾堡英语诗歌语料库(GEPC)，该语料库包含约50位作者(如济慈、乔伊斯、华兹华斯)的100多篇诗歌文本，约200万字。一些典型的QNA研究显示了基于潜在语义分析的作者相似性，每个作者的重要主题或乔治·艾略特的诗歌“丽莎如何爱国王”和詹姆斯·乔伊斯的“室内乐”的各种文本分析指标，例如词汇多样性或情感分析。GEPC特别适合于数字人文、自然语言处理或神经认知诗学的研究，例如作为训练和测试语料库，或用于刺激的开发和控制。

引用次数: 43

Ensemble Named Entity Recognition (NER): Evaluating NER Tools in the Identification of Place Names in Historical Corpora 集成命名实体识别(NER):评估历史语料库中地名识别中的NER工具

Frontiers Digit. Humanit.

Pub Date : 2018-03-09 DOI: 10.3389/fdigh.2018.00002

Miguel Won, Patricia Murrieta-Flores, Bruno Martins

The field of Spatial Humanities has advanced substantially in the past years. The identification and extraction of toponyms and spatial information mentioned in historical text collections has allowed its use in innovative ways, making possible the application of spatial analysis and the mapping of these places with Geographic Information Systems. For instance, automated place name identification is nowadays possible with Named Entity Recognition (NER) systems. Statistical NER methods based on supervised learning, in particular, are highly successful with modern datasets. However, there are still major challenges to address when dealing with historical corpora. These challenges include language changes over time, spelling variations, transliterations, OCR errors, and sources written in multiple languages among others. In this article, considering a task of place name recognition over two collections of historical correspondence, we report an evaluation of five NER systems and an approach that combines these through a voting system. We found that although individual performance of each NER system was corpus dependent, the ensemble combination was able to achieve consistent measures of precision and recall, outperforming the individual NER systems. Additionally, the results showed that these NER system are not strongly dependent on pre-processing and translation to modern English.

空间人文学科近年来取得了长足的发展。对历史文本集合中提到的地名和空间信息的识别和提取使其以创新的方式使用，使空间分析的应用和地理信息系统对这些地方的映射成为可能。例如，如今使用命名实体识别(NER)系统可以自动识别地名。特别是，基于监督学习的统计NER方法在现代数据集上非常成功。然而，在处理历史语料库时，仍然存在一些重大挑战需要解决。这些挑战包括语言随时间的变化、拼写变化、音译、OCR错误以及用多种语言编写的来源等。在本文中，考虑到两个历史通信集合的地名识别任务，我们报告了五个NER系统的评估以及通过投票系统将这些系统组合在一起的方法。我们发现，尽管每个NER系统的单个性能依赖于语料库，但集成组合能够实现一致的精度和召回率测量，优于单个NER系统。此外，结果表明，这些NER系统并不强烈依赖于预处理和翻译成现代英语。

{"title":"Ensemble Named Entity Recognition (NER): Evaluating NER Tools in the Identification of Place Names in Historical Corpora","authors":"Miguel Won, Patricia Murrieta-Flores, Bruno Martins","doi":"10.3389/fdigh.2018.00002","DOIUrl":"https://doi.org/10.3389/fdigh.2018.00002","url":null,"abstract":"The field of Spatial Humanities has advanced substantially in the past years. The identification and extraction of toponyms and spatial information mentioned in historical text collections has allowed its use in innovative ways, making possible the application of spatial analysis and the mapping of these places with Geographic Information Systems. For instance, automated place name identification is nowadays possible with Named Entity Recognition (NER) systems. Statistical NER methods based on supervised learning, in particular, are highly successful with modern datasets. However, there are still major challenges to address when dealing with historical corpora. These challenges include language changes over time, spelling variations, transliterations, OCR errors, and sources written in multiple languages among others. In this article, considering a task of place name recognition over two collections of historical correspondence, we report an evaluation of five NER systems and an approach that combines these through a voting system. We found that although individual performance of each NER system was corpus dependent, the ensemble combination was able to achieve consistent measures of precision and recall, outperforming the individual NER systems. Additionally, the results showed that these NER system are not strongly dependent on pre-processing and translation to modern English.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131068196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 58

Bridging the Gap: Enriching YouTube Videos with Jazz Music Annotations 弥合差距:丰富YouTube视频与爵士音乐注释

Frontiers Digit. Humanit.

Pub Date : 2018-02-20 DOI: 10.3389/fdigh.2018.00001

S. Balke, C. Dittmar, J. Abeßer, K. Frieler, Martin Pfleiderer, Meinard Müller

Web services allow permanent access to music from all over the world. Especially in the case of web services with user-supplied content, e.g., YouTube(TM), the available metadata is often incomplete or erroneous. On the other hand, a vast amount of high-quality and musically relevant metadata has been annotated in research areas such as Music Information Retrieval (MIR). Although they have great potential, these musical annotations are ofter inaccessible to users outside the academic world. With our contribution, we want to bridge this gap by enriching publicly available multimedia content with musical annotations available in research corpora, while maintaining easy access to the underlying data. Our web-based tools offer researchers and music lovers novel possibilities to interact with and navigate through the content. In this paper, we consider a research corpus called the Weimar Jazz Database (WJD) as an illustrating example scenario. The WJD contains various annotations related to famous jazz solos. First, we establish a link between the WJD annotations and corresponding YouTube videos employing existing retrieval techniques. With these techniques, we were able to identify 988 corresponding YouTube videos for 329 solos out of 456 solos contained in the WJD. We then embed the retrieved videos in a recently developed web-based platform and enrich the videos with solo transcriptions that are part of the WJD. Furthermore, we integrate publicly available data resources from the Semantic Web in order to extend the presented information, for example, with a detailed discography or artists-related information. Our contribution illustrates the potential of modern web-based technologies for the digital humanities, and novel ways for improving access and interaction with digitized multimedia content.

网络服务允许永久访问来自世界各地的音乐。特别是在用户提供内容的web服务中，例如YouTube(TM)，可用的元数据通常是不完整或错误的。另一方面，在音乐信息检索(MIR)等研究领域，大量高质量的音乐相关元数据已被注释。虽然它们有很大的潜力，但这些音乐注释通常是学术界以外的用户无法访问的。通过我们的贡献，我们希望通过在研究语料库中提供音乐注释来丰富公开可用的多媒体内容，同时保持对底层数据的轻松访问，从而弥合这一差距。我们的网络工具为研究人员和音乐爱好者提供了互动和浏览内容的新可能性。在本文中，我们考虑一个名为Weimar Jazz数据库(WJD)的研究语料库作为说明示例场景。WJD包含与著名爵士独奏相关的各种注释。首先，我们利用现有的检索技术在WJD注释和相应的YouTube视频之间建立链接。通过这些技术，我们能够为WJD中包含的456个独奏中的329个独奏识别988个对应的YouTube视频。然后，我们将检索到的视频嵌入到最近开发的基于web的平台中，并使用作为WJD一部分的独奏转录来丰富视频。此外，我们集成了来自语义网的公开可用的数据资源，以扩展所呈现的信息，例如，使用详细的唱片目录或与艺术家相关的信息。我们的贡献说明了现代基于网络的技术对数字人文学科的潜力，以及改进与数字化多媒体内容的访问和交互的新方法。

{"title":"Bridging the Gap: Enriching YouTube Videos with Jazz Music Annotations","authors":"S. Balke, C. Dittmar, J. Abeßer, K. Frieler, Martin Pfleiderer, Meinard Müller","doi":"10.3389/fdigh.2018.00001","DOIUrl":"https://doi.org/10.3389/fdigh.2018.00001","url":null,"abstract":"Web services allow permanent access to music from all over the world. Especially in the case of web services with user-supplied content, e.g., YouTube(TM), the available metadata is often incomplete or erroneous. On the other hand, a vast amount of high-quality and musically relevant metadata has been annotated in research areas such as Music Information Retrieval (MIR). Although they have great potential, these musical annotations are ofter inaccessible to users outside the academic world. With our contribution, we want to bridge this gap by enriching publicly available multimedia content with musical annotations available in research corpora, while maintaining easy access to the underlying data. Our web-based tools offer researchers and music lovers novel possibilities to interact with and navigate through the content. In this paper, we consider a research corpus called the Weimar Jazz Database (WJD) as an illustrating example scenario. The WJD contains various annotations related to famous jazz solos. First, we establish a link between the WJD annotations and corresponding YouTube videos employing existing retrieval techniques. With these techniques, we were able to identify 988 corresponding YouTube videos for 329 solos out of 456 solos contained in the WJD. We then embed the retrieved videos in a recently developed web-based platform and enrich the videos with solo transcriptions that are part of the WJD. Furthermore, we integrate publicly available data resources from the Semantic Web in order to extend the presented information, for example, with a detailed discography or artists-related information. Our contribution illustrates the potential of modern web-based technologies for the digital humanities, and novel ways for improving access and interaction with digitized multimedia content.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115638058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Topic Modeling of Everyday Sexism Project Entries 日常性别歧视项目条目的主题建模

Frontiers Digit. Humanit.

Pub Date : 2017-11-24 DOI: 10.3389/fdigh.2018.00028

Sophie Melville, Kathryn Eccles, T. Yasseri

The Everyday Sexism Project documents everyday examples of sexism reported by volunteer contributors from all around the world. It collected 100,000 entries in 13+ languages within the first 3 years of its existence. The content of reports in various languages submitted to Everyday Sexism is a valuable source of crowdsourced information with great potential for feminist and gender studies. In this paper, we take a computational approach to analyze the content of reports. We use topic-modelling techniques to extract emerging topics and concepts from the reports, and to map the semantic relations between those topics. The resulting picture closely resembles and adds to that arrived at through qualitative analysis, showing that this form of topic modeling could be useful for sifting through datasets that had not previously been subject to any analysis. More precisely, we come up with a map of topics for two different resolutions of our topic model and discuss the connection between the identified topics. In the low-resolution picture, for instance, we found Public space/Street, Online, Work related/Office, Transport, School, Media harassment, and Domestic abuse. Among these, the strongest connection is between Public space/Street harassment and Domestic abuse and sexism in personal relationships. The strength of the relationships between topics illustrates the fluid and ubiquitous nature of sexism, with no single experience being unrelated to another.

日常性别歧视项目记录了来自世界各地的志愿者报告的日常性别歧视案例。它在成立的头三年收集了超过13种语言的100,000个条目。“日常性别歧视”网站提交的各种语言报告内容是一个有价值的众包信息来源，对女权主义和性别研究具有巨大潜力。在本文中，我们采用计算方法来分析报告的内容。我们使用主题建模技术从报告中提取新出现的主题和概念，并映射这些主题之间的语义关系。所得到的图像与定性分析得到的图像非常相似，并增加了定性分析得到的图像，表明这种形式的主题建模对于筛选以前没有经过任何分析的数据集非常有用。更准确地说，我们为主题模型的两种不同分辨率提供主题映射，并讨论已识别主题之间的连接。例如，在低分辨率的图片中，我们发现了公共空间/街道、网络、工作相关/办公室、交通、学校、媒体骚扰和家庭暴力。其中，公共空间/街头骚扰与家庭暴力和个人关系中的性别歧视之间的联系最为密切。主题之间关系的强度说明了性别歧视的流动和无处不在的本质，没有任何一种经历与另一种经历无关。

{"title":"Topic Modeling of Everyday Sexism Project Entries","authors":"Sophie Melville, Kathryn Eccles, T. Yasseri","doi":"10.3389/fdigh.2018.00028","DOIUrl":"https://doi.org/10.3389/fdigh.2018.00028","url":null,"abstract":"The Everyday Sexism Project documents everyday examples of sexism reported by volunteer contributors from all around the world. It collected 100,000 entries in 13+ languages within the first 3 years of its existence. The content of reports in various languages submitted to Everyday Sexism is a valuable source of crowdsourced information with great potential for feminist and gender studies. In this paper, we take a computational approach to analyze the content of reports. We use topic-modelling techniques to extract emerging topics and concepts from the reports, and to map the semantic relations between those topics. The resulting picture closely resembles and adds to that arrived at through qualitative analysis, showing that this form of topic modeling could be useful for sifting through datasets that had not previously been subject to any analysis. More precisely, we come up with a map of topics for two different resolutions of our topic model and discuss the connection between the identified topics. In the low-resolution picture, for instance, we found Public space/Street, Online, Work related/Office, Transport, School, Media harassment, and Domestic abuse. Among these, the strongest connection is between Public space/Street harassment and Domestic abuse and sexism in personal relationships. The strength of the relationships between topics illustrates the fluid and ubiquitous nature of sexism, with no single experience being unrelated to another.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129726156","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

Using Semantic Linking to Understand Persons’ Networks Extracted from Text 利用语义连接理解文本中提取的人物网络

Frontiers Digit. Humanit.

Pub Date : 2017-11-16 DOI: 10.3389/fdigh.2017.00022

Alessio Palmero Aprosio, Sara Tonelli, S. Menini, Giovanni Moretti

In this work, we describe a methodology to interpret large persons' networks extracted from text by classifying cliques using the DBpedia ontology. The approach relies on a combination of NLP, Semantic web technologies and network analysis. The classification methodology that first starts from single nodes and then generalises to cliques is effective in terms of performance and is able to deal also with nodes that are not linked to Wikipedia. The gold standard manually developed for evaluation shows that groups of co-occurring entities share in most of the cases a category that can be automatically assigned. This holds for both languages considered in this study. The outcome of this work may be of interest to enhance the readability of large networks and to provide an additional semantic layer on top of cliques. This would greatly help humanities scholars when dealing with large amounts of textual data that need to be interpreted or categorised. Furthermore, it represents an unsupervised approach to automatically extend DBpedia starting from a corpus.

在这项工作中，我们描述了一种方法，通过使用DBpedia本体对派系进行分类，来解释从文本中提取的大型人员网络。该方法依赖于自然语言处理、语义网技术和网络分析的结合。首先从单个节点开始，然后推广到小组的分类方法在性能方面是有效的，并且能够处理没有链接到维基百科的节点。手动开发用于评估的黄金标准表明，在大多数情况下，共同出现的实体组共享一个可以自动分配的类别。这项研究中考虑的两种语言都是如此。这项工作的结果可能会增强大型网络的可读性，并在派系之上提供额外的语义层。这将极大地帮助人文学者在处理需要解释或分类的大量文本数据时。此外，它代表了一种从语料库开始自动扩展DBpedia的无监督方法。

引用次数: 1

#Halal Culture on Instagram Instagram上的清真文化

Frontiers Digit. Humanit.

Pub Date : 2017-11-02 DOI: 10.3389/fdigh.2017.00021

Yelena Mejova, Youcef Benkhedda, Khairani

Halal is a notion that applies to both objects and actions, and means permissible according to Islamic law. It may be most often associated with food and the rules of selecting, slaughtering, and cooking animals. In the globalized world, halal can be found in street corners of New York and beauty shops of Manila. In this study, we explore the cultural diversity of the concept, as revealed through social media, and specifically the way it is expressed by different populations around the world, and how it relates to their perception of (i) religious and (ii) governmental authority, and (iii) personal health. Here, we analyze two Instagram datasets, using Halal in Arabic (325,665 posts) and in English (1,004,445 posts), which provide a global view of major Muslim populations around the world. We find a great variety in the use of halal within Arabic, English, and Indonesian-speaking populations, with animal trade emphasized in first (making up 61% of the language's stream), food in second (80%), and cosmetics and supplements in third (70%). The commercialization of the term halal is a powerful signal of its detraction from its traditional roots. We find a complex social engagement around posts mentioning religious terms, such that when a food-related post is accompanied by a religious term, it on average gets more likes in English and Indonesian, but not in Arabic, indicating a potential shift out of its traditional moral framing.

清真是一个概念，适用于对象和行动，并意味着允许根据伊斯兰法律。它可能最常与食物和选择、屠宰和烹饪动物的规则联系在一起。在全球化的世界里，清真可以在纽约的街角和马尼拉的美容店找到。在本研究中，我们探索了通过社交媒体揭示的这一概念的文化多样性，特别是世界各地不同人群表达这一概念的方式，以及它与他们对(i)宗教和(ii)政府权威以及(iii)个人健康的看法之间的关系。在这里，我们分析了两个Instagram数据集，使用阿拉伯语的清真(325,665个帖子)和英语的清真(1,004,445个帖子)，它们提供了世界各地主要穆斯林人口的全球视图。我们发现，在阿拉伯语、英语和印尼语人群中，清真的使用种类繁多，首先强调的是动物贸易(占语言流的61%)，其次是食品(80%)，第三是化妆品和补充剂(70%)。“清真”一词的商业化是一个强有力的信号，表明它偏离了它的传统根源。我们发现，在提到宗教术语的帖子周围，有一种复杂的社会参与，比如，当一条与食物相关的帖子伴随着宗教术语时，它在英语和印尼语中平均得到更多的点赞，但在阿拉伯语中却没有，这表明传统道德框架的潜在转变。

{"title":"#Halal Culture on Instagram","authors":"Yelena Mejova, Youcef Benkhedda, Khairani","doi":"10.3389/fdigh.2017.00021","DOIUrl":"https://doi.org/10.3389/fdigh.2017.00021","url":null,"abstract":"Halal is a notion that applies to both objects and actions, and means permissible according to Islamic law. It may be most often associated with food and the rules of selecting, slaughtering, and cooking animals. In the globalized world, halal can be found in street corners of New York and beauty shops of Manila. In this study, we explore the cultural diversity of the concept, as revealed through social media, and specifically the way it is expressed by different populations around the world, and how it relates to their perception of (i) religious and (ii) governmental authority, and (iii) personal health. Here, we analyze two Instagram datasets, using Halal in Arabic (325,665 posts) and in English (1,004,445 posts), which provide a global view of major Muslim populations around the world. We find a great variety in the use of halal within Arabic, English, and Indonesian-speaking populations, with animal trade emphasized in first (making up 61% of the language's stream), food in second (80%), and cosmetics and supplements in third (70%). The commercialization of the term halal is a powerful signal of its detraction from its traditional roots. We find a complex social engagement around posts mentioning religious terms, such that when a food-related post is accompanied by a religious term, it on average gets more likes in English and Indonesian, but not in Arabic, indicating a potential shift out of its traditional moral framing.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"2013 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127426723","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

Aspects of Tempo and Rhythmic Elaboration in Hindustani Music: A Corpus Study 印度斯坦音乐中节奏与节奏精雕细琢方面的语料库研究

Frontiers Digit. Humanit.

Pub Date : 2017-10-31 DOI: 10.3389/fdigh.2017.00020

A. Srinivasamurthy, A. Holzapfel, K. Ganguli, Xavier Serra

This paper provides insights into aspects of tempo and rhythmic elaboration in Hindustani music, based on a study of a large corpus of recorded performances. Typical tempo developments and stress patterns within a metrical cycle are computed, which we refer to as tempo and rhythm patterns, respectively. Rhythm patterns are obtained by aggregating spectral features over metrical cycles. They reflect percussion patterns that are frequent in the corpus, and enable a discussion of the relation between such patterns and the underlying metrical framework, the taal. Tempo patterns, on the other hand, are computed using reference beat annotations. They document the dynamic development of tempo throughout a metrical cycle, and reveal insights into the flexibility of time in Hindustani music for the first time using quantitative methods on a large set of performances. Focusing on aspects of tempo and rhythm, we demonstrate the value of a computational methodology for the analysis of large music corpora by revealing the range of tempi used in performances, intra-cycle tempo dynamics and percussion accents at different positions of the taal cycle.

本文基于对大量录制表演的研究，提供了对印度斯坦音乐节奏和节奏阐述方面的见解。在一个韵律循环中计算出典型的速度发展和重音模式，我们分别称之为速度模式和节奏模式。节奏模式是通过在韵律周期上聚合光谱特征获得的。它们反映了语料库中频繁出现的打击乐模式，并使我们能够讨论这些模式与潜在的格律框架之间的关系。另一方面，节奏模式是使用参考节拍注释计算的。他们记录了节奏在整个韵律循环中的动态发展，并首次使用大量表演的定量方法揭示了印度斯坦音乐中时间的灵活性。专注于节奏和节奏方面，我们通过揭示表演中使用的速度范围，周期内节奏动态和节拍周期不同位置的打击乐重音，展示了大型音乐语料库分析的计算方法的价值。

{"title":"Aspects of Tempo and Rhythmic Elaboration in Hindustani Music: A Corpus Study","authors":"A. Srinivasamurthy, A. Holzapfel, K. Ganguli, Xavier Serra","doi":"10.3389/fdigh.2017.00020","DOIUrl":"https://doi.org/10.3389/fdigh.2017.00020","url":null,"abstract":"This paper provides insights into aspects of tempo and rhythmic elaboration in Hindustani music, based on a study of a large corpus of recorded performances. Typical tempo developments and stress patterns within a metrical cycle are computed, which we refer to as tempo and rhythm patterns, respectively. Rhythm patterns are obtained by aggregating spectral features over metrical cycles. They reflect percussion patterns that are frequent in the corpus, and enable a discussion of the relation between such patterns and the underlying metrical framework, the taal. Tempo patterns, on the other hand, are computed using reference beat annotations. They document the dynamic development of tempo throughout a metrical cycle, and reveal insights into the flexibility of time in Hindustani music for the first time using quantitative methods on a large set of performances. Focusing on aspects of tempo and rhythm, we demonstrate the value of a computational methodology for the analysis of large music corpora by revealing the range of tempi used in performances, intra-cycle tempo dynamics and percussion accents at different positions of the taal cycle.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122027888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15

Heritage As a Source of Studies into Industrial History: Using Digital Tools to Explore the Geography of the Industrialization 遗产作为工业历史研究的源泉:利用数字工具探索工业化的地理

Frontiers Digit. Humanit.

Pub Date : 2017-10-04 DOI: 10.3389/fdigh.2017.00019

Guillermo Esteban-Oliver, A. José, Jordi Martí-Henneberg

The main objective of this article is to explore the possibility of combining two very different sources in order to study the distribution of industrial activity throughout history. The traditional primary sources to use for this purpose are the official censuses on population and economic activity that have been conducted in the majority of countries since the mid-19th century. However, the majority of these lack detail at the regional level and also with respect to the types of professional occupations that they quantify. In order to complement and profile these census data, we propose the use of another type of information which can also be quantified, but whose characteristics are very different. We refer to the industrial heritage sites identified in digital format in a given territory, which in this case is Catalonia, Spain. This innovative dataset was obtained using digital tools like web scraping and data mining techniques.This type of historical information was used to check whether it is reliable and valid for interpreting the spatial impact of the introduction of industrial activity. The article also shows that the systematic identification of elements of industrial heritage offers a new and very useful source of information for interpreting the history of industrial geography.

本文的主要目的是探索将两个非常不同的来源结合起来的可能性，以便研究整个历史上工业活动的分布。用于此目的的传统主要来源是自19世纪中期以来在大多数国家进行的人口和经济活动的官方人口普查。但是，其中大多数缺乏区域一级的细节，也缺乏它们所量化的专业职业类型的细节。为了补充和分析这些人口普查数据，我们建议使用另一种类型的信息，这种信息也可以量化，但其特征非常不同。我们指的是在特定地区以数字格式确定的工业遗产遗址，在这种情况下是西班牙加泰罗尼亚。这个创新的数据集是通过网络抓取和数据挖掘技术等数字工具获得的。这种类型的历史信息被用来检验它是否可靠和有效地解释引入工业活动的空间影响。工业遗产要素的系统识别为解释工业地理学的历史提供了一种新的、非常有用的信息来源。

{"title":"Heritage As a Source of Studies into Industrial History: Using Digital Tools to Explore the Geography of the Industrialization","authors":"Guillermo Esteban-Oliver, A. José, Jordi Martí-Henneberg","doi":"10.3389/fdigh.2017.00019","DOIUrl":"https://doi.org/10.3389/fdigh.2017.00019","url":null,"abstract":"The main objective of this article is to explore the possibility of combining two very different sources in order to study the distribution of industrial activity throughout history. The traditional primary sources to use for this purpose are the official censuses on population and economic activity that have been conducted in the majority of countries since the mid-19th century. However, the majority of these lack detail at the regional level and also with respect to the types of professional occupations that they quantify. In order to complement and profile these census data, we propose the use of another type of information which can also be quantified, but whose characteristics are very different. We refer to the industrial heritage sites identified in digital format in a given territory, which in this case is Catalonia, Spain. This innovative dataset was obtained using digital tools like web scraping and data mining techniques.This type of historical information was used to check whether it is reliable and valid for interpreting the spatial impact of the introduction of industrial activity. The article also shows that the systematic identification of elements of industrial heritage offers a new and very useful source of information for interpreting the history of industrial geography.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132608439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

DigDesFab15 Research Pavilion DigDesFab15研究馆

Frontiers Digit. Humanit.

Pub Date : 2017-09-27 DOI: 10.3389/fdigh.2017.00018

Andrei Gheorghe, R. Vierlinger

This full-scale research pavilion exercises the application of timber and polymer concrete in architectural production (Fig. 1). It attempts to develop and test a new hybrid construction technique using composite joints (as introduced in Schober, 2014) within a modular geometric system and no need for formwork. The structure was designed and erected by students and instructors of the Digital Design and Full Scale Fabrication seminar taught at the University of Applied Arts Vienna. CNC milled, 3-layer spruce laminated timber boards are used for construction, which are temporarily fixed, then rigidized with polymer concrete. The cured composite node proves high structural capabilities, as polymer concrete withstands both pressure and tensile forces, and the bond between the materials is as strong as the wood itself. Compared to traditional timber construction, no metal bolting is needed for the creation of the node, while at the same time the node geometry becomes more flexible, meaning any three-dimensional layout can be produced, as long as a temporary containment and fixation can be implemented until the chemical curing process is completed (Becker, Schober, Weber, 2016). The geometry is developed as an interpretation of the Zollinger (Menges A. et al., 2016) grid, where members originally are of twice the grid length (Fig. 2) and reciprocally reliant on each other (Fig. 3). Instead, every second grid cell is made a joint node when cast out with concrete, making the structural members a lost formwork at the same time (Fig. 4). Double-layering each makes it possible to cast all 122 nodes of the pavilion structure separately and flat-bolt them together on-site with metal screws. The software plugin RhinoVault is used as a design tool to produce an efficient, compression only basic shell surface, although the subsequent imposition of the grid system introduces eccentricities and local imperfections. A parametric model in Rhinoceros and Grasshopper tests various subdivision densities and node sizes, and evaluates the overall performance with the structural analysis tool Karamba. The different heights of the lamellas in the structure arise from the analysis and parametric interpretation of stresses under vertical and horizontal loading.

这个全尺寸的研究馆在建筑生产中应用了木材和聚合物混凝土(图1)。它试图在模块化几何系统中开发和测试一种新的混合建筑技术，使用复合接缝(如Schober, 2014年所介绍的)，不需要模板。该结构是由维也纳应用艺术大学数字设计和全尺寸制造研讨会的学生和教师设计和建造的。CNC铣削，3层云杉层压木板用于施工，暂时固定，然后用聚合物混凝土固化。固化的复合材料节点证明了高结构能力，因为聚合物混凝土可以承受压力和拉力，材料之间的结合与木材本身一样牢固。与传统木结构相比，节点的创建不需要金属螺栓，同时节点的几何形状变得更加灵活，这意味着只要在化学固化过程完成之前可以实施临时密封和固定，就可以产生任何三维布局(Becker, Schober, Weber, 2016)。几何图形是作为Zollinger (Menges a . et al.， 2016)网格的解释而开发的，其中成员最初是网格长度的两倍(图2)，并且相互依赖(图3)。相反，当用混凝土浇铸时，每隔一秒网格单元就会成为一个联合节点。使结构构件同时成为一个丢失的模板(图4)。双层结构使得展馆结构的122个节点可以单独铸造，并在现场用金属螺钉平栓在一起。软件插件RhinoVault被用作设计工具，以产生有效的、仅压缩的基本外壳表面，尽管随后的网格系统的强加引入了偏心和局部缺陷。一个参数化模型在Rhinoceros和Grasshopper中测试了各种细分密度和节点大小，并使用结构分析工具Karamba评估了整体性能。结构中片层高度的不同源于对垂直和水平荷载作用下应力的分析和参数解释。

{"title":"DigDesFab15 Research Pavilion","authors":"Andrei Gheorghe, R. Vierlinger","doi":"10.3389/fdigh.2017.00018","DOIUrl":"https://doi.org/10.3389/fdigh.2017.00018","url":null,"abstract":"This full-scale research pavilion exercises the application of timber and polymer concrete in architectural production (Fig. 1). It attempts to develop and test a new hybrid construction technique using composite joints (as introduced in Schober, 2014) within a modular geometric system and no need for formwork. The structure was designed and erected by students and instructors of the Digital Design and Full Scale Fabrication seminar taught at the University of Applied Arts Vienna. CNC milled, 3-layer spruce laminated timber boards are used for construction, which are temporarily fixed, then rigidized with polymer concrete. The cured composite node proves high structural capabilities, as polymer concrete withstands both pressure and tensile forces, and the bond between the materials is as strong as the wood itself. Compared to traditional timber construction, no metal bolting is needed for the creation of the node, while at the same time the node geometry becomes more flexible, meaning any three-dimensional layout can be produced, as long as a temporary containment and fixation can be implemented until the chemical curing process is completed (Becker, Schober, Weber, 2016). The geometry is developed as an interpretation of the Zollinger (Menges A. et al., 2016) grid, where members originally are of twice the grid length (Fig. 2) and reciprocally reliant on each other (Fig. 3). Instead, every second grid cell is made a joint node when cast out with concrete, making the structural members a lost formwork at the same time (Fig. 4). Double-layering each makes it possible to cast all 122 nodes of the pavilion structure separately and flat-bolt them together on-site with metal screws. The software plugin RhinoVault is used as a design tool to produce an efficient, compression only basic shell surface, although the subsequent imposition of the grid system introduces eccentricities and local imperfections. A parametric model in Rhinoceros and Grasshopper tests various subdivision densities and node sizes, and evaluates the overall performance with the structural analysis tool Karamba. The different heights of the lamellas in the structure arise from the analysis and parametric interpretation of stresses under vertical and horizontal loading.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134518376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

“3D·CoD”: A New Methodology for the Design of Virtual Reality-Mediated Experiences in Digital Archeology “3D·CoD”:数字考古中虚拟现实媒介体验设计的新方法

Frontiers Digit. Humanit.

Pub Date : 2017-08-11 DOI: 10.3389/fdigh.2017.00016

Laia Pujol-Tost

Despite the capacity of Virtual Reality (VR) to recreate and enhance real and virtual worlds, many applications in Archaeology aim at the photorealistic depiction of architectural spaces. On the other hand, little is known about their real communicational effectiveness. In this context, the EU-funded project {LEAP] proposed the concept of Cultural Presence as the theoretical and methodological foundation for a new kind of VR-mediated experience, and the UNESCO World Heritage Neolithic site of Catalhoyuk (Turkey) was chosen as case of application. During this process, a survey of design pipelines in Digital Archaeology indicated that, to build such experiences, a new design and evaluation method may need to be adopted. This paper presents the process of building and testing “3D·CoD”, a new methodology for the design of VR-mediated experiences. Initially, different archaeologists working at Catalhoyuk were engaged in a first workshop, aimed at establishing a specific instantiation of Cultural Presence and how to depict it by means of VR. To that end, observation, questionnaires, multimodal, and statistical analyses were used. The results of this field work were translated into a co-design hands-on methodology (“3D·CoD”), which was tested in a second workshop, with a different group of archaeologists. In this case, observation and debriefing were used. The results of this evaluation suggest that co-design strategies are suitable for the creation of VR-mediated experiences, but that equally important is 1) to consider the co-designers’ concept of Archaeology; and 2) to think in terms, not of 3D models, but of Cultural Heritage goals and human experiences.

尽管虚拟现实(VR)能够重建和增强真实世界和虚拟世界，但考古学中的许多应用都是为了逼真地描绘建筑空间。另一方面，人们对它们真正的沟通效果知之甚少。在这种背景下，欧盟资助的项目{LEAP]提出了文化存在的概念，作为一种新的vr介导体验的理论和方法基础，并选择了联合国教科文组织世界遗产Catalhoyuk(土耳其)的新石器时代遗址作为应用案例。在这一过程中，对《数字考古》中设计管道的调查表明，要建立这样的体验，可能需要采用一种新的设计和评估方法。本文介绍了构建和测试“3D·CoD”的过程，这是一种用于vr介导体验设计的新方法。最初，在Catalhoyuk工作的不同考古学家参与了第一次研讨会，旨在建立文化存在的具体实例，以及如何通过VR来描绘它。为此，采用了观察、问卷调查、多模态分析和统计分析。这次实地工作的结果被转化为一种共同设计的实践方法(“3D·CoD”)，并在另一组考古学家的第二个研讨会上进行了测试。在这种情况下，采用了观察和汇报。评估结果表明，协同设计策略适用于vr媒介体验的创造，但同样重要的是:1)考虑协同设计师的考古学概念;2)考虑文化遗产的目标和人类经验，而不是3D模型。

{"title":"“3D·CoD”: A New Methodology for the Design of Virtual Reality-Mediated Experiences in Digital Archeology","authors":"Laia Pujol-Tost","doi":"10.3389/fdigh.2017.00016","DOIUrl":"https://doi.org/10.3389/fdigh.2017.00016","url":null,"abstract":"Despite the capacity of Virtual Reality (VR) to recreate and enhance real and virtual worlds, many applications in Archaeology aim at the photorealistic depiction of architectural spaces. On the other hand, little is known about their real communicational effectiveness. In this context, the EU-funded project {LEAP] proposed the concept of Cultural Presence as the theoretical and methodological foundation for a new kind of VR-mediated experience, and the UNESCO World Heritage Neolithic site of Catalhoyuk (Turkey) was chosen as case of application. During this process, a survey of design pipelines in Digital Archaeology indicated that, to build such experiences, a new design and evaluation method may need to be adopted. This paper presents the process of building and testing “3D·CoD”, a new methodology for the design of VR-mediated experiences. Initially, different archaeologists working at Catalhoyuk were engaged in a first workshop, aimed at establishing a specific instantiation of Cultural Presence and how to depict it by means of VR. To that end, observation, questionnaires, multimodal, and statistical analyses were used. The results of this field work were translated into a co-design hands-on methodology (“3D·CoD”), which was tested in a second workshop, with a different group of archaeologists. In this case, observation and debriefing were used. The results of this evaluation suggest that co-design strategies are suitable for the creation of VR-mediated experiences, but that equally important is 1) to consider the co-designers’ concept of Archaeology; and 2) to think in terms, not of 3D models, but of Cultural Heritage goals and human experiences.","PeriodicalId":227954,"journal":{"name":"Frontiers Digit. Humanit.","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130210377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 15