网络收购公司历史时期名称词汇的发展

IF 0.8 2区 历史学 0 ARCHAEOLOGY Mediterranean Archaeology & Archaeometry Pub Date : 2014-05-08 DOI:10.5281/ZENODO.13717
M. Mouroutsou, Stella Markantonatou, V. Papavassiliou
{"title":"网络收购公司历史时期名称词汇的发展","authors":"M. Mouroutsou, Stella Markantonatou, V. Papavassiliou","doi":"10.5281/ZENODO.13717","DOIUrl":null,"url":null,"abstract":"Periodization is a universal and very popular system of organizing History (Petras, et al., 2006) by arbitrary dividing time into periods such as “Δικτατορία” (dictatorship) in a way that is specific to places and communities. Structured collections of time period names and timelines are considered very useful in cultural content documentation and temporal information extraction. However, to the best of our knowledge, this is the first report on the systematic collection of period names of Greek History. New period names are constantly created or left out of use. Aiming to capture this combination of dispersed specificity and constant evolution, we used the Focused Monolingual Crawler (FMC) (Mastropavlos, et al., 2011) and an initial list of 25 “seed-terms” to develop corpora dense in period names with Web retrieved documents. Period names were manually retrieved from the accumulated corpora and were annotated for a set of features, including allomorphs that occurred in the collected corpora and whether the term denoted a fact or a time period or something else as well as for persons, places and other period names related with the term. The linguistic environments where the terms occurred were identified and some of them were fed to the (FMC) as new “seed-terms”. This cycle was repeated for three times and yielded 78 period names with an average of 16 paradigms per term and a corpus consisting of 3020 valid XML documents. Some first observations on the strategies employed by Greek communities to coin time period names are reported.","PeriodicalId":46130,"journal":{"name":"Mediterranean Archaeology & Archaeometry","volume":"5 1","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2014-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"THE DEVELOPMENT OF VOCABULARIES OF HISTORICAL PERIOD NAMES FROM WEB ACQUIRED CORPORA7\",\"authors\":\"M. Mouroutsou, Stella Markantonatou, V. Papavassiliou\",\"doi\":\"10.5281/ZENODO.13717\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Periodization is a universal and very popular system of organizing History (Petras, et al., 2006) by arbitrary dividing time into periods such as “Δικτατορία” (dictatorship) in a way that is specific to places and communities. Structured collections of time period names and timelines are considered very useful in cultural content documentation and temporal information extraction. However, to the best of our knowledge, this is the first report on the systematic collection of period names of Greek History. New period names are constantly created or left out of use. Aiming to capture this combination of dispersed specificity and constant evolution, we used the Focused Monolingual Crawler (FMC) (Mastropavlos, et al., 2011) and an initial list of 25 “seed-terms” to develop corpora dense in period names with Web retrieved documents. Period names were manually retrieved from the accumulated corpora and were annotated for a set of features, including allomorphs that occurred in the collected corpora and whether the term denoted a fact or a time period or something else as well as for persons, places and other period names related with the term. The linguistic environments where the terms occurred were identified and some of them were fed to the (FMC) as new “seed-terms”. This cycle was repeated for three times and yielded 78 period names with an average of 16 paradigms per term and a corpus consisting of 3020 valid XML documents. Some first observations on the strategies employed by Greek communities to coin time period names are reported.\",\"PeriodicalId\":46130,\"journal\":{\"name\":\"Mediterranean Archaeology & Archaeometry\",\"volume\":\"5 1\",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2014-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mediterranean Archaeology & Archaeometry\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5281/ZENODO.13717\",\"RegionNum\":2,\"RegionCategory\":\"历史学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"0\",\"JCRName\":\"ARCHAEOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mediterranean Archaeology & Archaeometry","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.13717","RegionNum":2,"RegionCategory":"历史学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"ARCHAEOLOGY","Score":null,"Total":0}
引用次数: 1

摘要

分期是一种普遍且非常流行的组织历史的系统(Petras等人,2006),它以特定于地方和社区的方式将时间任意划分为“Δικτατορία”(独裁)等时期。结构化的时期名称和时间线集合被认为在文化内容文档和时间信息提取中非常有用。然而,据我们所知,这是第一份系统收集希腊历史时期名称的报告。不断创建新的周期名称,或者不使用。为了捕捉这种分散特异性和持续进化的结合,我们使用了聚焦单语爬虫(FMC) (Mastropavlos等人,2011)和一个包含25个“种子术语”的初始列表,用Web检索的文档开发了密集的语料库。从累积的语料库中手动检索句式名称,并对其进行一系列特征注释,包括所收集的语料库中出现的异形体,以及该术语是否表示事实、时间段或其他内容,以及与该术语相关的人物、地点和其他句式名称。识别出术语出现的语言环境,并将其中一些作为新的“种子术语”馈送给FMC。这个循环重复了三次,产生了78个周期名称,平均每个术语16个范式,以及一个包含3020个有效XML文档的语料库。本文报道了一些关于希腊社会铸造时期名称所采用的策略的初步观察。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
THE DEVELOPMENT OF VOCABULARIES OF HISTORICAL PERIOD NAMES FROM WEB ACQUIRED CORPORA7
Periodization is a universal and very popular system of organizing History (Petras, et al., 2006) by arbitrary dividing time into periods such as “Δικτατορία” (dictatorship) in a way that is specific to places and communities. Structured collections of time period names and timelines are considered very useful in cultural content documentation and temporal information extraction. However, to the best of our knowledge, this is the first report on the systematic collection of period names of Greek History. New period names are constantly created or left out of use. Aiming to capture this combination of dispersed specificity and constant evolution, we used the Focused Monolingual Crawler (FMC) (Mastropavlos, et al., 2011) and an initial list of 25 “seed-terms” to develop corpora dense in period names with Web retrieved documents. Period names were manually retrieved from the accumulated corpora and were annotated for a set of features, including allomorphs that occurred in the collected corpora and whether the term denoted a fact or a time period or something else as well as for persons, places and other period names related with the term. The linguistic environments where the terms occurred were identified and some of them were fed to the (FMC) as new “seed-terms”. This cycle was repeated for three times and yielded 78 period names with an average of 16 paradigms per term and a corpus consisting of 3020 valid XML documents. Some first observations on the strategies employed by Greek communities to coin time period names are reported.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.60
自引率
20.00%
发文量
0
期刊介绍: The Mediterranean Archaeology and Archaeometry (MAA) is an Open Access Journal that covers the following interdisciplinary topics: 1. Natural Sciences applied to Archaeology (Archaeometry): Methods and Techniques of Dating, Analysis, Provenance, Archaeogeophysical surveys and Remote Sensing, Geochemical surveys, Statistics, Artifact and Conservation studies, Ancient Astronomy of both the Old and New Worlds, all applied to Archaeology, History of Art, and in general the Hominid Biological and Cultural evolution. 2. Biomolecular Archaeology. 3. Environmental Archaeology. 4. Osteoarchaeology. 5. Digital Archaeology. 6. Palaeo-climatological/geographical/ecological impact on ancient humans. 7. STEMAC (Science, Technology, Engineering, Mathematics in Art and Culture). 8. Reports on Early Science and Ancient Technology. 9. Special Issues on Archaeology and Archaeometry. 10. Palaeolithic, Prehistoric, Classical, Hellenistic, Roman, Protochristian, Byzantine, Etruscan periods, and Megalithic cultures in the Mediterranean region. 11. Egyptian and Middle Eastern Archaeology. 12. Biblical Archaeology. 13. Early Arab cultures. 14. Ethnoarchaeology. 15. Theoretical and Experimental Archaeology. 16. Mythology and Archaeology. 17. Archaeology and International Law. 18. Cultural Heritage Management. 19. Completed Excavation Reports. 20. Archaeology and the Origins of Writing. 21. Cultural interactions of the ancient Mediterraneans with people further inland.
期刊最新文献
Archaeometric analysis of Late Bronze Age and Early Iron Age pottery from Setefilla (SW Spain) Roman land division in Istria, Croatia: historiography, LIDAR, structural survey and excavations PANDEMICS - FROM ANCIENT TIMES TO COVID19 SOME THOUGHTS Aristotle, King David, King Zhou and Pharao Thutmosis III have seen comet Encke Evaluation of Mallorca Cathedral seismic behavior using different analysis techniques
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1