{"title":"Big Software Data Analysis","authors":"M. Lungu, Oscar Nierstrasz, Niko Schwarz","doi":"10.7892/BORIS.17295","DOIUrl":"https://doi.org/10.7892/BORIS.17295","url":null,"abstract":"","PeriodicalId":44543,"journal":{"name":"ERCIM News","volume":"2012 1","pages":""},"PeriodicalIF":0.1,"publicationDate":"2012-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"71357667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2011-09-26DOI: 10.1007/978-3-642-24469-8_54
N. Aloia, C. Concordia, A. V. Gerwen, C. Meghini, Nicola Zeni
{"title":"Design, Implementation and Evaluation of a User Generated Content Service for Europeana","authors":"N. Aloia, C. Concordia, A. V. Gerwen, C. Meghini, Nicola Zeni","doi":"10.1007/978-3-642-24469-8_54","DOIUrl":"https://doi.org/10.1007/978-3-642-24469-8_54","url":null,"abstract":"","PeriodicalId":44543,"journal":{"name":"ERCIM News","volume":"2011 1","pages":""},"PeriodicalIF":0.1,"publicationDate":"2011-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"51081557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In order to improve OCR quality in texts originally typeset in Gothic script, we have built an automated correction system which is highly specialized for the given text. Our approach includes external dictionary resources as well as information derived from the text itself. The focus lies on testing and improving different methods for classifying words as correct or erroneous. Also, different techniques are applied to find and rate correction candidates. In addition, we are working on a web application that enables users to read and edit the digitized text online.
{"title":"Reducing OCR Errors in Gothic-Script Documents","authors":"Lenz Furrer, M. Volk","doi":"10.5167/UZH-49812","DOIUrl":"https://doi.org/10.5167/UZH-49812","url":null,"abstract":"In order to improve OCR quality in texts originally typeset in Gothic script, we have built an automated correction system which is highly specialized for the given text. Our approach includes external dictionary resources as well as information derived from the text itself. The focus lies on testing and improving different methods for classifying words as correct or erroneous. Also, different techniques are applied to find and rate correction candidates. In addition, we are working on a web application that enables users to read and edit the digitized text online.","PeriodicalId":44543,"journal":{"name":"ERCIM News","volume":"2011 1","pages":""},"PeriodicalIF":0.1,"publicationDate":"2011-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70654094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The goal of the Dicode project is to facilitate and augment collaboration and decision making in data-intensive and cognitively-complex settings. To do so, it will exploit and build on the most prominent high-performance computing paradigms and large data processing technologies to meaningfully search, analyze and aggregate data existing in diverse, extremely large, and rapidly evolving sources.
{"title":"Mastering Data-Intensive Collaboration and Decision Making through a Cloud Infrastructure: The Dicode EU project","authors":"N. Karacapilidis","doi":"10.14806/EJ.17.1.202","DOIUrl":"https://doi.org/10.14806/EJ.17.1.202","url":null,"abstract":"The goal of the Dicode project is to facilitate and augment collaboration and decision making in data-intensive and cognitively-complex settings. To do so, it will exploit and build on the most prominent high-performance computing paradigms and large data processing technologies to meaningfully search, analyze and aggregate data existing in diverse, extremely large, and rapidly evolving sources.","PeriodicalId":44543,"journal":{"name":"ERCIM News","volume":"17 1","pages":"3"},"PeriodicalIF":0.1,"publicationDate":"2011-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66659506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2011-01-01DOI: 10.2312/PE/VAST/VAST11S/041-044
Andrea Marchetti, E. J. Shepherd, M. Tesconi
The AeroFototeca Nazionale of the Italian Ministry of Cultural Heritage in Rome maintains an extensive set of some million aerial photographs constituting an important memory archive of the Italian territory throughout the 20th century. Together with the Institute of Informatics and Telematics of CNR in Pisa the GeoMemories project was launched with the aim of creating a web platform covering spatial-temporal dimensions and also integrating multimedia data from other archives that displays the evolution of the Italian Landscape. We present some challenges of the project and achievements so far as well as examples of how the tool presented here has a great potential to become a valuable resource for both historians and archaeologists.
{"title":"GeoMemories: A Spatial-Temporal Atlas of the Italian Landscape","authors":"Andrea Marchetti, E. J. Shepherd, M. Tesconi","doi":"10.2312/PE/VAST/VAST11S/041-044","DOIUrl":"https://doi.org/10.2312/PE/VAST/VAST11S/041-044","url":null,"abstract":"The AeroFototeca Nazionale of the Italian Ministry of Cultural Heritage in Rome maintains an extensive set of some million aerial photographs constituting an important memory archive of the Italian territory throughout the 20th century. Together with the Institute of Informatics and Telematics of CNR in Pisa the GeoMemories project was launched with the aim of creating a web platform covering spatial-temporal dimensions and also integrating multimedia data from other archives that displays the evolution of the Italian Landscape. We present some challenges of the project and achievements so far as well as examples of how the tool presented here has a great potential to become a valuable resource for both historians and archaeologists.","PeriodicalId":44543,"journal":{"name":"ERCIM News","volume":"2011 1","pages":""},"PeriodicalIF":0.1,"publicationDate":"2011-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"68642290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2010-09-06DOI: 10.1007/978-3-642-15464-5_42
Brian Aitken, Andrew Lindley
{"title":"The Planets Testbed - A Collaborative Research Environment for Digital Preservation","authors":"Brian Aitken, Andrew Lindley","doi":"10.1007/978-3-642-15464-5_42","DOIUrl":"https://doi.org/10.1007/978-3-642-15464-5_42","url":null,"abstract":"","PeriodicalId":44543,"journal":{"name":"ERCIM News","volume":"15 1","pages":"401-404"},"PeriodicalIF":0.1,"publicationDate":"2010-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85631626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper we extend the original heterogeneous agent model by introducing smart traders and changes in agents' sentiment. The idea of smart traders is based on the endeavor of market agents to estimate future price movements. By adding smart traders and changes in sentiment we try to improve the original heterogeneous agents model so that it provides a closer description of real markets. The main result of the simulations is that the probability distribution functions of the price deviations change significantly when smart traders are added to the model, and they also change significantly when changes in sentiment are introduced. We also use the Hurst exponent to measure the persistence of the price deviations and we find that the Hurst exponent is significantly increasing with the number of smart traders in the simulations. This means that the introduction of the smart traders concept into the model results in significantly higher persistence of the simulated price deviations. On the other hand, the introduction of changing sentiment in the proposed form does not change the persistence of the simulated prices significantly.
{"title":"Smart Agents and Sentiment in the Heterogeneous Agent Model","authors":"Lukáš Vácha, Jozef Baruník, M. Vosvrda","doi":"10.18267/J.PEP.350","DOIUrl":"https://doi.org/10.18267/J.PEP.350","url":null,"abstract":"In this paper we extend the original heterogeneous agent model by introducing smart traders and changes in agents' sentiment. The idea of smart traders is based on the endeavor of market agents to estimate future price movements. By adding smart traders and changes in sentiment we try to improve the original heterogeneous agents model so that it provides a closer description of real markets. The main result of the simulations is that the probability distribution functions of the price deviations change significantly when smart traders are added to the model, and they also change significantly when changes in sentiment are introduced. We also use the Hurst exponent to measure the persistence of the price deviations and we find that the Hurst exponent is significantly increasing with the number of smart traders in the simulations. This means that the introduction of the smart traders concept into the model results in significantly higher persistence of the simulated price deviations. On the other hand, the introduction of changing sentiment in the proposed form does not change the persistence of the simulated prices significantly.","PeriodicalId":44543,"journal":{"name":"ERCIM News","volume":"2010 1","pages":""},"PeriodicalIF":0.1,"publicationDate":"2010-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"67804391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}