Pub Date : 2012-05-16DOI: 10.1109/RCIS.2012.6240440
Ines Ben Messaoud, J. Feki, G. Zurfluh
The Web plays a key role for information publication and exchange between organizations. In this context, the XML format becomes a common standard for data representation and exchange. On the other hand, XML documents constitute an important source for decisional analyses since they help decision makers to better understand and control the evolution of their business processes. However, even though several XML documents may belong to a same domain, they may be described by multiple structures. In this paper, we present a method to unify XML document structures in order to build a global and generic perception/view of heterogeneous documents, to store them as a document warehouse, and finally, to query them easily. We also describe our software prototype USD (Unification of Structures of XML Documents) which supports the proposed method. We illustrate its functionalities through an example.
{"title":"A first step for building a document warehouse: Unification of XML documents","authors":"Ines Ben Messaoud, J. Feki, G. Zurfluh","doi":"10.1109/RCIS.2012.6240440","DOIUrl":"https://doi.org/10.1109/RCIS.2012.6240440","url":null,"abstract":"The Web plays a key role for information publication and exchange between organizations. In this context, the XML format becomes a common standard for data representation and exchange. On the other hand, XML documents constitute an important source for decisional analyses since they help decision makers to better understand and control the evolution of their business processes. However, even though several XML documents may belong to a same domain, they may be described by multiple structures. In this paper, we present a method to unify XML document structures in order to build a global and generic perception/view of heterogeneous documents, to store them as a document warehouse, and finally, to query them easily. We also describe our software prototype USD (Unification of Structures of XML Documents) which supports the proposed method. We illustrate its functionalities through an example.","PeriodicalId":130476,"journal":{"name":"2012 Sixth International Conference on Research Challenges in Information Science (RCIS)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132448021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2012-05-16DOI: 10.1109/RCIS.2012.6240415
M. Villanueva, Ana Rosa Guzman, Francisco Valverde, A. M. Levin
Geneticists that use software tools to carry out their diagnosis claim that current solutions do not fulfill completely their requirements. From an Information System perspective, this issue is a consequence of the lack of formal data descriptions, which has led to genetic repositories full of heterogeneous data and inconsistencies. Simultaneously, the same lack of formalization is perceived in software tools for genetic data analysis. As a solution, we provide a unified view that formalizes genetic concepts through the definition of a Conceptual Schema of the Human Genome (CSHG). In order to demonstrate the benefits of this approach, a Web application for genetic analysis, named Diagen, has been developed applying the aforementioned CSHG: Diagen is a model-based tool since each of its software components is a projection of the CSHG proposed.
{"title":"Diagen: A model-based bioinformatic tool for genetic analysis","authors":"M. Villanueva, Ana Rosa Guzman, Francisco Valverde, A. M. Levin","doi":"10.1109/RCIS.2012.6240415","DOIUrl":"https://doi.org/10.1109/RCIS.2012.6240415","url":null,"abstract":"Geneticists that use software tools to carry out their diagnosis claim that current solutions do not fulfill completely their requirements. From an Information System perspective, this issue is a consequence of the lack of formal data descriptions, which has led to genetic repositories full of heterogeneous data and inconsistencies. Simultaneously, the same lack of formalization is perceived in software tools for genetic data analysis. As a solution, we provide a unified view that formalizes genetic concepts through the definition of a Conceptual Schema of the Human Genome (CSHG). In order to demonstrate the benefits of this approach, a Web application for genetic analysis, named Diagen, has been developed applying the aforementioned CSHG: Diagen is a model-based tool since each of its software components is a projection of the CSHG proposed.","PeriodicalId":130476,"journal":{"name":"2012 Sixth International Conference on Research Challenges in Information Science (RCIS)","volume":"20 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123649686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2012-05-16DOI: 10.1109/RCIS.2012.6240442
Jacques Simonin, Sébastien Bigaret, J. Gourmelen
The design of a data warehouse from the databases of the enterprise is a very important feature to specify the strategy of this enterprise. The strategy is indeed deduced from the enterprise results stored in its databases. The business processes of the enterprise have moreover to be consistent with this strategy. We propose then a rule-based data warehouse design method. The rules target first a data model alignment with respect to the business processes model. The other rules are featured by a data pattern proposing a solution for the data warehouse design in relation to the categorization of the data. The pattern is compliant with the life-cycle of a business process instance. This pattern is moreover relevant for the specification of the queries concerning the data warehouse. This data warehouse design method based on rules is experimented from two epidemiological databases building up for medical research.
{"title":"A data warehouse logical design method based on the alignment with business processes","authors":"Jacques Simonin, Sébastien Bigaret, J. Gourmelen","doi":"10.1109/RCIS.2012.6240442","DOIUrl":"https://doi.org/10.1109/RCIS.2012.6240442","url":null,"abstract":"The design of a data warehouse from the databases of the enterprise is a very important feature to specify the strategy of this enterprise. The strategy is indeed deduced from the enterprise results stored in its databases. The business processes of the enterprise have moreover to be consistent with this strategy. We propose then a rule-based data warehouse design method. The rules target first a data model alignment with respect to the business processes model. The other rules are featured by a data pattern proposing a solution for the data warehouse design in relation to the categorization of the data. The pattern is compliant with the life-cycle of a business process instance. This pattern is moreover relevant for the specification of the queries concerning the data warehouse. This data warehouse design method based on rules is experimented from two epidemiological databases building up for medical research.","PeriodicalId":130476,"journal":{"name":"2012 Sixth International Conference on Research Challenges in Information Science (RCIS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121683424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}