MyETL: A Java Software Tool to Extract, Transform, and Load Your Business

Nuovo Michele
{"title":"MyETL: A Java Software Tool to Extract, Transform, and Load Your Business","authors":"Nuovo Michele","doi":"10.1515/cris-2015-0011","DOIUrl":null,"url":null,"abstract":"Abstract The project follows the development of a Java Software Tool that extracts data from Flat File (Fixed Length Record Type), CSV (Comma Separated Values), and XLS (Microsoft Excel 97-2003 Worksheet file), apply transformation to those sources, and finally load the data into the end target RDBMS. The software refers to a process known as ETL (Extract Transform and Load). Those kinds of systems are called ETL systems. The analysis involved research on the theory behind the ETL process as well as the theory behind the various phases of the applied methodology. Also an in-depth look at the design and architecture of the software has been made. To create a complete design needed to be used for the implementation, different techniques and diagrams where used to visualise and refine ideas: UML class diagrams, System Architecture Diagrams, Physical Data Model, and Project Timeline. The implementation of the project involved the translation of the system architecture into working software using the Extreme Programming Methodology and the Java programming language. A mapping algorithm module and design patterns have been used in the implementation phase. A transformation syntax has been defined to achieve data transformation. The testing of the software was done in the form of a unit test. A formal test plan was prepared to ensure that the main features of the system worked as defined. An error handling code implementation has been developed to avoid an unexpected crash of the system and to communicate to the user problems or errors.","PeriodicalId":440425,"journal":{"name":"CRIS - Bulletin of the Centre for Research and Interdisciplinary Study","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CRIS - Bulletin of the Centre for Research and Interdisciplinary Study","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/cris-2015-0011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract The project follows the development of a Java Software Tool that extracts data from Flat File (Fixed Length Record Type), CSV (Comma Separated Values), and XLS (Microsoft Excel 97-2003 Worksheet file), apply transformation to those sources, and finally load the data into the end target RDBMS. The software refers to a process known as ETL (Extract Transform and Load). Those kinds of systems are called ETL systems. The analysis involved research on the theory behind the ETL process as well as the theory behind the various phases of the applied methodology. Also an in-depth look at the design and architecture of the software has been made. To create a complete design needed to be used for the implementation, different techniques and diagrams where used to visualise and refine ideas: UML class diagrams, System Architecture Diagrams, Physical Data Model, and Project Timeline. The implementation of the project involved the translation of the system architecture into working software using the Extreme Programming Methodology and the Java programming language. A mapping algorithm module and design patterns have been used in the implementation phase. A transformation syntax has been defined to achieve data transformation. The testing of the software was done in the form of a unit test. A formal test plan was prepared to ensure that the main features of the system worked as defined. An error handling code implementation has been developed to avoid an unexpected crash of the system and to communicate to the user problems or errors.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
MyETL:一个Java软件工具来提取,转换,并加载您的业务
该项目遵循Java软件工具的开发,从Flat File(固定长度记录类型),CSV(逗号分隔值)和XLS (Microsoft Excel 97-2003工作表文件)中提取数据,并对这些源应用转换,最后将数据加载到最终目标RDBMS中。该软件指的是一个称为ETL(提取、转换和加载)的过程。这类系统被称为ETL系统。分析涉及对ETL过程背后的理论以及应用方法各个阶段背后的理论的研究。此外,还对软件的设计和体系结构进行了深入的研究。为了创建一个完整的设计需要用于实现,不同的技术和图表用于可视化和细化思想:UML类图、系统架构图、物理数据模型和项目时间轴。该项目的实现包括使用极限编程方法和Java编程语言将系统架构转换为可工作的软件。在实现阶段使用了映射算法模块和设计模式。已经定义了转换语法来实现数据转换。软件的测试是以单元测试的形式进行的。准备了一个正式的测试计划,以确保系统的主要特性按照定义工作。已经开发了错误处理代码实现,以避免系统意外崩溃,并与用户沟通问题或错误。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Ontology Translation: The Semiotic Engineering of Content Management Systems Management of Culture Shock Emerging Business and Human Rights Standards: Internet Service Providers and Censorship Demands Cost of Equity in Emerging Markets Barter Online Network
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1