{"title":"Biomedical Research Data Cloud Services with Duckling Collaboration LiBrary (CLB)","authors":"Kejun Dong, Ji Li, Kai Nan, Wilfred W. Li","doi":"10.1109/eScience.2013.17","DOIUrl":null,"url":null,"abstract":"Rapid advances in scientific research have led to unprecedented data deluge and significant challenges in data interoperability, certification and collaboration. The Collaboration LiBrary (CLB) is designed to manage and collate millions of data files by setting up a unified, robust, and scalable data repository, especially in support of experimental data collaboration and timeline-based data life cycle management. It has recently been released as a component of Duckling, an open-source collaboration environment toolkit developed by the Chinese Academy of Sciences (CAS) and widely adopted in many disciplines. In this paper, we present newly developed components for data synchronization and snapshots in an updated architecture for CLB. We have also extended CLB with new data cloud service modules (CLB+) that enables data mapping and synchronization from the cloud to user workspace. CLB+ is implemented as CLB plugins that provide interfaces with biomedical research cloud services from a computer aided drug discovery (CADD) workflow for ensemble-based virtual screening. The flexible plug in architecture of CLB makes it easy to develop a prototype biomedical research data cloud environment. Many other e-science applications may leverage or expand CLB functionalities in data life cycle management in a similar fashion.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 9th International Conference on e-Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/eScience.2013.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Rapid advances in scientific research have led to unprecedented data deluge and significant challenges in data interoperability, certification and collaboration. The Collaboration LiBrary (CLB) is designed to manage and collate millions of data files by setting up a unified, robust, and scalable data repository, especially in support of experimental data collaboration and timeline-based data life cycle management. It has recently been released as a component of Duckling, an open-source collaboration environment toolkit developed by the Chinese Academy of Sciences (CAS) and widely adopted in many disciplines. In this paper, we present newly developed components for data synchronization and snapshots in an updated architecture for CLB. We have also extended CLB with new data cloud service modules (CLB+) that enables data mapping and synchronization from the cloud to user workspace. CLB+ is implemented as CLB plugins that provide interfaces with biomedical research cloud services from a computer aided drug discovery (CADD) workflow for ensemble-based virtual screening. The flexible plug in architecture of CLB makes it easy to develop a prototype biomedical research data cloud environment. Many other e-science applications may leverage or expand CLB functionalities in data life cycle management in a similar fashion.