Wilfred W. Li, R. Moore, Matthew Kullberg, B. Battistuz, S. Meier, Ronald Joyce, R. Wagner, T. Reynales, Qian Liu
{"title":"Developing Sustainable Data Services in Cyberinfrastructure for Higher Education: Requirements and Lessons Learned","authors":"Wilfred W. Li, R. Moore, Matthew Kullberg, B. Battistuz, S. Meier, Ronald Joyce, R. Wagner, T. Reynales, Qian Liu","doi":"10.1109/eScience.2013.46","DOIUrl":null,"url":null,"abstract":"The University of California, San Diego (UC San Diego) Research Cyber infrastructure (RCI) program provides long-term quality services in centralized storage, colocation, computing, data curation, networking and technical expertise. To help define the data storage needs and set priorities, the RCI data services (RCIDS) team conducted a series of interviews with faculty and senior staff members between September 2012 and February 2013. A total of 50 groups from 29 separate departments and organized research units (ORUs) participated in the interviews, representing more than 600 UC San Diego researchers. From human genomic sequences, marine natural products, to cosmological simulations, their diverse datasets are shared with hundreds of thousands of users worldwide. The top 10 requirements on data services and the top 5 existing challenges and risks as reported by UC San Diego researchers have been identified. Based upon these requirements, the RCIDS team recommends a Network Attached Storage (NAS) data service to be first deployed with a sustainable business model. Additional services will be developed through further discussion with the research community and in view of emerging cloud computing technologies. An extensive discussion is provided on the implementation plan, cloud-based data services, and the lessons learned in building sustainable e-science infrastructure for higher education research.","PeriodicalId":325272,"journal":{"name":"2013 IEEE 9th International Conference on e-Science","volume":"258 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 9th International Conference on e-Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/eScience.2013.46","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The University of California, San Diego (UC San Diego) Research Cyber infrastructure (RCI) program provides long-term quality services in centralized storage, colocation, computing, data curation, networking and technical expertise. To help define the data storage needs and set priorities, the RCI data services (RCIDS) team conducted a series of interviews with faculty and senior staff members between September 2012 and February 2013. A total of 50 groups from 29 separate departments and organized research units (ORUs) participated in the interviews, representing more than 600 UC San Diego researchers. From human genomic sequences, marine natural products, to cosmological simulations, their diverse datasets are shared with hundreds of thousands of users worldwide. The top 10 requirements on data services and the top 5 existing challenges and risks as reported by UC San Diego researchers have been identified. Based upon these requirements, the RCIDS team recommends a Network Attached Storage (NAS) data service to be first deployed with a sustainable business model. Additional services will be developed through further discussion with the research community and in view of emerging cloud computing technologies. An extensive discussion is provided on the implementation plan, cloud-based data services, and the lessons learned in building sustainable e-science infrastructure for higher education research.
加州大学圣地亚哥分校(UC San Diego)研究网络基础设施(RCI)项目提供集中存储、托管、计算、数据管理、网络和技术专业知识方面的长期优质服务。为了帮助定义数据存储需求和设置优先级,RCI数据服务(rcid)团队在2012年9月至2013年2月期间对教职员工和高级员工进行了一系列访谈。共有来自29个独立部门和有组织的研究单位(oru)的50个小组参加了采访,代表600多名加州大学圣地亚哥分校的研究人员。从人类基因组序列、海洋天然产品到宇宙学模拟,他们的各种数据集与全球数十万用户共享。加州大学圣地亚哥分校研究人员报告的数据服务的十大要求和五大现有挑战和风险已经确定。基于这些需求,rcid团队建议首先使用可持续的业务模型部署网络附加存储(NAS)数据服务。将通过与研究界的进一步讨论并考虑到新兴的云计算技术,开发更多的服务。对实施计划、基于云的数据服务以及为高等教育研究建立可持续的电子科学基础设施的经验教训进行了广泛的讨论。