Jong Youl Choi, Youngik Yang, Sun Kim, Dennis Gannon
{"title":"V-Lab-Protein: Virtual Collaborative Lab for protein sequence analysis","authors":"Jong Youl Choi, Youngik Yang, Sun Kim, Dennis Gannon","doi":"10.1109/BIBMW.2007.4425417","DOIUrl":null,"url":null,"abstract":"Recent development of genome and gene analysis technology enabled rapid accumulation of biological data. To utilize such huge data, a biologist needs to have resource-rich computing environment and user-friendly analysis tool invocation. To response such requirements, we designed and implemented a virtual lab, named Virtual Collaborative Lab (V-Lab-Protein), using an efficient and flexible computing resource management and workflow engine with a user-friendly graphical workflow composer. Utility of our system is demonstrated by analyzing sample protein sequence sets. This is the first system of its kind that combines flexible workflow systems and on-demand compute and data resources (Amazon EC2/S3 in this case). We believe that this system design principle will be a new and effective paradigm for small biology research labs to handle the ever-increasing biological data.","PeriodicalId":260286,"journal":{"name":"2007 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Conference on Bioinformatics and Biomedicine Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2007.4425417","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Recent development of genome and gene analysis technology enabled rapid accumulation of biological data. To utilize such huge data, a biologist needs to have resource-rich computing environment and user-friendly analysis tool invocation. To response such requirements, we designed and implemented a virtual lab, named Virtual Collaborative Lab (V-Lab-Protein), using an efficient and flexible computing resource management and workflow engine with a user-friendly graphical workflow composer. Utility of our system is demonstrated by analyzing sample protein sequence sets. This is the first system of its kind that combines flexible workflow systems and on-demand compute and data resources (Amazon EC2/S3 in this case). We believe that this system design principle will be a new and effective paradigm for small biology research labs to handle the ever-increasing biological data.