{"title":"Challenges in data extraction from Open Source software repositories","authors":"Arvinder Kaur, Vidhi Vig","doi":"10.1109/CONFLUENCE.2016.7508135","DOIUrl":null,"url":null,"abstract":"Open source softwares (OSS) are a boon to research and development. Their free access and availability of thousands of projects pushes research into new dimensions. But are these repositories actually helpful in quality research? Heterogeneous and incomplete data, lack of integration between repositories, performance and usability issues and lack of documentation are few factors that affect the quality of research in Open Source. In this paper, we lay down the difficulties experienced while working on extraction and analysis of data from open source repositories for quality research.","PeriodicalId":299044,"journal":{"name":"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)","volume":"196 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 6th International Conference - Cloud System and Big Data Engineering (Confluence)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CONFLUENCE.2016.7508135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Open source softwares (OSS) are a boon to research and development. Their free access and availability of thousands of projects pushes research into new dimensions. But are these repositories actually helpful in quality research? Heterogeneous and incomplete data, lack of integration between repositories, performance and usability issues and lack of documentation are few factors that affect the quality of research in Open Source. In this paper, we lay down the difficulties experienced while working on extraction and analysis of data from open source repositories for quality research.