TechSpaces: Identifying and Clustering Popular Programming Technologies

G. Miranda, João Eduardo Montandon, M. T. Valente
{"title":"TechSpaces: Identifying and Clustering Popular Programming Technologies","authors":"G. Miranda, João Eduardo Montandon, M. T. Valente","doi":"10.1145/3559712.3559715","DOIUrl":null,"url":null,"abstract":"Background: Software ecosystems are becoming increasingly complex and large. Therefore, discovering and selecting the right libraries and frameworks for use in a project is becoming a challenging task. Existing commercial services that support this task rely on annual surveys with developers to provide a landscape of the most popular technologies in a given ecosystem. Aims: In this paper, we outline a semi-automated technique for this purpose, which we call TechSpaces. Method: Our proposal relies on community detection and well-known NLP algorithms to automatically extract groups of related technologies, using as primary data source tags associated with Stack Overflow questions. Results: We describe the first results of using our technique to identify popular and inter-related technologies in five programming language ecosystems. Evaluation: We compare our technique against two other tools in the literature. Conclusions: The proposed technique shows potential to assist IT professionals in taking technical decisions supported by crowd knowledge. However, further improvements are needed to make it a viable choice. For instance, we envision the usage of other data sources (e.g., GitHub and Wikipedia) can contribute to improve the accuracy and expressiveness of our graph representations.","PeriodicalId":119656,"journal":{"name":"Proceedings of the 16th Brazilian Symposium on Software Components, Architectures, and Reuse","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 16th Brazilian Symposium on Software Components, Architectures, and Reuse","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3559712.3559715","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Software ecosystems are becoming increasingly complex and large. Therefore, discovering and selecting the right libraries and frameworks for use in a project is becoming a challenging task. Existing commercial services that support this task rely on annual surveys with developers to provide a landscape of the most popular technologies in a given ecosystem. Aims: In this paper, we outline a semi-automated technique for this purpose, which we call TechSpaces. Method: Our proposal relies on community detection and well-known NLP algorithms to automatically extract groups of related technologies, using as primary data source tags associated with Stack Overflow questions. Results: We describe the first results of using our technique to identify popular and inter-related technologies in five programming language ecosystems. Evaluation: We compare our technique against two other tools in the literature. Conclusions: The proposed technique shows potential to assist IT professionals in taking technical decisions supported by crowd knowledge. However, further improvements are needed to make it a viable choice. For instance, we envision the usage of other data sources (e.g., GitHub and Wikipedia) can contribute to improve the accuracy and expressiveness of our graph representations.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
技术空间:识别和聚集流行的编程技术
背景:软件生态系统正变得越来越复杂和庞大。因此,发现和选择项目中使用的正确库和框架正在成为一项具有挑战性的任务。支持这项任务的现有商业服务依赖于对开发人员的年度调查,以提供给定生态系统中最流行技术的概况。目的:在本文中,我们概述了一种半自动技术,我们称之为TechSpaces。方法:我们的建议依赖于社区检测和著名的NLP算法来自动提取相关技术组,使用与堆栈溢出问题相关的主要数据源标签。结果:我们描述了使用我们的技术识别五种编程语言生态系统中流行的和相互关联的技术的第一个结果。评价:我们将我们的技术与文献中其他两种工具进行比较。结论:所建议的技术显示出帮助IT专业人员在群体知识支持下进行技术决策的潜力。然而,要使其成为可行的选择,还需要进一步的改进。例如,我们设想使用其他数据源(例如,GitHub和Wikipedia)可以有助于提高我们的图形表示的准确性和表达性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Analysis of Microservice Evolution using Cohesion Metrics On the Nature of Duplicate Pull Requests: An Empirical Study Using Association Rules A Field Study on Reference Architectural Decisions for Developing a UML-based Software Product Line Tool TechSpaces: Identifying and Clustering Popular Programming Technologies OfflineManager: A Lightweight Approach for Managing Offline Status in Mobile Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1