DSpace Repository

Sonarlizer Xplorer Dataset

Show simple item record

dc.contributor.author Pina, Diogo
dc.contributor.author Goldman, Alfredo
dc.contributor.author Seaman, Carolyn
dc.date.accessioned 2022-04-01T10:32:28Z
dc.date.available 2022-04-01T10:32:28Z
dc.date.issued 2022-04-01T07:32:28Z
dc.identifier.uri http://repositorio.uspdigital.usp.br/handle/item/350
dc.description The advancement of artificial intelligence and the implementation of machine learning capabilities in programming languages such as Python, along with cloud services, allow researchers to apply methods to cluster and predict behaviors and patterns in software engineering data. On the other hand, these methods need a large amount of data in order to work with high accuracy in different contexts. This paper introduces Sonarlizer Xplorer: a tool that captures a large number of technical debt items and code metrics from public GitHub projects. Sonarlizer Xplorer is composed of two sub-tools. The first is Github Xplorer, responsible for mining public Github repositories from an initial project. The second is Sonarlizer, responsible for taking projects and analyzing them using SonarQube. We used the tool over four months, collecting technical debt items and code metrics on almost 46,000 public Java projects. In addition, we mined over 57 million repositories and 4 million users.
dc.subject Sonarlizer Xplorer Dataset
dc.subject Technical debt
dc.subject Github mining
dc.title Sonarlizer Xplorer Dataset
dc.type Dataset


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Browse

My Account

Statistics