Topics are collections of words that co-occur frequently in a text corpus. Topics have been found to be effective tools for describing the major themes spanning a corpus. In this work I developed a java tool for extract topics over Android bug report dataset applying a topic model called LDA. This tool also compute various metrics on the identif…
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
data
db
licenses
log
nbproject
properties
squemas
src
stopwords
.gitignore
LICENSE
README.md
build.xml
manifest.mf
run_linux
run_windows.bat

README.md

Extraction and Analysis System of Topics for Software History Reports

Faculty of Sciences, National University of the Center of Buenos Aires (UNICEN). Tandil, Argentina.

Director: PhD. Daniela Godoy, Co-Director: Eng. Alejandro Corbellini.

Abstract: Topics are collections of words that co-occur frequently in a text corpus. Topics have been found to be effective tools for describing the major themes spanning a corpus. In this work I developed a java tool for extract topics over Android bug report dataset applying a topic model called LDA. This tool also compute various metrics on the identifed topics and manually investigate how the metrics evolve over time. Other additional feature is full-text search engine based on Lucene and full-text retrieval technology, including indexing.