Skip to content

Commit

Permalink
feat(#307): more
Browse files Browse the repository at this point in the history
  • Loading branch information
h1alexbel committed May 9, 2024
1 parent 0cc376d commit 44bdcbf
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion tex/report.tex
Original file line number Diff line number Diff line change
Expand Up @@ -137,7 +137,8 @@ \section{Methodology}\label{sec:method}
tag, have reasonably big but not too big number of stars, have either MIT or Apache License,
and are of certain minimum size;
\item Filter out repositories those contain samples, instead real project,
framework or library.
framework or library utilizing Machine Learning techniques like text
classification.
\item Remove files without \ff{.java} extension, Java files with syntax errors,
supplementary files such as \ff{package-info.java} and \ff{module-info.java},
files with very long lines, and unit tests;
Expand All @@ -161,6 +162,7 @@ \section{Results}\label{sec:results}
\iexec{cat "${TARGET}/temp/repo-details.tex"}
The full list of them is in the \ff{repositories.csv} file.
The \ff{hashes.csv} file has a list of Git hashes of their latest commits.
Predictions about each repository being sample or not located in \ff{predictions.csv} file.

The filtering process was the following:

Expand Down

0 comments on commit 44bdcbf

Please sign in to comment.