Data-Analytics-Thesis

My thesis on ranking algorithms, submitted in partial fulfilment of the requirements for the degree of Masters of Science in Computer Science (Data Analytics).

This thesis acheived a first class honours degree.

Abstract

Citation analysis is an important tool used to evaluate researchers and their scientific work. The most common evaluation metrics used today are the impact factor for journals and the h-index for authors. In recent years a trend has emerged where these evaluation metrics are increasingly being used to determine whether or not a researcher gets considered for a job, gets a promotion, or even gets considered for a government grant. The issue here is that these evaluation metrics are easily manipulated by self-citations and the more serious recent emergence of citation cartels. On the one hand, self-citations are easy to spot but on the other hand, citation cartels are not. This research project introduces alternative approaches, which are based on Google’s PageRank algorithm, to evaluate researchers and journals. A citation dataset composed by Valcav Belák, ArnetCite, was used. How these algorithms ranked papers compared to raw citation counts was first looked at. The robustness of these algorithms against author self-citations was then determined. After this, four of the lowest ranking papers in both algorithms were chosen and a citation cartel was formed by creating synthetic citation data with cartel features by modifying existing entries. The performance of the algorithms is measured in terms of how robust they are after their scores were recalculated when the cartel was created. The methodologies and the results of the algorithms are discussed, and future work and limitations are also provided.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
code		code
Kevin_Derrane_Data_Analytics_Thesis.pdf		Kevin_Derrane_Data_Analytics_Thesis.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data-Analytics-Thesis

Abstract

About

Releases

Packages

Languages

License

kevinderrane/Data-Analytics-Thesis

Folders and files

Latest commit

History

Repository files navigation

Data-Analytics-Thesis

Abstract

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages