HKBU-Search-Engine

Search engine written for a group project in CSC 4047 Internet and World Wide Web taken at Hong Kong Baptist University.

Consists of a web crawler and a server.

The web crawler uses jsoup to connect to websites and saves information on website contents in an inverted index implemented with a hashmap for fast lookup times. Word position within the website is saved as well. A set of arrays hold information including total text length of each site, site title in metadata if present, and a calculated PageRank score.

Server uses the Spring Framework to accept queries and return results. Data files created by the crawler are deserialized from storage.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.mvn/wrapper		.mvn/wrapper
.settings		.settings
src		src
target		target
.classpath		.classpath
.project		.project
README.md		README.md
_classpath.xml		_classpath.xml
_gitignore		_gitignore
_project.xml		_project.xml
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.mvn/wrapper

.mvn/wrapper

.settings

.settings

src

src

target

target

.classpath

.classpath

.project

.project

README.md

README.md

_classpath.xml

_classpath.xml

_gitignore

_gitignore

_project.xml

_project.xml

mvnw

mvnw

mvnw.cmd

mvnw.cmd

pom.xml

pom.xml

Repository files navigation

HKBU-Search-Engine

About

Releases

Packages

Languages

KevinAiken/HKBU-Search-Engine

Folders and files

Latest commit

History

Repository files navigation

HKBU-Search-Engine

About

Resources

Stars

Watchers

Forks

Languages