Skip to content

Binary Independence Model algorithm to search the relevant news article from the list of documents.

Notifications You must be signed in to change notification settings

cricketjanoon/binary-independence-model

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Binary Independence Model

This was my first project in Python so most of the implementation may be naive and may not be pythonic. But the code works perfectly fine.

There are 32604 news in the dataset. Programs ask you a query (what you want to search) and gives you the most relevant news articles. Just working like a search engine.

Parameters

There are only two parameters. Because the dataset set is large and it takes 30/40 minutes to process all of them. You can work with small number of examples by setting "trainingSetCount" equal to 100 or maybe 1000. "numberOfSearchesToShow" is the number of results returned by the algorithm.

##----------------------------
trainingSetCount = 32604
numberOfSearchesToShow = 5
##----------------------------

Refrences Material

BIM by Standford / Derrivation of BIM

About

Binary Independence Model algorithm to search the relevant news article from the list of documents.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages