Bangla_n-gram_Stemmer

Code base for the paper "N-gram Statistical Stemmer for Bangla Corpus".

Please do not use this code without proper citation (follow the ethics and be honest ^_^)

Author: Md Ataur Rahman

Dept. of Language Science and Technology, University of Saarland

Email: ataur[at]coli[dot]uni-saarland.de

* INSTRUCTIONS ON HOW TO RUN *

Run any version of the stemmer (min - based on minimum edit distance | kmeans - based on K-means Clustering)
It might take some time w.r.t the input data provided/used

Citation Information

Paper

@misc{https://doi.org/10.48550/arxiv.1912.11612,
  doi = {10.48550/ARXIV.1912.11612},
  url = {https://arxiv.org/abs/1912.11612},
  author = {Sadia, Rabeya and Rahman, Md Ataur and Seddiqui, Md Hanif},
  keywords = {Computation and Language (cs.CL), Information Retrieval (cs.IR), FOS: Computer and information sciences, FOS: Computer and information sciences},
  title = {N-gram Statistical Stemmer for Bangla Corpus},
  publisher = {arXiv},
  year = {2019},
  copyright = {arXiv.org perpetual, non-exclusive license}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
#Paper		#Paper
sample_outputs		sample_outputs
test_data		test_data
LICENSE		LICENSE
README.md		README.md
kmeans.py		kmeans.py
min.py		min.py
punctuations.txt		punctuations.txt
sample_input.txt		sample_input.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#Paper

#Paper

sample_outputs

sample_outputs

test_data

test_data

LICENSE

LICENSE

README.md

README.md

kmeans.py

kmeans.py

min.py

min.py

punctuations.txt

punctuations.txt

sample_input.txt

sample_input.txt

Repository files navigation

Bangla_n-gram_Stemmer

Please do not use this code without proper citation (follow the ethics and be honest ^_^)

Author: Md Ataur Rahman

* INSTRUCTIONS ON HOW TO RUN *

Citation Information

Paper

About

Releases

Packages

Languages

License

shaoncsecu/Bangla_n-gram_Stemmer

Folders and files

Latest commit

History

Repository files navigation

Bangla_n-gram_Stemmer

Please do not use this code without proper citation (follow the ethics and be honest ^_^)

Author: Md Ataur Rahman

*** INSTRUCTIONS ON HOW TO RUN ***

Citation Information

Paper

About

Resources

License

Stars

Watchers

Forks

Languages

* INSTRUCTIONS ON HOW TO RUN *