Türkçe Doküman Konusu Tespit Sınıfı | 🇹🇷

Bu Python sınıfını kullanarak dahili dataset ile Türkçe dokümanların konularını birkaç satır kod ile tahmin edebilirsiniz. İçerisinde dahili olarak yer alan ve 4000 içerik, 1060500 kelimeden oluşan verisetinden yararlanır. Multinomial Naive Bayes Sınıflandırıcısı kullanılmıştır.

Emre Yavuz tarafından Python ile yazılmıştır | github.com/emreyvz

Kullanılabilecek Methotlar

İçeriğin tahmini konusunu bulma | getPrediction(@param)
Veriseti başarı skorunu getirme | getAccuracyScore()
Verisetinde yer alan konu adları | getTopics()

Verisetinde Yer Alan Konular

Politika
Spor
Sağlık
Teknoloji
Ekonomi
Magazin
Askeri

Turkish Document Topic Prediction Library | 🇬🇧

You can get topic of given Turkish text content with couple lines of code by using this library. The library use an internal dataset that include 4000 different contents and around 1060500 words. Multinomial Naive Bayes Classifer were used in this library.

Written by Emre Yavuz | github.com/emreyvz

Available Methods

Get topic prediction of given text | getPrediction(@param)
Get Accuracy score of dataset | getAccuracyScore()
Get unique topic list from dataset file | getTopics()

Avaible Topics

Politics
Sports
Health
Technology
Economics
Magazine
Military

Sample Codes / Örnek Kodlar (🇬🇧 / 🇹🇷)

Get prediction of given text content | Verilen dokümanın konusunu bulma

from TurkishTopicPredictLibrary import *

TopicPredict = TurkishTopicPredict()
detectedTopic = TopicPredict.getPrediction("İklim değişikliğinin en önemli nedenlerinden biri, sera gazlarına yol açan fosil yakıtlar. Bu nedenle fosil yakıtlara yeni alternatifler aranıyor. Bu alternatiflerden biri de biyoyakıt üretiminde kullanılabilen yosunlar. Yosunlar, verimli tarım arazilerine ihtiyaç duymadıkları için tarımsal faaliyetler ile rekabet etmiyorlar. Güneş ışığı ve karbondioksit kullanarak fotosentez yaptıklarından sürdürülebilir koşullar altında üretiliyorlar. Biyorafineriler, konvansiyonel rafinerilerde işlenen fosil yakıtlara alternatif biyoyakıt üretimi yanında katma değerli ek ürünlerin de elde edildiği, sıfır atık temelli ve sürdürülebilir kalkınma odaklı bir model sunuyor.")
print(detectedTopic)

Get unique topic list from dataset | Verisetinde yer alan konu adlarını göstermek

from TurkishTopicPredictLibrary import *

TopicPredict = TurkishTopicPredict()
print(TopicPredict.getTopics())

Check Dataset Accuracy Score | Veriseti Doğruluk Skorunu Kontrol Etme

from TurkishTopicPredictLibrary import *

TopicPredict = TurkishTopicPredict()
if TopicPredict.getAccuracyScore() < 50:
    print('Accuracy Score must be greater than 50')

License / Lisans

Apache 2.0 License

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
LICENSE		LICENSE
README.md		README.md
TurkishTopicPredictLibrary.py		TurkishTopicPredictLibrary.py
__init__.py		__init__.py
dataset		dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Türkçe Doküman Konusu Tespit Sınıfı | 🇹🇷

Turkish Document Topic Prediction Library | 🇬🇧

Sample Codes / Örnek Kodlar (🇬🇧 / 🇹🇷)

Get prediction of given text content | Verilen dokümanın konusunu bulma

Get unique topic list from dataset | Verisetinde yer alan konu adlarını göstermek

Check Dataset Accuracy Score | Veriseti Doğruluk Skorunu Kontrol Etme

License / Lisans

About

Releases

Packages

Languages

License

emreyvz/turkish-document-topic-prediction-library

Folders and files

Latest commit

History

Repository files navigation

Türkçe Doküman Konusu Tespit Sınıfı | 🇹🇷

Turkish Document Topic Prediction Library | 🇬🇧

Sample Codes / Örnek Kodlar (🇬🇧 / 🇹🇷)

Get prediction of given text content | Verilen dokümanın konusunu bulma

Get unique topic list from dataset | Verisetinde yer alan konu adlarını göstermek

Check Dataset Accuracy Score | Veriseti Doğruluk Skorunu Kontrol Etme

License / Lisans

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages