Can i just specify languages that i want to detect, such as only detect en, ja and zh-cn? #71

maliho0803 · 2020-04-20T02:05:01Z

can i just specify languages that i want to detect, such as only detect en, ja and zh-cn?

Zsub · 2020-05-18T14:56:20Z

You can do this by instantiating the detector yourself:

import csv
import html
import langdetect

with open('rawdata.csv', newline='', encoding="UTF-8") as rawdata:
    rawreader = csv.reader(rawdata, delimiter=',', quotechar='"')

    # instantiate the DetectorFactory
    factory = langdetect.detector_factory.DetectorFactory()
    factory.load_profile(langdetect.detector_factory.PROFILES_DIRECTORY)

    for row in rawreader:
        # this re-creates the detector each time
        detector = factory.create()
        # or whatever your text probabilities are.
        detector.set_prior_map({"en": 0.5, "de": 0.5})
        # give the detector the text to run on
        detector.append(row[column])
        # let the detector run!
        print(detector.detect())

batara666 · 2020-11-26T23:41:15Z

@Mimino666 can we just ignore specified language ?, and isn't be nice to have that as method ?

eduamf mentioned this issue Apr 22, 2020

detect confidence of a single language #41

Open

hugovk mentioned this issue May 16, 2021

Can you force the detection algorithm only to choose between a short list of languages? #84

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can i just specify languages that i want to detect, such as only detect en, ja and zh-cn? #71

Can i just specify languages that i want to detect, such as only detect en, ja and zh-cn? #71

maliho0803 commented Apr 20, 2020

Zsub commented May 18, 2020 •

edited

Loading

batara666 commented Nov 26, 2020

Can i just specify languages that i want to detect, such as only detect en, ja and zh-cn? #71

Can i just specify languages that i want to detect, such as only detect en, ja and zh-cn? #71

Comments

maliho0803 commented Apr 20, 2020

Zsub commented May 18, 2020 • edited Loading

batara666 commented Nov 26, 2020

Zsub commented May 18, 2020 •

edited

Loading