TextExtraction

An AutoML about text extraction, you can use three or four lines of code to extract text.

Read in other languages：简体中文、English

example

pipeline

This is a very simple example.(Suitable for beginners)

from TextExtraction import TextExtractionPipeline
pipeline = TextExtractionPipeline(train_dataset_path='../Sentiment_Extraction103/train.csv')
pipeline.run()

The dataset used in the example, clickhere

custom

If you want higher accuracy, you can use custom mode (Suitable for experts)

You can use the model API and tools API to build your own model.
If you have any questions, welcome to communicate with the author and contact information qq:1638650145, email:s1638650145@gmail.com, and issue.

Performance

Use Jaccard coefficient to evaluate, Jaccard coefficient is between 0.69-0.70.
Use tf.data.Datasets to load the dataset and use the default parameters. The reference speed on Nvidia Tesla P100 is 370ms/step, and the reference speed on Google TPU is 101ms/step.
Support running with TPU.

If you want to do something

If you want to code, please use the PEP8, otherwise it must not pass.
If you want to use other models such as Bert and Albert, please communicate with the author. The contact information is above.
If you want to star and fork, just do it. Your idea is very wise. Finally, thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README-en.md

README-en.md

TextExtraction

example

pipeline

custom

Performance

If you want to do something

Files

README-en.md

Latest commit

History

README-en.md

File metadata and controls

TextExtraction

example

pipeline

custom

Performance

If you want to do something