-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
any paper or algorithm description about text extraction? #665
Comments
Hi @whqwill , can you please specify your needs in detail. Thanks :) |
I mean how it selects the important parts as the 'main text' and if possible any comparison with other methods. @Ask149 |
Not exactly for this newspaper lib, but the slides in this link is very useful overview of the problem: |
Oh, it is helpful for me. Thanks.
Haiqing
bact <notifications@github.com> 于2019年1月22日周二 上午11:28写道:
… Not exactly for this newspaper lib, but the slides in this link is very
useful overview of the problem:
Boilerplate Detection using Shallow Text Features
http://www.l3s.de/%7Ekohlschuetter/boilerplate/
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#665 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AHCjdNxirdbTa-jTvWVcJZlEzDxpFxk8ks5vFoVJgaJpZM4Z0uAq>
.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
any paper or algorithm description about text extraction? I want to know its theory details, thanks
The text was updated successfully, but these errors were encountered: