Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Boilerpipe 1.2 port for .NET
C#
branch: master

Update Sharpen/Extensions.cs

In SubList section, it can give a "Out of Range" Exception
latest commit de7f11b4e3
@jordivicedo jordivicedo authored

README.md

NBoilerpipe is a C# port of boilerpipe 1.2 (http://code.google.com/p/boilerpipe/) library. Most of the code is converted with the Sharpen tool (https://github.com/slluis/sharpen). The code uses the Sharpen libary (with modification) from NGit project (https://github.com/slluis/ngit) and HmtlAgilityPack (http://htmlagilitypack.codeplex.com/).

NBoilerpipe is only been tested with Mono.

Usage:

using NBoilerpipe.Extractors;
...
String html = GetHtmlText();
var text = ArticleExtractor.INSTANCE.GetText (html);
//var text = DefaultExtractor.INSTANCE.GetText (html);
...

Something went wrong with that request. Please try again.