Permalink
Browse files

Updated comments

  • Loading branch information...
1 parent 8385063 commit 69700a9b12ec46998b8fed5c20237a8fae6951ca @skid committed Feb 28, 2012
Showing with 4 additions and 3 deletions.
  1. +3 −2 extractor.js
  2. +1 −1 package.json
View
@@ -175,10 +175,11 @@ function printTree(tree, options, depth) {
score: [Number] Repetitiveness of the node's contents. Calculated as:
- PL * { SUM( (score(Mi) + 1) * count(Mi) * height(Mi) ) + SUM( score(Ni) + 1 ) } / (cM + cN)
+ ( SUM( (score(Mi) + 1) * C(Mi) * H(Mi) ) + SUM( score(Ni) + 1 ) ) / (cM + cN)
- Mi = Group of children with identical patterns (groups)
+ Mi = Children nodes with identical patterns (groups)
cM = Total number of groups of children with identical patterns (gcount)
+ C = Number of nodes in a group
Ni = Children that have a unique pattern within the node
cN = Total number of children with unique patterns (count)
H = Node's pattern length. The pattern length depends on the number of children and
View
@@ -1,6 +1,6 @@
{
"name": "picksy",
- "description": "Extracts the relevant text from a html page",
+ "description": "Extracts the relevant text from a html article page",
"keywords": ["html", "scrape", "extract", "text"],
"author": "Dusko Jordanovski <jordanovskid@gmail.com>",
"version": "0.1.0",

0 comments on commit 69700a9

Please sign in to comment.