Commits on Feb 21, 2011
  1. url scheme added if none

    committed Feb 21, 2011
  2. rebar and makefile added

    committed Feb 21, 2011
Commits on Feb 18, 2011
  1. support files added

    mochiweb is in rebar deps now
    committed Feb 18, 2011
  2. minor chgs

    committed Feb 18, 2011
  3. full mapreduce in score_tree/1

    committed Feb 18, 2011
  4. gather now works without order

    committed Feb 18, 2011
  5. score_tree parallelized (map is done in parallel).

    not such effective way - elements are gathered in the same order, as
    they were processed; so gather() function may wait too long.
    
    Should be rewritten for map/reduce without order (order is not important
    here).
    committed Feb 18, 2011
  6. comments changed

    committed Feb 18, 2011
Commits on Feb 1, 2011
  1. fix

    committed Feb 1, 2011
  2. replace_node now can work with list of keys and process multiple tag …

    …replacements at once in one walk of the tree
    committed Feb 1, 2011
  3. @spec of replace_node changed

    committed Feb 1, 2011
  4. or changed to orelse in fold

    committed Feb 1, 2011
Commits on Jan 30, 2011
  1. .gitignore & TODO changes

    committed Jan 30, 2011
  2. <div> fix continues

    committed Jan 30, 2011
  3. fixed bug with recursive adding multiple <h1>

    (when doing simplify_page(simplify_page(...))).
    committed Jan 30, 2011
Commits on Jan 29, 2011
  1. TODO

    committed Jan 29, 2011
  2. Intelligent title detection added.

    If resulting page starts from <h1>, keeping this
    title. Otherwise adding title from page <title>
    as <h1>.
    committed Jan 29, 2011
  3. Page header added

    committed Jan 29, 2011
  4. Merge branch 'develop'

    committed Jan 29, 2011
  5. TODO

    committed Jan 29, 2011
  6. optimization:

    init_scores/2 now does not calculate score for
    each element (scores are calculated by 
    score_tree/1 only for parents of <p>'s (list of
    parents is scored by comma number, so it is
    already build, just using existing structure).
    
    debug output of calculated scores to html tag 
    attrs removed (commented).
    committed Jan 29, 2011
  7. Count scores by commas fixed (habrahabr.ru now

    works correctly).
    
    Scoring by commas is done done for parent, not 
    for <p> itself (because <p /> has empty subnode
    thus number of "," == 0)
    committed Jan 29, 2011
  8. Initial scores now built depending or html tag

    id or class name.
    
    This is faster and done in one walk of tree than
    building score by id or class name for each parent
    of <p> tag when walking is done so many times as
    <p> tags exist in tree.
    
    So score_one_p function is not required yet,
    because scoring by commas should be done with 
    parent, not with <p> itself (because <p> is empty
    and number of "," == 0)
    committed Jan 29, 2011
  9. readability score now saved to html tag attribs

    as readability=Score (for debug)
    committed Jan 29, 2011
  10. bug with read_file fixed

    committed Jan 29, 2011
Commits on Jan 28, 2011
  1. TODO changed

    committed Jan 28, 2011
  2. Support for page charset added.

    Charset & content-type is extracted from 
    http request and from html <meta> tag
    (html meta tag is prefered).
    
    Charset & content type is added to resulting 
    html page as a meta-tag.
    committed Jan 28, 2011
  3. Merge branch 'develop'

    committed Jan 28, 2011
  4. Body analysis improved.

    committed Jan 28, 2011
  5. Bug in get_max_score_ref

    committed Jan 28, 2011
  6. app.scr from rebar added

    committed Jan 28, 2011
  7. README

    committed Jan 28, 2011
  8. README.md

    committed Jan 28, 2011
  9. modified: README.md

    committed Jan 28, 2011