Skip to content

Commit

Permalink
remove mystery code
Browse files Browse the repository at this point in the history
  • Loading branch information
fgregg committed Jan 28, 2015
1 parent 8e31d9e commit cd3ee7d
Showing 1 changed file with 0 additions and 7 deletions.
7 changes: 0 additions & 7 deletions dedupe/training.py
Original file line number Diff line number Diff line change
Expand Up @@ -353,13 +353,6 @@ def __init__(self, predicate_set, pairs) :
def stopWords(data) :
tf_index = index.CanopyIndex([])

for i, (_, doc) in enumerate(data, 1) :
try :
tf_index.index_doc(i, doc)
except :
print doc
raise

doc_freq = [(len(tf_index.index._wordinfo[wid]), word)
for word, wid in tf_index.lexicon.items()]

Expand Down

0 comments on commit cd3ee7d

Please sign in to comment.