Skip to content

Commit

Permalink
setting minimum sentences to 5 -.-
Browse files Browse the repository at this point in the history
  • Loading branch information
Robert Meyer committed Feb 20, 2018
1 parent 6e42f82 commit e49353b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion trufflepig/preprocessing.py
Original file line number Diff line number Diff line change
Expand Up @@ -42,7 +42,7 @@ def apply_parallel(function, iterable, ncores, chunksize=1000):

def preprocess(post_df, ncores=4, chunksize=500,
detect_seed=42, detect_max_length=2000,
grammar_max_sentences=10,
grammar_max_sentences=5,
min_en_prob=0.9,
min_max_body_length=(500, 25000),
min_max_letter_ratio=(0.5, 0.85),
Expand Down

0 comments on commit e49353b

Please sign in to comment.