Skip to content

Commit

Permalink
clarify docs, align prepare_training, closes #955
Browse files Browse the repository at this point in the history
  • Loading branch information
fgregg committed Feb 4, 2022
1 parent 5c9c550 commit 9ba44fa
Showing 1 changed file with 5 additions and 5 deletions.
10 changes: 5 additions & 5 deletions dedupe/api.py
Original file line number Diff line number Diff line change
Expand Up @@ -1277,7 +1277,7 @@ def prepare_training(self,
with the keys being field names
training_file: file object containing training data
sample_size: Size of the sample to draw
blocked_proportion: The proportion of record pairs to be sampled from similar records, as opposed to randomly selected pairs. Defaults to 0.9.
blocked_proportion: The proportion of record pairs to be sampled from similar records, as opposed to randomly selected pairs.
.. code:: python
Expand Down Expand Up @@ -1349,8 +1349,8 @@ def prepare_training(self,
data_1: Data,
data_2: Data,
training_file: Optional[TextIO] = None,
sample_size: int = 15000,
blocked_proportion: float = 0.5) -> None:
sample_size: int = 1500,
blocked_proportion: float = 0.9) -> None:
'''
Initialize the active learner with your data and, optionally,
existing training data.
Expand All @@ -1363,12 +1363,12 @@ def prepare_training(self,
form as data_1
training_file: file object containing training data
sample_size: The size of the sample to draw. Defaults to 150,000
sample_size: The size of the sample to draw.
blocked_proportion: The proportion of record pairs to
be sampled from similar records,
as opposed to randomly selected
pairs. Defaults to 0.5.
pairs.
.. code:: python
Expand Down

0 comments on commit 9ba44fa

Please sign in to comment.