Skip to content

Commit

Permalink
Merge pull request #1020 from HazyResearch/lukehsiao-patch-1
Browse files Browse the repository at this point in the history
Fix bug in Ngram splitting bug
  • Loading branch information
ajratner committed Aug 22, 2018
2 parents 687cb62 + 93f75fa commit d0bbf36
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions snorkel/candidates.py
Original file line number Diff line number Diff line change
Expand Up @@ -170,11 +170,11 @@ def apply(self, context):
m = re.search(self.split_rgx, context.text[start-offsets[0]:end-offsets[0]+1])
if m is not None and l < self.n_max + 1:
ts1 = TemporarySpan(char_start=start, char_end=start + m.start(1) - 1, sentence=context)
if ts1 not in seen:
if ts1 not in seen and ts1.get_span():
seen.add(ts1)
yield ts
yield ts1
ts2 = TemporarySpan(char_start=start + m.end(1), char_end=end, sentence=context)
if ts2 not in seen:
if ts2 not in seen and ts1.get_span():
seen.add(ts2)
yield ts2

Expand Down

0 comments on commit d0bbf36

Please sign in to comment.