[question] Partial matching of strings #57

andrei-volkau · 2021-05-13T09:06:41Z

Goal: The goal is to group the following strings into the same group.

Should you raise an adverse event in a specific patient?
Although what you say will be treated in confidence, should you raise an adverse event in a specific patient?

The following code creates separate groups for those strings.

string_grouper = StringGrouper(question_table["question"])
string_grouper = string_grouper.fit()
question_table["labels"] = string_grouper.get_groups()

Question: is it possible to adjust string matching to reach the goal?

Thank you in advance for any hints!

The text was updated successfully, but these errors were encountered:

ParticularMiner · 2021-05-13T09:26:42Z

Hi @andrei-volkau

Simply lower the similarity-threshold (the default is 0.8). For example, you could try the following:

string_grouper = StringGrouper(question_table["question"], min_similarity=0.5)
string_grouper = string_grouper.fit()
question_table["labels"] = string_grouper.get_groups()

Continue lowering if it doesn't work.

Goal: The goal is to group the following strings into the same group.

Should you raise an adverse event in a specific patient?

Although what you say will be treated in confidence, should you raise an adverse event in a specific patient?

The following code creates separate groups for those strings.
string_grouper = StringGrouper(question_table["question"])

string_grouper = string_grouper.fit()

question_table["labels"] = string_grouper.get_groups()
Question: is it possible to adjust string matching to reach the goal?

Thank you in advance for any hints!

ParticularMiner · 2021-05-13T09:38:28Z

Notify: @andrei-volkau

There are a few more options described in the README (follow the link).

Hi @andrei-volkau

Simply lower the similarity-threshold (the default is 0.8). For example, you could try the following:
string_grouper = StringGrouper(question_table["question"], min_similarity=0.5)
string_grouper = string_grouper.fit()
question_table["labels"] = string_grouper.get_groups()
Continue lowering if it doesn't work.
Goal: The goal is to group the following strings into the same group.

Should you raise an adverse event in a specific patient?

Although what you say will be treated in confidence, should you raise an adverse event in a specific patient?

The following code creates separate groups for those strings.
string_grouper = StringGrouper(question_table["question"])

string_grouper = string_grouper.fit()

question_table["labels"] = string_grouper.get_groups()
Question: is it possible to adjust string matching to reach the goal?

Thank you in advance for any hints!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[question] Partial matching of strings #57

[question] Partial matching of strings #57

andrei-volkau commented May 13, 2021

ParticularMiner commented May 13, 2021

ParticularMiner commented May 13, 2021

[question] Partial matching of strings #57

[question] Partial matching of strings #57

Comments

andrei-volkau commented May 13, 2021

ParticularMiner commented May 13, 2021

ParticularMiner commented May 13, 2021