Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

StringSimilarityProcessor #60

Open
davidfrigola opened this issue Apr 8, 2014 · 0 comments
Open

StringSimilarityProcessor #60

davidfrigola opened this issue Apr 8, 2014 · 0 comments

Comments

@davidfrigola
Copy link
Owner

Feature to accomplish:

As a {User} I want to {get the list of similar downloaded items} so I can { remove duplicates}

Calculating the distance between element values (name) can achieve this, i.e. underscore differences, videos 1080/720, different video extension with same name, etc

See http://en.wikipedia.org/wiki/Levenshtein_distance and others (take a look to the see also section)

This processor should be a pass-through (no modification, or just the metadata) with a report output ( using a storage processor configurable?)

@davidfrigola davidfrigola added this to the Task pool milestone Apr 8, 2014
@davidfrigola davidfrigola self-assigned this Apr 8, 2014
@davidfrigola davidfrigola modified the milestones: 0.5, Task pool Apr 8, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant