Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Web-based Carrot2 Clustering Workbench #36

Closed
23 tasks done
stanislawosinski opened this issue Nov 16, 2020 · 0 comments
Closed
23 tasks done

Web-based Carrot2 Clustering Workbench #36

stanislawosinski opened this issue Nov 16, 2020 · 0 comments
Assignees
Milestone

Comments

@stanislawosinski
Copy link
Member

stanislawosinski commented Nov 16, 2020

Implementation of the clustering workbench as part of the frontend application delivered with Carrot2 Document Clustering Server.

  • Clustering language choice
  • Pass query hint to the clustering algorithms
  • export of parameters JSON for pasting into code
  • Custom schema results: display URL-typed fields
  • Export of clusters and documents as Excel/OpenOffice/CSV
  • Solr data source
    • Limit the number of values we display for multi-value fields
    • Scroll a long list of Fields to cluster
    • Improve autodetection of natural text fields
    • Show search errors returned by Solr (e.g. query syntax problems)
  • ElasticSearch data source
  • Rewrite the About section
    • Add a note on what happens with the data the user submits to Workbench (search results are sent to our server for clustering, data is not logged or stored on our server). Display the note only for our public demo or show the URL of the DCS where the data is processed.
  • Perform clustering on Ctrl+Enter
  • Filtering of advanced settings
  • Review attribute documentation
  • Improve initial workbench rendering time
  • Fix broken visualization export
  • Fix flickering loading overlay on visualizations in workbench
  • Option to export only non-default parameters from the UI
  • Mention workbench as part of "Getting started" information in the documentation.
  • Option to override the autodetected list of natural text fields (in case we were wrong and the field the user wishes to cluster is not on the list).
  • Code splitting
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant