Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Workflow to add many entities at once #109

Closed
pudo opened this issue Nov 11, 2016 · 1 comment
Closed

Workflow to add many entities at once #109

pudo opened this issue Nov 11, 2016 · 1 comment
Labels
ui Issues related to Aleph’s frontend

Comments

@pudo
Copy link
Contributor

pudo commented Nov 11, 2016

Users have requested that it should be easier to add many new entities to a collection at once. Our users often have existing search lists, which come in two formats:

  • simple, line-delimited name lists used during the Panama Papers for bulk search (without further details, such as jurisdictions, birth dates etc.)
  • multi-sheet Excel files created as "personal databases" for an investigation. These will typically have one sheet each for People, Companies, Lawyers, and Links between these.

With this ticket, we'd like to prototype a simple solution that allows users to submit these lists and turn them into Aleph collections. This could take the form of a large textarea, a CSV importer, or a CSV and Excel importer. If structured data (CSV) is to be imported, the users would probably need a mapping screen, on which they can describe the functionality of each column in the source data.

Some random questions on this:

  • Do we want to implement this as a JavaScript-only component? There are good JS CSV parsers, the lists are typically small (never more than thousands of entries) and we have an existing entity creation API?
  • Can we do something to de-duplicate new entities that come in against those that already exist in the collection? Should this be it's own screen?

Relates to #64, which is the inverse -- exporting entity lists as lead lists to be used in other search engines.

cc: @stas - I will provide you with samples of both import formats.

@pudo pudo added import ui Issues related to Aleph’s frontend labels Nov 11, 2016
@stas
Copy link
Contributor

stas commented Nov 15, 2016

Do we want to implement this as a JavaScript-only component? There are good JS CSV parsers, the lists are typically small (never more than thousands of entries) and we have an existing entity creation API?

Agree on this, this should also work with minor changes on the back-end side. Also, some of the libraries have decent copy/paste support for excel and similar formats. A good example of a library could be https://github.com/handsontable/ngHandsontable
Any major concerns here?

Can we do something to de-duplicate new entities that come in against those that already exist in the collection? Should this be it's own screen?

Well, we already have a per-entity Possible Duplicates screen. I suggest we either indicate the duplication score or come-up with a separate screen with bulk management support (how often and how big of a problem is this normally?). I'd lean more towards option B, except if this is not a major feature Aleph needs to provide (need your opinion here).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ui Issues related to Aleph’s frontend
Projects
None yet
Development

No branches or pull requests

2 participants