spike: persistence/indexing into elasticsearch (and maybe sqlite) #4

adamdecaf · 2019-01-04T01:42:09Z

A standalone instance of this app needs to download the OFAC files on its own and should refresh that copy after N hours. (N is configurable and likely defaults to 24h) This allows someone to start our app without the need for external dependencies and keeps the information up to date.

We can keep the files in temp storage close to the app. When the app restarts it can check the modification time of the files and if the files are too old download them again. This would help to prevent repeated downloads if the app is in a crash loop.

After reading the flat files we might want to persist the structured data in a database to allow for better queries, full text, etc. I think a SQL solution would be the best and we can start with sqlite since our other apps use that.

The spec for CSV files isn't too bad and can probably be directly mapped to a few tables. ent_num is used to join the tables together.

FORMAT SDN CSV

Main table, text file name SDN.CSV

Column
sequence Column name  Type     Size  Description
-------- ------------ -------  ----  ---------------------
1        ent_num     number          unique record
                                     identifier/unique
                                     listing identifier
2        SDN_Name     text     350   name of SDN
3        SDN_Type     text     12    type of SDN
4        Program      text     50    sanctions program name
5        Title        text     200   title of an individual
6        Call_Sign    text     8     vessel call sign
7        Vess_type    text     25    vessel type
8        Tonnage      text     14    vessel tonnage
9        GRT          text     8     gross registered tonnage
10       Vess_flag    text     40    vessel flag
11       Vess_owner   text     150   vessel owner
12       Remarks      text     1000  remarks on SDN*

Address table, text file name ADD.CSV

Column
sequence Column name  Type     Size  Description
-------- ------------ -------  ----  ---------------------
1        Ent_num      number         link to unique listing
2        Add_num      number         unique record identifier
3        Address      text     750   street address of SDN
4        City/				text     116   city, state/province, zip/postal code
         State/Province/
         Postal Code
5        Country      text     250   country of address
6        Add_remarks  text     200   remarks on address

Alternate identity table, text file name ALT.CSV

Column
sequence Column name  Type     Size  Description
-------- ------------ -------  ----  ---------------------
1        ent_num      number         link to unique listing
2        alt_num      number         unique record identifier
3        alt_type     text     8     type of alternate identity
                                     (aka, fka, nka)
4        alt_name     text     350   alternate identity name
5        alt_remarks  text     200   remarks on alternate identity

The text was updated successfully, but these errors were encountered:

adamdecaf · 2019-01-04T01:47:07Z

A standalone instance of this app needs to download the OFAC files on its own

One other reason for this is to have the minimum steps needed for local dev. Having anyone be able to go run our app (or docker run) is really powerful.

For local dev it'd be nice to just go run a 4th app and have all our services. http://docs.moov.io/en/latest/tutorials/local-dev/ (I have longer term plans to better automate 4+ local Go apps.)

adamdecaf · 2019-01-18T03:57:38Z

Changed the title. Let's store the OFAC records in elasticsearch (ES) to get something going. If we need to store the watches let's use sqlite - I'm not thinking of ES as durable storage right now.

adamdecaf · 2019-01-18T17:01:12Z

I can take on deploying ES if no one else wants to, but I'd like to get people familiar with Kubernetes.

Issue: moov-io#4

adamdecaf · 2019-01-25T02:24:15Z

We won't need ES storage for this, so that simplifies the app.

adamdecaf self-assigned this Jan 18, 2019

adamdecaf changed the title ~~initial storage / persistence / database~~ spike: persistence into elasticsearch (and maybe sqlite) Jan 18, 2019

adamdecaf changed the title ~~spike: persistence into elasticsearch (and maybe sqlite)~~ spike: persistence/indexing into elasticsearch (and maybe sqlite) Jan 18, 2019

adamdecaf mentioned this issue Jan 18, 2019

connect search endpoints to datastore #21

Closed

adamdecaf removed their assignment Jan 18, 2019

adamdecaf self-assigned this Jan 23, 2019

adamdecaf added a commit to adamdecaf/watchman that referenced this issue Jan 23, 2019

cmd/server: drop Elasticsearch and have (basic) inmem search

32f430e

Issue: moov-io#4

adamdecaf added a commit to adamdecaf/watchman that referenced this issue Jan 23, 2019

cmd/server: drop Elasticsearch and have (basic) inmem search

bb33da6

Issue: moov-io#4

adamdecaf mentioned this issue Jan 23, 2019

cmd/server: drop Elasticsearch and add (basic) inmem search #23

Merged

adamdecaf added a commit to adamdecaf/watchman that referenced this issue Jan 23, 2019

cmd/server: drop Elasticsearch and have (basic) inmem search

1c7f367

Issue: moov-io#4

adamdecaf closed this as completed Jan 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spike: persistence/indexing into elasticsearch (and maybe sqlite) #4

spike: persistence/indexing into elasticsearch (and maybe sqlite) #4

adamdecaf commented Jan 4, 2019 •

edited

Loading

adamdecaf commented Jan 4, 2019

adamdecaf commented Jan 18, 2019 •

edited

Loading

adamdecaf commented Jan 18, 2019

adamdecaf commented Jan 25, 2019

spike: persistence/indexing into elasticsearch (and maybe sqlite) #4

spike: persistence/indexing into elasticsearch (and maybe sqlite) #4

Comments

adamdecaf commented Jan 4, 2019 • edited Loading

adamdecaf commented Jan 4, 2019

adamdecaf commented Jan 18, 2019 • edited Loading

adamdecaf commented Jan 18, 2019

adamdecaf commented Jan 25, 2019

adamdecaf commented Jan 4, 2019 •

edited

Loading

adamdecaf commented Jan 18, 2019 •

edited

Loading