Ideas for improving AATAMS' performance #72

jkburges · 2014-02-18T00:38:37Z

Ref: #71

We should be able to speed things up and simplify the code a lot by implementing some of the following recommendations. My initial (but brief) prototyping has demonstrated that the following suggestions are feasible.

Load detections in to DB directly using PostgresQL's COPY command
- one table which mirrors the info exactly from the uploaded CSV files
- much faster and more straight-forward upload
- no validation, just loading
- will fix sql error when uploading events/detections with a station name containing apostrophe #9 implicitly
Validate and join to deployments/recoveries on the fly
- select from smaller table first, e.g. receiver, then join to detections - should be pretty fast, provided appropriate indexes are in place (see the examples in A couple of SQL examples which demonstrate the performance of querying d... #71)
- removes the chance of data inconsistency, e.g when deployment info is added after detections, no need to rescanForDeployment
- no need for the materialised view (and associated refresh, which is putting a lot of load on the DB)

The text was updated successfully, but these errors were encountered:

jkburges · 2014-08-08T04:12:41Z

An outage which just happened was possibly (likely) caused by materialised view refresh.

jkburges · 2014-08-15T02:04:48Z

I'm not sure if I've made it clear enough in the past w.r.t. to the seriousness of this issue.

There are two main problems occurring as a result of the issues list above:

data consistency - by having "join" tables and materialised views, the chances of having inconsistent data goes up immensely, and so this is related to the work @xhoenner is doing currently with the reconciliation;
performance, reliability - the materialised view refresh is putting an enormous load on the system every night, and is in fact making the app unavailable while it is happening. With continued growth of the DB, this outage period is only going to increase, possibly to the point of where it starts overlapping with business hours.

jkburges · 2014-12-01T04:53:57Z

I just prototyped (1) from above (loading detections using COPY).

I was able to load ~3.5M detections in around 11s (~300k records/s) - this is around 1500x as fast as the web app currently loads detections (including all the validation and what not).

FWIW, we could re-load every uploaded detection (~90M) in around 5 minutes this way :-)

jkburges · 2014-12-01T23:23:59Z

To be fairer, I added a compound index and a primary key constraint. It takes now ~40s to load 3.5M records - still fast enough.

Table definition:

-- Table: raw_det

-- DROP TABLE raw_det;

CREATE TABLE raw_det
(
  "timestamp" timestamp with time zone,
  receiver_name text,
  transmitter_id text,
  id bigint NOT NULL,
  CONSTRAINT raw_det_pkey PRIMARY KEY (id)
)
WITH (
  OIDS=FALSE
);
ALTER TABLE raw_det
  OWNER TO aatams;

-- Index: pagination_index

-- DROP INDEX pagination_index;

CREATE INDEX pagination_index
  ON raw_det
  USING btree
  ("timestamp", receiver_name COLLATE pg_catalog."default", transmitter_id COLLATE pg_catalog."default");

COPY query (including some awk magic to add an id column, we would need to do a similar thing to add a receiver_download_file_id column):

COPY raw_det
FROM PROGRAM 'awk -F, ''{$(NF+1)=++i;}1'' OFS=, /tmp/big_dets.csv'
CSV HEADER DELIMITER ',';

jkburges · 2015-05-18T06:30:52Z

Note that in 1), the file would have to exist on the same server as the DB. Possibly, rather than storing the file on disk, we could store it in the DB (as a BLOB or some such), and COPY from that.

This would also simplify deployment and backups somewhat (because only the DB has state, not DB + filesystem).

jkburges added the enhancement label Feb 18, 2014

jkburges mentioned this issue Mar 24, 2014

Reciever Export Reports not updating #69

Closed

jkburges mentioned this issue May 1, 2014

High level doc #79

Merged

jkburges mentioned this issue Sep 19, 2014

Tag Detections page times out #102

Closed

dnahodil mentioned this issue Sep 22, 2014

Receiver metadata not updating #103

Closed

jkburges mentioned this issue Nov 12, 2014

Detection filtering performance and the materialised view #136

Closed

jkburges mentioned this issue Nov 19, 2014

receiver_deployment_id associated with multiple receiver_name in valid_detection table #133

Closed

jkburges mentioned this issue Dec 1, 2014

Fix for #134 #150

Merged

jkburges mentioned this issue Feb 6, 2015

Missing Tag detections from Receiver #56

Closed

jkburges mentioned this issue Apr 29, 2015

Outline of changes for joining 'on-the-fly' detection validity #212

Closed

jkburges closed this as completed Jun 29, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ideas for improving AATAMS' performance #72

Ideas for improving AATAMS' performance #72

jkburges commented Feb 18, 2014

jkburges commented Aug 8, 2014

jkburges commented Aug 15, 2014

jkburges commented Dec 1, 2014

jkburges commented Dec 1, 2014

jkburges commented May 18, 2015

Ideas for improving AATAMS' performance #72

Ideas for improving AATAMS' performance #72

Comments

jkburges commented Feb 18, 2014

jkburges commented Aug 8, 2014

jkburges commented Aug 15, 2014

jkburges commented Dec 1, 2014

jkburges commented Dec 1, 2014

jkburges commented May 18, 2015