potential db compression settings? #665

neilh10 · 2023-07-11T22:07:46Z

Its likely systems in the field stop transmitting readings and then to reconnect at a later date and upload stored readings. #485

It would be optimal for there to be a model for what historical timeframe might be supported.

I'm just wondering what are the current internal settings fort the timescaledb compression are, if any.
https://www.timescale.com/features/compression

Examples are
#658 - had three systems interrupt Mar 12, and reconnected May 23rd and and when restored had three systems upload data.
Salmon Creek - that had its solar power cable pulled May 15, and restored June 27th and on reconnection transmitted data up https://monitormywatershed.org/sites/TUCA_Sa01/
Mill Creek installed June 12, and stopped a couple of days later https://monitormywatershed.org/sites/TUCA_Mi06/
or very remote Navarro Creek that stopped connecting June 2 https://monitormywatershed.org/sites/TUCA-Na13/

As I understand it, the timescaledb will compress data based on a historical timeframe , and then later if there is a POST to that period it will un-compress the rows associated with the device, then insert the new record.
Its likely that the un-compression will take time, and could exceed the 10second timeout that the device will wait for an http 201.
In that case the device will retry on the next connection, and continue retrying till a http 201 is received.

Then I'm guessing then on some schedule, timedb will recompress the data.

neilh10 · 2023-07-12T03:43:01Z

I guess I'm not clear is MMW / ODM2 using timescaledb or influxdb . Influxdb is referenced in the architecture diagram outlined 4yrs ago https://github.com/ODM2/ODM2DataSharingPortal/blob/main/doc/ArchitectureDiagram/Data%20Sharing%20Portal%20Architecture%20with%20Logos%20-%20Copy.png

aufdenkampe · 2024-02-05T21:56:18Z

@neilh10, we migrated from InfluxDB to the PosgreSQL Timescale extension (i.e. TimescaleDB) with our v0.12 release in December 2021. We've unfortunately gotten behind on updating all of our documentation to reflect cumulative changes since then (outside of our release notes).

To answer the original question, we're compressing old data into 90-day chunks, that are auto-created every 90 days, with a 1-month buffer. So there is always at least 1 month of uncompressed data.
See #502 (comment)

Regardless, the server-load required to insert historical data should get substantially improved when we implement #674 with our planned v0.18 Milestone.

aufdenkampe · 2024-02-05T22:07:04Z

Closing to track work on this under:

Update main ReadMe & other docs with current architecture #697

neilh10 · 2024-02-05T22:41:14Z

@aufdenkampe thanks for the update. I'm just wondering then what is the impact when a POST is made with a time stamp that is in a range that is compressed.
Of course this issue probably will still exist with #688 , and I wonder what that effect will be of it being queued.

aufdenkampe · 2024-02-06T16:46:01Z

The effect of a post of data into a compressed time chunk is that the server needs to use substantially more resources to insert the data, because the entire chunk needs to get decompressed, appended, then compressed again.

This is only for updates. Reading from a compressed chuck is fast.

#688 does alleviate the problem because it spreads out the work load. We don't mind our server doing the work. The issue is just getting too many posts at once leads to the server getting overloaded in the minute when they all arrive. We have a ton of time where the server CPU is idle. The point of SQS is spread the work into that idle time.

neilh10 changed the title ~~timedb compression settings?~~ timescaledb compression settings? Jul 11, 2023

neilh10 changed the title ~~timescaledb compression settings?~~ potential db compression settings? Jul 13, 2023

neilh10 mentioned this issue Jul 16, 2023

Invalid Time set and MMW doesn't accept old dates neilh10/ModularSensors#119

Open

neilh10 mentioned this issue Jul 31, 2023

Determine RTC is valid on boot neilh10/ModularSensors#144

Open

neilh10 mentioned this issue Sep 21, 2023

Rsp 500 based on submitted time #628

Closed

aufdenkampe closed this as completed Feb 5, 2024

neilh10 mentioned this issue Feb 5, 2024

Reliable Delivery model algorithim #485

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

potential db compression settings? #665

potential db compression settings? #665

neilh10 commented Jul 11, 2023 •

edited

Loading

neilh10 commented Jul 12, 2023

aufdenkampe commented Feb 5, 2024

aufdenkampe commented Feb 5, 2024

neilh10 commented Feb 5, 2024

aufdenkampe commented Feb 6, 2024

potential db compression settings? #665

potential db compression settings? #665

Comments

neilh10 commented Jul 11, 2023 • edited Loading

neilh10 commented Jul 12, 2023

aufdenkampe commented Feb 5, 2024

aufdenkampe commented Feb 5, 2024

neilh10 commented Feb 5, 2024

aufdenkampe commented Feb 6, 2024

neilh10 commented Jul 11, 2023 •

edited

Loading