-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Stress test schema #1
Comments
Quick question. We will very soon need to be able to filter (and exclude from visualization, or show in different color) points by month and year. This straightforward in this scheme? |
Right now, for our immediate goal, we're looking to quickly support a specific use case: Given a scientific name, quickly map all points. So as it is above, we're not factoring in date information. But I think we can do that another way. Andrew at Vizzuality created a pretty amazing visualization of the VertNet data in CartoDB. Basically it's an animation that shows specimens collected over time. Click the link below and watch. The dates are in the lower right corner: |
Aaron, thanks. Do say which other way. I don’t think animations are going to cut it. I mean interactive ability to filter for key fields. We need that essentially as soon as we put the new points on the ap. |
Totally. So in addition to month and year, what are the other key fields to filter on. |
Acccuracy, Institution. Rob? Is this all useful, or am I complicating things too much? |
Hi guys --- I can see a bunch of use cases we need to support re: I also want us to be able to flag records --- that is to write into Best, Rob On Wed, Oct 31, 2012 at 4:05 PM, Walter Jetz notifications@github.comwrote:
|
Yup, yup, good feedback guys. Let me fold this in and marinate. More thoughts soon. |
Basically for each unique name we'll store a MULTIPOINT of all unique points. We'll also store an array of OccurrenceID strings, one per point. For points with multiple IDs, the value will be a list of CSV IDs. The max points to test is 2 million.
Here's how to create the table on CartoDB:
SELECT AddGeometryColumn('points', 'the_geom_multipoint', 4326, 'MULTIPOINT', 2)
ALTER TABLE points ADD COLUMN occids text[]
Then we need to load in 2 million points like this:
And finally test the performance of this query:
If the performance isn't great, Vizz thinks we might consider unpacking points to a new table once they are uploaded.
The text was updated successfully, but these errors were encountered: