Distributed cluster gather/sort/filter/match operations #44

basaks · 2017-11-21T22:22:27Z

By # 41c83f7, we can process a small number of events, say upto, 5k events. This works on single process and in memory sort/filter/joins using pandas dataframe.

However, we need to process upwards of 500k+ events. Just from ISC and Engdahl we have 300k+ events.

The text was updated successfully, but these errors were encountered:

basaks · 2017-11-21T22:35:43Z

@alexgorb We can test 3d inversion functionality using a small number of events ( a few thousand events). Should we finish this parallelisation now before moving onto other tasks?

basaks · 2017-11-21T22:40:31Z

corresponding jira: https://gajira.atlassian.net/browse/PST-227

basaks · 2017-11-28T06:19:34Z

So far, gather of arrivals is optimally distributed.
Distributed median being difficult to compute, we still have a single process sort and median computation. Improved performance by using pandas throughout and avoiding for loop. For details see Improve median filter performance during 3d travel time inversion input generation #49.
Matching is still single process and very efficient for the size of our data by the matching stage.

Closing.

basaks mentioned this issue Nov 21, 2017

Generate 3D travel time inversion input #35

Closed

basaks self-assigned this Nov 21, 2017

basaks mentioned this issue Nov 25, 2017

Apply travel time ellipticity correction to inversion inputs #48

Closed

basaks closed this as completed Nov 28, 2017

basaks mentioned this issue Dec 20, 2017

Improve median filter performance during 3d travel time inversion input generation #49

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed cluster gather/sort/filter/match operations #44

Distributed cluster gather/sort/filter/match operations #44

basaks commented Nov 21, 2017 •

edited

Loading

basaks commented Nov 21, 2017

basaks commented Nov 21, 2017

basaks commented Nov 28, 2017 •

edited

Loading

Distributed cluster gather/sort/filter/match operations #44

Distributed cluster gather/sort/filter/match operations #44

Comments

basaks commented Nov 21, 2017 • edited Loading

basaks commented Nov 21, 2017

basaks commented Nov 21, 2017

basaks commented Nov 28, 2017 • edited Loading

basaks commented Nov 21, 2017 •

edited

Loading

basaks commented Nov 28, 2017 •

edited

Loading