Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

global: ES refactoring in aggregations #6

Merged
merged 1 commit into from
Jul 19, 2017

Conversation

dinosk
Copy link
Member

@dinosk dinosk commented Jul 18, 2017

  • FIXES the querying in the aggregation tasks to use elasticsearch
    date range, and aggregate over yearly, monthly or daily intervals

@dinosk dinosk self-assigned this Jul 18, 2017
@dinosk dinosk requested a review from nharraud July 18, 2017 12:31
_source=aggregation_data)
yield tobeindexed
self.agg_query = Search(using=self.client,
index='events-stats-file-download').\
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hard coded event name in the index.

self.agg_query.aggs.bucket('per_{}'.format(self.aggregation_interval),
'date_histogram',
field='timestamp',
interval=self.aggregation_interval)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How does this scale with many files? The result would be huge if you ask a year with aggregation_interval='day'

* FIXES the querying in the aggregation tasks to use elasticsearch
  date range, and aggregate over yearly, monthly or daily intervals

Signed-off-by: Dinos Kousidis <konstantinos.kousidis@cern.ch>
@nharraud nharraud merged commit 58da7bc into inveniosoftware:master Jul 19, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants