Fixes bug 901977 - Store raw crash data into elasticsearch. by adngdb · Pull Request #1647 · mozilla-services/socorro

adngdb · 2013-11-04T14:17:34Z

twobraids · 2013-11-05T16:39:51Z

instead of making the raw crash a branch of the processed crash, could you consider making a two branch tree instead?

raw_and_processed = { 'raw_crash': raw_crash, 'processed_crash': processed_crash }

or would that be too disruptive to all the data that's already in ES?

One of my current initiatives is to unify the fragmentation of the processed_crash format. The current state is that PG, ES, and HB/FS all store the processed crash in a little bit different form. PG/HB/FS are all lossy - the new redaction methods and the saving the json form of the processed crash in PG are all about making them all store exactly the same data.

If you add the 'raw_crash' key to the processed crash, you're making the ES processed crash different from the others. When we eventually document the processed_crash schema, we'll have to make an exception for ES and point out the difference.

Doing that would indeed imply changes to both advanced search and supersearch as well as a full reindexing of our database. We might need to do the reindexing at some point, especially since we will want to have that raw_crash field everywhere. Maybe it is worth putting the effort now.

I'm a bit concerned that this change might break search for a little though. I'm not quite sure what the strategy for data would be here. I expect that we will need to reprocess the last 6 months of crashes (but putting them in elasticsearch only, no need to reindex in postgres and hbase). Reprocessing will be needed because we don't have unredacted processed crashes in HBase yet, and we want PII data to be in elasticsearch.

I would be happy to discuss with you a strategy for reprocessing for elasticsearch only.

…dleware.

adngdb · 2013-11-14T11:04:18Z

Closing for the moment, will reopen when it is ready for review.

Conflicts: docs/middleware.rst

…rian

…remove-deprecated-middleware Fixes bug 891921 - Removed all files related to the old, obsolete middleware.

…sig-hist-doc Fixes bug 938410 - Fixed example in signature_history documentation.

…block Bug 939141 - Annotate the largest free VM block in the processed crash. r=ted

…riencing failure.

Fixes Bug 931147 - tagged logging of transaction failures with name of the resource experiencing failure

…6-non-plotted-graphs-on-topcrasher Bug789526 non plotted graphs on topcrasher

…ches. Updating backfill app.

…eds manual testing.

twobraids reviewed Nov 5, 2013
View reviewed changes

Fixes bug 891921 - Removed all files related to the old, obsolete mid…

f8c1c2f

…dleware.

adngdb closed this Nov 14, 2013

adngdb and others added 16 commits November 14, 2013 15:38

Fixes bug 938410 - Fixed example in signature_history documentation.

1fd64ed

bug 910006 - print nested list options correctly, r=adrian

7885460

(no bug, doc only) Fixed anchors in the middleware documentation.

97663fe

Conflicts: docs/middleware.rst

fixes bug 931242 - more fine grained permissions on data access, r=ad…

e13bd1d

…rian

Merge pull request mozilla-services#1660 from AdrianGaudebert/891921-…

f75ed1e

…remove-deprecated-middleware Fixes bug 891921 - Removed all files related to the old, obsolete middleware.

Merge pull request mozilla-services#1673 from AdrianGaudebert/938410-…

611f1f8

…sig-hist-doc Fixes bug 938410 - Fixed example in signature_history documentation.

Bug 939141 - Annotate the largest free VM block in the processed crash.

7cb6ca1

Merge pull request mozilla-services#1677 from lonnen/largest_free_vm_…

e0d2124

…block Bug 939141 - Annotate the largest free VM block in the processed crash. r=ted

tagged logging of transaction failures with name of the resource expe…

bb0a78b

…riencing failure.

Merge pull request mozilla-services#1637 from twobraids/trans-log

e11e0a9

Fixes Bug 931147 - tagged logging of transaction failures with name of the resource experiencing failure

Merge pull request mozilla-services#1674 from ossreleasefeed/bug78952…

87ff874

…6-non-plotted-graphs-on-topcrasher Bug789526 non plotted graphs on topcrasher

Fixes bug 901977 - Store raw crash data into elasticsearch.

62c33c5

wip - changing the crash document to have both processed and raw bran…

c678cdd

…ches. Updating backfill app.

backfilling works, but no HBase nor raw crash for the moment.

e376244

Wip, pulling the raw crash from HBase, using the new document schema.

b23253c

All elasticsearch based services are fixed, with their unit tests. Ne…

9d6e6e6

…eds manual testing.

adngdb reopened this Nov 19, 2013

adngdb closed this Nov 19, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes bug 901977 - Store raw crash data into elasticsearch.#1647

Fixes bug 901977 - Store raw crash data into elasticsearch.#1647
adngdb wants to merge 17 commits into
mozilla-services:masterfrom
adngdb:901977-raw-crash-json-in-elasticsearch

adngdb commented Nov 4, 2013

Uh oh!

twobraids Nov 5, 2013

Uh oh!

adngdb Nov 6, 2013

Uh oh!

adngdb commented Nov 14, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

adngdb commented Nov 4, 2013

Uh oh!

twobraids Nov 5, 2013

Choose a reason for hiding this comment

Uh oh!

adngdb Nov 6, 2013

Choose a reason for hiding this comment

Uh oh!

adngdb commented Nov 14, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants