Add ability to trace mutation form from each data source during query #11130

tgrabiec · 2022-07-26T09:57:52Z

Knowing mutation form returned by each of the data source touched by a query would be very helpful in debugging issues related to incorrect query results.

The user wouldn't have to share whole sstables with us, so the process would be easier and faster. Sensitive data could be easily removed. We could add an option to do this automatically, removing cell values and obfuscate keys. The user could keep the obfuscation translation map for reference.

It could be reported via CQL tracing and enabled with a new query syntax (like "bypass cache"), e.g.:

cqlsh> tracing on
cqlsh> select * from my_table where ... TRACE MUTATIONS

\cc @bhalevy

The text was updated successfully, but these errors were encountered:

bhalevy · 2022-07-26T11:51:27Z

Printing the mutation as json seems like a good idea so analyzing it could be automated.

denesb · 2022-07-27T14:04:01Z

We already have a mutation -> json converter in https://github.com/scylladb/scylla/blob/81e20ceaab8f54938aa77842ee5c9d9e62639432/tools/scylla-sstable.cc#L324.

avikivity · 2022-07-27T18:47:26Z

When a mutation reader creates subordinate readers, then we could create subordinate trace_state objects that name the subordinate and its parent. This would recreate the reader tree in runtime without much effort.

denesb · 2023-05-25T11:09:52Z

I'm, thinking a virtual table would be a better fit than tracing. The potential amount of data is huge, and it can be duplicated in all the different data sources, or even multiple times in a single data source. A virtual table naturally lends itself to delivering large amounts of data, and if done right, advanced filtering/redacting should be easily possible.

denesb · 2023-05-30T06:31:32Z

I opened an RFC PR using the virtual table approach: #14083.

mykaul · 2023-06-11T13:12:34Z

@bhalevy - this looks like an important improvement to our ability to debug issues - can we push it forward?

tgrabiec added feature/enhancement area/monitoring tracing n00b labels Jul 26, 2022

slivne added this to the 5.x milestone Aug 11, 2022

denesb mentioned this issue May 30, 2023

[RFC] db/system_keyspace: add data_source virtual table #14083

Closed

denesb self-assigned this May 30, 2023

DoronArazii modified the milestones: 5.x, 5.4 May 30, 2023

denesb mentioned this issue Jun 21, 2023

Introduce SELECT MUTATION FRAGMENTS statement #14347

Merged

scylladb-promoter closed this as completed in 460b28d Jul 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to trace mutation form from each data source during query #11130

Add ability to trace mutation form from each data source during query #11130

tgrabiec commented Jul 26, 2022 •

edited

bhalevy commented Jul 26, 2022

denesb commented Jul 27, 2022

avikivity commented Jul 27, 2022

denesb commented May 25, 2023

denesb commented May 30, 2023

mykaul commented Jun 11, 2023

Add ability to trace mutation form from each data source during query #11130

Add ability to trace mutation form from each data source during query #11130

Comments

tgrabiec commented Jul 26, 2022 • edited

bhalevy commented Jul 26, 2022

denesb commented Jul 27, 2022

avikivity commented Jul 27, 2022

denesb commented May 25, 2023

denesb commented May 30, 2023

mykaul commented Jun 11, 2023

tgrabiec commented Jul 26, 2022 •

edited