Statistics example queries sometimes too long for ranked attributes #349

arildm · 2024-03-20T09:06:07Z

Background

When clicking an attribute value in the statistics table, a sub-query is generated that finds specifically the items that are represented by that statistics row.

Some attributes are ranked, so values like apple:0.6 and apple:0.3 are grouped into one row. The row heading is shown as apple, and the CQP generated when clicking it has contains 'apple:(0\.6|0\.3)'.

Problem

In the example below, the top rows are merged from a lot of values, and the generated CQP queries contain a lot of probability numbers in the parentheses. They even get too long for the backend to handle.

https://spraakbanken.gu.se/korplabb/#?cqp=%3Csentence%3E%20%5B%5D&corpus=suc3&stats_reduce=transformer-neighbour&show_stats&search_tab=1&result_tab=2&search=cqp

Solution?

So, if the statistics are compiled on highest-ranking attribute, can/should we not use highest ranking also in the sub query?

The text was updated successfully, but these errors were encountered:

arildm added the bug label Mar 20, 2024

arildm mentioned this issue Mar 20, 2024

Nicer display of "transformer neighbor", a ranked set attribute from KB-BERT #340

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Statistics example queries sometimes too long for ranked attributes #349

Statistics example queries sometimes too long for ranked attributes #349

arildm commented Mar 20, 2024

Statistics example queries sometimes too long for ranked attributes #349

Statistics example queries sometimes too long for ranked attributes #349

Comments

arildm commented Mar 20, 2024

Background

Problem

Solution?