Test that scores shape matches (n_trials, n_observables). Test annotations alignment. Test edge cases: empty results, single trial.
Test that scores shape matches (n_trials, n_observables). Test annotations alignment. Test edge cases: empty results, single trial.