Performance optimizations for alignments tracks, particularly those with many short reads #2523

cmdcolin · 2021-11-16T03:46:07Z

on production build, viewing a largish 100kb region with ~25x coverage short reads

http://localhost:3000/?config=test_data%2Fconfig_demo.json&session=share-amxdajf5sI&password=pSD6x

numbers from production build, basically same on dev
before 32s
after 25s

so, maybe about, 20-25% faster on some datasets

removes the 'color' configuration variable on the reads and other stuff to avoid calling a expensive functions on every read

see #969

main standouts in the performance trace now

layout (addRect's)
rxjs filtering (specifically probably the jexl callbacks inside)
serialization (this one takes up a lot of memory so is also probably most involved in destabalizing/crashes)
drawing

each of these could be targetted for some additional performance improvement

codecov · 2021-11-16T04:32:48Z

Codecov Report

Merging #2523 (180b1d2) into main (a7bca7d) will increase coverage by 0.16%.
The diff coverage is 88.25%.

@@            Coverage Diff             @@
##             main    #2523      +/-   ##
==========================================
+ Coverage   61.09%   61.26%   +0.16%     
==========================================
  Files         543      543              
  Lines       25141    25311     +170     
  Branches     5900     5942      +42     
==========================================
+ Hits        15361    15506     +145     
- Misses       9457     9482      +25     
  Partials      323      323

Impacted Files	Coverage Δ
packages/core/rpc/WebWorkerRpcDriver.ts	`0.00% <ø> (ø)`
packages/core/util/layouts/PrecomputedLayout.ts	`24.24% <ø> (ø)`
...gins/gff3/src/Gff3TabixAdapter/Gff3TabixAdapter.ts	`88.65% <ø> (ø)`
...s/alignments/src/PileupRenderer/PileupRenderer.tsx	`54.50% <82.53%> (+0.02%)`	⬆️
packages/core/util/layouts/GranularRectLayout.ts	`87.87% <89.37%> (-2.13%)`	⬇️
...pluggableElementTypes/renderers/BoxRendererType.ts	`74.35% <100.00%> (ø)`
packages/core/util/rxjs.ts	`85.71% <100.00%> (ø)`
...lignments/src/BamAdapter/BamSlightlyLazyFeature.ts	`79.24% <100.00%> (ø)`
...lugins/alignments/src/BamAdapter/MismatchParser.ts	`84.61% <100.00%> (ø)`
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a7bca7d...180b1d2. Read the comment docs.

plugins/alignments/src/BamAdapter/MismatchParser.ts

cmdcolin · 2021-11-16T19:07:48Z

note that by getting rid of the "color" config slot, it removes the ability to specify a jexl callback for alignment feature color. if we want to keep that, it could be restored.

cmdcolin · 2021-11-17T00:44:15Z

restores the ability for the user to customize the color using a color callback now. it is by default a magenta color, and we can check against that to avoid a readConfObject on each feature.

cmdcolin · 2021-11-17T13:32:48Z

found a pretty significant performance update especially for viewing many short reads by restoring the old granular rect layout back. it was replaced with rbush in an attempt to simplify codebase and address a bug that I thought was caused by layout, but ended up being related to block observability

now it seems the rbush has a bad algorithmic characteristic because we query the data structure many times on insert looking for a place we can insert a rect for nice layout packing, but the rbush query time is probably something at least like O(log(n)) leading to probably O(nlog(n)) for inserting a single feature...then inserting many features is like O(log(n)*n^2)

so, revert back to granular rect layout which has like O(1) query time essentially, and O(n) insert for a single feature
I tried to see if there was a way to make the insert faster for rbush (e.g. dont query repeatedly to find an empty place) cause it is a nice data structure but didn't work out.

I added a deep sequencing track on volvox to demonstrate:

loading this with rbush layout: 70s, much time taken on layout
loading with old layout code: ~6s, almost no time taken on layout

rbuels · 2021-11-17T22:07:22Z

Looks good to me, merge if you feel it's ready

cmdcolin added 2 commits November 15, 2021 21:30

Misc

3e6a202

Reduce some readConfObject calls

e35e33c

github-actions bot added the needs label triage Needs a label to show in changelog (breaking, enhancement, bug, documentation, or internal) label Nov 16, 2021

cmdcolin force-pushed the pileup_optim branch from 36f3f72 to 541012f Compare November 16, 2021 04:19

cmdcolin added performance and removed needs label triage Needs a label to show in changelog (breaking, enhancement, bug, documentation, or internal) labels Nov 16, 2021

rbuels approved these changes Nov 16, 2021

View reviewed changes

plugins/alignments/src/BamAdapter/MismatchParser.ts Outdated Show resolved Hide resolved

cmdcolin force-pushed the pileup_optim branch from 541012f to e166a34 Compare November 16, 2021 18:59

cmdcolin force-pushed the pileup_optim branch from e166a34 to 949269f Compare November 17, 2021 00:42

Save time pileup renderer

e230d85

cmdcolin force-pushed the pileup_optim branch from 949269f to e230d85 Compare November 17, 2021 01:11

cmdcolin added 2 commits November 17, 2021 08:22

Small update to avoid a feature.get('seq') if unneeded

ba62503

Revert to old granular rect layout

f47800d

cmdcolin added 2 commits November 17, 2021 08:33

Add deep sequencing track for volvox

8d7d05e

Fix the set feature height ability

180b1d2

cmdcolin force-pushed the pileup_optim branch from 7398f41 to 180b1d2 Compare November 17, 2021 14:00

rbuels changed the title ~~Basic pileup optimizations~~ Performance optimizations for Alignments displays Nov 17, 2021

cmdcolin merged commit 7553aa8 into main Nov 17, 2021

cmdcolin deleted the pileup_optim branch November 17, 2021 22:42

cmdcolin changed the title ~~Performance optimizations for Alignments displays~~ Performance optimizations for alignments tracks, particularly those with many short reads Nov 17, 2021

cmdcolin mentioned this pull request Nov 17, 2021

alignments track performance optimization #969

Closed

cmdcolin added the enhancement New feature or request label Dec 3, 2021

cmdcolin mentioned this pull request Jan 7, 2022

Fail to click on features in layout in @jbrowse/react-linear-genome-view #2625

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance optimizations for alignments tracks, particularly those with many short reads #2523

Performance optimizations for alignments tracks, particularly those with many short reads #2523

cmdcolin commented Nov 16, 2021 •

edited

Loading

codecov bot commented Nov 16, 2021 •

edited

Loading

cmdcolin commented Nov 16, 2021

cmdcolin commented Nov 17, 2021

cmdcolin commented Nov 17, 2021 •

edited

Loading

rbuels commented Nov 17, 2021

Performance optimizations for alignments tracks, particularly those with many short reads #2523

Performance optimizations for alignments tracks, particularly those with many short reads #2523

Conversation

cmdcolin commented Nov 16, 2021 • edited Loading

codecov bot commented Nov 16, 2021 • edited Loading

Codecov Report

cmdcolin commented Nov 16, 2021

cmdcolin commented Nov 17, 2021

cmdcolin commented Nov 17, 2021 • edited Loading

rbuels commented Nov 17, 2021

cmdcolin commented Nov 16, 2021 •

edited

Loading

codecov bot commented Nov 16, 2021 •

edited

Loading

cmdcolin commented Nov 17, 2021 •

edited

Loading