Working definition

Structural variants in this corpus are genomic alterations typically 50 bp or larger, including deletions, insertions, duplications, inversions, translocations, mobile element insertions, and repeat-mediated events.

What this corpus says

  • Structural variation is the clearest and most consistent advantage zone for long-read WGS.
  • Long reads especially improve discovery of insertions, inversions, repeat-associated events, and rearrangements that short reads undercount or mischaracterize.
  • Population-scale long-read SV resources are becoming useful not only for discovery but also for patient-level filtering.
  • The clinically important shift is not just more calls, but more sequence-resolved and structurally interpretable calls.

Supporting sources

Current takeaways

  • Insertions are disproportionately missed by short-read workflows.
  • Inversions and large complex events still often benefit from orthogonal or graph-aware approaches even within the long-read era.
  • SV reference panels and atlases are becoming a realistic extension of long-read cohort studies.

Open questions

  • Which SV classes now require full multi-platform support, and which are well covered by modern single-platform long-read workflows?
  • How should SV resources be integrated into routine clinical prioritization pipelines?