Vladimír Boža, Eduard Batmendijn, Peter Perešíni, Viktória Hodorová, Hana Lichancová, Rastislav Rabatin, Broňa Brejová, Jozef Nosek, Tomáš Vinař. Precise Nanopore Signal Modeling Improves Unsupervised Single-Molecule Methylation Detection. In Proceedings of the 23rd Conference Information Technologies – Applications and Theory (ITAT 2023), 3498 volume of CEUR Workshop Proceedings, pp. 223-231, Tatranske Matliare, 2023.
Download preprint: not available
Download from publisher: https://ceur-ws.org/Vol-3498/paper29.pdf
Related web page: not available
Bibliography entry: BibTeX
See also: early version
Abstract:
Base calling in nanopore sequencing is a difficult and computationally intensive problem, typically resulting in high error rates. In many applications of nanopore sequencing, direct analysis of raw signal is a viable alternative. Dynamic time warping (DTW) is an important building block for raw signal analysis. In this paper, we propose several improvements to DTW class of algorithms to better account for specifics of nanopore signal modeling. We have implemented these improvements in a new signal-to-reference alignment tool Nadavca. We demonstrate that Nadavca alignments improve unsupervised methylation detection over Tombo. We also demonstrate that by providing additional information about the discriminative power of positions in the signal, an otherwise unsupervised method can approach the accuracy of supervised models. Availability and implementation: Nadavca is available under MIT license at https://github.com/fmfi-compbio/nadavca. Nanopore sequencing data sets are available from ENA bioproject PRJEB64246. Jaminaea angkorensis reference genome assembly is available from Zenodo https://doi.org/10.5281/zenodo.8145315