Paper
6 April 2023 LocPatcH: An efficient long-read hybrid error correction algorithm based on local pHMM
Rongshu Wang, Jianhua Chen, Zhiwen Lu
Author Affiliations +
Proceedings Volume 12615, International Conference on Signal Processing and Communication Technology (SPCT 2022); 126151X (2023) https://doi.org/10.1117/12.2674016
Event: International Conference on Signal Processing and Communication Technology (SPCT 2022), 2022, Harbin, China
Abstract
The length of long reads produced by third-generation sequencing technologies is tens to hundreds of kbps which benefits genomic research. Still, the high error rate of long reads seriously limits the downstream analysis. Only by preserving the length advantage and reducing the error rate of long reads can the effectiveness of the downstream analysis be improved. Here propose LocPatcH: an accurate, efficient, and universal hybrid error correction algorithm based on local machine learning. LocPatcH constructs a profile hidden Markov model for each region in a long read which is aligned with abundant accurate short reads produced by the second-generation sequencing technologies, and then uses the alignment information of the short reads to train the model and finishes the correction. As for the rest of the aligned regions with lower coverage depths, the idea referred to as “patching” is used to complete the correction. The proposed method outperforms mainstream hybrid error correction methods in continuity and memory usage on real Pacbio and Nanopore sequencing datasets.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Rongshu Wang, Jianhua Chen, and Zhiwen Lu "LocPatcH: An efficient long-read hybrid error correction algorithm based on local pHMM", Proc. SPIE 12615, International Conference on Signal Processing and Communication Technology (SPCT 2022), 126151X (6 April 2023); https://doi.org/10.1117/12.2674016
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Error control coding

Error analysis

Image segmentation

Machine learning

Education and training

Data processing

Alignment modeling

Back to Top