ERROR CORRECTION METHOD FOR SEQUENCING DATA WITH INSERTIONS AND DELETIONS
Annotation
Subject of Research.A method for error correction for sequencing reads of a haploid organism with insertions and deletions was developed. It was tested on two libraries: a synthesized dataset for Escherichia coli bacterium and a real dataset of reads for Pseudomonas stutzeri. Method. The method is based on using k-mers but only for finding reads that are close to each other. For the close reads a consensus string is created which is then used for correcting errors in the initial reads. Main Results. The algorithm is implemented as a separated program. The program has been tested on both real and synthesized data. The method performance is higher than that of the other known methods (N50 metric was used as well as total contig length and maximal contig length as metrics for comparison). Practical Relevance. The method can be used together with known genome assembly methods not suitable for application with the reads containing insertion and deletion errors.
Keywords
Постоянный URL
Articles in current issue
- TRENDS IN THE DEVELOPMENT OF DETONATION ENGINES FOR HIGH-SPEED AEROSPACE AIRCRAFTS AND THE PROBLEM OF TRIPLE CONFIGURATIONS OF SHOCK WAVES. Part I. Research of detonation engines
- SPECIAL ASPECTS OF INITIAL OPTICAL SCHEME SELECTION FOR DESIGN OF NON-IMAGING OPTICAL SYSTEMS
- APPLICATION OF CHEMOMETRICS FOR ANALYSIS OF BIOAEROSOLS BY FLOW-OPTICAL METHOD
- APPROACH TO AUTOMATION OF LENS COMPONENTS CENTERING FOR ASSEMBLING OF DIFFERENT DESIGN OBJECTIVES
- METHOD FOR CREATION OF SPHERICAL PANORAMAS FROM IMAGES OBTAINED BY OMNIDIRECTIONAL OPTOELECTRONIC SYSTEMS
- LINE-FIELD SWEPT-SOURCE OPTICAL COHERENCE TOMOGRAPHY SYSTEM FOR NEAR INFRARED SPECTRAL REGION
- BACKSTEPPING ALGORITHM FOR LINEAR SISO PLANTS UNDER STRUCTURAL UNCERTAINTIES
- RESEARCH OF FREE MOTION TRAJECTORIES FEATURES OF CONTINUOUS SYSTEM DEFINED AS A CONSECUTIVE CHAIN OF IDENTICAL FIRST-ORDER APERIODIC LINKS
- SPECTRAL CHARACTERISTICS OF MID-INFRARED LIGHT-EMITTING DIODES BASED ON InAs (Sb,P)
- SIMULATION OF SENSING ELEMENT OF TEMPERATURE SENSOR BASED ON SILICATE GLASS WITH SODIUM NANOPARTICLES
- (IN-) PRIVACY IN MOBILE APPS. CUSTOMER OPPORTUNITIES (in Engl.)
- PRINCIPLES OF INDICATION FOR EN-ROUTE FLIGHT PATHS OF THE AIRCRAFT ON THE SCREEN OF ON-BOARD DISPLAY DEVICES
- ERROR CORRECTION METHOD FOR SEQUENCING DATA WITH INSERTIONS AND DELETIONS
- CHOICE OF OPTION FOR IMPLEMENTATION OF THE MULTILEVEL SECURE ACCESS TO THE EXTERNAL NETWORK
- SYNTHESIS OF THE SECONDARY STRUCTURE OF ALGEBRAIC BAYESIAN NETWORKS: AN INCREMENTAL ALGORITHM AND STATISTICAL ESTIMATION OF ITS COMPLEXITY
- EFFECT OF MICROPHONES NON-IDENTITY ON THE MICROPHONE ARRAYS CHARACTERISTICS
- QUALITY CONTROL AUTOMATED LASER-ULTRASONIC METHOD FOR SOLDER JOINTS OF NOZZLES OF CHAMBERS IN LIQUID ROCKET ENGINES
- ROBUST MODIFICATION OF THE LASSO METHOD FOR GENOME-WIDE ASSOCIATION STUDY IN VIEW OF TARGET PHENOTYPE VALUES
- BENCHMARK SOLUTIONS FOR STOKES EQUATIONS WITH VARIABLE VISCOSITY IN CYLINDRICAL AND SPHERICAL COORDINATES
- INFORMATIONAL MODEL OF MENTAL ROTATION OF FIGURES
- WENO SCHEMES FOR SOLUTION OF UNSTEADY ONE-DIMENSIONAL GAS DYNAMICS TEST PROBLEMS
- AUTOMATED CONTROL SIMULATION OF PROFESSIONAL SKILLS FORMATION FOR PRODUCTION SYSTEM OPERATOR
- STUDY OF OPTICAL AND LUMINESCENT PROPERTIES OF POTASSIUM-ALUMINA-BORATE GLASS DOPED WITH Cr3+ IONS
- SPEAKER-DEPENDENT FEATURES FOR SPONTANEOUS SPEECH RECOGNITION