SPEAKERS' IDENTIFICATION METHOD BASED ON COMPARISON OF PHONEME LENGTHS STATISTICS
Annotation
Subject of research. The paper presents a semi-automatic method of speaker identification based on prosodic features comparison - statistics of phone lengths. Due to the development of speech technologies in recent times, there is an increased interest in searching of expert methods for speaker's voice identification, which supplement existing methods to increase identification reliability and also have low labour intensity. An efficient solution for this problem is necessary for making the reliable decision whether the voices of the speakers in the audio recordings are identical or different. Method description. We present a novel algorithm for calculating the difference of speakers’ voices based on comparing of statistics for phone and allophone lengths. Characteristic feature of the proposed method is the possibility of its application along with the other semi-automatic methods (acoustic, auditive and linguistic) due to the lack of a strong correlation between analyzed features. The advantage of the method is the possibility to carry out rapid analysis of long-duration recordings because of preprocessing automation for data being analyzed. We describe the operation principles of an automatic speech segmentation module used for statistics calculation of sound lengths by acoustic-phonetic labeling. The software has been developed as an instrument of speech data preprocessing for expert analysis. Method approbation. This method was approved on the speech database of 130 speech records, including the Russian speech of the male speakers and female speakers, and showed reliability equal to 71.7% on the database containing female speech records, and 78.4% on the database containing male speech records. Also it was experimentally established that the most informative of all used features are statistics of phone lengths of vowels and sonorant sounds. Practical relevance. Experimental results have shown applicability of the proposed method for the speaker recognition task in the course of phonoscopic examination.
Keywords
Постоянный URL
Articles in current issue
- OPTICAL PULLING FORCES IN “NANOPARTICLES DIMER IN THE STRUCTURED FIELD” SYSTEM
- ADVANTAGES OF DIFFRACTIVE OPTICAL ELEMENTS APPLICATION IN SIMPLE OPTICAL IMAGING SYSTEMS
- COMPARISON OF HOLOGRAPHIC AND ITERATIVE METHODS FOR AMPLITUDE OBJECT RECONSTRUCTION
- STARS IDENTIFICATION AT THE ASTRONOMICAL COORDINATES DETERMINATION BY MEANS OF AN AUTOMATED ZENITH TELESCOPE
- TRANSFORMATION ALGORITHM FOR IMAGES OBTAINED BY OMNIDIRECTIONAL CAMERAS
- ADAPTIVE FLUX OBSERVER FOR PERMANENT MAGNET SYNCHRONOUS MOTORS
- STUDY OF MECHANISMS RESPONSIBLE FOR THE EFFICIENCY DEGRADATION OF THE III-NITRIDES LIGHT EMITTING DIODES
- FORMATION OF LUMINESCENT OPTICAL WAVEGUIDES IN SILICATE GLASS MATRIX BY THE ION-EXCHANGE TECHNIQUE
- INVESTIGATION OF HETEROSTRUCTURES 3C-SIC/15R-SIC
- INFLUENCE OF THE ORTHOGONALLY POLARIZED BACK REFLECTIONS ON THE POWER AND RADIATION SPECTRUM OF SUPERLUMINESCENT DIODES
- MATRIX-VECTOR ALGORITHMS FOR NORMALIZING FACTORS IN ALGEBRAIC BAYESIAN NETWORKS LOCAL POSTERIORI INFERENCE
- STUDY OF BLOCKING EFFECT ELIMINATION METHODS BY MEANS OF INTRAFRAME VIDEO SEQUENCE INTERPOLATION
- ALGORITHM OF RATIONAL PROCESSOR ARCHITECTURE
- RUNTIME BALANCING EFFECT IN DISTRIBUTED SIMULATION MODEL
- STUDY OF SOLUTION REPRESENTATION LANGUAGE INFLUENCE ON EFFICIENCY OF INTEGER SEQUENCES PREDICTION
- NUCLEAR-MAGNETIC MINI-RELAXOMETER FOR LIQUID AND VISCOUS MEDIA CONTROL
- COMPARISON OF VARIOUS APPROACHES TO MULTI-CHANNEL
INFORMATION FUSION IN C-OTDR SYSTEMS FOR REMOTE MONITORING OF EXTENDED OBJECTS - COORDINATION IN MULTILEVEL NETWORK-CENTRIC CONTROL SYSTEMS OF REGIONAL SECURITY: APPROACH AND FORMAL MODEL
- ANALYSIS OF FINITE-DIFFERENCE SCHEMES BASED ON EXACT AND APPROXIMATE SOLUTION OF RIEMANN PROBLEM
- APPLICATION OF MODIFIED CONVERSION METHOD TO A NONLINEAR DYNAMICAL SYSTEM
- SERVICES OF FULL-TEXT SEARCHING IN A DISTRIBUTED INFORMATION ENVIRONMENT (PROJECT HUMANITARIANA)
- ANALYSIS OF THE RELATIONSHIP BETWEEN DEGREE OF BLOOD OXYGENATION AND BACKSCATTERED RADIATION WITH THE USE OF NUMERICAL MODELING
- USING PRECEDENTS FOR REDUCTION OF DECISION TREE BY GRAPH SEARCH
- TEXTS SENTIMENT-ANALYSIS APPLICATION FOR PUBLIC OPINION ASSESSMENT