METHOD OF HIGH-QUALITY SPEECH SYNTHESIS WITH A SMALL DATABASE USAGE
Annotation
We propose an approach to synthesizing high-quality speech in view of a small initial speech database. A robust method for solving this problem is vital for voice restoration (recovery of the lost fragments of recordings based on available speech material of a well-known person, e.g. an actor). The proposed TTS (text-to-speech) system is a hybrid one that combines the advantages of both HMM- and Unit Selection-based TTS systems. The paper deals with the approach based on statistical models of intonation parameters, which makes it possible to preserve the speaker's pronunciation in synthesized speech. We describe the preparation of the database and the solution to the problem of shortage of original speech material for model training. Special algorithms of speech element concatenation and modification are effective to correct parameters according to the requirements, provide overall tonal smoothness and reduce spectral distortion at the boundaries of concatenated elements. Listening tests showed the efficiency of the proposed methods and proved the possibility of highquality speech synthesis even with a small speech database (right up to one hour of speech).
Keywords
Постоянный URL
Articles in current issue
- PHOTONICS AND OPTICAL INFORMATICS IN EUROPE: TRENDS OF 2003–2013
- TWO-DIMENSIONAL LOCALIZATION OF ATOMIC POPULATIONS IN FOUR-LEVEL QUANTUM SYSTEMS
- THE RECURRENT ALGORITHM FOR INTERFEROMETRIC SIGNALS PROCESSING BASED ON MULTI-CLOUD PREDICTION MODEL
- INVESTIGATION OF BIOLOGICAL OBJECTS IN OPTICAL COHERENCE TOMOGRAPHY WITH DATA PROCESSING BY SEQUENTIAL MONTE CARLO METHOD
- AUTOMATIC CALIBRATION METHOD FOR STEREOSCOPIC SYSTEM
- METHOD OF IMAGE QUALITY ENHANCEMENT FOR SPACE OBJECTS
- ROBUST REGULATION FOR SYSTEMS WITH POLYNOMIAL NONLINEARITY APPLIED TO RAPID THERMAL PROCESSES
- NANOSTRUCTURING AS A WAY FOR THERMOELECTRIC EFFICIENCY IMPROVEMENT
- SPECTRAL AND LUMINESCENT PROPERTIES OF CHROMIUM IONS IN FORSTERITE-LIKE NANO-GLASS CERAMICS
- SPECTRAL AND LUMINESCENT PROPERTIES OF FLUOROPHOSPHATE GLASSES DOPED WITH YTTERBIUM AND ERBIUM
- PARAMETERS OPTIMIZATION OF METAL-DIELECTRIC NANOSTRUCTURES FOR SENSOR APPLICATIONS
- HLD-METHODOLOGY APPLICATION FOR RECONFIGURABLE EMBEDDED SYSTEMS DESIGN
- DETECTION OF CLIPPED FRAGMENTS IN ACOUSTIC SIGNALS
- TWO-LEVEL HIERARCHICAL COORDINATION QUEUING METHOD FOR TELECOMMUNICATION NETWORK NODES
- AN APPROACH FOR CLONE DETECTION IN DOCUMENTATION REUSE
- EFFECTIVENESS ASSESSMENT METHODOLOGY OF INFORMATION SECURITY MANAGEMENT SYSTEM THROUGH THE SYSTEM RESPONSE TIME TO INFORMATION SECURITY INCIDENTS
- MOVING PERSON IDENTIFICATION IN VIDEO SURVEILLANCE SYSTEMS
- MULTISENSOR SYSTEM APPLICATION FOR PREPARATIONS BITTERNESS EVALUATION IN TRADITIONAL CHINESE MEDICINE
- ACCURACY EVALUATION FOR THE NON-CONTACT DEFECT AREA MEASUREMENT AT THE COMPLEX-SHAPE SURFACES UNDER VIDEOENDOSCOPIC CONTROL
- COMPARATIVE ANALYSIS OF ENERGY ACCUMULATION SYSTEMS AND DETERMINATION OF OPTIMAL APPLICATION AREAS FOR MODERN SUPER FLYWHEELS
- MULTI-GRID METHOD OF CONVERGENCE SPEEDING-UP FOR THE SOLUTION OF GAS DYNAMICS PROBLEMS ON UNSTRUCTURED MESHES
- EXTENSION OF TENSOR PRODUCT FOR OPERATORS ON THE DIRAC OPERATOR EXAMPLE
- MOLECULAR DYNAMIC SIMULATION OF PEPTIDE POLYELECTROLYTES
- IDENTIFICATION OF NONLINEAR MODEL PARAMETERS FOR RAPID THERMAL PROCESSES