AUTOMATIC SPEECH RECOGNITION – THE MAIN STAGES OVER LAST 50 YEARS
Annotation
The main stages of automatic speech recognition systems over last 50 years are regarded. The attempt is made to evaluate different methods in the context of approaching to functioning of biological systems. The method implementation based on dynamic programming algorithm and done in 1968 is considered as a benchmark. Shortcomings of the method, which make it possible to use it only for command recognition, are considered. The next method considered is based on a formalism of Markov chains. Based on the notion of coarticulation the necessity of applying context dependent triphones and biphones instead of context independent phonemes is shown. The problems of insufficiency of speech databases for triphone training which lead to state tying methods are explained. The importance of model adaptation and feature normalization methods providing better invariance to speakers, communication channels and additive noise are shown. Deep Neural Networks and Recurrent Networks are considered as the most up-to-date methods. The similarity of deep (multilayer) neural networks and biological systems is noted. In conclusion, the problems and drawbacks of the modern systems of automatic speech recognition are described and prognosis of their development is given.
Keywords
Постоянный URL
Articles in current issue
- PHOTOELECTRIC AND PHOTOMAGNETIC RESPONSE OF INDIUM-TIN OXIDE FILMS
- POTENTIALS OF RAMAN BASED SENSOR SYSTEM FOR AN ONLINE ANALYSIS OF HUMAN INHALE AND EXHALE
- ANALYSIS OF PERIODIC NANOSTRUCTURES FORMATION ON A GOLD SURFACE UNDER EXPOSURE TO ULTRASHORT LASER PULSES NEAR THE MELTING THRESHOLD
- PERFORMANCE OPTIMIZATION OF THE DIODE-PUMPED SOLID-STATE LASER FOR SPACE APPLICATIONS
- EXPERIMENTAL COMPARISON OF HOMODYNE DEMODULATION ALGORITHMS FOR PHASE FIBER-OPTIC SENSOR
- DIRECTIVITY PATTERN INVESTIGATION OF DUAL FIBER OPTIC HYDROPHONE
- SUBJOULE DIODE-PUMPED YTTERBIUM-ERBIUM GLASS LASER WITH CAVITY DUMPING FOR CATARACT EXTRACTION
- AUTOMATION OF LENSES CENTERING AT GLUING IN THE FRAME
- СREATION OF CORRELATION FUNCTIONS OF LINEAR CONTINUOUS SYSTEMS BASED ON THEIR FUNDAMENTAL MATRICES
- СONTROL FOR QUADROCOPTER WITH COMPENSATION OF WIND DISTURBANCE
- СHIRAL RECOGNITION OF CYSTEINE MOLECULES BY CHIRAL CdSe AND CdS QUANTUM DOTS
- MICROSTRUCTURING OF SILICON SINGLE CRYSTALS BY FIBER LASER IN HIGH-SPEED SCANNING MODE
- DETERMINATION OF SATURATION VAPOR PRESSURE OF LOW VOLATILE SUBSTANCES THROUGH THE STUDY OF EVAPORATION RATE BY THERMOGRAVIMETRIC ANALYSIS
- STUDY OF CURRENT APPROACHES FOR WEB PUBLISHING OF OPEN SCIENTIFIC DATA
- PERFORMANCE EVALUATION OF SD-CARDS BY "SYSTEM-ON-CHIP" TECHNOLOGY
- MODELS OF LIVE MIGRATION WITH ITERATIVE APPROACH AND MOVE OF VIRTUAL MACHINES
- DOMAIN-DRIVEN DESIGN APPLICATION AND IMPLEMENTATION OF INFORMATION SYSTEMS FOR CLIENTS QUEUING SUBJECT AREAS
- INTEGRITY MONITORING IMPLEMENTATION FOR THE OPERATING SYSTEM IMAGE LOADED THROUGH A NETWORK TO THE THIN CLIENT
- METHOD AND ABSTRACT MODEL FOR CONTROL AND ACCESS RIGHTS BY REQUESTS REDIRECTION
- SMARTPHONE-BASED APPROACH TO ADVANCED DRIVER ASSISTANCE SYSTEM (ADAS) RESEARCH AND DEVELOPMENT
- LTE OFFLOADING THROUGH WiFi NETWORKS
- METHOD OF TRAINING EXAMPLES IN SOLVING INVERSE ILL-POSED PROBLEMS OF SPECTROSCOPY
- ARBITRARY INTERACTION OF PLANE SUPERSONIC FLOWS
- LOGICAL CONDITIONS ANALYSIS METHOD FOR DIAGNOSTIC TEST RESULTS DECODING APPLIED TO COMPETENCE ELEMENTS PROFICIENCY
- SYNTHESIS OF 2,6-DIAMINOPYRIDINE-4-NITROPHENOL (2,6DAP4N) COCRYSTAL NANOPARTICLES BY LASER ABLATION METHOD