Review of deep learning methods for imaging photoplethysmography data processing
Annotation
This paper presents a review of contemporary deep learning methods for processing remote photoplethysmography data. Architectures of convolutional neural networks, transformers, recurrent, and generative models are examined for video signal preprocessing and for extracting physiologically significant parameters under conditions involving artifacts caused by motion, illumination changes, or low video quality. An analysis of the prospects for implementing deep learning algorithms in real-world medical scenarios is conducted based on the proposed criteria, considering existing integration challenges, the demand for such solutions, and issues related to result validation. The study includes a review of existing deep learning approaches that utilize video signals to estimate imaging photoplethysmography. The methods are evaluated using newly proposed criteria, including the multidimensionality of the photoplethysmography output signal, the availability of open-source code, and the reporting of computational time costs, which is essential for their practical real-time application in medical institutions. It is shown that deep learning methods significantly outperform traditional approaches in physiological parameter estimation, cardiovascular disease diagnosis, and video signal preprocessing. However, most existing deep learning-based solutions are limited to one-dimensional output signals due to the complexity of obtaining multidimensional annotations required for supervised learning. Additional analysis revealed a lack of information regarding temporal and computational costs, which restricts the practical real-time implementation of these methods. The proposed systematization clarifies key terms related to photoplethysmography signal processing: contact photoplethysmography, imaging photoplethysmography, remote photoplethysmography, and photoplethysmographic imaging. Approaches to dataset collection are also described, considering the concepts of multidimensionality, multichannel, and multimodal signals. The results may be applied in the development of remote health monitoring systems, including medical and consumer devices. The review will be of interest to specialists in biomedical engineering, medical informatics, and developers of physiological signal analysis solutions.
Keywords
Постоянный URL
Articles in current issue
- Fluorescence studies of natural photosensitizers in oncology and antimicrobial therapy
- Effect of heat treatment on the growth and luminescence of quantum dots CsPbI3 in fluorophosphate glass
- Study of nanopipettes conductivity depending on their shape and size
- Thermal conductivity of multilayer hexagonal boron nitride nanoscrolls
- Integrated control algorithm for obstacle and singularity avoidance in a robotic manipulator
- Method of automatic generation of the informative space for identifying information security events in corporate computer networks
- Spectral-based multi-band recurrent neural networks for black-box modeling of dynamic range compressors (in English)
- Hierarchical multi-task learning for low-complexity models based on task synergy analysis
- Detection of network anomalies in the Internet of Things environment using modified statistical criteria and ensemble methods
- Automatic detection of software design patterns using a language model on transformer architecture (in English)
- Ego-net link prediction with GNN (in English)
- Multi-task human’s psychological profile analysis based on text data using semi-supervised learning
- Modeling and optimization of information flows in electronic document management systems under information security threats
- Series-parallel architecture for the FPGA implementation of neural networks trainable in real-time using the error backpropagation algorithm
- An approach to contextual example mining for DGA domain identification using large language models
- Analysis of the effectiveness of optimizing behavioral descriptions of hardware in logic synthesizers for FPGA
- Spheroidal models of ore deposits in the framework of gravity tomography
- Prediction of maximum stresses in the shaft–insert system using a neural network
- Estimation criterion and method for optimizing the redundancy of video images in surveillance systems
- Generating spatiotemporal network load series in multi-access edge computing tasks using open data
- Application of hybrid artificial intelligence methods to practical industrial tasks under conditions of scarce training data
- Implementation and investigation of a reservoir computer based on a hardware model of three-element spiking neuron
- Analysis of a centerless control scheme for profiles of large-sized shells in the process of their shaping
- Oblivious signature based on the theory of elliptic curve isogeny