![Scientific and Technical Journal
of Information Technologies, Mechanics and Optics](/images/mag-ntv.png)
Clustering in big data analytics: a systematic review and comparative analysis (review article)
![Scientific and Technical Journal
of Information Technologies, Mechanics and Optics](/images/mag-ntv.png)
Annotation
In the modern world, the widespread use of information and communication technology has led to the accumulation of vast and diverse quantities of data, commonly known as Big Data. This necessitates the need for novel concepts and analytical techniques to help individuals extract meaningful insights from rapidly increasing volumes of digital data. Clustering is a fundamental approach used in data mining to retrieve valuable information. Although a wide range of clustering methods have been described and implemented in various fields, the sheer variety complicates the task of keeping up with the latest advancements in the field. This research aims to provide a comprehensive evaluation of the clustering algorithms developed for Big Data highlighting their various features. The study also conducts empirical evaluations on six large datasets, using several validity metrics and computing time to assess the performance of the clustering methods under consideration.
Keywords
Постоянный URL
Articles in current issue
- Development of adaptive laser head for compensating error of beam waist position during processing materials using laser beam spot detection method
- Investigation of changes in the sensitivity of a fiber Bragg grating to temperature and strain using coatings from low-melting metal
- Cross-polarization coupling in polarization maintaining fiber induced by periodic mechanical stress
- Lyapunov function search method for analysis of nonlinear systems stability using genetic algorithm
- Robust disturbances compensation for MIMO linear systems with unmeasured state vector and control delay
- Trajectory tracking control for mobile robots with adaptive gain
- Switching the electrical properties of thin-film memristive elements based on GeTe by sequences of ultrashort laser pulses
- Spectral and kinetic characteristics of ultrathin cadmium selenide nanoscrolls
- Method for optimization of camera installation parameters for video monitoring of arbitrary surveillance zone
- The use of anthropometric points to introduce restrictions into the synthesis of a 3D model of the human body using SMPL
- Method for testing NLP models with text adversarial examples
- A new efficient adaptive rood pattern search motion estimation algorithm
- Segmentation of word gestures in sign language video
- A method for constructing interpretable hidden Markov models for the task of identifying binding cores in sequences
- Job scheduling in a distributed computing system on a chip with power consumption minimization
- System for customers’ routing based on their emotional state and age in public services systems
- Sedentary behavior health outcomes and identifying the uncertain behavior patterns in adult
- Confidence Lipschitz classifiers: an instrument of guaranteed reliability
- Visual programming environment for multidimensional fuzzy interval-logic regulators
- Solving the problem of spatial rotation of 3D surfaces and their mapping on the plane
- Analytical and simulation modeling of flexible joints for mechatronic and robotic systems
- Study of heat and mass transfer processes in the Fe-Sn reaction crucible in the presence of high-density electric current
- Measurement of the refractive index using an autocollimation goniometer