GENERATING DATASETS FOR THE BINARY CLASSIFICATION TASK BASED ON THEIR CHARACTERISTIC DESCRIPTIONS
Annotation
Subject of Study. We present a method for generating instances of the binary classification task by (according to, based on) their characteristic descriptions in the form of a meta-feature vector. We propose a naïve method for the same problem solution to be used as a referral one. We study the characteristic space of the binary classification task instances, as well as the methods for this space traversal. Method. The proposed method is based on genetic algorithm, where the distance in the characteristic space from the description vector of the generated instance for the binary classification task to the specified one is used as the minimized objective function. We developed the crossover and mutation operators for the genetic algorithm. These operators are based on such transformations as addition or removal of features and objects from datasets. Main Results. In order to validate the proposed method, we chose several non-trivial two-dimensional meta-feature spaces that were generated from statistical, information-theoretical and structural characteristics of classification task instance. We used the baseline method to evaluate the relative error of the proposed method. Both methods used the same number of classification tasks instances. The proposed method outperformed the naïve method and reduced average error by 30 times. Practical Relevance. The proposed method for generating instances for classification task based on their characteristic description allows obtaining unknown instances that are required to evaluate the performance of classifiers in certain areas of the meta-features space for design of automatic algorithm selection systems
Keywords
Постоянный URL
Articles in current issue
- TWO-LENS AFOCAL COMPENSATOR FOR THERMAL DEFOCUS CORRECTION OF CATADIOPTRIC SYSTEM
- ATMOSPHERE PRESSURE EFFECT ON THE FIBER OPTIC GYROSCOPE OUTPUT SIGNAL
- CONTROL OF SCATTERING IN OPTICAL FIBER BY FIBER TWIST
- ALGORITHM FOR MOBILE ROBOT CROSS COUNTRY MOTION
- ALGORITHM FOR RESONANCE CONTROL OF IRON MASS FRACTION IN MAGNETITE ORE
- THEORETICAL ANALYSIS OF DYNAMIC SELECTION OF SWITCHING AUXILIARY OBJECTIVES ON XdivK PROBLEM
- ON RESTORATION OF SMEARED COLOR IMAGES
- IMAGE QUALITY ENHANCEMENT BY PROCESSING OF VIDEO FRAMES WITH DIFFERENT EXPOSURE TIME
- AUTOMATIC SECURITY ANALYSIS OF INFORMATION SYSTEMS INDEPENDENTLY OF FORMAL SPECIFICATIONS
- SECURITY MODEL OF MOBILE MULTI-AGENT ROBOTIC SYSTEMS WITH COLLECTIVE MANAGEMENT
- ANOMALY DETECTION IN WIRELESS SENSOR NETWORKS OF «SMART HOME» SYSTEM
- EFFECTIVENESS OF STEGANALYSIS BASED ON MACHINE LEARNING METHODS
- POST-INCIDENT INTERNAL AUDIT PROCEDURE OF COMPUTER DEVICES
- IMPROVED VISUAL ODOMETRY METHOD FOR SIMULTANEOUS UNMANNED AERIAL VEHICLE NAVIGATION AND EARTH SURFACE MAPPING
- FAST TEST ZONE SEARCH ALGORITHM FOR INTERFRAME ENCODING
- TREE SIMILARITY ESTIMATION BY CALCULATION OF pq-GRAM DISTANCE
- PARAMETER INTERVALITY OF REMOTE CONTROL SYSTEMS GENERATED WITH ERROR DETECTION MODE IN COMMUNICATION CHANNEL
- HEAT TRANSFER IN A CAVITY WITH ROTATING DISK IN TURBULENT REGIME
- MATHEMATICAL AND NUMERICAL MODELING OF FREE TURNING SEGMENTS OF SELF-REGULATED STATIC-DYNAMIC GAS BEARING
- NUMERICAL ANALYSIS METHODS OF SOFTWARE TEST EFFICIENCY
- ON THE SIMULATION PARADIGM ANALYSIS
- SMART LASER HEAD
- NEW DESIGN METHOD OF OUTPUT ROBUST CONTROL ALGORITHMS