![Scientific and Technical Journal of Information Technologies, Mechanics and Optics](/images/mag-ntv.png)
DYNAMIC FEATURE SELECTION FOR WEB USER IDENTIFICATION ON LINGUISTIC AND STYLISTIC FEATURES OF ONLINE TEXTS
![Scientific and Technical Journal of Information Technologies, Mechanics and Optics](/images/mag-ntv.png)
Annotation
The paper deals with identification and authentication of web users participating in the Internet information processes (based on features of online texts).In digital forensics web user identification based on various linguistic features can be used to discover identity of individuals, criminals or terrorists using the Internet to commit cybercrimes. Internet could be used as a tool in different types of cybercrimes (fraud and identity theft, harassment and anonymous threats, terrorist or extremist statements, distribution of illegal content and information warfare). Linguistic identification of web users is a kind of biometric identification, it can be used to narrow down the suspects, identify a criminal and prosecute him. Feature set includes various linguistic and stylistic features extracted from online texts. We propose dynamic feature selection for each web user identification task. Selection is based on calculating Manhattan distance to k-nearest neighbors (Relief-f algorithm). This approach improves the identification accuracy and minimizes the number of features. Experiments were carried out on several datasets with different level of class imbalance. Experiment results showed that features relevance varies in different set of web users (probable authors of some text); features selection for each set of web users improves identification accuracy by 4% at the average that is approximately 1% higher than with the use of static set of features. The proposed approach is most effective for a small number of training samples (messages) per user
Keywords
Постоянный URL
Articles in current issue
- NANOSCALE STRUCTURES GENERATION WITHIN THE SURFACE LAYER OF METALS WITH SHORT UV LASER PULSES
- INKJET PRINTING OF ALUMOOXIDE SOL FOR DEPOSITION OF ANTIREFLECTING COATINGS
- INCREASED IMAGE QUALITY BY SYNTHESIZING SPACE PHOTOS WITH DIFFERENT EXPOSURES
- ROBUST CONTROL ALGORITHM FOR MULTIVARIABLE PLANTS WITH QUANTIZED OUTPUT
- COLLAPSE KINETIC OF COMPOSITES BASED ON COPOLYMERS OF ACRYLIC ACID AND ACRYLAMIDE FILLED WITH BENTONITE IN AQUEOUS SOLUTIONS OF POLYVALENT METALS
- FORMATION OF NANOSTRUCTURED CuO FILM ON FLUOROPHOSPHATE GLASS SURFACE
- VIRTUAL REALITY FOR MANAGEMENT OF SITUATIONAL AWARENESS DURING GLOBAL MASS GATHERINGS
- MUTUAL IMAGE TRANSFORMATION ALGORITHMS FOR VISUAL INFORMATION PROCESSING AND RETRIEVAL
- AUTOMATIC ANALYSIS OF LOCAL ROUTES AND ADJACENT HOUSE TERRITORY FOR URBAN PLANNING SUPPORT
- METHOD OF RARE TERM CONTRASTIVE EXTRACTION FROM NATURAL LANGUAGE TEXTS
- ANALYSIS OF STATISTICAL DATA FROM NETWORK INFRASTRUCTURE MONITORING TO DETECT ABNORMAL BEHAVIOR OF SYSTEM LOCAL SEGMENTS
- ON INFORMATION SECURITY SOLUTIONS APPLICABLE TO D2D COMMUNICATIONS WITHIN THE 5G DOMAIN: ANALYZING THE INFLUENCE OF USER MOBILITY
- PROBABILITY DISTRIBUTION OVER THE SET OF CLASSES IN ARABIC DIALECT CLASSIFICATION TASK
- DYNAMIC FEATURE SELECTION FOR WEB USER IDENTIFICATION ON LINGUISTIC AND STYLISTIC FEATURES OF ONLINE TEXTS
- DEEP LEARNING MODEL FOR BILINGUAL SENTIMENT CLASSIFICATION OF SHORT TEXTS
- INNOVATIVE HEAT FLUX SENSOR
- APPROACH TO SYNTHESIS OF PASSIVE INFRARED DETECTORS BASED ON QUASI-POINT MODEL OF QUALIFIED INTRUDER
- NUMERICAL SIMULATION OF MASS TRANSFER IN CENTRIFUGAL EVAPORATOR
- A CALCULATION OF SEMI-EMPIRICAL ONE-ELECTRON WAVE FUNCTIONS FOR MULTI-ELECTRON ATOMS USED FOR ELEMENTARY PROCESS SIMULATION IN NONLOCAL PLASMA
- PARAMETRICAL IDENTIFICATION OF DIFFERENTIAL-DIFFERENCE HEAT TRANSFER MODEL DURING LIDAR TEMPERATURE MONITORING
- EFFECT OF UV LASER ON SPECTRAL PROPERTIES OF BORATE GLASSES DOPED WITH COPPER CHLORIDE NANOCRYSTALS
- PROJECT ENGINEERING DATA MANAGEMENT AT AUTOMATED PREPARATION OF DESIGN DOCUMENTATION
- CONTROL SYSTEM FOR TILTABLE PLATE WITH TWO DEGREES OF FREEDOM FOR RESEARCH OF DYNAMIC MANIPULATION PROBLEMS
- APPARATUS FOR SURFACE TREATMENT OF FREE-FORM OBJECT BY LASER RADIATION
- AUTOMATED REMOTE MANAGEMENT AND CONTROL SYSTEM OF THE LABORATORY EQUIPMENT