![SCIENTIFIC AND TECHNICAL JOURNAL OF INFORMATION TECHNOLOGIES, MECHANICS AND OPTICS](/images/mag-ntv.png)
CROSS-DOMAIN WEB AUTHOR IDENTIFICATION
![SCIENTIFIC AND TECHNICAL JOURNAL OF INFORMATION TECHNOLOGIES, MECHANICS AND OPTICS](/images/mag-ntv.png)
Annotation
The paper is devoted to the cross-domain web author attribution (identification), where user's messages are obtained from several sources (web-sites). We focused on the problem of one web-site user identification by his messages from another web-site. We found that there is a stylistic difference between the texts of messages created by one user on different web-sites. The possibility of a single feature space forming for texts received from various sources was determined providing sufficient accuracy of linguistic identification. Two subtasks were studied: 1) mixed sources – training and test datasets include messages from mixed sources (web-sites); 2) separated sources – the text messages sources of the training and test datasets do not intersect; training dataset includes texts from one source, test dataset includes texts from another.The experiment results showed that identification accuracy in mixed sources task is 0.82. The accuracy in separated sources task is 0.74. It is concluded that there is a stylistic difference between texts created by one user, but on the various web-sites. But at the same time, it is possible to form a single feature space for text messages received from various web-sites, ensuring sufficient identification accuracy.
Keywords
Постоянный URL
Articles in current issue
- TO THE ANNIVERSARY OF ALEXANDER LVOVICH FRADKOV
- DEPENDENCE OF SPECTRAL CHARACTERISTICS OF SEMICONDUCTOR AND SOLID STATE LASERS OF VISIBLE RANGE ON ACTIVE ENVIRONMENT TEMPERATURE
- SPECTRAL SENSITIVITY STABILITY ESTIMATION OF DIGITAL COLOR CAMERAS
- ENEGETIC EFFICIENCY ASSESSMENT OF SPECTRAL COHERENCE TOMOGRAPH OPTICAL-ELECTRONIC SYSTEM
- METAFILM-BASED BIOSENSOR FOR DETERMINATION OF GLUCOSE CONCENTRATION IN HUMAN BLOOD
- ADAPTIVE ROBUST DISTURBANCE COMPENSATION IN LINEAR SYSTEMS WITH DELAY
- ROBUST STABILIZATION OF TWIN-ROTOR MIMO PLANT
- POROUS STRUCTURE AND FUNCTIONAL PROPERTIES OF HIGHLY-PERMEABLE POLYPROPYLENE FILMS
- FABRICATION OF ACTIVE ADDITIVE TO SHAMPOOS BASED ON DIFFERENT NATURE NANOPARTICLES
- LASER SYNTHESIS OF SELENIUM NANOPARTICLES IN LIQUID MONOMERS
- DESIGN CONCEPTS FOR DIGITAL PROJECT AND PRODUCTION COMPANIES OF INDUSTRY 4.0 STANDARD
- AUDIO-REPLAY ATTACKS SPOOFING DETECTION FOR SPEAKER RECOGNITION SYSTEMS
- MATRIX-ITERATIVE SOLUTION METHOD FOR SYSTEM OF LINEAR EQUATIONS AND ITS APPLICATION IN SPACE TOMOGRAPHY SCANNING USING RADAR
- FACE RECOGNITION SYSTEM FOR PAYMENT PROCESS ON MOBILE DEVICES AND WEB-APPLICATIONS
- MODELING OF ETHERNET NETWORKS IN OMNET ++ INET FRAMEWORK MEDIUM
- QUEUE SYSTEMS WITH POLYMODAL QUERY FLOWS
- VERIFICATION OF INTEGRATED CIRCUIT BEHAVIORAL MODELS BY PROGRAMMABLE LOGIC
- STUDY OF COMPUTER VISION ALGORITHMS FOR SPACE TRACKING SYSTEMS IN TYPICAL MODES OF THEIR FUNCTIONING
- AN ALGORITHM FOR COMPACT FIXED-POINT IMPLEMENTATION OF DIGITAL CONTROLLERS
- SIMULATION MODEL FOR MULTICHANNEL PRIORITY SERVICE OF REDUNDANT DATA TRANSFER SYSTEM
- PLATFORM ARCHITECTURE FOR DEVELOPMENT OF MOBILE APPLICATIONS WITH OUTDOOR-QUESTS
- SHORTCUT ANALYTICAL-STATISTICAL MODELING METHODS FOR TECHNICAL SYSTEMS WITH DISTRIBUTED STRUCTURE
- STABILITY OF VISCOUS FILM ON SURFACE OF SLIGHTLY INCLINED ROTATING VERTICAL CYLINDER
- MATHEMATICAL MODEL OF RESONATOR CHAINS IN EXTERNAL MAGNETIC FIELD
- CREATION OF INDIVIDUAL LEARNING TRAJECTORIES BASED ON STUDENT’S ACHIEVEMENTS AND FUNCTIONAL STATE ANALYSIS
- DEVELOPMENT OF EDUCATIONAL PLATFORM FOR INDUSTRY 4.0 PRODUCTION PROCESS STUDY