ANALYSIS AND ESTIMATION OF THE TRIE MINIMUM LEVEL IN NON-HASH DEDUPLICATION SYSTEM
Annotation
Subject of research. The paper deals with a method of restriction for the trie minimum level in non-hash data deduplication system. Method. The subject matter of the method lies in forcibly completing the trie to a specific minimum level. The proposed method makes it possible to increase performance of the process by reducing the number of collisions at the lower levels ofthe trie. The maximum theoretical performance growth corresponds to the share of collisions in the total number of data read operations from the storage medium. Proposed method application increases the metadata size to the amount of new structures containing one element. Main results. The results of the work have been proved by the data of computational experiment with non-has deduplication on 528 GB data set. The process analysis has shown that 99% of the execution time is taken to head positioning of hard-drives. The reason is a random distribution of the blocks on the storage medium. Application of the method of minimum level restriction for the trie in non-hash data deduplication system on the experimental data set gives the possibility to increase performance maximum by 16% and the increase of metadata size is 49%. The total amount of metadata is 34% less than with hash-based deduplication using the MD5 algorithm, and is 17% less than using Tiger192 algorithm. These results confirm the effectiveness of the proposed method. Practical relevance. The proposed method increases the performance of deduplication process by reducing the number of collisions in the trie construction. The results are of practical importance for professionals involved in the development of non-hash data deduplication methods.
Keywords
Постоянный URL
Articles in current issue
- APPLICABILITY ANALYSIS OF THE PHASE CORRELATION ALGORITHM FOR STABILIZATION OF VIDEO FRAMES SEQUENCES FOR CAPILLARY BLOOD FLOW
- POINT-BY-POINT INSCRIPTION OF FIBER BRAGG GRATINGS INTO BIREFRINGENT OPTICAL FIBER THROUGH PROTECTIVE ACRYLATE COATING BY TI:SA FEMTOSECOND LASER
- ELLIPSOMETRY METHOD APPLICATION IN OPTICS OF INHOMOGENEOUS MEDIA
- METHOD OF SOFTWARE-BASED COMPENSATION OF TECHNOLOGICAL VARIATION IN CHROMATICITY COORDINATES OF LCD PANELS
- HIGH-PRECISION DETERMINATION OF THE ANGULAR POSITION FOR POINT LIGHT SOURCE WITH CCD-ARRAYS
- REQUIREMENTS FOR IMAGE QUALITY OF EMERGENCY SPACECRAFTS
- CALCULATION AND RESEARCH OF CONTACT OPHTHALMIC DUAL APPLICATION LENSES
- APPROACHES FOR STABILIZING OF BIPED ROBOTS IN A STANDING POSITION ON MOVABLE SUPPORT
- LASER ABLATION OF MONOCRYSTALLINE SILICON UNDER PULSED-FREQUENCY FIBER LASER
- SYNTHESIS OF MULTI-LAYER SUBSTRATE FOR OBSERVING OF HYDROXYBENZOIC ACIDS MOLECULES BY SERS
- EUROPIUM ION INFLUENCE ON THE FORMATION OF Ag-NANOPARTICLES IN FLUORINE PHOSPHATE GLASSES
- PROPERTIES AND OPTICAL APPLICATION OF POLYCRYSTALLINE ZINC SELENIDE OBTAINED BY PHYSICAL VAPOR DEPOSITION
- COMPARISON OF TWO TEMPERATURE MEASUREMENT METHODS BY UPCONVERSION FLUORESCENCE SPECTRA OF ERBIUM-DOPED LEAD-FLUORIDE NANO-GLASS-CERAMICS
- CENTRALIZED MAC PROTOCOL FOR HIERARCHICAL CACHING PROCESSORS
- CREATION OF PARTIAL ORDERS OF VARIANTS FOR SELECTION OF OPTIMAL ALTERNATIVES IN HOMOGENEOUS SETS
- ABSTRACT MODELS FOR SYSTEM VIRTUALIZATION
- SELF-CONSISTENT FIELD MODEL OF BRUSHES FORMED BY ROOT-TETHERED DENDRONS
- INTERFERENCE OF UNIDIRECTIONAL SHOCK WAVES
- APPLICABILITY OF VARIOUS DIFFERENTIAL TURBULENCE MODELS IN THE CALCULATION OF SUPERSONIC GAS JETS
- ON PRICE CHOICE AT SELLING OF INFORMATION RESOURCES
- GENETIC ALGORITHM APPLICATION FOR MULTI-CRITERIA SCHEDULING PROBLEM
- INFLUENCE OF DFT-FUNCTIONAL AND BASIS SET OF FUNCTIONS ON CALCULATION RESULTS OF THE STRUCTURAL AND ENERGY PROPERTIES OF Ag2 MOLECULAR CLUSTER
- MATHEMATICAL MODEL FOR CALCULATION OF INFORMATION RISKS FOR INFORMATION AND LOGISTICS SYSTEM
- DETERMINATION OF VICKERS MICROHARDNESS IN β-Ga2O3 SINGLE CRYSTALS GROWN FROM THEIR OWN MELT