SCALABLE DEDUPLICATION SYSTEM WITH SMALL BLOCKS

Exemplary method, system, and computer program product embodiments for scalable data deduplication working with small data chunk in a computing environment are provided. In one embodiment, by way of example only, for each small data chunk, a signature is generated based on a combination of a represe...

Full description

Saved in:
Bibliographic Details
Main Authors ASHER RON, KLEIN SHMUEL T, ARONOVICH LIOR, MEIRI EHUD, HIRSCH MICHAEL, TOAFF YAIR
Format Patent
LanguageEnglish
Published 08.10.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Exemplary method, system, and computer program product embodiments for scalable data deduplication working with small data chunk in a computing environment are provided. In one embodiment, by way of example only, for each small data chunk, a signature is generated based on a combination of a representation of characters used in selecting data to be deduplicated. A c-spectrum of the small data chunk being a sequence of representations of different characters ordered by a frequency of occurrence in the small data chunk, and an f-spectrum of the small data chunk being a corresponding sequence of frequencies of the different characters in the small data chunk.
Bibliography:Application Number: US201514733507