Return to Article Details PAS: A Sampling Based Similarity Identification Algorithm for compression of Unicode data content Download Download PDF