A repo that contains Text Classification, Similarity Detection using LSH and Duplicate Detection using XGBoost.