near-duplicate-docsSimple library for finding duplicate and near-duplicate text documents in massive sets/libraries/databasesnear-duplicate documentsduplicate documentsnear-duplicate pagesnear-duplicate textssimilar pagessimilar documentsnear-duplicate detectionJaccard similarity index1.1.13 • Published 3 years ago