text-dedup

View on PyPIReverse Dependencies (0)

0.4.0 text_dedup-0.4.0-py3-none-any.whl

Wheel Details

Project: text-dedup
Version: 0.4.0
Filename: text_dedup-0.4.0-py3-none-any.whl
Download: [link]
Size: 48441
MD5: ab05b505db41bb67ff042925d43e578e
SHA256: b8f8e99343202ee21912069ab0ef9a402dae8f7a0df1512bf57958a3ba6d59cb
Uploaded: 2024-04-17 20:14:00 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: text-dedup
Version: 0.4.0
Author: Chenghao Mou
Author-Email: mouchenghao[at]gmail.com
License: Apache 2.0
Classifier: License :: Other/Proprietary License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Requires-Python: >=3.10,<4.0
Requires-Dist: bitarray (>=2.6.2)
Requires-Dist: click (<9.0.0,>=8.1.7)
Requires-Dist: click-option-group (<0.6.0,>=0.5.6)
Requires-Dist: datasets (>=2.17.0)
Requires-Dist: fire (<0.7.0,>=0.6.0)
Requires-Dist: ftfy (>=6.1.1)
Requires-Dist: numpy (>=1.26.4)
Requires-Dist: psutil (>=5.9.8)
Requires-Dist: pybloom-live (>=4.0.0)
Requires-Dist: pyspark (>=3.3.1)
Requires-Dist: regex (>=2023.5.5)
Requires-Dist: rich (<14.0.0,>=13.7.1)
Requires-Dist: scipy (>=1.10.1)
Requires-Dist: setuptools (>=69.1.0)
Requires-Dist: sphinxcontrib-bibtex (>=2.5.0)
Requires-Dist: tqdm (>=4.64.1)
Requires-Dist: unisim (<0.0.2,>=0.0.1)
Requires-Dist: urllib3 (<=2.0)
Requires-Dist: xxhash (>=3.0.0)
Requires-Dist: zstandard (>=0.21.0)
Description-Content-Type: text/markdown
[Description omitted; length: 17899 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.9.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
text_dedup/__init__.py sha256=kJYnpIwatWmXK9bDNvSHpPnsv_Ck92vgZkMnzx1EhBk 357
text_dedup/ann_unisim.py sha256=HoFhTuKv6s6YxtCHXGXrkblqjGt8pAo0msHgssDUO94 7037
text_dedup/bloom_filter.py sha256=z5112NvTosEWk1K6AeeXXtZvxA9llWB0nc0bO8OlPw0 2864
text_dedup/ccnet.py sha256=y3LqauTf3AjnYoFCnp_moVXy3ZpDC-WWeV0PI0PQtVU 6172
text_dedup/exact_hash.py sha256=FNhDikFGAis1l7f0E-Q_P33zA2tcLuEeJABFNq3VkVY 3327
text_dedup/minhash.py sha256=Z69qsjLw2Pa1vrSK_AA09RYgthX6o5LicYAuUbw7KrE 12325
text_dedup/minhash_spark.py sha256=jDEhBdsN4qnccNHl_8TCKzNL4xC3T5yqgIXET3M01FY 16774
text_dedup/simhash.py sha256=dNUJ41rsKAjs2zG_BN8Bjd1bW_LnUbU8vG0pYa8QrGs 14267
text_dedup/suffix_array.py sha256=3cCPzLk3ulYvSLn7JP7M9wPcY02DSNxZyUuOpOGg1jI 12192
text_dedup/utils/__init__.py sha256=zSxs1Rs3pWRVuTlHJ0iXa1xpLMFwYT96btMx-E3liaE 2400
text_dedup/utils/analysis.py sha256=EsJS5vnUWAoQzJ-hdxjQCKKIjwgSQoc45nYXsKtb2oU 3229
text_dedup/utils/args.py sha256=RonSV3WZo1XqnL5nyEC0gUFKHh051q3uG0OPUqcgNk8 13017
text_dedup/utils/const.py sha256=nayve1kvy5zVZtUViQPShjQJMkBEgk0KWoCfM5wtQus 58
text_dedup/utils/ftfy_utils.py sha256=CjOkfkljX6r87JVTYkAXJF_78lMteEGekItaRu-h6ck 233
text_dedup/utils/hashfunc.py sha256=vp01Y99Q4NPOt8Zsf-vjHnH_Vs3TJF43HajfGDY0Gjk 6344
text_dedup/utils/inspect.py sha256=TgIAL8OLq2LSrnis8X8atM0n0w3P8LQQtjEdBALszHM 753
text_dedup/utils/load.py sha256=UQw_USrDFjIL2e6d7qOvNm1jzNpjO-P5IF8wFEVJeX4 1368
text_dedup/utils/memory.py sha256=GnbDz1X3puuD-tPSsfIr_rZq7L4mMFUf_v8a9OY2PhI 380
text_dedup/utils/preprocess.py sha256=EBos7nzoNG2oaiUDGtXp73ZpVK_IVy9tC3kRPeewKgM 1439
text_dedup/utils/timer.py sha256=8nZQC8Ju2ypsOpsE69jKKY5smFhsBuKqfMO_vAXkTso 1733
text_dedup/utils/tokenization.py sha256=EdPtz6YdRJjR8_9UnnQBKb7ofSInpFlIQTrbOX0Pskc 1193
text_dedup/utils/union_find.py sha256=anNgbtePEU7MntV5e7mFF6ry_RzVd5nq4lNm7-8cL0M 2903
text_dedup-0.4.0.dist-info/LICENSE sha256=z8d0m5b2O9McPEK1xHG_dWgUBT6EfBDz6wA0F7xSPTA 11358
text_dedup-0.4.0.dist-info/METADATA sha256=3Z3j4dz6O7J-UEYVPmYnt-gWwpy1eXQcFzH1I6IOd5k 19089
text_dedup-0.4.0.dist-info/WHEEL sha256=sP946D7jFCHeNz5Iq4fL4Lu-PrWrFsgfLXbbkciIZwg 88
text_dedup-0.4.0.dist-info/RECORD