clean-text-my

View on PyPIReverse Dependencies (0)

0.1.1 clean_text_my-0.1.1-py2.py3-none-any.whl

Wheel Details

Project: clean-text-my
Version: 0.1.1
Filename: clean_text_my-0.1.1-py2.py3-none-any.whl
Download: [link]
Size: 19694
MD5: 59ec029d5602a4ad1f4e33808546bfa9
SHA256: b9cc7ce2bfc35f38a847550275575b18980dd0a3e6c083cf82a0c8d8d21c6336
Uploaded: 2023-08-25 10:34:21 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: clean-text-my
Version: 0.1.1
Author: Malaysia AI
Author-Email: malaysia.ai2020[at]gmail.com
Home-Page: https://github.com/malaysia-ai/clean_text_my
Requires-Python: >=3.6
Requires-Dist: aiohttp (==3.8.5)
Requires-Dist: aiosignal (==1.3.1)
Requires-Dist: async-timeout (==4.0.3)
Requires-Dist: attrs (==23.1.0)
Requires-Dist: certifi (==2023.7.22)
Requires-Dist: charset-normalizer (==3.2.0)
Requires-Dist: Cython (==3.0.0)
Requires-Dist: datasets (==2.14.4)
Requires-Dist: dill (==0.3.7)
Requires-Dist: filelock (==3.12.2)
Requires-Dist: frozenlist (==1.4.0)
Requires-Dist: fsspec (==2023.6.0)
Requires-Dist: huggingface-hub (==0.16.4)
Requires-Dist: idna (==3.4)
Requires-Dist: multidict (==6.0.4)
Requires-Dist: multiprocess (==0.70.15)
Requires-Dist: numpy (==1.25.2)
Requires-Dist: packaging (==23.1)
Requires-Dist: pandas (==2.0.3)
Requires-Dist: pyarrow (==13.0.0)
Requires-Dist: python-dateutil (==2.8.2)
Requires-Dist: pytz (==2023.3)
Requires-Dist: PyYAML (==6.0.1)
Requires-Dist: requests (==2.31.0)
Requires-Dist: six (==1.16.0)
Requires-Dist: tqdm (==4.66.1)
Requires-Dist: typing-extensions (==4.7.1)
Requires-Dist: tzdata (==2023.3)
Requires-Dist: urllib3 (==2.0.4)
Requires-Dist: xxhash (==3.3.0)
Requires-Dist: yarl (==1.9.2)
Requires-Dist: scipy (==1.11.2)
Requires-Dist: rich (==13.5.2)
Description-Content-Type: text/markdown
[Description omitted; length: 350 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.41.2)
Root-Is-Purelib: true
Tag: py2-none-any
Tag: py3-none-any

RECORD

Path Digest Size
clean_text_my/__init__.py sha256=_CN5C9fJHDzdAC8zNpeA6UFJfH3DCg8VYd2QO3q2anY 179
clean_text_my/deduplication.py sha256=KyTwx1lSQBUXXOS-_oXKs-RmAS9Z6gzNB0fcstdnBCs 3719
clean_text_my/download_dataset.py sha256=Jd5DiftsYUHynoaM7fumBDzG9iqOdqHbsErCh9e94As 631
clean_text_my/logging.py sha256=S3Qu35aeYpJt9orM3kUFpxomma-yHhLVM0d_Sd9LEeo 973
clean_text_my/manipulation.py sha256=CtBdME0231ZGMn6vTag2lH5QJ7zJVYUDiqX7XKYqQ-8 2528
clean_text_my/postprocessing.py sha256=2S_YTPoIILKH_nRt_iJUtz0BNKQ00RZkLDcmjO_I9aU 4114
clean_text_my/utils.py sha256=2JrQH4OjoCmbbEI6wZyKqr-VlvVnqusyrJFLu9jNywI 1127
clean_text_my/text_dedup/__init__.py sha256=vNpo7t6C_3ycPlRzHgGEc6JLyFmSpoRis7IYbIsu7VQ 240
clean_text_my/text_dedup/minhash.py sha256=Lr-n5FpnxvpBTI_EcqKY7vK7npH93lPnfmTF2o5SnBo 13281
clean_text_my/text_dedup/utils/__init__.py sha256=tqMsTWpCx14W6_68Rvids_9o82a8O_PdaXB0f64s1iY 1680
clean_text_my/text_dedup/utils/add_args.py sha256=dvecgcuZzRyTuCa8f0OS8N7begFAJQZ2XZ0xLhRsbCQ 7981
clean_text_my/text_dedup/utils/analysis.py sha256=MGZDr2tJ1yYVTXwD2flRZHI61E9ZCzn5rkBdAraCbyc 3143
clean_text_my/text_dedup/utils/hashfunc.py sha256=p2iz_mtvEsfexnT5EWBKUu1rX0eUWn4KPM0NCH1EOGs 5012
clean_text_my/text_dedup/utils/preprocess.py sha256=EPrcQ894ZB_TL6kuzZcxjKhhmx1fXY_cBwsdFnXi4ZM 809
clean_text_my/text_dedup/utils/timer.py sha256=N2Bh139jGNfCn-lDJtm3jHJ4TtABm9EuU68WdgjEK_A 1489
clean_text_my/text_dedup/utils/tokenization.py sha256=ZMuwB_NSk_gMyTOQAkkPm0Zxv-031oxZcN-q9zd85Y8 1112
clean_text_my/text_dedup/utils/union_find.py sha256=-iNlVsENozOJP3-lIWOlz_4u0gAEaJdnYGhczbtsStg 809
clean_text_my-0.1.1.dist-info/METADATA sha256=fxUYnFnACFIHTN1sXT2vtX9ticIpwkzC2JxSNpyznOI 1655
clean_text_my-0.1.1.dist-info/WHEEL sha256=iYlv5fX357PQyRT2o6tw1bN-YcKFFHKqB_LwHO5wP-g 110
clean_text_my-0.1.1.dist-info/top_level.txt sha256=coFM29PeLMPTcg-uoh29vCDpoEXY0Za_gJjfOL4nNsA 14
clean_text_my-0.1.1.dist-info/RECORD

top_level.txt

clean_text_my