cllm-data-curation

View on PyPIReverse Dependencies (0)

0.1.2 cllm_data_curation-0.1.2-py3-none-any.whl

Wheel Details

Project: cllm-data-curation
Version: 0.1.2
Filename: cllm_data_curation-0.1.2-py3-none-any.whl
Download: [link]
Size: 19510
MD5: 8c77c6079fcc0e80b94948bc41470e5d
SHA256: 5a3c9b464e038775344c7621d7ae390b9affa517bf85e299436cedd93b41f927
Uploaded: 2023-04-08 23:49:20 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: cllm-data-curation
Version: 0.1.2
Summary: A package to visualize tokenization of text using HTML
Author: Darien Schettler
Author-Email: ds08tf[at]gmail.com
Home-Page: https://github.com/ds08tf/cllm-data-curation
Classifier: Development Status :: 3 - Alpha
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Requires-Python: >=3.7
Requires-Dist: huggingface-hub
Requires-Dist: transformers
Requires-Dist: datasets
Requires-Dist: requests
Requires-Dist: pandas
Requires-Dist: numpy
Requires-Dist: tqdm
Requires-Dist: chardet
Requires-Dist: python-magic
Description-Content-Type: text/markdown
[Description omitted; length: 177 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.35.1)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
cllm_data_curation/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
cllm_data_curation/parallel_dl/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
cllm_data_curation/parallel_dl/multiprocessing_utils.py sha256=DdIbAVJM_5zk5I-aQpD2GIgetn0vUgDpZ0x02TLEx4s 2023
cllm_data_curation/parallel_dl/other_utils.py sha256=0qhUX2LrUYjt9WsQ3P_H36Q32LbOsG0IC23I9_FLFag 744
cllm_data_curation/parallel_dl/parallel_dl.py sha256=gtFMhWXUfq1aJjx90edlA51BfssUP4SjpaXJ9axpmb0 1768
cllm_data_curation/parallel_dl/preprocessing_utils.py sha256=G6WmZ5wHHnJyOaeBRQHfKqZKw1NMuaa73AXnXALLN7U 3284
cllm_data_curation/parallel_dl/processing_utils.py sha256=3E3XvVyhBONFD0rXMDzMqBgAteMXCZGvi1GUkWr77Qs 7872
cllm_data_curation/thestack_curation/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
cllm_data_curation/thestack_curation/curation_configs.py sha256=Wx2rCAAHsv3SkHxO92GJt_w47dhI0iZd5CE9KNEBhbk 2736
cllm_data_curation/thestack_curation/curation_utils.py sha256=eG79OBSxhsmmAnTfg3FaCp7RZM481GX3TE_1IHka6NQ 16656
cllm_data_curation/thestack_curation/download.py sha256=Is5RS1wZCsJaxmdtL45icgpJjUoi1Oq_jEvy5RwHNBA 4123
cllm_data_curation/thestack_curation/download_utils.py sha256=72oPrk5-0a6BK-gIGfctezfPT5I1Yv3cFsocU8dq8a4 6188
cllm_data_curation/thestack_curation/general_utils.py sha256=fFrqvSAtl_uwa7plHPWSoK_Oo6UPxi9uTzYqQ6-oLog 2492
cllm_data_curation-0.1.2.dist-info/METADATA sha256=ZSu59C4yaoS93bP503emAOBC8rnFCNZ_z4BggPJyOOU 1037
cllm_data_curation-0.1.2.dist-info/WHEEL sha256=EVRjI69F5qVjm_YgqcTXPnTAv3BfSUr0WVAHuSP3Xoo 92
cllm_data_curation-0.1.2.dist-info/top_level.txt sha256=Cr3OdQA1TboKEP7YjRt_K1ROZ9KWq2y5yxXFW04WnUY 19
cllm_data_curation-0.1.2.dist-info/RECORD

top_level.txt

cllm_data_curation