curated-tokenizers

View on PyPIReverse Dependencies (2)

2.0.0 curated_tokenizers-2.0.0-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
curated_tokenizers-2.0.0-cp39-cp39-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
curated_tokenizers-2.0.0-cp39-cp39-win_amd64.whl
curated_tokenizers-2.0.0-cp39-cp39-macosx_10_9_x86_64.whl
curated_tokenizers-2.0.0-cp39-cp39-macosx_11_0_arm64.whl
curated_tokenizers-2.0.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
curated_tokenizers-2.0.0-cp312-cp312-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
curated_tokenizers-2.0.0-cp312-cp312-win_amd64.whl
curated_tokenizers-2.0.0-cp312-cp312-macosx_10_9_x86_64.whl
curated_tokenizers-2.0.0-cp312-cp312-macosx_11_0_arm64.whl
curated_tokenizers-2.0.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
curated_tokenizers-2.0.0-cp311-cp311-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
curated_tokenizers-2.0.0-cp311-cp311-win_amd64.whl
curated_tokenizers-2.0.0-cp311-cp311-macosx_10_9_x86_64.whl
curated_tokenizers-2.0.0-cp311-cp311-macosx_11_0_arm64.whl
curated_tokenizers-2.0.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
curated_tokenizers-2.0.0-cp310-cp310-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
curated_tokenizers-2.0.0-cp310-cp310-win_amd64.whl
curated_tokenizers-2.0.0-cp310-cp310-macosx_10_9_x86_64.whl
curated_tokenizers-2.0.0-cp310-cp310-macosx_11_0_arm64.whl

Wheel Details

Project: curated-tokenizers
Version: 2.0.0
Filename: curated_tokenizers-2.0.0-cp311-cp311-win_amd64.whl
Download: [link]
Size: 760906
MD5: 62997a2d6da014ee01236faeeefc63d8
SHA256: 6d56eb806a791b5818cbccdf22fffe39c07eff2f2dd1a32b94f11792c646d343
Uploaded: 2024-04-15 17:19:09 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: curated-tokenizers
Version: 2.0.0
Summary: Lightweight piece tokenization library
Author: Explosion
Author-Email: contact[at]explosion.ai
Home-Page: https://github.com/explosion/curated-tokenizers
License: MIT
Classifier: Development Status :: 5 - Production/Stable
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Programming Language :: Cython
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering
Requires-Python: >=3.9
Requires-Dist: regex (>=2022)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 927 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.43.0)
Root-Is-Purelib: false
Tag: cp311-cp311-win_amd64

RECORD

Path Digest Size
curated_tokenizers/__init__.py sha256=I5Ylm5wnIZo2FyIk8nSTokNOctBmx37gM1Tw-bdciZ8 123
curated_tokenizers/_bbpe.cp311-win_amd64.pyd sha256=2Iotl1gPoGi_2ldQxVbTVIst6_dku8GveLGfHAwd-4A 125440
curated_tokenizers/_bbpe.cpp sha256=zZot78lYKU0nI_Xg-Q6hgs1xAFTTIA0hcfrBqDSjjis 692324
curated_tokenizers/_bbpe.pxd sha256=n-uYe4FpcCtPS_w2aAZpjMwPkPY86BbFN3vr2BSu-s0 359
curated_tokenizers/_bbpe.pyx sha256=2BwY8sc0orYL494vLtfcLf4WD6g1bT5BFLPU_vl2BD0 7750
curated_tokenizers/_spp.cp311-win_amd64.pyd sha256=rJ8p61f_FAZKOf54eKGGE0Bls9Rwc_NqL8EomBkD4DY 459776
curated_tokenizers/_spp.cpp sha256=azu6sD2-jbzR9QcUtzhGNYs_5uTj_0G13VTEfuNd00s 572709
curated_tokenizers/_spp.pxd sha256=c1v3vDwZw9Y_PmxlGc9owdbn3QGvXYDlldyQnVwmd1Q 2210
curated_tokenizers/_spp.pyx sha256=JIPO2oAqmoHY4nVAuVFnFC9rAme8UneuYdIrFD_US-I 8559
curated_tokenizers/_util.py sha256=mdFbESbeyFArRiwMaLyvhXxtSmgCEZCerZJiMPyxcbs 1066
curated_tokenizers/_wordpiece.cp311-win_amd64.pyd sha256=LEgJsrQh0yYkB7pxO3U9uDdPIfxoZ1fK5wcAMv4gCSc 97280
curated_tokenizers/_wordpiece.cpp sha256=1Ru4NmG5jbNWLWpdQ0XkOpaoKpzMG4W-a9xSFZ-ieS4 546522
curated_tokenizers/_wordpiece.pxd sha256=vUvF1Pd5JqxkYH0aE58WP631ZyRQKismljBxZudhPZ0 678
curated_tokenizers/_wordpiece.pyx sha256=AxB5hHAJaIpKZz_ddndYGixp3OSLh1XL8n5hdQSw2vk 5402
curated_tokenizers/config.h sha256=oqYLY79H-0BDs6gb4lvIlkHpNErL0G-O-v8ogUnO7WQ 165
curated_tokenizers/merges.cc sha256=wBqBr10AvNu7eUoMB5dfq3B9dx1mKYlzAwIXCU8-Mk8 2224
curated_tokenizers/merges.hh sha256=qQJcNB_uhwMGdBhDCPNOwxI7DcZoH4RXJNMoUwczOR0 1348
curated_tokenizers/py.typed sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
curated_tokenizers/types.py sha256=gwTV4zxFXRynBw2zr_YqUmL1IegZaJQT-GRq0HW6Ppw 163
curated_tokenizers/util.hh sha256=NvGqvq5yxhOSoI6sDIbeNDWF3mrIRbOEg56ESg3gxlk 346
curated_tokenizers/wordpiece.cc sha256=Cun1biGOuh9YaSm-wAJOjs57ibvetlZS2Ml_55vMFcM 1120
curated_tokenizers/wordpiece.hh sha256=fpjoC1eYHf9l1U_owkOJguCau_EWRJhIRqDaauQCIqg 1051
curated_tokenizers/tests/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
curated_tokenizers/tests/compat.py sha256=eR-boC8Jqj192vm6U9VauQxsLEskHUgufz50dQxJGPU 166
curated_tokenizers/tests/conftest.py sha256=9R5OZoQ7LoVyTQ4NQ2gapAMzwm_TAf89YaEJu8WhPQo 1144
curated_tokenizers/tests/incorrect-merges.txt sha256=fnsVuf9z1j0xufXD-pk7Co_84FllzO7nFOnVCJrwcYY 20
curated_tokenizers/tests/robbert-merges-1000.txt sha256=2rMsEPdaED-CmBtoGTO0cnqt-vJeZyesKFg86aWu6t8 7292
curated_tokenizers/tests/robbert-vocab-1000.json sha256=sTR5uy1sMPqy8H-Uz1UuLcR4eBDMx-uz-BiOD4dHthM 17902
curated_tokenizers/tests/test_bbpe_processor.py sha256=qgndvj4YhD6drvD20OJjdMnJH2Gcxwdt_P2Mm3hUkGs 3997
curated_tokenizers/tests/test_sp.py sha256=HgbAdde3bSSnyqaok0cIfJqT5FNJo5lejGvi9Uxtb7E 4602
curated_tokenizers/tests/test_word_piece_processor.py sha256=dcnKNTG582bwnszOhy64sf6Bksq2FZwu4kVYvMK8jk4 2570
curated_tokenizers/tests/toy-word-pieces.txt sha256=sj8CZR_4F20yURvo1IfUmJDz-hAk4g2yeLjXefzcKQg 35
curated_tokenizers/tests/toy.model sha256=aOgs9MnD7wvh8lp8riyDb8r6fTyDgh-5hkEeVnea0eI 253270
curated_tokenizers/tests/troonrede.txt sha256=1IZIbPoCW9E0fGh6BRj3nphpGJFnFQs9LLMR8sFjgK4 69981
curated_tokenizers-2.0.0.dist-info/LICENSE sha256=tTKVA-2oaLtu5MKsNMDQUjVhbPAa0z5X6VB_W06KS7s 2693
curated_tokenizers-2.0.0.dist-info/METADATA sha256=2rH4HKf4M9P5x-Gkz0znw1Nf4xpT0ciw3PT-LGuJEk8 1968
curated_tokenizers-2.0.0.dist-info/WHEEL sha256=nSybvzWlmdJnHiUQSY-d7V1ycwEVUTqXiTvr2eshg44 102
curated_tokenizers-2.0.0.dist-info/top_level.txt sha256=EKBDsRk94HtBI7QQVDtt8uMqpwI--el9tTFO5T99Yjw 19
curated_tokenizers-2.0.0.dist-info/zip-safe sha256=frcCV1k9oG9oKj3dpUqdJg1PxRT2RSN_XKdLCPjaYaY 2
curated_tokenizers-2.0.0.dist-info/RECORD

top_level.txt

curated_tokenizers

zip-safe