galactic-ai

View on PyPIReverse Dependencies (0)

0.2.16 galactic_ai-0.2.16-py3-none-any.whl

Wheel Details

Project: galactic-ai
Version: 0.2.16
Filename: galactic_ai-0.2.16-py3-none-any.whl
Download: [link]
Size: 62470
MD5: 474d97451d5c3ae6c88c30c650d7c711
SHA256: c5089811657307a7ea7fbc6f60991cb291b370c52bbd11d09f77cba33534c9fc
Uploaded: 2023-10-14 23:57:13 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: galactic-ai
Version: 0.2.16
Summary: Curate, annotate, and clean massive unstructured text datasets for machine learning and AI systems.
Author-Email: Benjamin Anderson <ben[at]trytaylor.ai>
License: Apache 2.0
Requires-Python: >=3.7
Requires-Dist: colorama
Requires-Dist: datasets
Requires-Dist: ctranslate2
Requires-Dist: onnxruntime
Requires-Dist: transformers
Requires-Dist: huggingface-hub
Requires-Dist: numpy
Requires-Dist: pandas
Requires-Dist: tqdm
Requires-Dist: pybloom-live
Requires-Dist: networkx
Requires-Dist: scikit-learn (>=1.3.1)
Requires-Dist: scrubadub
Requires-Dist: fasttext
Requires-Dist: pyarrow
Requires-Dist: openai
Requires-Dist: tiktoken
Requires-Dist: jinja2
Requires-Dist: sentencepiece
Requires-Dist: aiohttp
Requires-Dist: protobuf (==3.20.2)
Requires-Dist: joblib
Requires-Dist: datasketch
Requires-Dist: nest-asyncio
Requires-Dist: umap-learn
Requires-Dist: matplotlib
Requires-Dist: seaborn
Requires-Dist: trafilatura
Requires-Dist: requests
Requires-Dist: readabilipy
Requires-Dist: boilerpy3
Requires-Dist: goose3
Requires-Dist: selenium
Requires-Dist: webdriver-manager
Requires-Dist: torch
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 10958 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.41.2)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
galactic/__init__.py sha256=1KWGV0Pboc4-BNzSMmuZK1QU9I2KAQGAY3rZF_w4Szo 38
galactic/async_openai.py sha256=LGLhpSlTQ1JZGkA04CRAY1L4Hq8trHYdDKyUUAcVACY 13621
galactic/augment.py sha256=WuP05S3bPa3k2xx8f8gdjCU_LBv7RbZ8Ca7KXOn-VWY 103
galactic/base.py sha256=Z5esKtG2scF_D-pZwwxDdA7RR4uVNrpOh7yuToGvx_M 12514
galactic/classifiers.py sha256=uNhAtaBxYvOvsJ-LiILJHS8_epMXt3byk3-7ILSUSIc 15858
galactic/clustering.py sha256=Qq-qQMjNWt1G87ddwNjLZerLFfN3rdkSmlUbQKdUavE 23959
galactic/conversations.py sha256=xRnn0Vmvo6bR5drJCAvbISylmWXVxL18dLJLTjg0De0 12027
galactic/embedding.py sha256=BNu6WyLE9mon4J48QlRjuYfTsYa97fpgZL9nvw2gqTY 8785
galactic/extract_doc.py sha256=6MERQNgtSEu0wpS5Ib5EHzfFJT1o6zHzz4Y_9TTeb1g 1597
galactic/filters.py sha256=POnfrwonrJLyvmEKIItZ8WE044NS0pQJrZIdSei_97s 5990
galactic/galactic.py sha256=pkeNmjiUqVdXjFCAP2TrOf8c8BdanwBYd0g-ARfTf-g 7090
galactic/kenlm.py sha256=ZHEZ9DO2hDWB1Vg1HWMKHEHJPP9AaYOj3Y2nh51oWDU 5224
galactic/loaders.py sha256=0irnkfVGfr2OroPk5KfSRXWm-eQTFnDzyKVlHb9n6FI 7236
galactic/logger.py sha256=3ZLsmucUt0U5-jyAYBQ9bleSpOYcz8aOCKGCw5qhkZs 1541
galactic/minhash_lsh.py sha256=820m5RYGYW9E3Vuz7CN1nnv26J_PRK_NO9hO8Dbp_h8 1328
galactic/scraping.py sha256=Ovf0AMm5BmgQkik9eQ38EjwPt_TZZi4Qc-LsaUOTBmY 4007
galactic/taggers.py sha256=FL5Bmiqee4ES8urGaDookNXKp458nI-GU8iY8oxIdCo 13384
galactic/transforms.py sha256=HoAJ8Eand3_UpBfWkK5egfGMGEkwT7TszCIM21zZU7E 6745
galactic/utils.py sha256=qkjrpq3P-UlR9cU_-kKP_QRKtzKeFZ5PZ2Peusgyfm4 1803
galactic/visualize.py sha256=3lPar2rYG_FQKRnMU8FVCrfYGIPXqOuBnhNZjk6TCRY 2142
galactic/embedding_backends/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
galactic/embedding_backends/base.py sha256=vOaOh2EizC8N7NIT7AZfihr9t9b7Ge8f2gGIZAyHkXo 5920
galactic/embedding_backends/ctranslate2_backend.py sha256=gtrW_qi8TaTJy6acUeF843u_fnyp664TO3KTHZEuqnY 5284
galactic/embedding_backends/modal_backend.py sha256=wmYx3PsN_oDutyE0d_gUqD8wLYUIqyrsr7zSjPM7c2c 1369
galactic/embedding_backends/modal_remote.py sha256=8-aWrRR7NHJfDMHVlxWAtpPia5hGrZ7IyZm-GMKvO9M 2624
galactic/embedding_backends/onnx_backend.py sha256=GjItj9M_6CC05f2KxsDBztcfS89XttWp1PbJqSMNWAs 4144
galactic/embedding_backends/openai_backend.py sha256=BOPeU7HPFhyq2a9i7X3IwrA7dZeoAkL2sA5qUqdn-bQ 2005
galactic/embedding_backends/replicate_backend.py sha256=sTOox9A3R32cCNBRC5qJ6U682crmo4JCk-h3n-xq8RQ 12545
galactic_ai-0.2.16.dist-info/LICENSE sha256=h4edwPFk0e9eBkcR-y4GAHRVfBkuVxZrskMlGm8CBgc 9152
galactic_ai-0.2.16.dist-info/METADATA sha256=34gB-O9rcGkfEQSqH2CFFiNMb3xtqu3gGU8II3eXBsQ 12175
galactic_ai-0.2.16.dist-info/WHEEL sha256=yQN5g4mg4AybRjkgi-9yy4iQEFibGQmlz78Pik5Or-A 92
galactic_ai-0.2.16.dist-info/top_level.txt sha256=TtPBMxJH7tlXJu6fvtm8U14hxkuHuKaTBQsydHmPfpc 9
galactic_ai-0.2.16.dist-info/RECORD

top_level.txt

galactic