nougat-ocr

View on PyPIReverse Dependencies (5)

0.1.17 nougat_ocr-0.1.17-py3-none-any.whl

Wheel Details

Project: nougat-ocr
Version: 0.1.17
Filename: nougat_ocr-0.1.17-py3-none-any.whl
Download: [link]
Size: 82497
MD5: b1bd9a7b1b40a768db422a946cbd6ea1
SHA256: f776732c716250972c7de11a47b36e94fa48e271d67045a427f19f12eeeef118
Uploaded: 2023-10-04 09:29:52 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: nougat-ocr
Version: 0.1.17
Summary: Nougat: Neural Optical Understanding for Academic Documents
Author: Lukas Blecher
Author-Email: lblecher[at]meta.com
Home-Page: https://github.com/facebookresearch/nougat
License: MIT
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Libraries
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.7
Requires-Dist: transformers (>=4.25.1)
Requires-Dist: timm (==0.5.4)
Requires-Dist: orjson
Requires-Dist: opencv-python-headless
Requires-Dist: datasets[vision]
Requires-Dist: lightning (<2022,>=2.0.0)
Requires-Dist: nltk
Requires-Dist: python-Levenshtein
Requires-Dist: sentencepiece
Requires-Dist: sconf (>=0.2.3)
Requires-Dist: albumentations (>=1.0.0)
Requires-Dist: pypdf (>=3.1.0)
Requires-Dist: pypdfium2
Requires-Dist: fastapi; extra == "api"
Requires-Dist: uvicorn[standard]; extra == "api"
Requires-Dist: python-multipart; extra == "api"
Requires-Dist: pytesseract; extra == "dataset"
Requires-Dist: beautifulsoup4; extra == "dataset"
Requires-Dist: scikit-learn; extra == "dataset"
Requires-Dist: Pebble; extra == "dataset"
Requires-Dist: pylatexenc; extra == "dataset"
Requires-Dist: fuzzysearch; extra == "dataset"
Requires-Dist: unidecode; extra == "dataset"
Requires-Dist: htmlmin; extra == "dataset"
Requires-Dist: pdfminer.six (>=20221105); extra == "dataset"
Provides-Extra: api
Provides-Extra: dataset
Description-Content-Type: text/markdown
License-File: LICENSE
License-File: LICENSE-MODEL.md
License-File: NOTICE
[Description omitted; length: 7995 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.41.2)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
app.py sha256=TX-o3KSRe57j09KEl_OXCWmkZ1H6jz2ZKYhAmYFjW_0 5387
predict.py sha256=Bkhto1ymJazCBDuGCDrInpUbPWw1CmLMpUc5xaogsH0 7650
test.py sha256=FtlahA9uXO_tbVz5jF6eq4d1WvPpkRCK_wyx9o67Awc 3924
train.py sha256=FAYNYYmf6AjyInsDZTkfSzC0Mdt2HyveMglPzjix-Gk 7664
nougat/__init__.py sha256=tRqVzmpthVwEIJ4rR3x39zudNzdemmE_6PIY7xni9Cg 311
nougat/_version.py sha256=3ok2xmBZfyCsRJ2dV_OlRr2yyT_GFlfmwkBSYZ25K-Q 212
nougat/metrics.py sha256=p05pRRvOD4keE4uVBfxV7bh8RL_QNm1UzQIxDgpBaWg 3961
nougat/model.py sha256=sQBa-nyJ537ZW7CsLnUQHVgbPLjFMeU0NhrJkFzpQig 26737
nougat/postprocessing.py sha256=bJE3q8zzb3Sm61yMlG6r5gBpuUlceWTilGYGJ8gdJX4 17307
nougat/transforms.py sha256=7YblXAWDzuY3SeQS-YXL6_uf5twfjV48tdH31y4Ap0Y 6159
nougat/dataset/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
nougat/dataset/create_index.py sha256=9-cxCCOMlyV9m6UcLbh4Hr8YuaXzHro7DCck67zoYwc 5655
nougat/dataset/gen_seek.py sha256=X5AJC6WgHHfTucedxLURTvV7rQgwGebFb6yOypUfRfM 1015
nougat/dataset/pdffigures.py sha256=4iLjZT7N7GSkriSvT6DulURPG9oad3YzXqTr7zcyY5U 2371
nougat/dataset/rasterize.py sha256=G0gL3pZyASrcdFgQZnM6PJ8TcPT2_3yXaFeTTpSRk70 2887
nougat/dataset/split_htmls_to_pages.py sha256=uzH1_fXV6j9ltMcBOFZklYmg3roffyufUVh9k6H7fj4 7936
nougat/dataset/split_md_to_pages.py sha256=EcNKtdhCNrpLLzwEpgdxaT3fKuMcDbkyhyBSHzx0nk8 17817
nougat/dataset/splitter.py sha256=O_jNIu5HO1pAp8IaYsjUOMYfCDfK6hnAXsh4XmjcQEc 15157
nougat/dataset/staircase.py sha256=-JknOn0FQPrzXpsjoSDlD6Mtia1zj3N60nWH9OVBGNs 10800
nougat/dataset/parser/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
nougat/dataset/parser/document.py sha256=3TUnm2MxWxCaYGuxGtlu0vu7R2vEyFK-mCgfFQx5JNw 20486
nougat/dataset/parser/html2md.py sha256=1VSTccGoLzcG3ehONTcLAd2uaU_nrHuWkkF0aWy1t9g 2287
nougat/dataset/parser/latexml_parser.py sha256=oMTzDsPM8l2OjIJh_HzRYyEIS8xdIYEwSq3ZS1mTxU8 18798
nougat/dataset/parser/markdown.py sha256=qsAo06mMBhPCDFVNEFBkD2IovabLpx6ZcX-FcS629W4 15776
nougat/dataset/utils/__init__.py sha256=vOyV1-tdu_jlxSo2q5wTf2JrCA2JjAfCKxFQNh8HkMQ 273
nougat/dataset/utils/latex_conversion.py sha256=6vy9GU7SuDBwuKRhPiraZ85-MaWeZjSKcnvwmAYpE2I 4380
nougat/dataset/utils/pdf_text_extract.py sha256=2vlAua1z4EwAxdX3bztKGUFaaThsNL3WKxy4z6UAzSA 2636
nougat/dataset/utils/utils.py sha256=dnJpoBPE69quGSB-Jo5gS-zFvgOzAJsoulyJ9RNsLhE 528
nougat/utils/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
nougat/utils/checkpoint.py sha256=1NTMePMrSDnYin0srnZfIuoQFhhumMnbt1WbQ_13ozc 4009
nougat/utils/dataset.py sha256=i87lhPKx0Erv3w8pJ1i0jH0JZqj1K7jfd9VLr4q6_MQ 9555
nougat/utils/device.py sha256=X--8MgZWvM1uuOi0oQ0CaGbf7A6hwL0CpEswF3nXrLA 1236
nougat_ocr-0.1.17.dist-info/LICENSE sha256=2m03A-0Ry-Qr0hLHJZV8mNojy_8ZmMBfpLPZdtGljpM 1088
nougat_ocr-0.1.17.dist-info/LICENSE-MODEL.md sha256=DYTN9372eML334c75X41tV8BEomQpj0iULlshU9Duns 13585
nougat_ocr-0.1.17.dist-info/METADATA sha256=Dl_EcabMdEdFB34Ci7CHTz6V5n2Lq-2jDbxkIVHL9PU 10403
nougat_ocr-0.1.17.dist-info/NOTICE sha256=nORN0GXJvm9Q7fV4Ew_5-ozmtXx8yE0AdlyYqT-64ck 9145
nougat_ocr-0.1.17.dist-info/WHEEL sha256=yQN5g4mg4AybRjkgi-9yy4iQEFibGQmlz78Pik5Or-A 92
nougat_ocr-0.1.17.dist-info/entry_points.txt sha256=Ws5XMQODYfb1XJuWFwOK27BwbEVv6ET9jSUZ8tqLVTQ 62
nougat_ocr-0.1.17.dist-info/top_level.txt sha256=6VURQaz4px8H2IAgiSZcagvooYQDQp-k-A5kN-YFdnE 30
nougat_ocr-0.1.17.dist-info/RECORD

top_level.txt

app
nougat
predict
test
train

entry_points.txt

nougat = predict:main
nougat_api = app:main