pdf-language-detector

View on PyPIReverse Dependencies (0)

0.0.11 pdf_language_detector-0.0.11-py3-none-any.whl

Wheel Details

Project: pdf-language-detector
Version: 0.0.11
Filename: pdf_language_detector-0.0.11-py3-none-any.whl
Download: [link]
Size: 9540
MD5: e675bd001687ca977126d8589affa49c
SHA256: 4c624a4fd8664a8e856a39bca1864b55b790111e6a17c023ca6b033bfeacd5fd
Uploaded: 2023-06-20 10:22:01 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: pdf-language-detector
Version: 0.0.11
Summary: A python script to iterate over a list of PDF in a directory and try to guess their language with Tesseract OCR.
Author: ICIJ
Author-Email: engineering[at]icij.org
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Requires-Python: >=3.8.1,<4.0.0
Requires-Dist: langcodes[data] (<4.0.0,>=3.3.0)
Requires-Dist: lingua-language-detector (<2.0.0,>=1.3.2)
Requires-Dist: pytesseract (<0.4.0,>=0.3.10)
Requires-Dist: rich (<14.0.0,>=13.4.2)
Requires-Dist: sh (<3.0.0,>=2.0.4)
Requires-Dist: spytula (==0.0.3)
Requires-Dist: typer[all] (<0.10.0,>=0.9.0)
Description-Content-Type: text/markdown
[Description omitted; length: 3088 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.4.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
src/__init__.py sha256=0Qcfm1E_qN8O8KaiFCXi2VhSpdSysNTlT2BM3g0PwKY 24
src/cli.py sha256=93pcQ6LPx7PReQSAbZlUzfX5FRDWtNbdKmhbSdAbJTs 3982
src/pld.py sha256=6GMPoEy_KfOoV5gHso1SPBGtiep4UDNDzuABWvnnu7s 13198
src/report.py sha256=887Ef7tK61ypVQ7aIDBHpGL-LMDOYIQ9TKhErcI2_pY 7130
pdf_language_detector-0.0.11.dist-info/METADATA sha256=xHg6Xb9nexe3JRLlI1moyssS7YALR3syonm_I4qPqqU 3898
pdf_language_detector-0.0.11.dist-info/entry_points.txt sha256=eyeFvN8tb1DQB6DPUXpIE7iPz_LTLT5b6ErIGshft70 69
pdf_language_detector-0.0.11.dist-info/WHEEL sha256=vVCvjcmxuUltf8cYhJ0sJMRDLr1XsPuxEId8YDzbyCY 88
pdf_language_detector-0.0.11.dist-info/RECORD

entry_points.txt

pdf-language-detector = src.cli:app
pld = src.cli:app