pdf-struct

View on PyPIReverse Dependencies (0)

0.3.4 pdf_struct-0.3.4-py3-none-any.whl

Wheel Details

Project: pdf-struct
Version: 0.3.4
Filename: pdf_struct-0.3.4-py3-none-any.whl
Download: [link]
Size: 63255
MD5: ce56b6cf9dfff94b9528b3bb0c64d46b
SHA256: e91eb0023bd0864e962f6318b4a88a1c4afe07d0d74161b26abc12a8756b9310
Uploaded: 2022-08-20 01:16:10 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: pdf-struct
Version: 0.3.4
Summary: Logical structure analysis of visually structured documents.
Author: Yuta Koreeda
Author-Email: yuta.koreeda[at]hal.hitachi.com
Maintainer: Yuta Koreeda
Maintainer-Email: yuta.koreeda[at]hal.hitachi.com
Home-Page: https://github.com/stanfordnlp/pdf-struct
License: Apache
Classifier: Environment :: Console
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3.8
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Requires-Dist: click (==8.1.3)
Requires-Dist: numpy (==1.23.1)
Requires-Dist: pdfminer.six (==20220524)
Requires-Dist: regex (==2022.7.25)
Requires-Dist: torch (==1.9.0)
Requires-Dist: tqdm (==4.48.0)
Requires-Dist: transformers (==4.9.1)
Requires-Dist: scikit-learn (==1.1.2)
Requires-Dist: joblib (==1.0.0)
Requires-Dist: editdistance (==0.5.3)
Requires-Dist: beautifulsoup4 (==4.11.1)
Requires-Dist: sentencepiece (==0.1.96)
Requires-Dist: wheel
Requires-Dist: twine
Description-Content-Type: text/markdown
[Description omitted; length: 11293 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.37.1)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
pdf_struct/__init__.py sha256=0OPDsm1HPjVSKLtHlp9OUmvocsM9loxLcI_bANDQhlo 240
pdf_struct/_version.py sha256=oYLGMpySamd16KLiaBTfRyrAS7_oyp-TOEHmzmeumwg 22
pdf_struct/cli.py sha256=a_tmWOZXyMlB7JZPa5JGp53FIgxYlWepD9nc_px9tlg 8548
pdf_struct/core/__init__.py sha256=QcbHe5bzO4Y4bYhKKyOm1u1SycV0dWbBeoxzWZa-9qA 486
pdf_struct/core/clustering.py sha256=gDCBJx6-A9huFKQNgAQcdAIaJ6Ubpy9ZH35kvJEx-7Y 2446
pdf_struct/core/data_statistics.py sha256=AQQa4K-6yzy47rGpOOPibimJweL5kjUd1LUJNouju80 3265
pdf_struct/core/document.py sha256=cMBTRz5SS92WNIaD6RtiGDx26MJP9tuwohBiCiSWP2k 4185
pdf_struct/core/download.py sha256=LKb-xImNKZ8B1y5n87zW0dtUG0aLKSEwkztkcz5xPcE 3319
pdf_struct/core/evaluation.py sha256=-OVYPesUc5YWSyXx7WycCwtNd2pU2vyUbXH50QhBkVU 2989
pdf_struct/core/export.py sha256=MCoEclT_Y0tfMssORvP3sPGz-DZeNRijHIeRyIIUpx0 4814
pdf_struct/core/feature_extractor.py sha256=mzgwMC0aSKyXmdFYHHveUNSMdePbUvk5OGl2roiyuS8 15459
pdf_struct/core/predictor.py sha256=Aoh6h4cIOWZ_fyDaP1qhmFlfLyedyNsSufLECBLcbuY 5946
pdf_struct/core/preprocessing.py sha256=MDMLFhSXHdZkrE5u3apuZb6Y3KT8A-KMZEE4zdzkZyc 1549
pdf_struct/core/structure_evaluation.py sha256=PFtaytPBZ9hZOeDt0YCKniCF1DHxZxxrL-Gppkdjyb0 8938
pdf_struct/core/transition_labels.py sha256=70FVptah9PwjQLryjkWwJEjKDLbpjF9jhNAHog3qTVM 5930
pdf_struct/core/utils.py sha256=7LLZYPRbLhwEPmwRkjir6CWR1lROFrAWypa5eWkYYxo 1655
pdf_struct/export/__init__.py sha256=HQpnyGf1uuuZOUiZRE5p3Eo05rGHEFxYuZ5ClibwbN0 35
pdf_struct/export/hocr.py sha256=27JGHChT8bV9yety_yXm-N-dxRTl4qQi_7n90AYfn_g 3124
pdf_struct/feature_extractor/__init__.py sha256=BRnpCCKMO7q5Oy5qxLhHCCpf9W9qSiQtRiYG2K2WUfQ 965
pdf_struct/feature_extractor/hocr_balance_sheet_ja.py sha256=D7Q6QQ__dg_ybgOb5BS_tdAkVX__rMQ24a7NB1PwlK4 7849
pdf_struct/feature_extractor/pdf_contract.py sha256=Q0RurfTkDokC0qUvyviZD2X96mlQ7fFoYNNOBck7rk0 10157
pdf_struct/feature_extractor/pdf_contract_ja.py sha256=t_5zJHjEhQJ3ATwMbS1FKwaMMbwc5t92dNUdbdtnrxs 4812
pdf_struct/feature_extractor/text_contract.py sha256=m-gfvEQ-DeAUhtcRH_nUhckJRnxTN8Ujb5kXA-Y-p-o 7048
pdf_struct/features/__init__.py sha256=NiPx40DFvZLcuTk2C3eK5L4smJX0lPYB99iLcExhdWE 115
pdf_struct/features/lexical.py sha256=glBB6tYUbfuMdXlzzVsiwVsMBVnov7ILzYtMezWYJQs 2629
pdf_struct/features/lm.py sha256=hqiGq_i7DDInRrkK_zAlahfUziQ1009Z6o3T-RP1yQo 3189
pdf_struct/features/listing/__init__.py sha256=p-FAu58uakxKB49ptbqA5dZ6gqDnF-vYJR3ynjRyzRY 256
pdf_struct/features/listing/base.py sha256=xy6GSaws_ngUvi7djuwQsxm1uhtGM9YLn1vnGD7Q5Xo 5904
pdf_struct/features/listing/en.py sha256=hMDP41coAnQw6Ib_UDV_c3s3TLgFUGwqBkI2vn0OGGc 3608
pdf_struct/features/listing/ja.py sha256=1TPieCt2CM10eojNKvs2wHu3m7ahdB__Ktc0M09kdZI 7633
pdf_struct/loader/__init__.py sha256=4PMNY4CNpgnZkx6D3OH-lBhKDOzRqqf-Jdrh95fj8cg 169
pdf_struct/loader/hocr.py sha256=7zmgDpPT71778ZacrOuJkpbVyE_Y-axXQkHgb8XNgcM 8432
pdf_struct/loader/pdf.py sha256=Ac2ysIbI5QoF4JZZAyvNeZ0lE-rn0KTzeduXjOyiDv0 7045
pdf_struct/loader/text.py sha256=thc_gzM_5ZzphkZKko-pebxKiiecOJ9iwqk0jKhxdyM 3766
pdf_struct-0.3.4.dist-info/LICENSE sha256=xx0jnfkXJvxRnG63LTGOxlggYnIysveWIZ6H3PNdCrQ 11357
pdf_struct-0.3.4.dist-info/METADATA sha256=fCg1fbVtyEo9zkL1Meg21snbjlaSeKoEkj2CtS6cfek 12665
pdf_struct-0.3.4.dist-info/WHEEL sha256=G16H4A3IeoQmnOrYV4ueZGKSjhipXx8zc8nu9FGlvMA 92
pdf_struct-0.3.4.dist-info/entry_points.txt sha256=05Pf1Cw-BJpqCXjtKfvE4l9uwUay_t3cdtwv_yUZ2z4 51
pdf_struct-0.3.4.dist-info/top_level.txt sha256=KynwyDanKpMzZE2485A9qNlI1rGwzxCTZD1en9TW2Xw 11
pdf_struct-0.3.4.dist-info/RECORD

top_level.txt

pdf_struct

entry_points.txt

pdf-struct = pdf_struct.cli:cli