pdf2embeddings

View on PyPIReverse Dependencies (0)

0.2.8 pdf2embeddings-0.2.8-py3-none-any.whl

Wheel Details

Project: pdf2embeddings
Version: 0.2.8
Filename: pdf2embeddings-0.2.8-py3-none-any.whl
Download: [link]
Size: 20135
MD5: 1df7758c2316966c037391387ac71bf3
SHA256: 5318d4427ce8b8889e9eee7460a58ee2f38bb4fc43bb05827eceed5c64f67610
Uploaded: 2020-09-28 13:40:44 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: pdf2embeddings
Version: 0.2.8
Summary: NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.
Author: moj-analytical-services
Home-Page: https://github.com/moj-analytical-services/airflow-pdf2embeddings
Download-Url: https://pypi.python.org/pypi/pdf2embeddings
License: MIT
Classifier: License :: OSI Approved :: MIT License
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Requires-Python: >=3.6
Requires-Dist: allennlp (==0.9.0)
Requires-Dist: boto3 (~=1.10.34)
Requires-Dist: gensim (==3.8.1)
Requires-Dist: nltk (==3.4.5)
Requires-Dist: numpy (==1.18.2)
Requires-Dist: pandas (==0.25.3)
Requires-Dist: ply (==3.11)
Requires-Dist: pyarrow (==0.16.0)
Requires-Dist: pytest (==5.4.1)
Requires-Dist: scikit-learn (==0.22.1)
Requires-Dist: scipy (==1.4.1)
Requires-Dist: sentence-transformers (==0.2.5.1)
Requires-Dist: slate3k (==0.5.3)
Requires-Dist: typing (==3.7.4.1)
Requires-Dist: tqdm (==4.45.0)
Requires-Dist: s3fs (~=0.4.2)
Description-Content-Type: text/markdown
[Description omitted; length: 16698 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.34.2)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
pdf2embeddings/__init__.py sha256=lh3xoCtnBUjAiGODpcTPfV5RCtoGxK1iJ8PvsJT7rKg 218
pdf2embeddings/arrange_text.py sha256=sfBSn10S-Moqaa4NTjYylE3RwTH0PRit_VO1VmR9y-Y 2243
pdf2embeddings/embedder.py sha256=CKY3QprCZdG3-_KwMnfJ4LvCmHbgGSgIg5dHXp9pEoA 12638
pdf2embeddings/json_creator.py sha256=AEjelQQF7BtQ981EPpCB7cJ0EsKYySweplkTfrjfv-M 1913
pdf2embeddings/logging.yaml sha256=xAp-mbP5CQ7_Lm_t38hmx3b3CkQhCb3MOdQCJcKLPXI 1593
pdf2embeddings/process_user_queries.py sha256=Q7Glik0O06RYITtGN_pdjED2FTH_IKuY5vy06yXWldQ 9199
pdf2embeddings/scraper.py sha256=X8Frphco2zNxrmEO3uf-Zfm4PxYUxrtezyrtJakHR6U 12054
pdf2embeddings-0.2.8.dist-info/LICENSE sha256=ra1L7QajNBpReE424WYu9VoVW_tyaQoghBdz4NDqJxs 1094
pdf2embeddings-0.2.8.dist-info/METADATA sha256=SI_lfch03jvNgxeuEmulXC-5GK0oDvIMxZYx45Eo9xw 18400
pdf2embeddings-0.2.8.dist-info/WHEEL sha256=g4nMs7d-Xl9-xC9XovUrsDHGXt-FT0E17Yqo92DEfvY 92
pdf2embeddings-0.2.8.dist-info/top_level.txt sha256=eQXE48DUrRZCD0wSCVP9jhLWfGV9s1VpbK4Q6uNgDYI 15
pdf2embeddings-0.2.8.dist-info/RECORD

top_level.txt

pdf2embeddings