wildebeest-nlp

View on PyPIReverse Dependencies (0)

0.9.2 wildebeest_nlp-0.9.2-py3-none-any.whl

Wheel Details

Project: wildebeest-nlp
Version: 0.9.2
Filename: wildebeest_nlp-0.9.2-py3-none-any.whl
Download: [link]
Size: 283468
MD5: 59a7bb7c00c8732810b7d7c729b0f92e
SHA256: 090754039aee379bc71295512cc642a44758499ce9556c4b2fec4ddd124d2f13
Uploaded: 2022-11-20 05:02:35 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: wildebeest-nlp
Version: 0.9.2
Summary: The wildebeest scripts investigate, repair and normalize a wide range of text file problems at the character level, e.g. encoding errors, normalization of characters into their canonical form, mapping digits and some punctuation to ASCII, deletion of some non-printable characters.
Author: Ulf Hermjakob
Author-Email: ulf[at]isi.edu
Home-Page: https://github.com/uhermjakob/wildebeest
Download-Url: https://github.com/uhermjakob/wildebeest
Keywords: machine translation,datasets,NLP,natural language processing,computational linguistics
Classifier: Development Status :: 5 - Production/Stable
Classifier: Intended Audience :: Developers
Classifier: Topic :: Utilities
Classifier: Topic :: Text Processing
Classifier: Topic :: Text Processing :: General
Classifier: Topic :: Text Processing :: Filters
Classifier: Topic :: Text Processing :: Linguistic
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Programming Language :: Python :: 3 :: Only
Platform: any
Requires-Python: >=3.8
Requires-Dist: regex (>=2021.8.3)
Requires-Dist: tqdm (>=4.40)
Requires-Dist: unicodeblock (>=0.3.1)
Requires-Dist: wheel (>=0.38.4)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 18334 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.37.1)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
venv/bin/activate_this.py sha256=45dnJsdtOWIt5LtVSBmBfB8E7AlKcnhnZe9e3WGclak 1199
wildebeest/__init__.py sha256=-7XOKaIacWBovJk-uJuulO05QK2cGOHs8Ct__fdPp5A 442
wildebeest/__main__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
wildebeest/wb-analysis.pl sha256=m503QPgvcVihJq8S_tOSQX94Tbm1NXG5YPbotZv1t8k 62804
wildebeest/wb_analysis.py sha256=J9uNpvMUb5Akl6WWfRvIvWX6dCCRgZn_jQwm6rFTLO8 60052
wildebeest/wb_normalize.py sha256=zBlQxCsaHg23v4CPFP-nok6tCtiq0DGJwrcrfCugFv8 100465
wildebeest/data/ArabicPresentationFormMappingAnnotated.tsv sha256=qK_KvB57f4EPk1AJpI6vjfit0fxZPXHcskl4WHeCLvs 85420
wildebeest/data/CJKCompatibilityMappingAnnotated.tsv sha256=VRb-9I0FIQUUcrZdWu6Uf0W8LKsiKmp6COZ1eaZCRvs 160125
wildebeest/data/CombiningModifierMappingAnnotated.tsv sha256=0Gwq_9wwN_KK9RrcxElPmWR-TTZdonTY4NSwjGTdTMA 182747
wildebeest/data/CoreCompatibilityMappingAnnotated.tsv sha256=MhhDpZPJX26_2wyjiKSCMEp7DAV_E_dPZr3-Z-0uPxw 16792
wildebeest/data/DigitMappingAnnotated.tsv sha256=ODV59zL9VB27MEf7-O5T9o8tDuhcy1MkAYV1cWzSE9I 32078
wildebeest/data/EnclosureMappingAnnotated.tsv sha256=dc8CdBdlXvZ95zNKbJX8lZ9QabWgkYArAgFRGPfbQfA 69791
wildebeest/data/EncodingRepairMappingAnnotated.tsv sha256=3fEsf3nmNHEQKSH9XzvGnEM_cW-SkldQfEY6YYjkSqc 28029
wildebeest/data/FontSmallVerticalMappingAnnotated.tsv sha256=d1P9sjAgGf1e3TvpL-abkqZmFFgdAOjrDELhUbMdtoE 106882
wildebeest/data/PythonWildebeestMappingAnnotated.tsv sha256=coJ9Dz5mR6HZ23vpb46w4qT2y_TX6guBO2I_s6Gf8po 61697
wildebeest/data/assert-preserve.tsv sha256=8K5zI5UB_NoX8f2Eiw6SaXT7sVmDJKFnBwmAwvBof6o 19087
wildebeest/data/assert.tsv sha256=fvaa5jHTuq3YmNiNhHs5q4Nhq34vlmhJYfmh3AmEUmg 155902
wildebeest/data/look-alikes.txt sha256=bR5LXBPbPJp86Jflei0G_7KQafSP8rw1mQ1oPPG2-y8 790
wildebeest/old/normalize-old.py sha256=mwspkhB0P8M--5AHxh3nHmLdXiD3qmvX8utTFpqFcfo 93762
wildebeest/test/__init__.py sha256=mUkeeYgR3leUWyyYu7BxthG6EiCka8-ZvPKglkVAiE8 60
wildebeest/test/sample_wb_analysis.py sha256=kJ0X7oTrlOYcsQQRDEEs1zEXjEFnknGljZEsPXJQQ6Q 1008
wildebeest/test/test_wb_norm.py sha256=z4zUykqLUKZimCxOEZWeNHxHtl9jL0qucG0Zbegq-I4 978
wildebeest/test/data/corpus.txt sha256=67apVhbO_1kc-ya7lPYDh451h-BX0kmys6g1wSBEhFY 18
wildebeest/test/data/hello.txt sha256=hzHLFJRa3ZUBHABEMYXequ1fhyKc-xnt0pW8_2PdFFU 9
wildebeest/test/data/out.json.ref sha256=Es9sMaI1OIHMSvuA3DClEEreIhpzY2nJprQmBL-1F1o 2003
wildebeest/test/data/out.txt.ref sha256=G7CJhDjiN3BmbITX1cXTml1uEC56iGJ1hv9i1R28B4k 1460
wildebeest/test/data/phrasebook-deu-out.ref sha256=VLA1cm6sFly9RLqElwi1ihzO8GdG5JehGlT9Rwv6jTM 1713
wildebeest/test/data/phrasebook-dir-out.ref sha256=eBRzg1xu9zpqhaqZOmBLCWoLWy9-9GcGlEGJE719ARk 175
wildebeest/test/data/wildebeest-test-analysis-pprint.txt sha256=x7mh4CAD-tk1p7PpjLGYjoIhnZgjHXi8Gomty8a4UKM 54555
wildebeest/test/data/wildebeest-test-invalid-utf8.txt sha256=BvWZptuY9EZSqpVvh-QVbkG5VikBSpaXl7bC179PuX8 7
wildebeest/test/data/wildebeest-test-norm-all.txt sha256=NVtl2ZsMPtqMXYqkjk96QdhFiqoVcTCYoIgJslHdNSA 2366
wildebeest/test/data/wildebeest-test-norm-all.txt.ref sha256=NVtl2ZsMPtqMXYqkjk96QdhFiqoVcTCYoIgJslHdNSA 2366
wildebeest/test/data/wildebeest-test-norm-custom.txt sha256=_5u3i-4YyTPIA0NpGwjvTEayJOgEMOrG5LtD-69hHhc 2576
wildebeest/test/data/wildebeest-test-norm-custom.txt.ref sha256=_5u3i-4YyTPIA0NpGwjvTEayJOgEMOrG5LtD-69hHhc 2576
wildebeest/test/data/wildebeest-test-norm.txt sha256=tG9ldo6I1IwxFGPMqaUBPnEd4Q8QPyGEsERT1QelrtM 2528
wildebeest/test/data/wildebeest-test-norm.txt.ref sha256=tG9ldo6I1IwxFGPMqaUBPnEd4Q8QPyGEsERT1QelrtM 2528
wildebeest/test/data/wildebeest-test-out.ref sha256=x7mh4CAD-tk1p7PpjLGYjoIhnZgjHXi8Gomty8a4UKM 54555
wildebeest/test/data/wildebeest-test.txt sha256=hxxjo_3oEilbRLxHv8AGQQzcMUBGkMCd3RPFYHqoCDE 2628
wildebeest/test/data/phrasebook/deu.txt sha256=9o4CxOZVvYkAXW4M0JfCFTAcPF501eRHN_I8hCFV9Nw 24
wildebeest/test/data/phrasebook/ell.txt sha256=YnWCcohze9KEiXBwBHD-qvnihLQXMHu7fKUVPEdxi38 27
wildebeest/test/data/phrasebook/eng.txt sha256=z5z5lbERESji1Sq2lYI0---xqsG1tTYHqnc6g7_3yos 17
wildebeest_nlp-0.9.2.dist-info/LICENSE sha256=NqE7jAD0Jjfxt0ZS2cLeW9xiulzrEXRNU1W86aHGTa8 1070
wildebeest_nlp-0.9.2.dist-info/METADATA sha256=htYJ7L1GHrU-mLGWv2NTSzHMzoOrCGXezYeYqNH4sFQ 19791
wildebeest_nlp-0.9.2.dist-info/WHEEL sha256=G16H4A3IeoQmnOrYV4ueZGKSjhipXx8zc8nu9FGlvMA 92
wildebeest_nlp-0.9.2.dist-info/entry_points.txt sha256=_iuMZMzIYsJfIbE-oTs_EDJuELX--TDhOL5eFatVYaM 186
wildebeest_nlp-0.9.2.dist-info/top_level.txt sha256=HL4Aem3xTn_qf0XYjdmLhZbwYUz_nW7mnNxxljYdi1U 33
wildebeest_nlp-0.9.2.dist-info/RECORD

top_level.txt

aux
data
profile
venv
wildebeest

entry_points.txt

wb-ana = wildebeest.wb_analysis:main
wb-norm = wildebeest.wb_normalize:main
wb_analysis.py = wildebeest.wb_analysis:main
wb_normalize.py = wildebeest.wb_normalize:main