courlan

View on PyPIReverse Dependencies (7)

1.1.0 courlan-1.1.0-py3-none-any.whl

Wheel Details

Project: courlan
Version: 1.1.0
Filename: courlan-1.1.0-py3-none-any.whl
Download: [link]
Size: 33433
MD5: 2bdbdacab6bd6a9aa44d05eb9b1fc641
SHA256: 88354c1e50fde84890c13f58924a5ecc5c0527dcf3eb36e80564fe68e330bb03
Uploaded: 2024-04-30 11:20:23 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: courlan
Version: 1.1.0
Summary: Clean, filter and sample URLs to optimize data collection – includes spam, content type and language filters.
Author: Adrien Barbaresi
Author-Email: barbaresi[at]bbaw.de
Home-Page: https://github.com/adbar/courlan
Project-Url: Blog, https://adrien.barbaresi.eu/blog/
Project-Url: Tracker, https://github.com/adbar/courlan/issues
License: Apache-2.0
Keywords: cleaner,crawler,preprocessing,url-parsing,url-manipulation,urls,validation,webcrawling
Classifier: Development Status :: 4 - Beta
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Education
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: MacOS :: MacOS X
Classifier: Operating System :: Microsoft :: Windows
Classifier: Operating System :: POSIX :: Linux
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Internet :: WWW/HTTP
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Text Processing :: Filters
Classifier: Topic :: Text Processing :: Linguistic
Requires-Python: >=3.6
Requires-Dist: babel (>=2.11.0)
Requires-Dist: tld (==0.12.6); python_version < "3.7"
Requires-Dist: urllib3 (<2,>=1.26); python_version < "3.7"
Requires-Dist: tld (>=0.13); python_version >= "3.7"
Requires-Dist: urllib3 (<3,>=1.26); python_version >= "3.7"
License-File: LICENSE
[Description omitted; length: 16670 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.43.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
courlan/__init__.py sha256=zuUuzTtqAcc2_X46AGl3qA1IdP6OMhl3XhQiVrHXnwA 687
courlan/clean.py sha256=vrB_jQZSLe_r9UA7bt5VMgRbJkqp9WCS68ss4KoQHGI 6908
courlan/cli.py sha256=DKq9RyXfh3L8V5cMzg1AY9l9SwkHQnr8zqqIRo_zKu8 5673
courlan/core.py sha256=uiOFtkxzRetj4KU-c1h3ViCDVRUSbUY9PFBwhKTLNlc 8171
courlan/filters.py sha256=Yd_blXcQu72EG6AOpGjc3lMmai5i2su6sCv60RIJDlo 8177
courlan/meta.py sha256=lufXPRfrDAZOft9GqWKSRZiikvi35ILFnmJksczTGIM 367
courlan/network.py sha256=oZT5R3oaTNGX4Cw1qhwRlvsLG6nDEUnldY2RI8GJNaQ 1572
courlan/py.typed sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
courlan/sampling.py sha256=8nk-bEOwv98HfcbW_x3c4eWtMJgkjbpe_C2WC1DS_cE 2091
courlan/settings.py sha256=W9Y5qPPZmakA5EJ8UWaKdulzhx1Y7US1770nobrlYQg 1636
courlan/urlstore.py sha256=Z6tHebnvmaduaD122yroTKIVyUvuDKi9CZ5qz2Ys6Gk 19037
courlan/urlutils.py sha256=FCBf4XRw0Fll-r9kYukxRJ2KMdwgy6fsIyLlfTQ2494 5683
courlan-1.1.0.dist-info/LICENSE sha256=psuoW8kuDP96RQsdhzwOqi6fyWv0ct8CR6Jr7He_P_k 10173
courlan-1.1.0.dist-info/METADATA sha256=kVW3Iu2kVYFSU1G93sjm5bNOZjsCghHgNiOedqKu-0s 18625
courlan-1.1.0.dist-info/WHEEL sha256=GJ7t_kWBFywbagK5eo9IoUwLW6oyOeTKmQ-9iHFVNxQ 92
courlan-1.1.0.dist-info/entry_points.txt sha256=wRo8e-AAGQwxhWNeykVfrvCHt3hngLNI8MkyQD15AHY 45
courlan-1.1.0.dist-info/top_level.txt sha256=mNIXZYlTWhDv5JMGmkE35PCU3jWgIWOsweoIYzNxNEs 8
courlan-1.1.0.dist-info/RECORD

top_level.txt

courlan

entry_points.txt

courlan = courlan.cli:main