scrapy-helper

View on PyPIReverse Dependencies (0)

1.5.3 scrapy_helper-1.5.3-py3-none-any.whl

Wheel Details

Project: scrapy-helper
Version: 1.5.3
Filename: scrapy_helper-1.5.3-py3-none-any.whl
Download: [link]
Size: 159971
MD5: eeba12c1afa6162da86d1ffb09dea7ad
SHA256: df8f8e3884c9cbcf72dc89ac14cdef76cd030d33b45d5f028de07e16d82cff3e
Uploaded: 2023-07-15 13:47:40 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: scrapy-helper
Version: 1.5.3
Summary: scrapy helper
Author: Zhou Ping
Author-Email: 231409[at]qq.com
License: Apache 2.0
Keywords: scrapy,spider,helper
Classifier: Framework :: Scrapy
Classifier: Development Status :: 5 - Production/Stable
Classifier: Environment :: Console
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: BSD License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: Implementation :: CPython
Classifier: Programming Language :: Python :: Implementation :: PyPy
Classifier: Topic :: Internet :: WWW/HTTP
Classifier: Topic :: Software Development :: Libraries :: Application Frameworks
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Requires-Python: >=3.7
Requires-Dist: PyMySQL (~=1.0.2)
Requires-Dist: Twisted (~=22.8.0)
Requires-Dist: pymongo (~=4.2.0)
Requires-Dist: furl (~=2.1.3)
Requires-Dist: Scrapy (~=2.6.2)
Requires-Dist: websockets (~=10.3)
Requires-Dist: pyppeteer (~=1.0.2)
Requires-Dist: scrapy-splash (~=0.8.0)
Requires-Dist: tldextract (~=3.4.0)
Requires-Dist: lxml (~=4.9.1)
Requires-Dist: bs4 (~=0.0.1)
Requires-Dist: beautifulsoup4 (~=4.11.1)
Requires-Dist: python-dateutil (~=2.8.2)
Requires-Dist: jieba (~=0.42.1)
Requires-Dist: nltk (~=3.7)
Requires-Dist: tinysegmenter (~=0.4)
Requires-Dist: pythainlp (~=3.1.0)
Requires-Dist: requests (~=2.28.1)
Requires-Dist: redis (~=4.3.4)
Requires-Dist: platinum2 (~=1.5.2)
Requires-Dist: itemloaders (~=1.0.6)
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 175 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.40.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
scrapy_helper/__init__.py sha256=zG2cd7WLo5fIizFwE9xNzjpfZPff6uzTZIgsxS6ri_g 21
scrapy_helper/__pycache__/__init__.cpython-310.pyc sha256=_tpDX_kUkexN9X6RxKBPwIQDjnoOG7XyrIOI_nJSngY 190
scrapy_helper/__pycache__/__init__.cpython-311.pyc sha256=2J6bDmNZSA749q0uCSFlfp99qq3oSj4WFe_9KysBrhY 205
scrapy_helper/core/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
scrapy_helper/core/dict_path.py sha256=YkPT1pZzGxknZc5_6EdNS20emLzALJqlwJAXNdxtw9c 4523
scrapy_helper/core/fast_item.py sha256=TMCwqRxEDnePmMQRprcBtOXdIy-xjvrIr7nT0MMxBO4 502
scrapy_helper/core/urls.py sha256=3IvwmV6I0xhhwOt-cx4FYEgjF7teKWde0hRaULcNT_4 8617
scrapy_helper/core/utils.py sha256=0rvRWv5u34W_wRhXvIWeYsDN8kErUBMUhQgDnZsB6v4 2180
scrapy_helper/core/__pycache__/__init__.cpython-310.pyc sha256=CPum7kQF4Qs_UbMHBEWmbG3v1TfcVVFOJeT6HF8wnc4 174
scrapy_helper/core/__pycache__/dict_path.cpython-310.pyc sha256=-FFVZNkNrFHNpp0SCsnUeOZHmmcparuLOIA6Tj8wvR8 4366
scrapy_helper/core/__pycache__/fast_item.cpython-310.pyc sha256=jVm7QXcBny_n_7r21tj_Q0_1sUNpB-ubRRk4F92aL54 1053
scrapy_helper/core/__pycache__/urls.cpython-310.pyc sha256=QygOcAoqddKF5x0LvRE5O6wsRWMTljq3z-pw2v59gAI 6520
scrapy_helper/core/__pycache__/utils.cpython-310.pyc sha256=j9Jmvr734K54x6O0HmA_syH4W5haRqHosU_Use0HvFc 2264
scrapy_helper/linkextractors/__init__.py sha256=oRE7-jvLHQG8koilPMQB_bn6YLEmFXa8UN19X1_VG_M 42
scrapy_helper/linkextractors/article.py sha256=90oG-K3birZOZVGCHW9WO92EQb_GAdgdwIcUfbSEgRQ 341
scrapy_helper/linkextractors/__pycache__/__init__.cpython-310.pyc sha256=dMOBNAusKalHsqFNwgusWTPCy2rmN-S3LTqIp8u_R6k 184
scrapy_helper/linkextractors/__pycache__/article.cpython-310.pyc sha256=fxpY0ZUKjRVvZHVTsID6bueIIWFVIuyarrwkXh_rvYs 728
scrapy_helper/middlewares/__init__.py sha256=7JkmsG117s7xqaMRijezF5H9Okkjt9R3yirTXvzwyOw 279
scrapy_helper/middlewares/__pycache__/__init__.cpython-310.pyc sha256=gFYfAdfVfKvhmJf4NRzS16d4IgFC8D4FUikgH74Mb7c 510
scrapy_helper/middlewares/downloader/__init__.py sha256=mbzAghpVb2Xh2uvKle3ZFBysU1ijI_in40r5EEJjfcU 180
scrapy_helper/middlewares/downloader/cookies.py sha256=6KyPqi1Znp_trxUziDEThaSVjnybx1JIdOqg7fYL440 966
scrapy_helper/middlewares/downloader/persist_filter.py sha256=i6rkjOcZHYHVwSIzjA9_MBzX0y8df_Oc19T7-Ct5j_s 1811
scrapy_helper/middlewares/downloader/proxy.py sha256=FEw8oSTKMT2E47FV6if1ARtkYSR-41loQVbLmyOwPsY 1132
scrapy_helper/middlewares/downloader/pyppeteer_middleware.py sha256=fRz3ZJ-qAH_LjgczFCNqvvtfsSiMadpLg4Qihp3z8aw 4381
scrapy_helper/middlewares/downloader/user_agent.py sha256=9K83cB6q5I2UrSBz6Y9m9U4yEPEIIH18b4YaqwVIXys 1209
scrapy_helper/middlewares/downloader/__pycache__/__init__.cpython-310.pyc sha256=rDVz3khMfa1SWicVq19EUGls4_G5JtjPqW4n4NdhHKY 412
scrapy_helper/middlewares/downloader/__pycache__/cookies.cpython-310.pyc sha256=q6P2ntPoh9r6-xjzQ1M0bIdLYX6s_F-4eSygwD5aAh8 1465
scrapy_helper/middlewares/downloader/__pycache__/persist_filter.cpython-310.pyc sha256=-NGXoIbMso9_3jaITFOW7fhovBs2j4TuaMgEGqwC3Ns 2344
scrapy_helper/middlewares/downloader/__pycache__/proxy.cpython-310.pyc sha256=4SYv5VD95M9luFqj5tmu9725Qe1vc3xI5RiSmMQ6mlw 1522
scrapy_helper/middlewares/downloader/__pycache__/pyppeteer_middleware.cpython-310.pyc sha256=IOST9r1o52JzLyGzN2FY-nM-vdIscXwxvtpEfQ9ZlTQ 3812
scrapy_helper/middlewares/downloader/__pycache__/user_agent.cpython-310.pyc sha256=gvl9LQMPjGwSNl7b47AMJqBNnu6oyB1GYJcBq9K7jAk 1587
scrapy_helper/middlewares/spider/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
scrapy_helper/newspaper/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
scrapy_helper/newspaper/article.py sha256=37YvBMt-0JCCiudMUkVnxoYiGCLEx_5QVBepfajj7-s 8959
scrapy_helper/newspaper/cleaners.py sha256=y6xeboA6OHJkVmeFlRnYTxFHZ1butN0MowKhDQvFkcg 10419
scrapy_helper/newspaper/configuration.py sha256=p90vzDL8o6k-Szr-j8YDPpoLxuFdC89GZ5bm8L38naw 2657
scrapy_helper/newspaper/extractors.py sha256=E7xCx22CpakDxW203LDAFuBAYeEOGt9ZQ5z1G2SBtCQ 41623
scrapy_helper/newspaper/outputformatters.py sha256=Z3VYijz1HTGCJWk83ndVB_iCVWh1pYChyXFaxF1_5D0 5850
scrapy_helper/newspaper/parsers.py sha256=vaHGElYR5lgqNo_Ng1RLTiSc1hPAwGdzJSIY05_bbyo 7854
scrapy_helper/newspaper/settings.py sha256=ILID5G1AuJF6f_RcM5B6nisKTYo35c_z09ZHwRs4N_M 138
scrapy_helper/newspaper/text.py sha256=aTp6-N33vjkxNWfEmjkqUwDmcIlXtErdk8lBTG2iU-0 5850
scrapy_helper/newspaper/utils.py sha256=nlT9rsI-m5er7HdEQZC1XHvxUkx-SpMr43U49iR4LIs 2451
scrapy_helper/newspaper/__pycache__/__init__.cpython-310.pyc sha256=e2fjyJXAKpTXUjwvfadG2GESr7Om1X8fqSyyKT4Jq_8 179
scrapy_helper/newspaper/__pycache__/article.cpython-310.pyc sha256=05f4u77S6pu8WOaFGA4xrkEWcrMxnELnv4jNXxFj7-4 7138
scrapy_helper/newspaper/__pycache__/cleaners.cpython-310.pyc sha256=Uj20RMBPmHBwjdEkHY8YggvBvWzGP_BXvhFU6yAEt1M 7831
scrapy_helper/newspaper/__pycache__/configuration.cpython-310.pyc sha256=vUMZ5L3juz9qWBWlZ-MSbqn8ZjY1H-Vc3eWIAIJLOTI 2553
scrapy_helper/newspaper/__pycache__/extractors.cpython-310.pyc sha256=w7rqANzDcoNnW1nz2tlhUgRnCOEJ1HVlDp79F-OXIOU 28163
scrapy_helper/newspaper/__pycache__/outputformatters.cpython-310.pyc sha256=41S3rlNsBZdtYCIZD1E75iHdzdmxIUD9FfQj5z2Ar7g 6154
scrapy_helper/newspaper/__pycache__/parsers.cpython-310.pyc sha256=-R74GCtw1ZnwwF7n5YrJyW1BRDjjhpoxNISvqMk1jmI 8535
scrapy_helper/newspaper/__pycache__/settings.cpython-310.pyc sha256=wnmJtYsu48udHnj90NYdH9uyXbCX99CPkfDQIE7Jr4I 320
scrapy_helper/newspaper/__pycache__/text.cpython-310.pyc sha256=l781XL2a8IapDCavGSLNt3L53L5hT-TuMe_1v67zhmY 6820
scrapy_helper/newspaper/__pycache__/utils.cpython-310.pyc sha256=2Pq2Khu7V8KDeLGJ2oJCgllRQhwEHYewCszixT2mtEk 3565
scrapy_helper/newspaper/resources/text/stopwords-ar.txt sha256=0qf4yw00lcv-LIVs5WLQSxbken3GYcLDfU5ES_EYn5U 1450
scrapy_helper/newspaper/resources/text/stopwords-be.txt sha256=Sa08G0yBPKJhz2Om1kQ1BQCMc9XvPWQv5enRH1kC858 936
scrapy_helper/newspaper/resources/text/stopwords-bg.txt sha256=eiIwYk1TU8YcYYPbMPjUzZSZlgd7gl5o7d0LIthzqHQ 2409
scrapy_helper/newspaper/resources/text/stopwords-da.txt sha256=A1tQ6LutIdwsN09YSqZM33DcWFbX6-RndIA29lSEW7M 484
scrapy_helper/newspaper/resources/text/stopwords-de.txt sha256=HzP6O_gyasMIqF6ShZ6uSiVrOLdTsDUCBFy5u9xv9U0 5967
scrapy_helper/newspaper/resources/text/stopwords-el.txt sha256=MrNPZccmGguHmadO9WC762dhS2uGJnhBj8kdQoQ-jvU 13903
scrapy_helper/newspaper/resources/text/stopwords-en.txt sha256=taCMo75agEB4ZWZjws5J6Q6kVD9Z6Bmkm-cMGKYpOBU 3585
scrapy_helper/newspaper/resources/text/stopwords-es.txt sha256=g1uQlrf5_Sk_3oyzxPkA8p1gWOYB-5LvSmoiE91yHMI 2185
scrapy_helper/newspaper/resources/text/stopwords-et.txt sha256=TQBb3Q388dQ_ZVRjlPQqnrfONORKIC2cBB7UM3mYGa8 189
scrapy_helper/newspaper/resources/text/stopwords-fa.txt sha256=ulOfl-FQQ2nDBE1bdD5JF7QNCvfAMV_DcODT9eu9Ajk 7710
scrapy_helper/newspaper/resources/text/stopwords-fi.txt sha256=NH7nDTJ5u-hAIKGSe-NzAbXkb-No5uDSqvY3UND_fvo 464
scrapy_helper/newspaper/resources/text/stopwords-fr.txt sha256=I_rwIrFt7h2I8nE0yWS4QKB1OoOujpx0XFH40mx5Q04 2002
scrapy_helper/newspaper/resources/text/stopwords-he.txt sha256=qlKybwk2NjDbpyXs62qDSL6jDAW-aaCE7CtY15urPFo 1836
scrapy_helper/newspaper/resources/text/stopwords-hi.txt sha256=_fzU97E_l-n6TIjacLSfBSHokQq6YAa5Sv87bJs6TZk 2790
scrapy_helper/newspaper/resources/text/stopwords-hr.txt sha256=p5xbz63lUuH6EB1hOlQKnMAcFwHvJJz6MFMbj4eRfz0 870
scrapy_helper/newspaper/resources/text/stopwords-hu.txt sha256=qe89sgnGd6gn8LYXNLLnSNM73Lm-uUI5BwW4GTUoCB4 2337
scrapy_helper/newspaper/resources/text/stopwords-id.txt sha256=XU2IceR3UIIHqb5J67G5UmLdfa2TtQuJS8VAo_alAVs 10499
scrapy_helper/newspaper/resources/text/stopwords-it.txt sha256=ykDV7p7nQxu1R3z81FWlpe_BMgF1V6Gjgvw7mF8pYQU 1696
scrapy_helper/newspaper/resources/text/stopwords-ja.txt sha256=TEUMcdO5sEjRn-01vYNYFkkKntOXGgjiQkMqiQ9R6Uc 1006
scrapy_helper/newspaper/resources/text/stopwords-ko.txt sha256=W7uVb8xvJj8DeXludsDSU82UqvIHfaMYq-YekCzLzJ4 459
scrapy_helper/newspaper/resources/text/stopwords-lt.txt sha256=sDGDkr40JsK6graPDtmAlxJ6rSm9EbIVGAvEM5nRpKE 763
scrapy_helper/newspaper/resources/text/stopwords-mk.txt sha256=CKEzV8NJDb4jq3C97P7MuSd8qwFmcYjODGXwC3HxS78 1504
scrapy_helper/newspaper/resources/text/stopwords-nb.txt sha256=VuZbxq0aq66b4MkwezWQqtdPMxGOs8kO7IF2NSqPTSk 587
scrapy_helper/newspaper/resources/text/stopwords-nl.txt sha256=GfMWt-rO7i3IcHRCsPvZUXARVNkor3_dZrz0Tqadrkk 177
scrapy_helper/newspaper/resources/text/stopwords-no.txt sha256=jP8KBEQv6DqsZCRBSgEpieTv8QzX3YKXwE7fD5HhewY 513
scrapy_helper/newspaper/resources/text/stopwords-pl.txt sha256=ltOBOjV8JG3DgcA3xBR7t3Aad9ZngQpheGd8FoGhPPI 2015
scrapy_helper/newspaper/resources/text/stopwords-pt.txt sha256=I1xLWCygpgkd9ZZ8hYSjQ7jr4shuj-3ySSf2hkddwsc 3610
scrapy_helper/newspaper/resources/text/stopwords-ro.txt sha256=f1w8ji43rK0ptjazunOYUM2yERl7JD2ECsSiQnBXf4g 1915
scrapy_helper/newspaper/resources/text/stopwords-ru.txt sha256=soQOPcfR18HOcSoZWzWFkajvSrLG9pj-A4yP1pmrneo 4958
scrapy_helper/newspaper/resources/text/stopwords-sl.txt sha256=jPDRGLNBvW7gI9mBsv2jiMt39PKxtcm6RrAVibY2ih4 2435
scrapy_helper/newspaper/resources/text/stopwords-sr.txt sha256=ykQ0q6Qrfr_32RchZzDTZZU9dZWceQFbb_xmDR6biRM 776
scrapy_helper/newspaper/resources/text/stopwords-sv.txt sha256=j99ousYmploqqSZJt-x7BCOstegufTMg2d_ccJ5oXPY 3956
scrapy_helper/newspaper/resources/text/stopwords-sw.txt sha256=pHuLPf47kDGCjRsImLD-ush8c2nafOci1TID2GAzCxM 407
scrapy_helper/newspaper/resources/text/stopwords-th.txt sha256=ZzOqQCVYz6sCTRuY7OaJ5eMbZiF0vFFZRmTpCLGwVtI 1420
scrapy_helper/newspaper/resources/text/stopwords-tr.txt sha256=CQLIhb35bsYDvcLa-UpXNI3FuzvvQRGI6VqiGC7fe5k 1368
scrapy_helper/newspaper/resources/text/stopwords-uk.txt sha256=X0WqdluNz_g2MiLIEoPTDniqtSA93WIMkCIS8RyMn2U 4029
scrapy_helper/newspaper/resources/text/stopwords-vi.txt sha256=038u06SJb4rzOIPZAgCkSzvK8t9LH8_clDWluHUEJVo 724
scrapy_helper/newspaper/resources/text/stopwords-zh.txt sha256=v3mSCwIJx8pKSHetc8Gy8fudL3KYtyRq0mlNMWT0yYA 623
scrapy_helper/newspaper/videos/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
scrapy_helper/newspaper/videos/extractors.py sha256=sEmwlE1P4osYPziUFCuaoE5dHMWGa0fOUXiOdO1on4o 3793
scrapy_helper/newspaper/videos/videos.py sha256=SnFPZwgGSm2LN0sK29KTWjJvADqHwrnKn0Np6NS3i0U 421
scrapy_helper/newspaper/videos/__pycache__/__init__.cpython-310.pyc sha256=nC7rSGRmhyXQe3q11QIb_HW1S4Xi5Laiu2RF-ZAl2-M 186
scrapy_helper/newspaper/videos/__pycache__/extractors.cpython-310.pyc sha256=ISLJCxmJj1-O1NRp1XN7W-JCc3RdxHcPEUBBTgrtvpQ 3724
scrapy_helper/newspaper/videos/__pycache__/videos.cpython-310.pyc sha256=Qj1KQwaYXqB5xigjJaD9Zsj07aDD_Btsgoa2VFAVLSE 589
scrapy_helper/pipelines/__init__.py sha256=E01St2RpX9pbmA3mpJSBQu64x_WjjUpb7AynniISDcw 70
scrapy_helper/pipelines/mongodb.py sha256=O2lYhti1T5YwmdJk671bzUHdvKKZ9DheFiAZeBskEHM 1454
scrapy_helper/pipelines/mysql.py sha256=0DbHDWBOkmhV1vmBmWm3tN7d6GSqXPkuVFOHNtdwFe8 3085
scrapy_helper/pipelines/__pycache__/__init__.cpython-310.pyc sha256=eN7-VUce94IFvY3ToWaqJyw2-IC4mJbtVUztnSpuWpI 269
scrapy_helper/pipelines/__pycache__/mongodb.cpython-310.pyc sha256=_xrf7SIpk5oqztkeI5eQ9f-5Y1-LJ-4Mys6l-JlXKZY 1855
scrapy_helper/pipelines/__pycache__/mysql.cpython-310.pyc sha256=da84hua0lu32b4Z3MV3RTu_rDjJH3MwP7_l5s9OO5I8 2017
scrapy_helper/spiders/__init__.py sha256=oX0DBT8OMxnYmHedaLUbCf4x1oDCFQ_wnAgD_pdp2j4 55
scrapy_helper/spiders/crawl.py sha256=SFoO7gC_mU8NE4hmaEMqP9bu3HVVub7d8LNcxs4KzmY 7409
scrapy_helper/spiders/__pycache__/__init__.cpython-310.pyc sha256=Mct8Z6eTUiPX9QsqgJOT_Qgh5M4J9x_sbua0OiuKgy8 245
scrapy_helper/spiders/__pycache__/crawl.cpython-310.pyc sha256=PMaOgKBxOR2OYuFnRuet_C9WZzykYw_hstsi2oqj89I 6931
scrapy_helper-1.5.3.dist-info/LICENSE sha256=2XpMeic_OsDy6gMww2sr_7zpHO0cyPORcbZIOZJLpZo 11343
scrapy_helper-1.5.3.dist-info/METADATA sha256=iHJSCC977BM1zLGjsYlXgdI4GRFHk26bOm9qBFR3TL8 2115
scrapy_helper-1.5.3.dist-info/WHEEL sha256=pkctZYzUS4AYVn6dJ-7367OJZivF2e8RA9b_ZBjif18 92
scrapy_helper-1.5.3.dist-info/top_level.txt sha256=e_L0o1ovs_VowjhzLmtsjQMOeF_KjfS9TqSRljIobMs 14
scrapy_helper-1.5.3.dist-info/RECORD

top_level.txt

scrapy_helper