Spark-Matcher

View on PyPIReverse Dependencies (0)

0.3.2 Spark_Matcher-0.3.2-py3-none-any.whl

Wheel Details

Project: Spark-Matcher
Version: 0.3.2
Filename: Spark_Matcher-0.3.2-py3-none-any.whl
Download: [link]
Size: 290643
MD5: 9078c3a110e0bab515065b5fd1234d42
SHA256: 91be9da27e4ebe6114cc81f8924d14f56b00f15b21f3a5f2e09b5e0d4d6f4e35
Uploaded: 2023-10-31 10:01:44 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: Spark-Matcher
Version: 0.3.2
Summary: Record matching and entity resolution at scale in Spark
Author: Ahmet Bayraktar, Stan Leisink, Frits Hermans
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.7
Requires-Dist: pandas
Requires-Dist: numpy
Requires-Dist: scikit-learn
Requires-Dist: python-Levenshtein
Requires-Dist: thefuzz
Requires-Dist: modAL-python
Requires-Dist: pytest
Requires-Dist: multipledispatch
Requires-Dist: dill
Requires-Dist: graphframes
Requires-Dist: scipy
Requires-Dist: pandas; extra == "base"
Requires-Dist: numpy; extra == "base"
Requires-Dist: scikit-learn; extra == "base"
Requires-Dist: python-Levenshtein; extra == "base"
Requires-Dist: thefuzz; extra == "base"
Requires-Dist: modAL-python; extra == "base"
Requires-Dist: pytest; extra == "base"
Requires-Dist: multipledispatch; extra == "base"
Requires-Dist: dill; extra == "base"
Requires-Dist: graphframes; extra == "base"
Requires-Dist: scipy; extra == "base"
Requires-Dist: pandas; extra == "dev"
Requires-Dist: numpy; extra == "dev"
Requires-Dist: scikit-learn; extra == "dev"
Requires-Dist: python-Levenshtein; extra == "dev"
Requires-Dist: thefuzz; extra == "dev"
Requires-Dist: modAL-python; extra == "dev"
Requires-Dist: pytest; extra == "dev"
Requires-Dist: multipledispatch; extra == "dev"
Requires-Dist: dill; extra == "dev"
Requires-Dist: graphframes; extra == "dev"
Requires-Dist: scipy; extra == "dev"
Requires-Dist: sphinx; extra == "dev"
Requires-Dist: nbsphinx; extra == "dev"
Requires-Dist: sphinx-rtd-theme; extra == "dev"
Requires-Dist: pyspark; extra == "dev"
Requires-Dist: pyarrow; extra == "dev"
Requires-Dist: jupyterlab; extra == "dev"
Requires-Dist: pandas; extra == "doc"
Requires-Dist: numpy; extra == "doc"
Requires-Dist: scikit-learn; extra == "doc"
Requires-Dist: python-Levenshtein; extra == "doc"
Requires-Dist: thefuzz; extra == "doc"
Requires-Dist: modAL-python; extra == "doc"
Requires-Dist: pytest; extra == "doc"
Requires-Dist: multipledispatch; extra == "doc"
Requires-Dist: dill; extra == "doc"
Requires-Dist: graphframes; extra == "doc"
Requires-Dist: scipy; extra == "doc"
Requires-Dist: sphinx; extra == "doc"
Requires-Dist: nbsphinx; extra == "doc"
Requires-Dist: sphinx-rtd-theme; extra == "doc"
Provides-Extra: base
Provides-Extra: dev
Provides-Extra: doc
Description-Content-Type: text/markdown
License-File: LICENSE
[Description omitted; length: 4065 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.41.3)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
spark_matcher/__init__.py sha256=vNiWJ14r_cw5t_7UDqDQIVZvladKFGyHH2avsLpN7Vg 22
spark_matcher/config.py sha256=E53Cz2bXjp5h7y_DMrDMApWnG33ESUYmRGAwsxTgUdw 57
spark_matcher/table_checkpointer.py sha256=qlalHsUZWzKR2qHxmnlEy6idJJBWUC2nuuHaMHNYwKY 3752
spark_matcher/utils.py sha256=-H83j9eK8HKl_B_8k77TFHa7QkQkeprfpYahbvmkBDk 2880
spark_matcher/activelearner/__init__.py sha256=hhFjehCn2ZMr4IGbsttnUo4AanJHbYNDrg28zCvfvW8 72
spark_matcher/activelearner/active_learner.py sha256=_H6d9mu4Ef8vlTjt_uXa-SU_zCIEFIv-Dnwgm8NaWFA 9637
spark_matcher/blocker/__init__.py sha256=Hz-BWl1sJI9OvcGJuV9IPLFIMaKZJXtaTSAqcuHZPro 124
spark_matcher/blocker/block_learner.py sha256=xXiU7_Tgd_tM6AKjruUTlNm3TnTL2a4Qf-vzJt7ZHn4 6497
spark_matcher/blocker/blocking_rules.py sha256=e1VN0GUh6BVn6E589milqyyJD_XU4F5RRurdNv6WSGo 12577
spark_matcher/data/__init__.py sha256=K8B-8iWH1EY3VBrnS8s9SPfA1rnwTV0wyHX0deiunZ8 32
spark_matcher/data/acm.csv sha256=qltQuP7I2szv3cS7lN9JYVmQkGMy4CDCysDrMFoUMMY 340434
spark_matcher/data/datasets.py sha256=NFcWP42yo9kphhYSEAX8ND8XhnOLtJofuGgaSdEBDTs 4049
spark_matcher/data/dblp.csv sha256=-k9EfRUTAqdrPGmWt2vLYSO_Ae8hXg6LgCRbFHMMwhY 325443
spark_matcher/data/stoxx50.csv sha256=Cp6p_3n5XIWK-aess-m3cenabLYa2DvvqfLqRo9nfzs 6370
spark_matcher/data/voters_1.csv sha256=0juSvxzakSOQfcVjtOyhLM8-UDT4Mjjr-cUW110NUYw 30053
spark_matcher/data/voters_2.csv sha256=SGrwdCW04wTvIFtOp5cYATKeGTsQsL6Cc0xrvEv-b8M 30042
spark_matcher/deduplicator/__init__.py sha256=6_rNjPgH8RQO_BKsKSE4EMRZBGHcjAzl7_h3pbWXr2s 66
spark_matcher/deduplicator/connected_components_calculator.py sha256=Agf2AlzTfDfw7XsmEE6P9VLBjOWYwz8PbPM03aegRWM 15191
spark_matcher/deduplicator/deduplicator.py sha256=o2T7gr-QxEJTJaGox18c_8qR3C1my6yEeGdLsMSgeVo 14013
spark_matcher/deduplicator/hierarchical_clustering.py sha256=vSUbxIWdWoy4RPuETVfsE4AuFSKON8Uu22X8TSn3Itc 5781
spark_matcher/matcher/__init__.py sha256=4N3BV95iITGHfGV54s8RsrfevEU5ipKSqV0gGJf8wwQ 51
spark_matcher/matcher/matcher.py sha256=73I8wyWXSHYwJjsmXLbspxILoND6CEk08dOldfhEgE0 5883
spark_matcher/matching_base/__init__.py sha256=HbUK0MzCLyDQXfS3zoEte_J9epgRc73bXxFOgN3CHig 67
spark_matcher/matching_base/matching_base.py sha256=aKSuc_jjf7oHpspY2zglxaEqrlm-nFPoCbG0tT0aohY 10835
spark_matcher/sampler/__init__.py sha256=ivwpojpu_qmIKCggiTH1TNkzvUd3-qCPVM05yEzauFM 100
spark_matcher/sampler/training_sampler.py sha256=Vz70c4qsGjkfkMO7tl7dbCIJAD5e8JAR_3AME73YLK0 14962
spark_matcher/scorer/__init__.py sha256=aIgA_zb-R7JJc5c3e42QfWJj6OLMMOcT661_jmfdMSk 48
spark_matcher/scorer/scorer.py sha256=mG8pCPH9q09ko4_SrafPvdTxOg2H5ctn7PlsHDd3QhM 3097
spark_matcher/similarity_metrics/__init__.py sha256=01HdVVp7RTHUHamfq4VmuPWtAtTyTphOavtyHcUkc-s 83
spark_matcher/similarity_metrics/similarity_metrics.py sha256=OV8BGc5wQSdYt3paRI97iDiUkqL_DxReAforKbA3phA 3929
Spark_Matcher-0.3.2.dist-info/LICENSE sha256=-cN1ob5KQfe3AwHdg8kcuJ5BVnR4hZt37vN1pS14JQU 18091
Spark_Matcher-0.3.2.dist-info/METADATA sha256=HFC7h0akJhZieBWePRbIOoIFzh5i6TWs8tfVEUmxNhg 6598
Spark_Matcher-0.3.2.dist-info/WHEEL sha256=Xo9-1PvkuimrydujYJAjF7pCkriuXBpUPEjma1nZyJ0 92
Spark_Matcher-0.3.2.dist-info/top_level.txt sha256=poz-JGcZfCmZLXFw86H7tpaElmr35bGzSHLJX6RzBOA 14
Spark_Matcher-0.3.2.dist-info/RECORD

top_level.txt

spark_matcher