emm

View on PyPIReverse Dependencies (0)

2.1.1 emm-2.1.1-py3-none-any.whl

Wheel Details

Project: emm
Version: 2.1.1
Filename: emm-2.1.1-py3-none-any.whl
Download: [link]
Size: 183488
MD5: ccfd9ed9f9dbf9e950847e381b18bc66
SHA256: c7a862b3efc6e3bb6f320f9718eb41e4f362860f7e71cb6342710c213c44a09b
Uploaded: 2024-04-23 11:27:54 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: emm
Version: 2.1.1
Summary: Entity Matching Model package
Author-Email: Max Baak <max.baak[at]ing.com>, Stephane Collot <stephane.collot[at]gmail.com>, Apoorva Mahajan <apoorva.mahajan[at]ing.com>, Tomasz Waleń <tomasz.walen[at]ing.com>, Simon Brugman <simon.brugman[at]ing.com>
Requires-Python: >=3.6
Requires-Dist: numpy (>=1.20.1)
Requires-Dist: scipy
Requires-Dist: scikit-learn (>=1.0.0)
Requires-Dist: pandas (!=1.5.0,>=1.1.0)
Requires-Dist: jinja2
Requires-Dist: rapidfuzz (<3.0.0)
Requires-Dist: regex
Requires-Dist: urllib3
Requires-Dist: recordlinkage
Requires-Dist: cleanco (>=2.2)
Requires-Dist: xgboost
Requires-Dist: sparse-dot-topn (>=1.1.1)
Requires-Dist: joblib
Requires-Dist: pyarrow (>=6.0.1)
Requires-Dist: requests
Requires-Dist: unidecode
Requires-Dist: pre-commit; extra == "dev"
Requires-Dist: gitpython; extra == "dev"
Requires-Dist: nbconvert; extra == "dev"
Requires-Dist: jupyter-client (>=5.2.3); extra == "dev"
Requires-Dist: ipykernel (>=5.1.3); extra == "dev"
Requires-Dist: matplotlib; extra == "dev"
Requires-Dist: pygments; extra == "dev"
Requires-Dist: pandoc; extra == "dev"
Requires-Dist: pympler; extra == "dev"
Requires-Dist: matplotlib; extra == "doc"
Requires-Dist: seaborn; extra == "doc"
Requires-Dist: sphinx; extra == "doc"
Requires-Dist: sphinx-material; extra == "doc"
Requires-Dist: furo; extra == "doc"
Requires-Dist: sphinx-copybutton; extra == "doc"
Requires-Dist: sphinx-autodoc-typehints; extra == "doc"
Requires-Dist: jupyter-contrib-nbextensions; extra == "doc"
Requires-Dist: nbstripout; extra == "doc"
Requires-Dist: nbsphinx; extra == "doc"
Requires-Dist: nbsphinx-link; extra == "doc"
Requires-Dist: ipywidgets; extra == "doc"
Requires-Dist: jinja2; extra == "doc"
Requires-Dist: jinja-cli; extra == "doc"
Requires-Dist: markupsafe; extra == "doc"
Requires-Dist: pandoc; extra == "doc"
Requires-Dist: jupyter-client (>=5.2.3); extra == "doc"
Requires-Dist: myst-parser; extra == "doc"
Requires-Dist: numpy (<1.24.0); extra == "spark"
Requires-Dist: pyspark (>=3.1); python_version < "3.11" and extra == "spark"
Requires-Dist: pytest; extra == "test"
Requires-Dist: pytest-ordering; extra == "test"
Requires-Dist: virtualenv; extra == "test"
Requires-Dist: pytest-benchmark; extra == "test-bench"
Requires-Dist: coverage; extra == "test-cov"
Requires-Dist: pytest-cov; extra == "test-cov"
Requires-Dist: pytest-notebook (>=0.6.1); extra == "test-notebook"
Requires-Dist: ipykernel (>=5.1.3); extra == "test-notebook"
Requires-Dist: matplotlib; extra == "test-notebook"
Requires-Dist: nbdime (<4); extra == "test-notebook"
Provides-Extra: dev
Provides-Extra: doc
Provides-Extra: spark
Provides-Extra: test
Provides-Extra: test-bench
Provides-Extra: test-cov
Provides-Extra: test-notebook
Description-Content-Type: text/markdown
License-File: LICENSE
License-File: NOTICE
[Description omitted; length: 6747 characters]

WHEEL

Wheel-Version: 1.0
Generator: bdist_wheel (0.43.0)
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
emm/__init__.py sha256=KG43kP9M43Mj_1Z7y3aal8rbSCh05ZrICy6vSIEGjdY 1531
emm/parameters.py sha256=ycOO6UGcRPYPepZo3UYDidS4JODaDeLvhJwkZnkTlpU 7621
emm/resources.py sha256=mmamyzAjl-nSSEw9FyWIN55YQAKxAEUn0YJiBnuvSow 2766
emm/version.py sha256=2tY228JCMRd7C_C6MlYjIqpW5W1JMUQ86KVK439JWwQ 1150
emm/aggregation/__init__.py sha256=2oNBcoRsfNfRort5QXocr4bvF30d4t48qYxAKk37R0o 1409
emm/aggregation/base_entity_aggregation.py sha256=Vp69pxkeWtHPBD9SQBpI_kE45VcvUxgj4D49KcQY17U 8197
emm/aggregation/pandas_entity_aggregation.py sha256=w0HeZtzCAiSX0BEWhOeOwRH5HXHH9s66CHTciYzSp2s 8175
emm/aggregation/spark_entity_aggregation.py sha256=9uekI5mB5P7a9aRcW63cIF50gCyOVqVPj4uQUGBMyXY 9062
emm/base/__init__.py sha256=KSr93yM2D0PtVaBogKzZfsCPZdRaj8CAMg9JcZf5AfQ 1108
emm/base/module.py sha256=rwPKl4yjQuzhv1LC0a7WW5Te3_RCp1WIiOaHMTpk6QQ 1195
emm/base/pipeline.py sha256=lqv8QKNisg9kW9WEFtSjxE2uUgK9n8x1eGB8jEHWU4M 1255
emm/data/README.md sha256=Q7R8PpVla9Uoq8UM9ogSsUJUGiGNIKOT3G9Vq15bedc 225
emm/data/__init__.py sha256=Tqvv823woHDzVJAwyTEKVIpugkvllV6CGUNB-0WjaF4 1250
emm/data/create_data.py sha256=SDyyhFj9xyI2KWcazoJqJzpJXrTwsNvKRGBRwXNilXk 20996
emm/data/negative_data_creation.py sha256=KncDTaSV0SfAn8j5LfpeOvB3qvaxM1IvUhIPM0_XmeI 12331
emm/data/noiser.py sha256=Gh5rVXaNqS9EIrKgcFnFHO37_bCq1eC3Z1S11gClF6E 6553
emm/data/prepare_name_pairs.py sha256=nWl3bWDKnyIXxoi4xdlRefqxMw2fDounYz7QPchyJ8o 8814
emm/data/unittest_sample_namepairs.csv.gz sha256=pnSnbV-F1hDjDg1wOMXmiOyGJhpYmBSOWbstVj_hybY 7038
emm/features/__init__.py sha256=j-wPs-Yo6zNjNrE35Mm-3_a6FL0fEKkAL6ehX9X2Eq8 1220
emm/features/base_feature_extractor.py sha256=vtBcP9yZHLq_JKspubRbhGj5Ixrl86i7Cy0NnJcng1E 1241
emm/features/features_extra.py sha256=WEY9DP4e0Ej2e7eMRapZJS82mSIpLGKCzNhOtDHArIs 2533
emm/features/features_lef.py sha256=e1SvJpwZIonyV3TXycohYDKcZZ7izN8RPUjIM2RWnzc 9446
emm/features/features_name.py sha256=HVTO8A-4vRlYs76eG8h2BYtc9L87cJQ3d2KiJmo4at4 5825
emm/features/features_rank.py sha256=jFXhjmE60oOvumQClwSU5idmEOvrtg9m178OousTmW0 3940
emm/features/features_vocabulary.py sha256=dcs8lQEI3go3v51P-OW3P96t72_UkxCuURt6FIhqFCE 6269
emm/features/pandas_feature_extractor.py sha256=bH48a82J-jJtdjKEVvb4gcpr0rzciKbf66n1Q4MNnEM 8654
emm/helper/__init__.py sha256=zCvUErP7I8EJnJCU30r-tx51ukTsT0Xmp-qIa8u-Vqo 1293
emm/helper/blocking_functions.py sha256=Uos3rOw0qZPh-ozmdYJGFFF9Xv8kQ9T58z9xRCUI9nQ 2342
emm/helper/custom_path.py sha256=PMVgE7plXIEZrcTvNnRJ093CFhkuoMW0o6UJ3wYm7Q4 3744
emm/helper/io.py sha256=z8p3Rct6rAU4EU4UoyDBtj0xU4kdkHmvUAZqBWo9OkY 5483
emm/helper/sklearn_pipeline.py sha256=mMNo6_vp9fZokFONoXdZV4gPF61Zfe1ycKU8v-UXGKA 1463
emm/helper/spark_custom_reader_writer.py sha256=I7mrcSxTm19mnNWCr8VpbHOTY7GjMbFVAVAyNixYL-I 13083
emm/helper/spark_ml_pipeline.py sha256=J-l0PSZWr2BZZPLM1fV67Ozg64vm2wO5HS2H02XDiH8 3345
emm/helper/spark_utils.py sha256=B-fTwVAdqWA_JI8o12zEXhLsOhgxG1y2MD6ieBih1xk 6382
emm/helper/util.py sha256=D-kE-EgJ44OxTNA8Nkpo_uXA9tvR7QmSoaQVOaNH2-4 6205
emm/indexing/__init__.py sha256=YCXZ_Bo40K8JuEoJ0M_sAKgdhIh47swo24yzfXN85v4 1679
emm/indexing/base_indexer.py sha256=TqrAY00xJEyXM3_8SRZT0NRg_kXnYqtUmHcKw_sofQU 3447
emm/indexing/pandas_candidate_selection.py sha256=n-96pMO0Lq0Nz76uRbKmeomMTwHaXOGVA9S3IxBLx1Y 11503
emm/indexing/pandas_cos_sim_matcher.py sha256=SZMwF1j6qtBnDElOQT54s2-3SB4xJIjNkaLTaOYgID0 13277
emm/indexing/pandas_naive_indexer.py sha256=NMbiod4eg-0FNfoSglQW3Z1g9RtaGKzT0LBB2FYI98Y 2623
emm/indexing/pandas_normalized_tfidf.py sha256=uRcaUqfBQfWvHZEM19gZSe3F0RBnnPQfWbJdzu6EwdM 6588
emm/indexing/pandas_sni.py sha256=FWg23C-nAjnpdp_345NLOjgqK5OVGcB7uoj1s0V-P1g 6418
emm/indexing/spark_candidate_selection.py sha256=oSkaK_wqmLItgSincToa-qJjJCGYOeOhGf9WcvsORJQ 15490
emm/indexing/spark_character_tokenizer.py sha256=ynFdy8bsKPExmH5GyNVnnVZQliLr5k9DKsuDUoYvx_8 1444
emm/indexing/spark_cos_sim_matcher.py sha256=TgVt9G1WFnCTRKuHTqggMJSApeqZb9D-RptljLlJg-4 31867
emm/indexing/spark_indexing_utils.py sha256=slEyglVsZTSORH8vycVVnqb8UzycWMr8LIsBuVBusUM 8472
emm/indexing/spark_normalized_tfidf.py sha256=Q6hOsd76Czd4LErFdp_PS07uMoAT2eCkiW9RNGRwzMw 6534
emm/indexing/spark_sni.py sha256=I0lAAz7u40uRLgwYI62cxRCfrzcWV00gg42yBNhg5n0 11996
emm/indexing/spark_word_tokenizer.py sha256=qEbRZUqFRNDo9CfHDVBLTWTPWPZc2W4NqILWr3RujPE 1466
emm/loggers/__init__.py sha256=GgYSAUXGfYhfCFly97nK6PQIJ-ZUeNlUT0I0dg_DxM8 1166
emm/loggers/logger.py sha256=u1YZzVtQqIuB-1rpkkJ_0cvsltafSaXdDiwgr0ilVSs 2667
emm/loggers/timer.py sha256=f72d7xfV-4N_vZMzptGKlQ6PCyNuh-WNslOrVbAyR4w 3867
emm/pipeline/__init__.py sha256=5LvGD69dNWw0-0zKrjkcX6qwy-3px6nG2zLjqEPWfrg 1385
emm/pipeline/base_entity_matching.py sha256=a7GnHbX8cyNSqSfFBKDeqyol5Ka9U2Er9ASkULIujtY 14514
emm/pipeline/pandas_entity_matching.py sha256=nWZBD3ImAbihgVnxt4ENgyMeZ3QXHv3a7H9wiwv8C0E 38661
emm/pipeline/spark_entity_matching.py sha256=5JG-kHDq0YjjiTDTRwk1Q795Wq5V5Jb1OuJ_XLKGJBw 39097
emm/preprocessing/__init__.py sha256=3az-gTCmiVDxQ9wottHPJiH_3ZXzJvQkrnXiH41gnmc 1382
emm/preprocessing/abbreviation_util.py sha256=MtX3HFD4FIPF_XURUNqgqKLRiEp0Dcwab-GTOIzKP0Q 5636
emm/preprocessing/base_name_preprocessor.py sha256=IxcUNA95uonu2e0OJbM9Vk4h2FffaFYZ7OzdiQqIVmQ 4857
emm/preprocessing/functions.py sha256=i-TG7OOLMSLNEjtl3OclXIlkE9_grZhGo-Jr9JjPI1s 5011
emm/preprocessing/pandas_functions.py sha256=OSS0_3NPz5mztToOQxHF9wUmUsNNnjR9g3HWJRx2hsY 1840
emm/preprocessing/pandas_preprocessor.py sha256=NOIOWOwcktQ0S2uwNTSOtN3gIoC9h6xYWCGTALpZdLk 8336
emm/preprocessing/spark_functions.py sha256=r9q0N3uyLEEFoZgWhsWzYs7cmKMJ-k7ntmi46iRQAmg 1814
emm/preprocessing/spark_preprocessor.py sha256=gM8bqJsk6Hp_rwzA7ucdfykexiXpBB-v6LaB_-hSeLU 5505
emm/supervised_model/__init__.py sha256=LcI-HtSh1QLB2toJTorvbRXHM365B6YoNwjUTD8fsso 1567
emm/supervised_model/base_supervised_model.py sha256=FQ6iKJDVm8i1BYXD3yNJyf2pn1T9RJt2ML9N7DQLRbE 12681
emm/supervised_model/pandas_supervised_model.py sha256=5UnNKuwHWFroqFT6CdFGhE0yuB_u1GOtam9rPUueM18 11491
emm/supervised_model/spark_supervised_model.py sha256=XawgXxQpTAgObSwlG1l6dNPjXbIlPMm5tCNxtMa5ZrQ 14939
emm/threshold/__init__.py sha256=KSr93yM2D0PtVaBogKzZfsCPZdRaj8CAMg9JcZf5AfQ 1108
emm/threshold/threshold_decision.py sha256=VVt0gnHceRIiKUwEotgGFt0Ta2v2Vjy_JQMrXfawMME 6069
emm-2.1.1.dist-info/LICENSE sha256=CA9-11d3VoTsfRN7SI6iZ4S0j7J9qkOecsEQR2zhsZQ 1071
emm-2.1.1.dist-info/METADATA sha256=HhzBWJ0gZqOIifMLcA7wocwz5jnUqsst5nQHC0_V6-8 9602
emm-2.1.1.dist-info/NOTICE sha256=RVd8l2FPCjs__mpvCVxc_DjvJNDrLg5G-LIvZdWa5TA 1242
emm-2.1.1.dist-info/WHEEL sha256=GJ7t_kWBFywbagK5eo9IoUwLW6oyOeTKmQ-9iHFVNxQ 92
emm-2.1.1.dist-info/top_level.txt sha256=mP3tykwXey_RZ9z1pG_BE7EmgutTx1gWu4IKQhcCqxc 4
emm-2.1.1.dist-info/RECORD

top_level.txt

emm