modelbench

View on PyPIReverse Dependencies (0)

0.5.1 modelbench-0.5.1-py3-none-any.whl

Wheel Details

Project: modelbench
Version: 0.5.1
Filename: modelbench-0.5.1-py3-none-any.whl
Download: [link]
Size: 76256
MD5: 6f01a511f236f0e0f6145975e9e7f036
SHA256: bd3cd24be507559a3c5d66deb867597234a98fa067adebbbbefcdfef31d34427
Uploaded: 2024-04-29 14:13:53 +0000

dist-info

METADATA

Metadata-Version: 2.1
Name: modelbench
Version: 0.5.1
Summary: Run benchmarks and generate reports measuring the behavior of many AI Systems.
Author: MLCommons AI Safety
Author-Email: ai-safety-engineering[at]mlcommons.org
Home-Page: https://github.com/mlcommons/modelbench
Project-Url: Repository, https://github.com/mlcommons/modelbench
License: Apache-2.0
Keywords: AI,GenAI,LLM,NLP,evaluate,measure,quality,testing,prompt,safety,compare,artificial,intelligence,Large,Language,Models
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: Intended Audience :: Information Technology
Classifier: Intended Audience :: Science/Research
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Natural Language :: English
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Classifier: Topic :: Scientific/Engineering
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: System :: Benchmark
Classifier: Typing :: Typed
Requires-Python: >=3.10,<3.13
Requires-Dist: casefy (<0.2.0,>=0.1.7)
Requires-Dist: click (<9.0.0,>=8.1.7)
Requires-Dist: jinja2 (<4.0.0,>=3.1.3)
Requires-Dist: jq (<2.0.0,>=1.6.0)
Requires-Dist: modelgauge[all-plugins] (>=0.5.1)
Requires-Dist: pip (<25.0,>=24.0)
Requires-Dist: retry (<0.10.0,>=0.9.2)
Requires-Dist: scipy (<2.0.0,>=1.12.0)
Requires-Dist: tabulate (<0.10.0,>=0.9.0)
Requires-Dist: termcolor (<3.0.0,>=2.4.0)
Description-Content-Type: text/markdown
[Description omitted; length: 6889 characters]

WHEEL

Wheel-Version: 1.0
Generator: poetry-core 1.9.0
Root-Is-Purelib: true
Tag: py3-none-any

RECORD

Path Digest Size
modelbench/__init__.py sha256=47DEQpj8HBSa-_TImW-5JCeuQeRkm5NMpJWZG3hSuFU 0
modelbench/benchmarks.py sha256=kl-9Ub1-HmZfvahS_uijiuJwLKF7v8sWgtn7kODbtJQ 2306
modelbench/hazards.py sha256=vE8vbvlzTSYKJQGdKAe9KDC2xAOK0xmoDgPQ6FUiPok 4351
modelbench/modelgauge_runner.py sha256=isbDRHoqnPRi4RAUq2kuh3Aiydkl0EaA4UFrKD3CKOc 2661
modelbench/run.py sha256=GhoNB4FdBUau3Cz6UmOECkSDtDwhnCV_WUttLpvMGMc 13504
modelbench/scoring.py sha256=PM-7Ft-YxKAwyBjX-Smrl1nj7w0nbSHwmJmdahCQXWI 2788
modelbench/standards.json sha256=PWUPAzbcq2KAh9Sl6wbEfy98LXI7mMX8XfNCIDENvB4 1366
modelbench/static_site_generator.py sha256=oG2knrCq0DyEXEcXWjVfgFkCAnp5uFM3boYOsK_1d-Q 10293
modelbench/templates/_provisional.html sha256=zD5uU21MlUtGyVP7fDPc8n1kZ5OZpqu6N1u3Td1bTj4 947
modelbench/templates/_test_runs_legend.html sha256=VNwTyBX5BiEexLwa2IMkpVt83rQlEKR7Xy7puxoQ6YQ 481
modelbench/templates/base.html sha256=QIyHKMpbQaoQOx5kg4MgbqTR1wTqFJEyUn9ieMQqDEg 648
modelbench/templates/benchmark.html sha256=-60flSJGKaxIby6Y_vPIgIEdhTqbtaYHbSClVBsh2zA 2876
modelbench/templates/benchmarks.html sha256=smbBZy1s8pWqi9fHa2YdYQlSTA_Ha_2ZDw35dPZkULo 1047
modelbench/templates/content/general.toml sha256=OgMGFDvcZjfOHJCyTBbJRVewQW7pOT92HMTvxL_jYlA 3369
modelbench/templates/content/general_purpose_ai_chat_benchmark.toml sha256=BPdEiFFpp9qUqKlKfNbs2Bk42jmlsfX3u3UUoqu7kxM 2810
modelbench/templates/content/grades.toml sha256=9vR0ZJp4wujiFiBUSothdRCF40Kvzsys0lX4u6fK3gk 997
modelbench/templates/content/hazards.toml sha256=SrNryRXU1zB8ELZWMo10Yp8gu2nJrjWgAAPjq5GEzbQ 1041
modelbench/templates/content/suts.toml sha256=uN36NARE2po8ZgLIlq_feblnXKEqJmU9LNzbrCyak4E 732
modelbench/templates/content/tests/bbq.toml sha256=sYqtj3jTMvtcCXlWxcPr5aI6UUriZsNLgW8GTFjx9AY 70
modelbench/templates/content/tests/real_toxicity_prompts.toml sha256=cAWIXmqS5NuF2iL3IdO7h6VRwlptQqTfgQ_giiLZ51o 77
modelbench/templates/content/tests/safe-cae-benign.toml sha256=S20y4gXRFwQMnD12YeL-52NDwIw8GQeaocklZoSaKPg 121
modelbench/templates/content/tests/safe-cae.toml sha256=9qvwKE4xVfm4Th3jrmQ3GB4LT17LFvZmgUk0uAp5njE 127
modelbench/templates/content/tests/safe-cbr.toml sha256=lKd_CDrjzl0q9AVCAHVCPIKNHCXUfrPV7wKDQqqRgKU 176
modelbench/templates/content/tests/safe-gra.toml sha256=ErYYXrEXlyVJb9XSX-Vp_MTrAikH5pR3_JxkTwjhIo0 128
modelbench/templates/content/tests/safe-ssh-benign.toml sha256=rD5KOJnmjGkL3Bg3Hp845feQwM5Z4wYjTfjsencc9-c 132
modelbench/templates/content/tests/safe-ssh.toml sha256=l4Bz-x-qVM0A1ZdGvUeSO2dEH_SBO5oG9rd4PuJFLII 138
modelbench/templates/content/tests/safe-ter-benign.toml sha256=H6cOcZ56qO5pWKj5QPRbTNRptn3Q1jJ9yxZ7mAhC07Q 102
modelbench/templates/content/tests/safe-ter.toml sha256=q71-8P7urTjt1XZEAWfLK3YV5MKLgl9bcNBgo7sloO0 108
modelbench/templates/content/tests/simple_safety_tests.toml sha256=mDtOHAEGMJZB9tzbWw7sFvonKgP6YYc2An5wx3T-PZ4 83
modelbench/templates/content/tests/xstest.toml sha256=Fr_zCOeRyelY5FBkzub-6w6hBfBX7vxEpUpac4RHEO8 67
modelbench/templates/index.html sha256=GYJP6c8ZhPnh9k5jg1O5a8qPvC7sucv57wkD-EYTB6U 247
modelbench/templates/macros/benchmark_card.html sha256=wFX4VUBJYpQ94FG5D087esiK1yBhFOwJ8ktQopYIKNQ 538
modelbench/templates/macros/breadcrumb.html sha256=o3vy4AEyuN0S_ofQauVgwmc1Qk7GUramFpu3cSyIkK4 996
modelbench/templates/macros/interpret_safety_ratings.html sha256=qvpai4MpgTIX5JdO2nlupH7DkZxOQAiprGkRIa_VK5s 1872
modelbench/templates/macros/sut_card.html sha256=EN9G0LX6aHv3ZWw2OOhCxJLNCO10FvEZa_o0WUEkl78 1951
modelbench/templates/macros/test_runs.html sha256=jqWMV1HRsSD1OPRmxN5Z-hB66c5Pnf4_XvrjxJYlNmo 3160
modelbench/templates/macros/use_hazards_limitations.html sha256=5LFd6IP9FIpPTaZgQofpgc9RDFOsXd-D8vdEEEQW_S0 1316
modelbench/templates/static/images/ml_commons_logo.png sha256=FOoZC66i2hGE6PVKA6hSpyzC2A81qDrcJ-vcRCXo54Y 33565
modelbench/templates/static/style.css sha256=Mx36Q-yH_JOp0cV66gYKlGQYdeeV9FYMTz26t1ZDtyQ 29613
modelbench/templates/test_report.html sha256=nIqmtek0QVBBcGvKoH95d2ka3Qvnu4HPtiomk64Au5c 2646
modelbench/utilities.py sha256=dKw_1bXy84WHto3MN-HNwOiMqM5xP3pG_jnyBQXQ_eY 318
modelbench-0.5.1.dist-info/LICENSE.md sha256=DVQuDIgE45qn836wDaWnYhSdxoLXgpRRKH4RuTjpRZQ 10174
modelbench-0.5.1.dist-info/METADATA sha256=SugyBN-pHX9nO9RlQAvVGZoBMhH47wTtKPCFITbwwTM 8641
modelbench-0.5.1.dist-info/WHEEL sha256=sP946D7jFCHeNz5Iq4fL4Lu-PrWrFsgfLXbbkciIZwg 88
modelbench-0.5.1.dist-info/entry_points.txt sha256=I4hxcFOVRR1G8A3RzM-CGfHOxMSByGJ1XRGQ5sSDnf8 49
modelbench-0.5.1.dist-info/RECORD

entry_points.txt

modelbench = modelbench.run:cli