Sync master #1993

arjunsuresh · 2024-12-20T19:09:46Z

No description provided.

* Fix loadgen build for version numbers having "0" * Update test-resnet50.yml * Update test-retinanet.yml * Update test-bert.yml

Co-authored-by: Miro <[email protected]>

* Fix submission checker for v5.0 rgat * Update submission_checker.py | Updates for v5.0 * [Automated Commit] Format Codebase * Update submission_checker.py | Fixes latency_constraints for v5.0 * [Automated Commit] Format Codebase --------- Co-authored-by: mlcommons-bot <[email protected]>

* Fix llama3-405B docker workflow * Fix the performance sample count from 8312 to 8313 * More fixes

* Fix submission checker for v5.0 rgat * Fix accuracy pattern for rgat, report-generator for v5.0

* More minor fixes * Fix indentation for stats report

Co-authored-by: Miro <[email protected]>

* Require equal issue mode for R-GAT * Add equal issue note in readme --------- Co-authored-by: Miro <[email protected]>

* Fixes #1648, restrict loadgen uncommitted error message to within the loadgen directory * Update test-rnnt.yml (#1688) Stopping the github action for rnnt * Added docs init Added github action for website publish Update benchmark documentation Update publish.yaml Update publish.yaml Update benchmark documentation Improved the submission documentation Fix taskname Removed unused images * Fix benchmark URLs * Fix links * Add _full variation to run commands * Added script flow diagram * Added docker setup command for CM, extra run options * Added support for docker options in the docs * Added --quiet to the CM run_cmds in docs * Fix the test query count for cm commands * Support ctuning-cpp implementation * Added commands for mobilenet models * Docs cleanup * Docs cleanup * Added separate files for dataset and models in the docs * Remove redundant tab in the docs * Fixes some WIP models in the docs * Use the official docs page for CM installation * Fix the deadlink in docs * Fix indendation issue in docs * Added dockerinfo for nvidia implementation * Added run options for gptj * Added execution environment tabs * Cleanup of the docs * Cleanup of the docs * Reordered the sections of the docs page * Removed an unnecessary heading in the docs * Fixes the commands for datacenter * Fix the build --sdist for loadgen * Fixes #1761, llama2 and mixtral runtime error on CPU systems * Added mixtral to the benchmark list, improved benchmark docs * Update docs for MLPerf inference v4.1 * Update docs for MLPerf inference v4.1 * Fix typo * Gave direct link to implementation readmes * Added tables detailing implementations * Update vision README.md, split the frameworks into separate rows * Update README.md * pointed links to specific frameworks * pointed links to specific frameworks * Update Submission_Guidelines.md * Update Submission_Guidelines.md * Update Submission_Guidelines.md * api support llama2 * Added request module and reduced max token len * Fix for llama2 api server * Update SUT_API offline to work for OpenAI * Update SUT_API.py * Minor fixes * Fix json import in SUT_API.py * Fix llama2 token length * Added model name verification with server * clean temp files * support num_workers in LLAMA2 SUTs * Remove batching from Offline SUT_API.py * Update SUT_API.py * Minor fixes for llama2 API * Fix for llama2 API * removed table of contents * enabled llama2-nvidia + vllm-NM : WIP * enabled dlrm for intel * lower cased implementation * added raw data input * corrected data download commands * renamed filename * changes for bert and vllm * documentation to work on custom repo and branch * benchmark index page update * enabled sdxl for nvidia and intel * updated vllm server run cmd * benchmark page information addition * fix indendation issue * Added submission categories * update submission page - generate submission with or w/o using CM for benchmarking * Updated kits dataset documentation * Updated model parameters * updation of information * updated non cm based benchmark * added info about hf password * added links to model and access tokens * Updated reference results structuree tree * submission docs cleanup * Some cleanups for benchmark info * Some cleanups for benchmark info * Some cleanups for benchmark info * added generic stubs deepsparse * Some cleanups for benchmark info * Some cleanups for benchmark info * Some cleanups for benchmark info * Some cleanups for benchmark info (FID and CLIP data added) * typo fix for bert deepsparse framework * added min system requirements for models * fixed code version * changes for displaying reference and intel implementation tip * added reference to installation page * updated neural magic documentation * Added links to the install page, redirect benchmarks page * added tips about batch size and dataset for nvidia llama2 * fix conditions logic * modified tips and additional run cmds * sentence corrections * Minor fix for the documentation * fixed bug in deepsparse generic model stubs + styling * added more information to stubs * Added SCC24 readme, support reproducibility in the docs * Made clear the custom CM repo URL format * Support conditional implementation, setup and run tips * Support rocm for sdxl * Fix _short tag support * Fix install URL * Expose bfloat16 and float16 options for sdxl * Expose download model to host option for sdxl * IndySCC24 documentation added * Improve the SCC24 docs * Improve the support of short variation * Improved the indyscc24 documentation * Updated scc run commands * removed test_query_count option for scc * Remove scc24 in the main docs * Remove scc24 in the main docs * Fix docs: indendation issue on the submission page * generalised code for skipping test query count * Fixes for SCC24 docs * Fix scenario text in main.py * Fix links for scc24 * Fix links for scc24 * Improve the general docs * Fix links for scc24 * Use float16 in scc24 doc * Improve scc24 docs * Improve scc24 docs * Use float16 in scc24 doc * fixed command bug * Fix typo in docs * Fix typo in docs * Remove unnecessary indendation in docs * initial commit for tip - native run CUDA * Updated tip * added docker_cm_repo_branch to more run option - docker * Update docs for IndySCC24 * Support custom repo branch and owner for final report generation * enabled amd implementation for llama2 * updations for amd - docs * Fix scenarios in docs page * formatted the files to pass the gh action * scenarios -> fixed_scenarios in docs * [Automated Commit] Format Codebase * Update indyscc24-bert.md * Update scc24.md * updated tip for reference implementation (#1912) * [Automated Commit] Format Codebase * fix for run suffix (#1913) * [Automated Commit] Format Codebase * Updation for adding submission flow diagram * Added submission flow diagram * Update scc24.md * changes in submission documentation (#1946) * update results category (#1947) * changes for adding rgat to docs (#1965) * Update index.md | Added R-GAT details (WIP) * Update index.md * Create system_requirements.yml * Update system_requirements.yml * Update system_requirements.yml * Update system_requirements.yml --------- Co-authored-by: anandhu-eng <[email protected]> Co-authored-by: ANANDHU S <[email protected]> Co-authored-by: Michael Goin <[email protected]> Co-authored-by: arjunsuresh <[email protected]> Co-authored-by: Pablo Gonzalez <[email protected]> Co-authored-by: Mitchelle Rasquinha <[email protected]> Co-authored-by: Miro <[email protected]>

* Update automated run command section * add cm commands for model and dataset downloads * Update README.md * Update cm run cmds --------- Co-authored-by: Miro <[email protected]>

* Unify llama3 names to llama3.1-405b * Set mlperf.conf name to llama3_1-405b

* Create test-rgat.yml * Update test-rgat.yml * Update test-rgat.yml --------- Co-authored-by: Miro <[email protected]>

Co-authored-by: Miro <[email protected]>

* Create benchmark-checklist.md for r-gat * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md * Update benchmark-checklist.md --------- Co-authored-by: Miro <[email protected]>

github-actions · 2024-12-20T19:10:00Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

nvzhihanj and others added 24 commits December 9, 2024 14:32

Upgrade loadgen version to 5.0 (#1962)

8ce46d5

Fix loadgen build for version numbers having "0" (#1967)

4968ba0

* Fix loadgen build for version numbers having "0" * Update test-resnet50.yml * Update test-retinanet.yml * Update test-bert.yml

Increment version to 5.0.1

6afe996

Fix Dockerfile for 405B (#1960)

e69b504

Co-authored-by: Miro <[email protected]>

Update backend_pytorch.py | Fix lock usage (#1964)

cec62ff

Co-authored-by: Miro <[email protected]>

Add llama3 metrics + remove llama3-99.9 (#1973)

bc5e6fb

Fix test05 seeds missing error for v5.0 submission checker (#1976)

2d2eb30

Fix llama3-405B docker workflow and performance sample count (#1978)

fa27456

* Fix llama3-405B docker workflow * Fix the performance sample count from 8312 to 8313 * More fixes

Increment version to 5.0.2

1e48c0d

Fix submission generation for v5.0 (#1981)

c71b83f

* Fix submission checker for v5.0 rgat * Fix accuracy pattern for rgat, report-generator for v5.0

More minor fixes for llama3.1-405b (#1983)

aebc018

* More minor fixes * Fix indentation for stats report

Remove unused rgat files (#1961)

3ae2b2a

Co-authored-by: Miro <[email protected]>

Update docker GPU, avoid long build time (#1966)

03c9666

Co-authored-by: Miro <[email protected]>

Require equal issue mode for R-GAT (#1968)

867def4

* Require equal issue mode for R-GAT * Add equal issue note in readme --------- Co-authored-by: Miro <[email protected]>

Increment version to 5.0.3

b3e1e8e

[Automated Commit] Format Codebase

647f9f8

Update automated run command section - R-GAT (#1970)

e6069aa

* Update automated run command section * add cm commands for model and dataset downloads * Update README.md * Update cm run cmds --------- Co-authored-by: Miro <[email protected]>

Unify llama3 names to llama3.1-405b (#1982)

00945c3

* Unify llama3 names to llama3.1-405b * Set mlperf.conf name to llama3_1-405b

Increment version to 5.0.4

6af0288

Create test-rgat.yml (#1984)

f5382b7

* Create test-rgat.yml * Update test-rgat.yml * Update test-rgat.yml --------- Co-authored-by: Miro <[email protected]>

Update compliance test table (#1987)

939b2fe

Co-authored-by: Miro <[email protected]>

arjunsuresh requested a review from a team as a code owner December 20, 2024 19:09

arjunsuresh merged commit 9309ef7 into dev Dec 20, 2024
34 of 35 checks passed

github-actions bot locked and limited conversation to collaborators Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sync master #1993

Sync master #1993

arjunsuresh commented Dec 20, 2024

github-actions bot commented Dec 20, 2024

Sync master #1993

Sync master #1993

Conversation

arjunsuresh commented Dec 20, 2024

github-actions bot commented Dec 20, 2024