Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: mlcommons/inference
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: v5.1.1
Choose a base ref
...
head repository: mlcommons/inference
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: master
Choose a head ref
  • 4 commits
  • 350 files changed
  • 5 contributors

Commits on Nov 11, 2025

  1. [VLM] Offline scenario, performance-only mode of the reference implem…

    …entation (#2381)
    
    * Initial commit.
    
    * WIP
    
    * [Automated Commit] Format Codebase
    
    * misc
    
    * adding pydantic_typer
    
    * offline WIP
    
    * [Automated Commit] Format Codebase
    
    * [Automated Commit] Format Codebase
    
    * rename the notebook
    
    * clean-up
    
    * [Automated Commit] Format Codebase
    
    * Downgrade from 3.13 to 3.12
    
    * [Automated Commit] Format Codebase
    
    * send the response back to LoadGen one at a time
    
    * Move the ownership of the AsyncOpenAI client into Task, and clean up the client, event loop and the event loop thread
    
    * [Automated Commit] Format Codebase
    
    * fixing typo
    
    * [Automated Commit] Format Codebase
    
    * allowing --settings.min_duration to take in float or int as seconds
    
    * fix lint
    
    * [Automated Commit] Format Codebase
    
    * Parametrize use_token_latencies
    
    * [Automated Commit] Format Codebase
    
    * fix typos
    
    * [Automated Commit] Format Codebase
    
    * update README
    
    ---------
    
    Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
    wangshangsam and github-actions[bot] authored Nov 11, 2025
    Configuration menu
    Copy the full SHA
    808e2d7 View commit details
    Browse the repository at this point in the history

Commits on Nov 15, 2025

  1. Configuration menu
    Copy the full SHA
    ceafe86 View commit details
    Browse the repository at this point in the history

Commits on Nov 18, 2025

  1. [VLM] Server Scenario, performance and accuracy (#2388)

    * initial server version
    
    * change description for parameter use_token_latencies
    
    * [Automated Commit] Format Codebase
    
    * changes based on PR comments
    johncalesp authored Nov 18, 2025
    Configuration menu
    Copy the full SHA
    32b6d66 View commit details
    Browse the repository at this point in the history

Commits on Nov 25, 2025

  1. Submission dir update (#2366)

    * Update submission checker
    
    * Add submission structure description in markdown file
    
    * Add description of placeholders in submission structure
    
    * Update preprocess submission
    
    * Add directory structure samples
    
    * Structure documentation fixes
    
    * Add power to submission structure documentation
    
    * Fixes for preprocessing submission
    
    * To be reverted: point to custom inference branch
    
    * To be reverted: use custom mlc automation branch
    
    * To be reverted: use custom branch for automation and inference repo
    
    * To be reverted: use custom branch for mlcflow and inference repo
    
    * Turn off the action
    
    * To be reverted: custom branches for mlcflow and inference repo
    
    ---------
    
    Co-authored-by: ANANDHU S <[email protected]>
    pgmpablo157321 and anandhu-eng authored Nov 25, 2025
    Configuration menu
    Copy the full SHA
    ee809ab View commit details
    Browse the repository at this point in the history
Loading