Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: ggml-org/llama.cpp
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: master
Choose a base ref
...
head repository: ggml-org/llama.cpp
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: sync-ggml-25-04-03-try-fix
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 6 commits
  • 15 files changed
  • 4 contributors

Commits on Apr 7, 2025

  1. cpu: move all the operators into a separate c++ file (except mul_mat)…

    … (ggml/1167)
    
    * cpu: refactor SIMD mappings and vectorized op functions into separate files
    
    * Fix warning for ggml_float to float
    
    * Fix warnings
    
    * cpu: move all the operations (except mul_mat) to a separate c++ file
    
    * fix whitespace
    
    * Update ggml/src/ggml-cpu/vec.h
    
    Co-authored-by: Diego Devesa <[email protected]>
    
    * Fix PR comments - use GGML_UNUSED, use cassert in ops.cpp
    
    * Reverse the order of import for ops.h and vec.h, to match what was present in ggml-cpu.c previously
    
    ---------
    
    Co-authored-by: Diego Devesa <[email protected]>
    2 people authored and ggerganov committed Apr 7, 2025
    Configuration menu
    Copy the full SHA
    53cb49e View commit details
    Browse the repository at this point in the history
  2. CUDA: don't convert BF16 weights to FP32 (ggml/1174)

    * add bf16 support
    
    * use convert_from_bf16_cuda instead of convert_unary_cuda for f32
    
    * revert 7ec5085
    
    * move functionality into convert_unary with constexpr
    CISC authored and ggerganov committed Apr 7, 2025
    Configuration menu
    Copy the full SHA
    4683cb4 View commit details
    Browse the repository at this point in the history
  3. ggml : simplify Arm fp16 CPU logic (ggml/1177)

    * ggml : simlpify Arm fp16 CPU logic
    
    ggml-ci
    
    * cont : bring back CUDA/MUSA checks
    
    ggml-ci
    ggerganov committed Apr 7, 2025
    Configuration menu
    Copy the full SHA
    e638450 View commit details
    Browse the repository at this point in the history
  4. sync : ggml

    ggml-ci
    ggerganov committed Apr 7, 2025
    Configuration menu
    Copy the full SHA
    6232cee View commit details
    Browse the repository at this point in the history
  5. test

    ggerganov committed Apr 7, 2025
    Configuration menu
    Copy the full SHA
    5ef588b View commit details
    Browse the repository at this point in the history
  6. cont

    ggerganov committed Apr 7, 2025
    Configuration menu
    Copy the full SHA
    ced2648 View commit details
    Browse the repository at this point in the history
Loading