Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

llama : ggml-backend integration #4766

Merged
merged 39 commits into from
Jan 12, 2024
Merged

llama : ggml-backend integration #4766

merged 39 commits into from
Jan 12, 2024

Commits on Jan 6, 2024

  1. Configuration menu
    Copy the full SHA
    33f0761 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6483328 View commit details
    Browse the repository at this point in the history
  3. fix unmap after loading

    slaren committed Jan 6, 2024
    Configuration menu
    Copy the full SHA
    a1ab35c View commit details
    Browse the repository at this point in the history
  4. batched-bench : add tensor_split param

    ggerganov authored and slaren committed Jan 6, 2024
    Configuration menu
    Copy the full SHA
    1fa7ee2 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    863ef45 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    d107459 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    ece0b0d View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    2f2c367 View commit details
    Browse the repository at this point in the history
  9. cuda : do not create buffer types for devices that don't exist (fixes…

    … usage without CUDA devices available)
    slaren committed Jan 6, 2024
    Configuration menu
    Copy the full SHA
    72b74f3 View commit details
    Browse the repository at this point in the history

Commits on Jan 7, 2024

  1. ggml : fix null backend dereference (#4807)

    * ggml : fix null backend dereference
    
    * ggml : also check ggml_backend_is_cpu
    ggerganov committed Jan 7, 2024
    Configuration menu
    Copy the full SHA
    f77c72f View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7c16cf1 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    87c8207 View commit details
    Browse the repository at this point in the history
  4. llama : add cparam (split_mode) and command line argument (--split-mo…

    …de, -sm) to configure the split mode (none, layer or row)
    slaren committed Jan 7, 2024
    Configuration menu
    Copy the full SHA
    5e879c9 View commit details
    Browse the repository at this point in the history

Commits on Jan 8, 2024

  1. Configuration menu
    Copy the full SHA
    ac145fd View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    444b975 View commit details
    Browse the repository at this point in the history
  3. minor

    slaren committed Jan 8, 2024
    Configuration menu
    Copy the full SHA
    d41cef9 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5a62db3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    4813e17 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    11583c1 View commit details
    Browse the repository at this point in the history
  7. llama : only map to a backend buffer the region of the file mapping c…

    …ontaining the tensors used in the buffer
    slaren committed Jan 8, 2024
    Configuration menu
    Copy the full SHA
    4ed5f62 View commit details
    Browse the repository at this point in the history

Commits on Jan 9, 2024

  1. Configuration menu
    Copy the full SHA
    fa76201 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2e7814a View commit details
    Browse the repository at this point in the history

Commits on Jan 10, 2024

  1. Configuration menu
    Copy the full SHA
    5d2dffc View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3cb1c1f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    07a1b05 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    3cd0cbb View commit details
    Browse the repository at this point in the history
  5. Apply suggestions from code review

    Co-authored-by: Johannes Gäßler <[email protected]>
    slaren and JohannesGaessler committed Jan 10, 2024
    Configuration menu
    Copy the full SHA
    74066f8 View commit details
    Browse the repository at this point in the history
  6. cuda : fix split buffer free

    slaren committed Jan 10, 2024
    Configuration menu
    Copy the full SHA
    c522c11 View commit details
    Browse the repository at this point in the history

Commits on Jan 11, 2024

  1. address review comments

    slaren committed Jan 11, 2024
    Configuration menu
    Copy the full SHA
    9d4ba6e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d83c084 View commit details
    Browse the repository at this point in the history
  3. fix whitespace

    slaren committed Jan 11, 2024
    Configuration menu
    Copy the full SHA
    6dcc42b View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    42aa835 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    c3681af View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    c486719 View commit details
    Browse the repository at this point in the history

Commits on Jan 12, 2024

  1. Configuration menu
    Copy the full SHA
    23c14ef View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e73009e View commit details
    Browse the repository at this point in the history
  3. fix opencl

    slaren committed Jan 12, 2024
    Configuration menu
    Copy the full SHA
    1e7694e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    458674c View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    53ae0dd View commit details
    Browse the repository at this point in the history