Skip to content

Bug: imatrix is now encapsulated in a GGUF file in mainline. #664

@whatever1983

Description

@whatever1983

What happened?

Mainline llama.cpp just wrapped the imatrix.dat file in a gguf format, which means Bartowski's imatrix and mradermacher's imatrix.gguf file can't be used to quantize low bit ik_llama.cpp GGUFs. What a below the belt hit on you @ikawrakow

mradermacher/Llama-3_3-Nemotron-Super-49B-v1_5-i1-GGUF/blob/main/Llama-3_3-Nemotron-Super-49B-v1_5.imatrix.gguf

Probably need to merge the imatrix GGUF patch from mainline to maintain compatibility.

Name and Version

llama.cpp

What operating system are you seeing the problem on?

No response

Relevant log output

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions