Fix types of key_length and value_length

ggerganov · ggerganov · Jan 5, 2024 · Jan 2, 2024 · Jan 3, 2024 · Jan 3, 2024
commit 7e9d67ca19d05ba4785f8f50952c834302b4e65a
diff --git a/docs/gguf.md b/docs/gguf.md
@@ -296,8 +296,8 @@ In the following, `[llm]` is used to fill in for the name of a specific LLM arch
 - `[llm].attention.clamp_kqv: float32`: Value (`C`) to clamp the values of the `Q`, `K`, and `V` tensors between (`[-C, C]`).
 - `[llm].attention.layer_norm_epsilon: float32`: Layer normalization epsilon.
 - `[llm].attention.layer_norm_rms_epsilon: float32`: Layer RMS normalization epsilon.
-- `[llm].attention.key_length: uint64`: The optional size of a key head, $d_k$. If not specified, it will be `n_embd / n_head`.
-- `[llm].attention.value_length: uint64`: The optional size of a value head, $d_v$. If not specified, it will be `n_embd / n_head`.
+- `[llm].attention.key_length: uint32`: The optional size of a key head, $d_k$. If not specified, it will be `n_embd / n_head`.
+- `[llm].attention.value_length: uint32`: The optional size of a value head, $d_v$. If not specified, it will be `n_embd / n_head`.
 
 #### RoPE