Confirmation about the order of tensor dimensions #500

mwmercury · 2023-09-01T14:31:56Z

Hello! Thank you so much for developing and sharing this awesome library!
Can I ask a silly question?

I'm investigating the source code of gpt-neox and I see these lines from convert-h5-to-ggml.py file:

    for i in range(n_dims):
        fout.write(struct.pack("i", data.shape[n_dims - 1 - i]))
    fout.write(str)

Could you please tell me why do we need to store in the reverse order?
Thank you in advance!

The text was updated successfully, but these errors were encountered:

YavorGIvanov · 2023-09-03T14:38:51Z

This is done, because the dimension order in GGML is the reverse of the dimension order used in PyTorch. In PyTorch the order is N x C x H x W. In GGML it is W x H x C x N.

N - the batch dimension, C - the channel dimension, H - number of rows, and W - number of columns.

In GGML W x H x C X N correspond to the ne[0] x ne[1] x ne[2] x ne[3] members of a tensor.

Say we have a 4 dimension tensor named "t" in both GGML and PyTorch.
Here is the correspondence:

GGML	PyTorch
t->ne[0]	t.shape[3]
t->ne[1]	t.shape[2]
t->ne[2]	t.shape[1]
t->ne[3]	t.shape[0]

mwmercury · 2023-09-03T15:55:41Z

@YavorGIvanov
Thank you very much for your kind response!
I have another question: Why was the decision made to reverse the order of dimensions compared to the one used in PyTorch? Is there a specific reason, such as improved memory management or performance, for this choice?

YavorGIvanov · 2023-09-03T19:23:01Z

Here ggerganov will be able to provide the best answer, but I don't think the library was intended to match any other deep learning library and the initial version was written relatively quickly.

As you design the data type representing a tensor, you may decide to limit the dimensions to a fixed upper number and then use a static C++ array comprised of integers in order to store the size of each dimensions + dimension count integer. This avoids dynamic allocation of the dimension array making it more memory/cache friendly. Also it has the pro of making all tensors have the same dimension array (ne[4]) + count. However, in order to make the dimension array easy to use you need to store the dimensions in sequantial order.

E.g. This makes it easy to compare the width dimension of 2D, 3D and 4D tensor as you know that all of their width dimensions size is at ne[0] instead of ne[1], ne[2] and ne[3]

YavorGIvanov · 2023-09-04T15:52:29Z

If you have any additional questions, you can reopen the issue or open a new one with label "question".

mwmercury · 2023-09-05T11:34:11Z

@YavorGIvanov
I'm sorry for my late response.
Very detailed and helpful explanation. Thank you so much!

* Retire the ggml_mul_mat() for transposed src0 - It can always be made contiguous with ggml_cpy() - The code is now simplified - The results are deterministic in respect to num threads * SIMD-ify dequantize_row_q4_0() for ARM_NEON (ggerganov#502) * Attempt to SIMD-ify dequantize_row_q4_0() for ARM_NEON * Fix dequantization - forgot to interleave the quants

YavorGIvanov closed this as completed Sep 4, 2023

ggerganov mentioned this issue Jan 5, 2024

Inspect tensor taking dims into consideration antirez/gguf-tools#5

Merged

jbochi mentioned this issue Jan 5, 2024

GGUF support ml-explore/mlx#350

Merged

8 tasks

ggerganov mentioned this issue Jan 29, 2024

Is a ggml tensor's dimension order reversed? #710

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confirmation about the order of tensor dimensions #500

Confirmation about the order of tensor dimensions #500

mwmercury commented Sep 1, 2023

YavorGIvanov commented Sep 3, 2023 •

edited

Loading

mwmercury commented Sep 3, 2023

YavorGIvanov commented Sep 3, 2023 •

edited

Loading

YavorGIvanov commented Sep 4, 2023

mwmercury commented Sep 5, 2023

Confirmation about the order of tensor dimensions #500

Confirmation about the order of tensor dimensions #500

Comments

mwmercury commented Sep 1, 2023

YavorGIvanov commented Sep 3, 2023 • edited Loading

mwmercury commented Sep 3, 2023

YavorGIvanov commented Sep 3, 2023 • edited Loading

YavorGIvanov commented Sep 4, 2023

mwmercury commented Sep 5, 2023

YavorGIvanov commented Sep 3, 2023 •

edited

Loading

YavorGIvanov commented Sep 3, 2023 •

edited

Loading