YaRN : correction to GPT-NeoX implementation #4093

cebtenzzre · 2023-11-15T22:19:50Z

At one point I was struggling to understand what the Metal kernel was doing with GPT-NeoX RoPE, and I think I got it wrong. I got halfway there - the comment makes it fairly obvious what is going on. But the rotation amount should be an integer and should not be multiplied by inv_ndims - inv_ndims should only be part of theta.

@jquesnelle does this seem like the right thing to do?

I learned from my mistakes, this is running on ggml-ci so I don't have to worry about error-prone manual testing across several machines.

ggml-ci

ggerganov · 2023-11-17T15:31:33Z

Is there some way to compare the results with a reference implementation. I'm confused as well at this point

maddes8cht · 2023-12-02T21:09:37Z

As far as i can see, the only reference implementation of gpt-neox is the pre-gguf implemementation in
https://github.com/ggerganov/ggml/tree/master/examples
Is this still correct?

cebtenzzre · 2023-12-02T22:45:41Z

As far as i can see, the only reference implementation of gpt-neox is the pre-gguf implemementation in

I believe he means a reference implementation of YaRN for GPT-NeoX... but there is none. There is only one for LLaMA.

ggerganov · 2024-02-19T14:31:58Z

Is this still relevant? I think RoPE is computed correctly across all backends now

cebtenzzre · 2024-03-12T19:45:52Z

Is this still relevant? I think RoPE is computed correctly across all backends now

The changes in this PR only affect RoPE when using the YaRN scaling options. I believe one should see a perplexity difference between this PR and master while using YaRN with Falcon or any other model using GPT-NeoX RoPE.

ggerganov · 2024-05-29T17:17:44Z

superseded by #7617

YaRN : correction to GPT-NeoX implementation

f824902

ggml-ci

cebtenzzre mentioned this pull request Nov 21, 2023

ggml-cuda : support stablelm rope #4156

Merged

ggerganov mentioned this pull request May 17, 2024

llama : add DeepSeek-v2-Chat support #7118

Closed

ggerganov mentioned this pull request May 29, 2024

ggml : fix YARN + add tests + add asserts #7617

Merged

ggerganov closed this May 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YaRN : correction to GPT-NeoX implementation #4093

YaRN : correction to GPT-NeoX implementation #4093

cebtenzzre commented Nov 15, 2023

ggerganov commented Nov 17, 2023

maddes8cht commented Dec 2, 2023

cebtenzzre commented Dec 2, 2023

ggerganov commented Feb 19, 2024

cebtenzzre commented Mar 12, 2024

ggerganov commented May 29, 2024

YaRN : correction to GPT-NeoX implementation #4093

YaRN : correction to GPT-NeoX implementation #4093

Conversation

cebtenzzre commented Nov 15, 2023

ggerganov commented Nov 17, 2023

maddes8cht commented Dec 2, 2023

cebtenzzre commented Dec 2, 2023

ggerganov commented Feb 19, 2024

cebtenzzre commented Mar 12, 2024

ggerganov commented May 29, 2024