-
Notifications
You must be signed in to change notification settings - Fork 966
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Are derived shapes possible? #227
Comments
Actually, closing this because I can destroy and rebuild a graph when needed (they are cheap). |
Yes, for now I think we should just rebuild the graph in such cases. |
I mean the recursive find-parents that takes upwards of 2 seconds would like to have a word, but yeah other than that fine |
As a part of implementing RWKV sequence mode, I want to do what basically amounts to
ggml_get_rows(ctx, model.emb, tokens)
, but wheretokens
could possibly be any length.ggml stops there; this isn't allowed. If the length of the tensor at computation time is different than the length at creation time, ggml throws an assert. This means that I need a new computation graph for each length of sequence, because I need to call
ggml_get_rows
again.Is this something that
ggml
plans to, or even can, support? I know with its style of allocation, this might not be possible without bailing each tensor out into its own allocation, but still.The text was updated successfully, but these errors were encountered: