Skip to content
View hedrergudene's full-sized avatar
  • Credicorp

Block or report hedrergudene

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
2 stars written in C
Clear filter

Inference Llama 2 in one file of pure C

C 17,165 2,039 Updated Aug 6, 2024

fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backend.

C 408 27 Updated Jun 2, 2023