im just a generic research assistant that likes to program for fun. i am curious about acceleration, specifically effective and efficient utilization of llms on the edge. think mixtral8x7b in your pocket.
Popular repositories Loading
-
-
qwen
qwen PublicForked from QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Python
-
pytorch
pytorch PublicForked from pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Python
-
-
mistral
mistral PublicForked from mistralai/mistral-inference
Reference implementation of Mistral AI 7B v0.1 model.
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.