Skip to content
View cedricblondeau's full-sized avatar
💻✈️📷🍻🐱⚽️🥾🏕️⛰️
💻✈️📷🍻🐱⚽️🥾🏕️⛰️

Organizations

@bitbearstudio
Block or Report

Block or report cedricblondeau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🦍 Kong is a Jira CLI at terminal velocity

Go 10 Updated Jul 3, 2023

Workflow Engine for Kubernetes

Go 14,587 3,125 Updated Jul 11, 2024

AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kubernetes Engine

HCL 186 141 Updated Jul 11, 2024

an R-Tree library for Go

Go 604 122 Updated Feb 17, 2024

Build Conversational AI in minutes ⚡️

TypeScript 6,146 784 Updated Jul 11, 2024

A lightning-fast search API that fits effortlessly into your apps, websites, and workflow

Rust 45,233 1,707 Updated Jul 11, 2024

The Kubernetes Package Manager

Go 26,450 7,009 Updated Jul 11, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 46,229 4,455 Updated Jul 10, 2024

Open source codebase powering the HuggingChat app

TypeScript 6,792 958 Updated Jul 11, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,122 152 Updated Jul 11, 2024

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 1,439 163 Updated Jul 2, 2024

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,669 213 Updated Sep 30, 2023

The Triton TensorRT-LLM Backend

Python 593 84 Updated Jul 9, 2024

Fast inference engine for Transformer models

C++ 3,044 271 Updated Jul 11, 2024

LLMPerf is a library for validating and benchmarking LLMs

Python 480 71 Updated Jul 8, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7,445 803 Updated Jul 11, 2024

Django Channels HTTP/WebSocket server

Python 2,317 256 Updated Jul 2, 2024

An ASGI web server, for Python. 🦄

Python 8,132 704 Updated Jul 11, 2024

Declarative Continuous Deployment for Kubernetes

Go 16,778 5,078 Updated Jul 11, 2024

Progressive Delivery for Kubernetes

Go 2,575 817 Updated Jul 11, 2024

Netflix's Hystrix latency and fault tolerance library, for Go

Go 4,190 476 Updated Feb 24, 2024

The first real AI developer

Python 29,115 2,907 Updated Jul 10, 2024

✨ Textbase is a simple framework for building AI chatbots. ✨

Python 1,275 357 Updated Nov 27, 2023

Rich is a Python library for rich text and beautiful formatting in the terminal.

Python 48,220 1,695 Updated Jul 11, 2024

The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.

Python 24,176 744 Updated Jul 11, 2024

🤖 The free, Open Source OpenAI alternative. Self-hosted, community-driven and local-first. Drop-in replacement for OpenAI running on consumer-grade hardware. No GPU required. Runs gguf, transformer…

C++ 21,799 1,667 Updated Jul 11, 2024

Run your favourite LLMs locally on macOS from Swift

Swift 77 Updated Jun 8, 2023

Chat with your favourite LLaMA models in a native macOS app

Swift 1,430 52 Updated Jun 9, 2023

A Gradio web UI for Large Language Models.

Python 38,301 5,076 Updated Jul 11, 2024

French instruction-following and chat models

Jupyter Notebook 495 47 Updated Oct 23, 2023
Next