Skip to content
View venkatkalluru's full-sized avatar
Block or Report

Block or report venkatkalluru

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,061 3,267 Updated Jul 17, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,040 688 Updated May 31, 2024

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

Python 5,858 210 Updated Jul 2, 2024

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Go 78,858 6,002 Updated Jul 17, 2024

Sampling profiler for Python programs

Rust 12,207 401 Updated Jul 14, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 33,900 3,974 Updated Jul 17, 2024

Cost monitoring for Kubernetes workloads and cloud costs

Go 4,892 527 Updated Jul 16, 2024

πŸ’« Industrial-strength Natural Language Processing (NLP) in Python

Python 29,301 4,325 Updated Jul 12, 2024

πŸ’₯ Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,721 753 Updated Jul 16, 2024

Podman: A tool for managing OCI containers and pods.

Go 22,497 2,309 Updated Jul 17, 2024

A command-line benchmarking tool

Rust 20,839 339 Updated Jul 17, 2024

Bandit is a tool designed to find common security issues in Python code.

Python 6,163 593 Updated Jul 8, 2024

⚑ A Fast, Extensible Progress Bar for Python and CLI

Python 27,983 1,340 Updated Jul 14, 2024

LlamaIndex is a data framework for your LLM applications

Python 33,660 4,728 Updated Jul 17, 2024

18 Lessons, Get Started Building with Generative AI πŸ”— https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 56,635 29,089 Updated Jul 11, 2024

πŸ€— The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 18,767 2,591 Updated Jul 16, 2024

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 5,806 548 Updated Jul 16, 2024

the AI-native open-source embedding database

Rust 13,653 1,151 Updated Jul 17, 2024

OpenAPI Generator allows generation of API client libraries (SDK generation), server stubs, documentation and configuration automatically given an OpenAPI Spec (v2, v3)

Java 20,708 6,315 Updated Jul 17, 2024

Multilingual Sentence & Image Embeddings with BERT

Python 14,445 2,399 Updated Jul 14, 2024

Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Rust 18,889 1,297 Updated Jul 17, 2024

Examples and guides for using the OpenAI API

MDX 57,673 9,101 Updated Jul 17, 2024

Kubernetes IN Docker - local clusters for testing Kubernetes

Go 13,068 1,507 Updated Jul 15, 2024

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 7,782 1,423 Updated Jul 17, 2024

Declarative Continuous Deployment for Kubernetes

Go 16,814 5,088 Updated Jul 17, 2024

Numbers every LLM developer should know

3,994 138 Updated Jan 16, 2024

TensorFlow code and pre-trained models for BERT

Python 37,507 9,540 Updated Jul 16, 2024

An extremely fast Python linter and code formatter, written in Rust.

Rust 29,109 947 Updated Jul 17, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 21,985 5,439 Updated Jun 11, 2024

Python packaging and dependency management made easy

Python 30,468 2,233 Updated Jul 16, 2024
Next