Skip to content
View wajeehulhassanvii's full-sized avatar
Block or Report

Block or report wajeehulhassanvii

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Pretrain, finetune and serve LLMs on Intel platforms with Ray

Python 57 27 Updated Jul 2, 2024

The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!

Python 6,773 767 Updated Jul 2, 2024
Python 7 1 Updated Jun 20, 2024

KServe models web UI

TypeScript 30 39 Updated Jun 28, 2024

A toolkit to run Ray applications on Kubernetes

Go 972 330 Updated Jul 2, 2024

Dynamic RAG for enterprise. Ready to run with Docker,⚡in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.

3,354 192 Updated Jul 1, 2024

Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.

Python 2,841 98 Updated Jul 2, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,285 422 Updated May 3, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 22,034 3,110 Updated Jul 2, 2024
Jupyter Notebook 6 2 Updated Feb 3, 2021

🤗 AutoTrain Advanced

Python 3,615 434 Updated Jun 28, 2024

Create GNOME Shell extensions in seconds

JavaScript 24 1 Updated Apr 13, 2022

🥧 Fly-Pie is an innovative marking menu written as a GNOME Shell extension.

JavaScript 1,181 27 Updated Jun 3, 2024

Create GNOME Shell extensions in seconds

JavaScript 1,653 112 Updated Apr 3, 2024

A browser extension that generates and runs LLM prompts based on templates and user input such as selected text and the contents of the current page

JavaScript 13 Updated Apr 3, 2024

Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of …

Java 2,679 573 Updated Jul 1, 2024

Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution for valuable insights.

Shell 19 1 Updated Sep 11, 2023

Memory for AI agents

Python 8,908 1,130 Updated Jun 30, 2024

Quick Guides from Dremio on Several topics

51 13 Updated Apr 25, 2024
Go 88 44 Updated Jul 2, 2024

This book is a comprehensive manual designed to empower professionals to harness the potential of AI technologies responsibly and innovatively. The book addresses the technical, ethical, and practi…

HTML 36 12 Updated Mar 15, 2024

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 10,178 696 Updated Jul 2, 2024

Create markdown-backed Kanban boards in Obsidian.

TypeScript 2,982 173 Updated Jul 2, 2024

Apache Kafka® running on Kubernetes

Java 4,622 1,256 Updated Jul 2, 2024

Kubeflow on OKE

Jupyter Notebook 6 5 Updated Jun 4, 2024

🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your d…

Python 14,486 1,704 Updated Jul 2, 2024

The GUI for Milvus

TypeScript 1,029 106 Updated Jul 1, 2024

Open-source vector similarity search for Postgres

C 10,416 471 Updated Jun 30, 2024
Python 11 4 Updated Sep 15, 2023

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

Jupyter Notebook 315 55 Updated Feb 13, 2024
Next