Skip to content
View LLMFocus's full-sized avatar
Block or Report

Block or report LLMFocus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Never forget the resource that helps to close that sales call! Power a real-time speech-to-text agent with retrieval augmented generation based on webscraped customer use-cases.

Python 10 2 Updated Jan 23, 2024

A unified codebase for finetuning (full, lora) large multimodal models, supporting llava-1.5, qwen-vl, llava-interleave, llava-next-video, phi3-v etc.

Python 47 2 Updated Jul 23, 2024

Implementation of Paint-with-words with Stable Diffusion : method from eDiff-I that let you generate image from text-labeled segmentation map.

Jupyter Notebook 633 51 Updated Mar 24, 2023

Stable Diffusion Painting

Python 1,617 114 Updated Apr 25, 2024

online image editor

JavaScript 2,617 610 Updated Jun 30, 2024

Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources. It listens on a dedicated port for ea…

Go 43 3 Updated Jul 24, 2024

EfficientViT is a new family of vision models for efficient high-resolution vision.

Python 1,659 147 Updated Jul 11, 2024

An open source implementation of CLIP.

Python 9,308 928 Updated Jul 23, 2024

Lightweight, performant, deep table extraction

Python 50 4 Updated Jul 12, 2024

UniTable: Towards a Unified Table Foundation Model

Jupyter Notebook 298 18 Updated Jun 4, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 8,516 546 Updated Apr 16, 2024

Code for Text2Performer. Paper: Text2Performer: Text-Driven Human Video Generation

Python 312 18 Updated Sep 29, 2023

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

HTML 7,776 618 Updated Jul 24, 2024

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

Python 1,461 234 Updated Apr 14, 2024

Claude Plus is an advanced AI-powered development assistant that combines the capabilities of Anthropic's Claude AI with a suite of development tools.

Python 5 Updated Jul 20, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,485 355 Updated Jul 10, 2024

🎙️ Speak with AI - Run locally using ollama or OpenAI - XTTS or OpenAI Speech or ElevenLabs

Python 30 8 Updated Jul 24, 2024

An app that blurs faces in realtime using VisionCamera, Skia and MLKit 😷

TypeScript 77 5 Updated May 10, 2024

Video anonymization by face detection

Python 590 83 Updated May 23, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,164 161 Updated Jul 16, 2024

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Python 1,336 154 Updated Jun 9, 2024

Multi-Aspect Vision Language Pretraining - CVPR2024

Python 39 1 Updated Jul 1, 2024

[Arxiv-2024] CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

Python 103 7 Updated Feb 7, 2024

A collection of resources on applications of multi-modal learning in medical imaging.

404 41 Updated Jul 18, 2024

"Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"

Python 11 1 Updated Jun 24, 2024

BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks

Python 322 31 Updated Apr 24, 2024
Python 370 31 Updated Aug 23, 2023

Agent benchmark for medical diagnosis

Python 56 5 Updated Jun 28, 2024

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 53,081 7,126 Updated Jul 23, 2024

distributed trainer for LLMs

Python 503 73 Updated May 20, 2024
Next