Stars
Sample code from the Neural Networks from Scratch book.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Convert PDF to markdown quickly with high accuracy
Tesseract Open Source OCR Engine (main repository)
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
GraphRAG-Ollama-UI + GraphRAG4OpenWebUI 融合版(有gradio webui配置生成RAG索引,有fastapi提供RAG API服务)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
A modular graph-based Retrieval-Augmented Generation (RAG) system
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Translation plugin for IntelliJ based IDEs/Android Studio.
An article about information extraction from text based documents such as PDF documents using neural networks.
基于关键词的无监督文本分类;Implementation for paper "Text Classification by Bootstrapping with Keywords, EM and Shrinkage" https://www.cs.cmu.edu/~knigam/papers/keywordcat-aclws99.pdf
https://arxiv.org/pdf/1909.04054
Python code for classification of documents into different classes using machine learning
Next generation face swapper and enhancer
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Build AI Assistants with memory, knowledge and tools.
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
A Gradio web UI for Large Language Models.
Real time interactive streaming digital human
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".