Skip to content
View bravery's full-sized avatar
Block or Report

Block or report bravery

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OCR, layout analysis, reading order, line detection in 90+ languages

Python 9,243 582 Updated Jul 18, 2024

Math OCR model that outputs LaTeX and markdown

Python 637 51 Updated Jun 30, 2024

HF🤗每日简报机器人

Python 32 1 Updated Jul 18, 2024

Data processing with ML and LLM

Python 2,335 290 Updated Jul 18, 2024

🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍

Python 5,063 441 Updated Jul 19, 2024

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

Python 1,634 109 Updated Jun 20, 2024

Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

Python 4,991 528 Updated Jul 18, 2024

CodiumAI Cover-Agent: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞

Python 4,044 275 Updated Jul 17, 2024

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

2,854 92 Updated May 23, 2024

Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

Python 54,902 6,691 Updated Jul 16, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 12,326 1,200 Updated Jul 19, 2024

A multi-level tensor algebra superoptimizer

C++ 270 17 Updated Jul 18, 2024

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Python 349 11 Updated Jul 19, 2024

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines

Python 5,827 551 Updated Jul 19, 2024

Question and Answer based on Anything.

Python 10,728 1,032 Updated Jul 11, 2024

Forward-Looking Active REtrieval-augmented generation (FLARE)

Python 555 50 Updated Nov 20, 2023

GPT based autonomous agent that does online comprehensive research on any given topic

Python 13,243 1,665 Updated Jul 19, 2024

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 6,715 499 Updated Jun 14, 2024

Retrieval and Retrieval-augmented LLMs

Python 6,115 438 Updated Jul 14, 2024

Ace interviews with AI practice. Our agent role-plays personalized interview tailored to your background, listening and replying like a real interviewer. Train across personas for any situation.

Python 94 17 Updated Jun 9, 2024

Accessible large language models via k-bit quantization for PyTorch.

Python 5,782 586 Updated Jul 17, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,101 277 Updated May 4, 2024

Summarize existing representative LLMs text datasets.

737 64 Updated Jun 15, 2024

Tools to bulk download arxiv data

Python 115 17 Updated Oct 29, 2018
Python 1,442 123 Updated Apr 27, 2023

Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

Python 149 22 Updated Jun 18, 2024

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

Python 498 50 Updated Jul 6, 2024

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

Python 561 41 Updated Apr 30, 2024

State-of-the-art LLM-based translation models.

Ruby 362 26 Updated Jun 20, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,238 223 Updated Jul 8, 2024
Next