Skip to content
View shicript's full-sized avatar
🎮
🎮

Block or report shicript

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Official JAX implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 334 23 Updated Aug 11, 2024

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 1,182 97 Updated Aug 22, 2024
Python 56 6 Updated Aug 18, 2024

Rephrasing Language Model for CSC (AAAI 2024)

Python 33 3 Updated May 14, 2024

CoreNet: A library for training deep neural networks

Python 6,898 536 Updated May 28, 2024

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,383 148 Updated Aug 26, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,327 231 Updated Aug 19, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,417 1,617 Updated Aug 25, 2024

The official Meta Llama 3 GitHub site

Python 25,748 2,865 Updated Aug 12, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

Python 13,128 1,282 Updated Aug 26, 2024

Python Library for Accessing the Cohere API

Python 280 60 Updated Aug 20, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,079 78 Updated Aug 8, 2024

LLM training in simple, raw C/CUDA

Cuda 22,944 2,553 Updated Aug 26, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 15,454 1,566 Updated Aug 26, 2024

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 819 115 Updated Jul 25, 2024

A programming framework for agentic AI 🤖

Jupyter Notebook 29,982 4,372 Updated Aug 26, 2024

Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".

Python 172 29 Updated Jul 16, 2024

Giant Language Model Test Room

TypeScript 450 106 Updated Jan 18, 2024
Jupyter Notebook 9,253 645 Updated Jul 29, 2024

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Python 20,410 1,559 Updated Aug 18, 2024

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,031 80 Updated Aug 20, 2024

A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current issues and future directions.

55 16 Updated Jan 18, 2024

Question and Answer based on Anything.

Python 11,158 1,070 Updated Aug 26, 2024

Netease Youdao's open-source embedding and reranker models for RAG products.

Python 1,318 87 Updated Aug 2, 2024

Grok open release

Python 49,376 8,332 Updated Aug 7, 2024

An elegent pytorch implement of transformers

Python 1,197 152 Updated Aug 26, 2024

MLNLP社区用来更好进行论文搜索的工具。Fully-automated scripts for collecting AI-related papers

Python 1,115 118 Updated Dec 16, 2023

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 63,450 7,861 Updated Aug 21, 2024

中文Mixtral混合专家大模型(Chinese Mixtral MoE LLMs)

Python 571 41 Updated Apr 30, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,615 348 Updated Aug 7, 2024
Next