Skip to content
View mzthhy's full-sized avatar
Block or Report

Block or report mzthhy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,207 661 Updated Jun 30, 2024

Fine-Tune LLM Synthetic-Data application and "From Data to AGI: Unlocking the Secrets of Large Language Model"

Python 8 Updated Jul 5, 2024

Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset"

Python 51 3 Updated May 6, 2024

[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.

Python 338 32 Updated Jul 11, 2024

[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models

Python 217 14 Updated May 5, 2024

An open-source project dedicated to build foundational large language model for natural science, mainly in physics, chemistry and material science.

Jupyter Notebook 167 23 Updated Feb 14, 2024

Web-Scarping tool for downloading the content of the following publishers: Elsevier, RSC, Web of Science, Springer Nature , Wiley.

Python 13 1 Updated Dec 2, 2023

Large Language Model Text Generation Inference

Python 8,374 950 Updated Jul 12, 2024

Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json

Jupyter Notebook 184 39 Updated Jun 18, 2023

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 965 101 Updated Nov 21, 2023

Finetuning a small BLOOMZ model (bloomz-560m) on a small dataset and with limited resources.

Jupyter Notebook 16 4 Updated May 10, 2023

Tencent Pre-training framework in PyTorch & Pre-trained Model Zoo

Python 1,002 140 Updated Jun 8, 2024

TianGong-AI-Unstructure

Python 43 26 Updated Jul 7, 2024

NLTK Data

Python 1,386 1,022 Updated Jul 12, 2024

An Open-sourced Knowledgable Large Language Model Framework.

Python 1,126 118 Updated Jun 26, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 21,951 5,433 Updated Jun 11, 2024

LLM training in simple, raw C/CUDA

Cuda 21,628 2,353 Updated Jul 12, 2024

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

TypeScript 5,725 437 Updated Jul 12, 2024

The official implementation of MarineGPT

Python 24 1 Updated Mar 2, 2024

An application allowing for interaction with different LLM models. With the option to provide PDF, web and CSV links for context.

Python 12 2 Updated Apr 3, 2024

Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"

Python 103 4 Updated Jun 5, 2024

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

642 41 Updated May 8, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,591 741 Updated May 19, 2024

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

Python 1,632 141 Updated May 25, 2024

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,011 688 Updated May 31, 2024

Codes and packages for the paper titled Evaluating Retrieval Quality in Retrieval-Augmented Generation.

Python 7 Updated May 2, 2024

Measuring Massive Multitask Language Understanding | ICLR 2021

Python 1,063 82 Updated May 28, 2023

Continual Learning of Large Language Models: A Comprehensive Survey

163 11 Updated Jul 2, 2024
Next