Skip to content
View Starshipping's full-sized avatar
📮
When you code something you execute or you fail. There is no middle ground.
📮
When you code something you execute or you fail. There is no middle ground.
  • Chicago
  • 18:35 (UTC -06:00)

Block or report Starshipping

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
63 results for source starred repositories
Clear filter

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 8,865 1,197 Updated Sep 3, 2024
Go 108 41 Updated May 28, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 16,320 1,259 Updated Sep 3, 2024

[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton

Python 179 20 Updated Apr 6, 2024

深度学习经典、新论文逐段精读

26,017 2,375 Updated Aug 8, 2024

中国大模型

5,191 430 Updated Jun 7, 2024

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

JavaScript 19,529 2,129 Updated Aug 30, 2024

RAG LLM Ops App for easy deployment and testing

Python 353 44 Updated Jul 23, 2024

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 547 67 Updated Sep 4, 2024

Transformer related optimization, including BERT, GPT

C++ 5,749 884 Updated Mar 27, 2024

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Python 97 19 Updated Jul 8, 2021

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Svelte 5,634 401 Updated Sep 4, 2024

This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…

Python 197 22 Updated Nov 3, 2023

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,362 685 Updated Jul 11, 2024

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 262 18 Updated Feb 26, 2024

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,497 124 Updated Aug 4, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,616 921 Updated Sep 2, 2024

Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

Python 943 115 Updated Aug 28, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 1,132 59 Updated Sep 3, 2024

A Paper List for Open-Domain Dialogue Generation, and related datasets.

203 29 Updated May 24, 2020

A Mini Gradient Descent library.

Python 5 Updated Oct 11, 2023

Train transformer language models with reinforcement learning.

Python 9,193 1,152 Updated Sep 3, 2024

A preliminary evaluation of ChatGPT/GPT-4 for machine translation.

Python 239 16 Updated Nov 3, 2023

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,206 111 Updated Apr 3, 2024

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Python 203 13 Updated Aug 16, 2024

A professionally curated list of awesome resources (paper, code, data, etc.) on transformers in time series.

2,358 235 Updated Aug 8, 2024

Codes for our IJCAI21 paper: Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

Python 59 12 Updated Jun 15, 2021

A paper & resource list of large language models, including course, paper, demo, figures

182 8 Updated Aug 8, 2023
Next