Skip to content
View Starshipping's full-sized avatar
📮
When you code something you execute or you fail. There is no middle ground.
📮
When you code something you execute or you fail. There is no middle ground.
  • Chicago
  • 01:56 (UTC -06:00)
Block or Report

Block or report Starshipping

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 7,519 977 Updated Jul 4, 2024
Go 108 41 Updated May 28, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 14,913 1,145 Updated Jul 26, 2024

[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton

Python 177 20 Updated Apr 6, 2024

深度学习经典、新论文逐段精读

25,177 2,340 Updated Mar 30, 2023

中国大模型

4,968 424 Updated Jun 7, 2024

The all-in-one Desktop & Docker AI application with full RAG and AI Agent capabilities.

JavaScript 17,988 1,947 Updated Jul 28, 2024

RAG LLM Ops App for easy deployment and testing

Python 329 40 Updated Jul 23, 2024

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Python 497 60 Updated Jul 28, 2024

Transformer related optimization, including BERT, GPT

C++ 5,690 878 Updated Mar 27, 2024

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

Python 96 19 Updated Jul 8, 2021

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Svelte 5,606 401 Updated Jul 29, 2024

This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…

Python 195 21 Updated Nov 3, 2023

ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

Python 9,348 684 Updated Jul 11, 2024

[EMNLP 2023] Adapting Language Models to Compress Long Contexts

Python 252 17 Updated Feb 26, 2024

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,453 122 Updated Apr 22, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 7,438 887 Updated Jul 29, 2024

Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone

Python 930 113 Updated Jun 17, 2024

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Python 784 45 Updated Jul 29, 2024

A Paper List for Open-Domain Dialogue Generation, and related datasets.

204 29 Updated May 24, 2020

A Mini Gradient Descent library.

Python 5 Updated Oct 11, 2023

Train transformer language models with reinforcement learning.

Python 8,890 1,091 Updated Jul 28, 2024

A preliminary evaluation of ChatGPT/GPT-4 for machine translation.

Python 236 16 Updated Nov 3, 2023

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,184 109 Updated Apr 3, 2024

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Python 196 12 Updated Jun 3, 2024

A professionally curated list of awesome resources (paper, code, data, etc.) on transformers in time series.

2,292 232 Updated Apr 6, 2024

Codes for our IJCAI21 paper: Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

Python 59 12 Updated Jun 15, 2021

A paper & resource list of large language models, including course, paper, demo, figures

175 8 Updated Aug 8, 2023
Next