Skip to content
View shixiangsong's full-sized avatar
  • Shanghai Jiao Tong Univerisity
  • Shanghai
  • 04:37 (UTC +08:00)

Highlights

  • Pro
Block or Report

Block or report shixiangsong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A native PyTorch Library for large model training

Python 1,477 133 Updated Aug 17, 2024

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,130 5,462 Updated Aug 14, 2024

Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount of time on any token

Python 48 Updated Oct 22, 2023

Train Models Contrastively in Pytorch

Python 493 36 Updated Aug 9, 2024

上海交通大学 Beamer 模版 | Beamer template for Shanghai Jiao Tong University

TeX 559 62 Updated Dec 27, 2023

NO TIME TO SLEEP

Python 636 24 Updated May 26, 2024

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 668 76 Updated Aug 14, 2024

A light proxy solution for HuggingFace hub.

Python 43 5 Updated Nov 6, 2023

Retrieval and Retrieval-augmented LLMs

Python 6,494 462 Updated Aug 17, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,270 2,032 Updated Aug 9, 2024

Grok open release

Python 49,334 8,320 Updated Aug 7, 2024

【升级版-Electron】Check how many CEFs are on your computer. 检测你电脑上有几个CEF.

JavaScript 1,916 27 Updated Jul 3, 2023

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Python 4,631 329 Updated Jul 31, 2024

A framework for few-shot evaluation of language models.

Python 6,160 1,631 Updated Aug 17, 2024
Jupyter Notebook 43 5 Updated Jul 13, 2024

文言文編程語言 A programming language for the ancient Chinese.

TypeScript 19,542 1,098 Updated Oct 20, 2023

This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.

Python 364 22 Updated Feb 12, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 35,699 5,530 Updated Aug 2, 2024

Artistic Fusion:Revolutionizing Mural Style Transfer with Combined GAN and Diffusion Model Techniques

Python 4 2 Updated Jan 8, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,780 399 Updated Jul 15, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,229 3,173 Updated Jul 23, 2024

2022 Chcore Lab

C 41 18 Updated Jun 7, 2022

A curated list of language modeling researches for code and related datasets.

1,217 85 Updated Aug 18, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,023 6,348 Updated Jul 26, 2024

本仓库包含上海交通大学IPADS实验室设计的操作系统课程系列实验。

C 204 50 Updated Aug 17, 2024

CHCore Lab for CS3601, SJTU

C 8 Updated Dec 24, 2023

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Python 20,158 1,540 Updated Aug 18, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,093 643 Updated Aug 14, 2024
Next