Skip to content
View sz128's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shanghai Jiao Tong University
  • shanghai

Block or report sz128

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch code for Deep Audio-Signal Holistic Embeddings

Python 46 5 Updated Sep 7, 2024

LongMIT: Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets

Python 30 Updated Sep 30, 2024

基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

Python 30,257 7,962 Updated Sep 26, 2024

DataComp for Language Models

HTML 1,121 99 Updated Oct 1, 2024

[ICASSP 2024] A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames

Python 5 Updated Nov 27, 2023

Brand new TTS solution

Python 12,743 956 Updated Sep 30, 2024

MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning

Python 81 5 Updated Aug 15, 2023

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,798 302 Updated Sep 29, 2024

Parse files for optimal RAG

Python 2,714 263 Updated Sep 30, 2024

Mamba SSM architecture

Python 12,707 1,066 Updated Sep 26, 2024
C++ 98 11 Updated Apr 2, 2024

Summarize existing representative LLMs text datasets.

838 83 Updated Sep 4, 2024

VideoSys: An easy and efficient system for video generation

Python 1,673 112 Updated Oct 1, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,464 446 Updated Oct 1, 2024

Data and tools for generating and inspecting OLMo pre-training data.

Python 928 100 Updated Sep 30, 2024

DISC-LawLLM, an intelligent legal system utilizing large language models (LLMs) to provide a wide range of legal services

Python 518 59 Updated Apr 24, 2024

All-in-one text de-duplication

Python 593 69 Updated May 21, 2024

[ICLR 2024] Lemur: Open Foundation Models for Language Agents

Python 531 34 Updated Oct 28, 2023

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,532 346 Updated Aug 8, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,588 245 Updated Dec 12, 2023

Yuan 2.0 Large Language Model

Python 680 85 Updated Jul 11, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,311 119 Updated Jun 13, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 31,783 3,898 Updated Oct 1, 2024

A guidance language for controlling large language models.

Jupyter Notebook 18,795 1,038 Updated Sep 28, 2024

类似于chatpdf的简化demo版

Python 192 33 Updated Mar 10, 2023

LLM inference in C/C++

C++ 65,684 9,426 Updated Oct 1, 2024

Easily train a good VC model with voice data <= 10 mins!

Python 23,444 3,491 Updated Sep 5, 2024

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,212 112 Updated Apr 3, 2024

面向中文大模型价值观的评估与对齐研究

Python 469 20 Updated Jul 20, 2023

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

TypeScript 75,380 58,877 Updated Sep 30, 2024
Next