Skip to content
View varuy322's full-sized avatar
  • Shanghai Artificial Intelligence Laboratory
  • Shanghai, PRC
  • 12:30 (UTC +08:00)
Block or Report

Block or report varuy322

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

The hub for EleutherAI's work on interpretability and learning dynamics

Jupyter Notebook 2,150 157 Updated Jul 12, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 1,479 85 Updated Jul 18, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 6,719 981 Updated Jul 12, 2024

Making big AI models cheaper, easier, and more scalable

Python 1 Updated Feb 23, 2023

Out-of-distribution detection, robustness, and generalization resources. The repository contains a professionally curated list of papers, tutorials, books, videos, articles and open-source librarie…

615 62 Updated Jun 26, 2024

Ongoing research training transformer models at scale

Python 9,443 2,126 Updated Jul 18, 2024
Python 7,018 540 Updated Jul 13, 2024

Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models

Python 29 3 Updated Jul 17, 2024
Jupyter Notebook 14 Updated Feb 21, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 681 68 Updated Jul 19, 2024

A unified evaluation framework for large language models

Python 2,282 178 Updated Jun 29, 2024

DataComp for Language Models

HTML 186 10 Updated Jul 18, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Python 4,227 319 Updated Jul 12, 2024

Bring portraits to life!

Python 7,021 619 Updated Jul 17, 2024

PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning

Jupyter Notebook 105 5 Updated Jun 21, 2024

A simple program to view MIT-BIH waveform data and annotations.

Python 5 Updated Jul 12, 2024

RAG-GPT, leveraging LLM and RAG technology, learns from user-customized knowledge bases to provide contextually relevant answers for a wide range of queries, ensuring rapid and accurate information…

Python 230 26 Updated Jul 19, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

2,594 213 Updated Jul 3, 2024

A collection of AWESOME things about mixture-of-experts

847 62 Updated Jun 25, 2024

DeepSeek LLM: Let there be answers

Makefile 1,337 89 Updated Feb 4, 2024

中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc

2,055 352 Updated Jan 17, 2024

Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 2,522 221 Updated Jul 19, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

622 26 Updated Jul 17, 2024

The code and data for the paper JiuZhang3.0

Python 28 1 Updated May 26, 2024

Open, Multi-modal Catalog for Data & AI

Java 1,971 280 Updated Jul 18, 2024

Audio generation using diffusion models, in PyTorch.

Python 1,878 163 Updated Jun 12, 2023

A repository for research on medium sized language models.

Python 350 47 Updated Jul 17, 2024

Repository for analysis and experiments in the BigCode project.

Jupyter Notebook 110 20 Updated Mar 20, 2024

Instruction Tuning with GPT-4

HTML 4,098 295 Updated Jun 11, 2023

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程

Jupyter Notebook 6,530 803 Updated Jul 18, 2024
Next