Skip to content
View CCCarloooo's full-sized avatar

Highlights

  • Pro

Block or report CCCarloooo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Minimalistic large language model 3D-parallelism training

Python 1,163 109 Updated Oct 9, 2024

The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

Jupyter Notebook 490 46 Updated Sep 30, 2024

A throughput-oriented high-performance serving framework for LLMs

Cuda 586 24 Updated Sep 21, 2024

The LLM Evaluation Framework

Python 3,211 249 Updated Oct 11, 2024

Chat first code editor. To download the packaged app:

TypeScript 4,799 313 Updated Oct 2, 2024

Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.

Python 2,083 133 Updated Aug 21, 2024

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 33,379 5,656 Updated Oct 11, 2024

machine learning and deep learning tutorials, articles and other resources

15,448 3,786 Updated Jun 12, 2024

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,827 485 Updated Oct 10, 2024
Python 58 2 Updated Sep 18, 2024

(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)

Python 123 7 Updated May 9, 2024

A full Python Implementation of the ROUGE Metric (not a wrapper)

Python 669 100 Updated Mar 26, 2023

The repository is about 100+ python programming exercise problem discussed, explained, and solved in different ways

Jupyter Notebook 2,765 1,449 Updated Jul 27, 2024

The official evaluation suite and dynamic data release for MixEval.

Python 211 32 Updated Sep 29, 2024

复旦大学自然语言处理组发布的自然语言入门练习项目的代码与报告

Jupyter Notebook 15 2 Updated Feb 25, 2022

an intro to retrieval augmented large language model

266 21 Updated Sep 9, 2023

​ 李白 👤 作为唐代杰出诗人,其诗歌作品在中国文学史上具有重要地位。近年来,随着数字技术和人工智能的快速发展,传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入,但在数字化、智能化普及方面仍存在不足。因此,本项目旨在通过构建李白知识图谱,结合大模型训练出专业的AI智能体,以生成式对话应用的形式,推动李白文化的普及与推广。

Python 1,179 140 Updated Sep 1, 2024

Instruction Tuning with GPT-4

HTML 4,180 302 Updated Jun 11, 2023

利用LLM构建应用实践笔记

Python 613 40 Updated Apr 12, 2024

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,387 1,072 Updated May 23, 2024

本项目旨在分享大模型相关技术原理以及实战经验。

HTML 9,523 931 Updated Sep 22, 2024

The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.

Python 22 2 Updated Apr 9, 2024

本项目是一个面向小白开发者的大模型应用开发教程,在线阅读地址:https://datawhalechina.github.io/llm-universe/

Jupyter Notebook 4,534 562 Updated Sep 22, 2024

算法竞赛模板库 by 灵茶山艾府 💭💡🎈

Go 5,055 553 Updated Oct 10, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,467 287 Updated Oct 9, 2024

An advanced guide to learn English which might benefit you a lot 🎉 . 离谱的英语学习指南/英语学习教程。

HTML 36,932 4,119 Updated Jul 13, 2024

114北京挂号

Python 66 37 Updated Jun 14, 2022
Next