Skip to content
View thtang's full-sized avatar
🏸
For fun
🏸
For fun
  • Shopee
  • Singapore

Highlights

  • Pro

Block or report thtang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 31,118 3,695 Updated Nov 6, 2024

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Python 269 28 Updated Apr 14, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,946 461 Updated Oct 29, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,009 378 Updated Aug 7, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,497 879 Updated Oct 22, 2024

看图学大模型

Python 175 12 Updated Jul 30, 2024

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Python 6,879 1,177 Updated Aug 24, 2022

Structured Text Generation

Python 9,189 469 Updated Nov 6, 2024

Fuzzy-JSON is a compact Python package with no dependencies, designed to address the pesky JSONDecodeError that sometimes occurs when utilizing OpenAI's powerful call function.

Python 31 5 Updated Nov 1, 2024

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,514 67 Updated Oct 16, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,023 144 Updated Oct 31, 2024

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Python 1,287 105 Updated Nov 7, 2024

A lightning fast Finite State machine and REgular expression manipulation library.

C++ 1,831 128 Updated Oct 24, 2023

WikiSP, a semantic parser for Wikidata. WikiWebQuestions, a SPARQL-annotated dataset on Wikidata

Python 83 8 Updated Oct 21, 2024

Wikipedia / Wikidata search project for knowledge base RAG systems.

Python 3 Updated Jun 21, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,425 1,443 Updated Oct 21, 2024

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Python 483 33 Updated Nov 2, 2024

unified embedding model

Python 828 64 Updated Sep 1, 2023

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 13,676 1,093 Updated May 23, 2024

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 13,954 1,253 Updated Sep 5, 2024

Tevatron - A flexible toolkit for neural retrieval research and development.

Python 516 100 Updated Oct 20, 2024

LLM (Large Language Model) FineTuning

Jupyter Notebook 464 110 Updated May 19, 2024

Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.

Python 159 11 Updated Oct 4, 2023

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python 33,806 4,160 Updated Nov 4, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 561 40 Updated Nov 6, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,255 93 Updated Oct 8, 2024

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 335 42 Updated Nov 3, 2024

Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]

Python 524 48 Updated Mar 10, 2024

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

738 42 Updated Oct 31, 2024
Next