Skip to content
View ZeningLin's full-sized avatar
  • South China University of Technology
  • Guangzhou, China
  • 18:08 (UTC +08:00)

Highlights

  • Pro

Organizations

@SCUT-DLVCLab

Block or report ZeningLin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

TongGu, a classical Chinese language model.

4 Updated Sep 28, 2024

本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。

422 34 Updated Apr 22, 2024

🌱 a fast, batteries-included static-site generator that transforms Markdown content into fully functional websites

TypeScript 6,775 2,459 Updated Sep 28, 2024

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 2,367 132 Updated Sep 24, 2024

OCR Annotations from Amazon Textract for Industry Documents Library

Python 99 6 Updated Aug 20, 2022

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 2,861 334 Updated Aug 19, 2024

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

1,396 99 Updated Mar 31, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,070 838 Updated Jul 1, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 67,995 14,426 Updated May 10, 2024
Python 2 Updated Aug 5, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,102 846 Updated Sep 13, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,627 438 Updated Sep 19, 2024
Python 57 9 Updated Aug 27, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

3,387 296 Updated Sep 20, 2024

Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations

Python 49 1 Updated Jul 15, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 4,931 335 Updated Sep 20, 2024

CV算法岗知识点及面试问答汇总,主要分为计算机视觉、机器学习、图像处理和 C++基础四大块,一起努力向offers发起冲击!

1,582 266 Updated Nov 2, 2021

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,732 385 Updated Sep 26, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 37,588 3,951 Updated Jul 28, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 3,826 406 Updated Sep 29, 2024

Official implementation of UPOCR: Towards unified pixel-level OCR interface (ICML 2024)

Python 37 2 Updated Jun 6, 2024

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

JavaScript 34,621 4,234 Updated Sep 27, 2024

Pure Pytorch Docker Images.

Shell 408 68 Updated Mar 13, 2024

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 295 28 Updated Sep 24, 2024

Repository for the KVP10k dataset

Python 11 2 Updated Sep 4, 2024

VimTS: A Unified Video and Image Text Spotter

Python 71 6 Updated Jun 13, 2024

深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)

7,499 1,314 Updated Apr 24, 2024

The official Meta Llama 3 GitHub site

Python 26,371 2,978 Updated Aug 12, 2024

Document Artifical Intelligence

116 4 Updated Sep 29, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,788 303 Updated Sep 29, 2024
Next