Skip to content
View yjc11's full-sized avatar

Highlights

  • Pro

Block or report yjc11

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 12,644 1,031 Updated Jul 5, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,511 246 Updated Nov 2, 2024

复盘所有NLP比赛的TOP方案,只关注NLP比赛,持续更新中!

2,669 420 Updated Oct 9, 2024

新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频

Python 3,454 768 Updated Oct 31, 2024

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

Python 8,830 1,616 Updated Nov 12, 2024

Build resilient language agents as graphs.

Python 6,601 1,056 Updated Nov 12, 2024

An all-in-one LLMs Chat UI for Apple Silicon Mac using MLX Framework.

Python 1,479 134 Updated Sep 6, 2024

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,300 255 Updated Jun 24, 2024

A repository that contains models, datasets, and fine-tuning techniques for DB-GPT, with the purpose of enhancing model performance in Text-to-SQL

Python 1,438 185 Updated Sep 1, 2024

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

1,854 141 Updated Oct 28, 2024

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,554 684 Updated Nov 3, 2024

Fast Segment Anything

Python 7,488 708 Updated Jul 30, 2024

A curated list of resources dedicated to table recognition

374 51 Updated Jan 28, 2024
Python 2 1 Updated Mar 18, 2024

A large scale camera-taken table detection and recognition dataset.

Python 112 8 Updated Oct 12, 2023

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,472 1,932 Updated Apr 4, 2024

Must-read papers on graph neural networks (GNN)

16,017 2,992 Updated Dec 20, 2023

Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习

Python 13,458 3,810 Updated Nov 1, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 69,039 14,503 Updated May 10, 2024

AI Code Completions

Shell 10,631 498 Updated Jul 3, 2024

deep learning for image processing including classification and object-detection etc.

Python 23,115 7,992 Updated Jul 25, 2024