Skip to content
View jiangnanboy's full-sized avatar
Block or Report

Block or report jiangnanboy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Key Information Extraction from Image with LLM(large language model).Basically, it can extract key information from all bills and documents.

Python 2 Updated Apr 12, 2024

利用Swin-Unet(Swin Transformer Unet)实现对文档图片里表格结构的识别,Swin-unet (Swin Transformer Unet) is used to identify the document table structure

Python 12 Updated Feb 23, 2024

大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning

Java 10 2 Updated Jan 19, 2024

TABLE DETECTION IN IMAGES AND OCR TO CSV WITH JAVA

Java 8 4 Updated Jul 18, 2023

利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images

Java 18 7 Updated May 5, 2023

中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。

Python 39 3 Updated Apr 28, 2023

智能文本自动处理工具(Intelligent text automatic processing tool)。AutoText的功能主要有文本纠错,图片ocr、版面检测以及表格结构识别等。The main functions of this project include text error correction, ocr, layout-detection and table struc…

Java 19 2 Updated May 17, 2023

MacBERT for Chinese Spelling Correction, macbert中文拼写纠错

Java 12 4 Updated May 23, 2022

利用lightgbm做(learning to rank)排序学习,包括数据处理、模型训练、模型决策可视化、模型可解释性以及预测等。Use LightGBM to learn ranking, including data processing, model training, model decision visualization, model interpretability and …

Python 249 72 Updated Sep 11, 2022

本项目利用JNI加载paddle-ocr的C++编译的dll库,并利用springboot进行web部署访问。This project uses JNI to load the C++ compiled dll libraries of paddle-ocr, and uses springboot for web deployment

Java 25 5 Updated Dec 9, 2022

本项目利用java加载paddle-ocr的C++编译的exe文件,并利用springboot进行web部署访问。This project loads the C++ compiled version of paddle-ocr in java and makes use of springboot for web deployment.

Java 55 19 Updated Dec 7, 2022

grammatical correction,中文语法纠错模板

Java 5 1 Updated Jul 27, 2022

A micro scalar-valued Autograd engine developed with java, and a neural net library on top of it.

Java 4 2 Updated May 31, 2022

Education knowledge graph(graph display, knowledge point tracking, intelligent question and answer,questions knowledge point prediction)。k12教育学科知识图谱,图谱展示,知识点追踪,智能问答以及题目知识点预测。

JavaScript 75 18 Updated Sep 11, 2022

intelligent medical,智慧医疗,包括疾病搜索、相关推荐、疾病医疗问答以及智能疾病诊断等功能。

Java 53 18 Updated Jul 9, 2023

java for nlp,java自然语言处理

Java 3 1 Updated Mar 25, 2022

jcorrector 中文文本纠错工具, Text Error Correction Tool,Spelling Check

Java 46 14 Updated Jan 18, 2023

gnn for link prediction,图神经网络用于链接预测。

Python 34 5 Updated Sep 11, 2022

albert-fc for RE(Relation Extraction),中文关系抽取

Python 17 1 Updated Apr 24, 2023

电影知识图谱,主要包括实体识别、实体查询、关系查询以及智能问答等。movie knowledge graph(Entity identification, graph display, and intelligent question and answer)

JavaScript 114 27 Updated Sep 11, 2022

Chinese Information Extraction Toolkit。中文信息抽取工具。利用CNN各种变体进行实体抽取。

Python 7 2 Updated Oct 23, 2021

intent detection and slot filling 意图识别与槽填充联合模型

Jupyter Notebook 36 11 Updated Sep 11, 2022

Chinese chatbot for neural machine translation in PyTorch.Including basic seq2seq、seq2seq with attention、pointer generator、seq2seq with cnn and so on.

PLSQL 8 1 Updated Jan 22, 2021

spark tutorial for big data mining。包括app流量运营分析、als推荐、smote样本采样、RFM客户价值分群、AHP层次分析客户价值得分、手机定位数据商圈挖掘、马尔可夫智能邮件预测、时序预测、关联规则、推荐电影好友等。

Java 37 15 Updated Sep 10, 2022

基于知识图谱的电影智能问答。neo4j构建电影图谱,spark ml完成问答意图分类,将问答语句转为cypher查询语句完成匹配查询。

Java 34 9 Updated Oct 16, 2022

text de-duplication 文本去重

4 1 Updated Jun 6, 2020

Text Content Grapher based on keyinfo extraction by NLP method。输入一篇文档,将文档进行关键信息提取,进行结构化,并最终组织成图谱组织形式,形成对文章语义信息的图谱化展示。

Python 1,320 358 Updated Oct 20, 2021

利用java对文章进行分析并图谱化展示(主要提取关键词、实体、依存分析等)。

Java 12 5 Updated Apr 14, 2023

利用熵计算查询与文档的相关性。Entropy is used to calculate the relevance of a query to a document. This program is mainly based on 《Content-based relevance estimation on the web using inter-document similarities》…

Java 2 Updated Oct 13, 2020

计算词间的相关性,并进行图谱化展示。calculate the relevance between words

Python 2 2 Updated Aug 30, 2017
Next