Skip to content
View isLouisHsu's full-sized avatar
😎
Wow
😎
Wow

Block or report isLouisHsu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

devtools

65 repositories

A PyTorch Library for Multi-Task Learning

Python 2,009 186 Updated Oct 18, 2024

A Python Implementation of Simhash Algorithm

Python 978 223 Updated Mar 24, 2022

用TF特征向量和simhash指纹计算中文文本的相似度

Python 211 74 Updated Aug 12, 2016

中文常用停用词表(哈工大停用词表、百度停用词表等)

4,654 2,219 Updated Jan 25, 2024

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 2,995 528 Updated May 9, 2024

Open source annotation tool for machine learning practitioners.

Python 9,532 1,722 Updated Oct 29, 2024

EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

Python 2,056 252 Updated Mar 18, 2024

Mimix: A Text Generation Tool and Pretrained Chinese Models

Python 152 17 Updated Oct 31, 2024

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,529 95 Updated Feb 16, 2024

A general-purpose neural semantic parser for mapping natural language queries into machine executable code

Python 460 111 Updated Nov 12, 2022

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,819 1,896 Updated Sep 26, 2024

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。

Python 932 108 Updated Sep 14, 2024

CRF, Partial CRF and Marginal CRF in PyTorch

Python 31 5 Updated Dec 8, 2022

Automated Phrase Mining from Massive Text Corpora in Python.

Python 168 37 Updated May 23, 2021

百度NLP:分词,词性标注,命名实体识别,词重要性

C++ 3,870 597 Updated May 25, 2021

PyScript is an open source platform for Python in the browser. Try PyScript: https://pyscript.com Examples: https://tinyurl.com/pyscript-examples Community: https://discord.gg/HxvBtukrg2

Python 17,946 1,439 Updated Oct 31, 2024

Language Technology Platform

Python 4,954 1,039 Updated Oct 12, 2024

A Unified Semi-Supervised Learning Codebase (NeurIPS'22)

Python 1,345 178 Updated Sep 15, 2024

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

1,603 127 Updated Sep 19, 2023

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 8,479 352 Updated Oct 13, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 37,038 5,873 Updated Aug 19, 2024

ModelScope: bring the notion of Model-as-a-Service to life.

Python 6,969 717 Updated Oct 31, 2024

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,520 682 Updated Oct 30, 2024

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,484 471 Updated Jan 8, 2024

Streamlit — A faster way to build and share data apps.

Python 35,434 3,080 Updated Oct 31, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,236 1,172 Updated Oct 1, 2024

High-accuracy NLP parser with models for 11 languages.

Python 868 153 Updated Jan 10, 2022