Stars
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
Chinese version of GPT2 training code, using BERT tokenizer.
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Large World Model -- Modeling Text and Video with Millions Context
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
Wide and Deep Learning for CTR Prediction in tensorflow
自然语言处理工具Macropodus,基于Albert+BiLSTM+CRF深度学习网络架构,中文分词,词性标注,命名实体识别,新词发现,关键词,文本摘要,文本相似度,科学计算器,中文数字阿拉伯数字(罗马数字)转换,中文繁简转换,拼音转换。tookit(tool) of NLP,CWS(chinese word segnment),POS(Part-Of-Speech Tagging),NE…
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Source code for Twitter's Recommendation Algorithm
基于向量数据库与GPT3.5的通用本地知识库方案(A universal local knowledge base solution based on vector database and GPT3.5)
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Code and data for Price Revelation from Insider Trading: Evidence from Hacked Earnings News
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
The convertor/conversion of deep learning models for different deep learning frameworks/softwares.
NLP方面的一些小的demo,包括文本生成,文本分类,文本聚类等等,使用tensorflow实现,长期更新,欢迎指正,交流
Statistical Machine Translation implementation with Python: especially IBM Model1, 2, and phrase-based machine translation.
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Natural Language Processing Tutorial for Deep Learning Researchers
🛒 Simple recommender with matrix factorization, graph, and NLP. Beating the regular collaborative filtering baseline.
flink learning blog. https://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去…