小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers
This is a CoNLL formatted version of the OntoNotes 5.0 release.
EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)
CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)
📘《Python进阶》(Intermediate Python - Chinese Version)
"狗屁不通文章生成器" ( 的 Telegram Bot 移植版
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
A full Python Implementation of the ROUGE Metric (not a wrapper)
Calculating ROUGE score between two files (line-by-line)
These file are for hunting multiple bumps in graphs
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Natural Language Processing notes and implementations.
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
Topic-Aware Convolutional Neural Networks for Extreme Summarization
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.