Stars
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,…
OnionScan is a free and open source tool for investigating the Dark Web.
【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!
Benchmark datasets, data loaders, and evaluators for graph machine learning
freeCodeCamp.org's open-source codebase and curriculum. Learn to code for free.
Classic Space Invaders game written in JavaScript as a learning exercise.
GAN Lab: An Interactive, Visual Experimentation Tool for Generative Adversarial Networks
Workflow for creating and analyzing the Open Catalyst Dataset
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Graphormer is a general-purpose deep learning backbone for molecular modeling.
Datasets, Transforms and Models specific to Computer Vision
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Prov-GigaPath: A whole-slide foundation model for digital pathology from real-world data
A tiny KV storage based on skiplist written in C++ language| 使用C++开发,基于跳表实现的轻量级键值数据库🔥🔥 🚀
Llama3、Llama3.1 中文仓库(随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档)
💻📖 Laws, Theories, Principles and Patterns that developers will find useful. #hackerlaws
涵盖C++ Primer 5th、 effective C++ 、 STL api和demos C++ 基础知识与理论、 智能指针、C++11、 Git教程 Linux命令 Unix操作系统(进程、线程、内存管理、信号)计算机网络、 数据结构(排序、查找)、数据库、、C++对象模型、 设计模式、算法(《剑指offer》、leetcode、lintcode、hihocoder、《王道程序员求职…
120+ interactive Python coding interview challenges (algorithms and data structures). Includes Anki flashcards.
《剑指Offer:名企面试官精讲典型编程面试题》第二版源代码