Skip to content
View Hubotcoder's full-sized avatar
Block or Report

Block or report Hubotcoder

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)

Python 9,073 1,091 Updated Jun 25, 2024

collecting books, papers and docs.

2,119 1,025 Updated Jun 1, 2023

整理一些书籍 ~

310 118 Updated Jul 8, 2022

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程

Jupyter Notebook 6,147 756 Updated Jul 8, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 11,058 750 Updated Jul 2, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

13,099 1,210 Updated Jun 8, 2024

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 982 115 Updated Apr 20, 2024

MiniLLM is a minimal system for running modern LLMs on consumer-grade GPUs

Python 836 50 Updated May 15, 2023

从0到1构建一个MiniLLM

Python 248 26 Updated Jun 19, 2024

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,318 284 Updated May 21, 2024

This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.

Python 275 45 Updated Apr 24, 2024

Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.

Jupyter Notebook 427 46 Updated Feb 19, 2024

Sample of using proxies to crawl baidu search results.

Python 118 63 Updated Mar 10, 2018

BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。

Python 963 206 Updated Jun 14, 2024

自己手写的百度搜索接口的封装,pip安装,支持命令行执行。Baidu Search unofficial API for Python with no external dependencies

Python 52 8 Updated Oct 30, 2019

Official release of InternLM2.5 7B base and chat models. 1M context support

Python 5,716 410 Updated Jul 4, 2024

Flutter codelab examples

C 1,790 1,322 Updated Jul 5, 2024

Code for Stanford CS224u

Jupyter Notebook 2,077 892 Updated Apr 7, 2024

The official PyTorch implementation of Google's Gemma models

Python 5,155 490 Updated Jul 4, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 8,705 792 Updated Jul 1, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 33,921 3,538 Updated Jun 11, 2024
Jupyter Notebook 543 223 Updated Jun 15, 2024

Modeling, training, eval, and inference code for OLMo

Python 4,191 390 Updated Jul 7, 2024
Jupyter Notebook 707 396 Updated May 31, 2024

Code for the book Deep Learning with PyTorch by Eli Stevens, Luca Antiga, and Thomas Viehmann.

Jupyter Notebook 4,596 1,956 Updated Jul 4, 2024

Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"

Python 623 185 Updated Jan 11, 2024

Metric depth estimation from a single image

Jupyter Notebook 2,099 195 Updated May 3, 2024
Python 924 120 Updated Jan 19, 2023

Official implementation of "Neuralangelo: High-Fidelity Neural Surface Reconstruction" (CVPR 2023)

Python 4,259 382 Updated Apr 14, 2024
Next