![tensorflow logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/tensorflow/tensorflow.png)
Block or Report
Block or report github-luffy
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
Utilities intended for use with Llama models.
Firefly: 大模型训练工具,支持训练Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的可商用开源多模态对话模型
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
The official Python client for the Huggingface Hub.
YOLOv10: Real-Time End-to-End Object Detection
The official GitHub page for the survey paper "A Survey of Large Language Models".
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
[CVPR2024] OneFormer3D: One Transformer for Unified Point Cloud Segmentation
[arXiv preprint] The official code of paper "Open-Vocabulary SAM".