Stars
A topic-centric list of HQ open datasets.
🌍 Discover our global repository of countries, states, and cities! 🏙️ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native l…
High accuracy RAG for answering questions from scientific documents with citations
🎓 Path to a free self-taught education in Computer Science!
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
A curated list of awesome open-source libraries for production LLM
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
Sharing some info around job offers and interviews preparations
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Practical course about Large Language Models.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
A quick guide (especially) for trending instruction finetuning datasets
LLM Finetuning with peft
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …
An Awesome List of Open-Source Data Engineering Projects
Python library and CLI tool to interface with Google Translate's text-to-speech API
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory
Instruct-tune LLaMA on consumer hardware
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
High-quality datasets, tools, and concepts for LLM fine-tuning.
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning