Lists (12)
Sort Name ascending (A-Z)
Stars
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Things you can do with the token embeddings of an LLM
Fast and memory-efficient exact attention
Neural Network Compression Framework for enhanced OpenVINO™ inference
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Jupyter notebooks for the Natural Language Processing with Transformers book
This repository contains demos I made with the Transformers library by HuggingFace.
Python client library for Google Maps API Web Services
Google Colaboratory useful notebooks
This sample has the full End2End process of creating RAG application with Prompty and AI Studio. It includes GPT 3.5 Turbo LLM application code, evaluations, deployment automation with AZD CLI, Git…
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
A package that can be locally executed to generate minutes in Japanese
Ignite Japan 2023 でデモを行った Azure Communication Service Call Automation と Azure OpenAI を連携させた POC シナリオです。
書籍:「Azure OpenAI Service実践ガイド ~ LLMを組み込んだシステム構築」のリポジトリ
Sample code for the Microsoft Cognitive Services Speech SDK
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
Official community-driven Azure Machine Learning examples, tested with GitHub Actions.
AirLLM 70B inference with single 4GB GPU
Drag & drop UI to build your customized LLM flow
🦜🔗 Build context-aware reasoning applications
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)