-
Oracle
- Australia
- https://daiquocnguyen.github.io
- @daiqng
Block or Report
Block or report daiquocnguyen
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A high-throughput and memory-efficient inference and serving engine for LLMs
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
YaRN: Efficient Context Window Extension of Large Language Models
A PyTorch-based knowledge distillation toolkit for natural language processing
LLM training code for Databricks foundation models
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
GPT4All: Chat with Local LLMs on Any Device
A Database of Real Faults and an Experimental Infrastructure to Enable Controlled Experiments in Software Engineering Research
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A curated list of Knowledge Graph related learning materials, databases, tools and other resources
Python bindings to the Tree-sitter parsing library
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
JEMMA: An Extensible Java dataset for Many ML4Code Applications
Macaron is an extensible supply-chain security analysis framework from Oracle Labs that supports a wide range of build systems and CI/CD services. It can be used to prevent supply chain attacks or …
A game theoretic approach to explain the output of any machine learning model.
Joint Multilingual Knowledge Graph Completion and Alignment (Findings of EMNLP 2022) (Pytorch)
Graph Neural Network Library for PyTorch
JDK main-line development https://openjdk.org/projects/jdk
[ICML 2021] Break-It-Fix-It: Unsupervised Learning for Program Repair
An Open-Source Framework for Prompt-Learning.
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
Repository accompanying the paper "SeSaMe: A Data Set of Semantically Similar Java Methods"