-
Iowa State University
- Ames, Iowa
-
20:45
(UTC -05:00) - https://mingdianliu.github.io/
- @MingdianLiu
- in/mingdian-liu-205804110
- https://scholar.google.com/citations?user=I0_tNbsAAAAJ&hl=en
Block or Report
Block or report mingdianliu
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (1)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
A high-throughput and memory-efficient inference and serving engine for LLMs
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
This is a collection of Python scripts for implementing ASTRA Toolbox for cone-beam X-ray CT reconstruction.
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
Companion repo to Hypetrigger, providing extensibility and modding support.
🦋 A personal research and development (R&D) lab that facilitates the sharing of knowledge.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A generative speech model for daily dialogue.
The official repository of our ICRA 2024 paper "Stereo-NEC: Enhancing Stereo Visual-Inertial SLAM Initialization with Normal Epipolar Constraints".
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Turn any glasses into AI-powered smart glasses
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
SATO: Stable Text-to-Motion Framework
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152, I…
Official repository for "AM-RADIO: Reduce All Domains Into One"
Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"
Fast, scalable, accessible photonic simulation