Lists (3)
Sort Name ascending (A-Z)
Starred repositories
[NeurIPS 2024] Dual-Perspective Activation: Efficient Channel Denoising via Joint Forward-Backward Criterion for Artificial Neural Networks
LOTUS: A semantic query engine - process data with LLMs as easily as writing pandas code
A Declarative System for Optimizing AI Workloads
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data P…
A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!
DSPy: The framework for programming—not prompting—foundation models
We well know GANs for success in the realistic image generation. However, they can be applied in tabular data generation. We will review and examine some recent papers about tabular GANs in action.
We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理
Awesome-LLM-Tabular: a curated list of Large Language Model applied to Tabular Data
A suite of auto-regressive and Seq2Seq (sequence-to-sequence) transformer models for tabular and relational synthetic data generation.
Discovering Data-driven Hypotheses in the Wild
A novel approach for synthesizing tabular data using pretrained large language models
Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""
Repository for collecting and categorizing papers outlined in our survey paper: "Large Language Models on Tabular Data -- A Survey".
Smart Automation Tool for building modern Data Lakes and Data Pipelines
Pytorch implementation of our paper OvSW: Overcoming Silent Weights for Accurate Binary Neural Networks accepted by ECCV 2024.
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Creative interactive views of any dataset.
[WWW2024] The official code for paper "Distributionally Robust Graph-based Recommendation System"
Pytorch implementation of our paper MaxQ: Multi-Axis Query for N:M Sparsity Network accepted by CVPR 2024.