Skip to content
View yueyu1030's full-sized avatar
🏠
Working from home
🏠
Working from home
Block or Report

Block or report yueyu1030

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

Python 63 2 Updated Jul 21, 2024

[ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.

Python 7 Updated Jun 23, 2024

Source code of MOLLEO

Python 24 Updated Jun 26, 2024

A task generation and model evaluation system.

Python 47 3 Updated Jul 17, 2024

This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".

Python 12 2 Updated Jul 22, 2024

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 355 122 Updated Jul 4, 2024

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 121 9 Updated Jun 20, 2024

Train Models Contrastively in Pytorch

Python 478 36 Updated Jul 18, 2024
Python 66 2 Updated Dec 22, 2023

[preprint'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records

Python 50 5 Updated Jul 22, 2024

MoraBench (Model Ranking Benchmark)

Python 5 Updated Mar 2, 2024
Python 11 Updated Jan 26, 2024

[ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models".

Python 28 1 Updated Jun 23, 2024

EcoAssistant: using LLM assistant more affordably and accurately

Python 124 7 Updated Jun 30, 2024

面试高频算法题总结,个人博客

C++ 1,105 276 Updated Dec 16, 2023

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Jupyter Notebook 28,664 4,182 Updated Jul 23, 2024

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

365 19 Updated Feb 7, 2024
Jupyter Notebook 329 31 Updated Jan 3, 2024

MAD: The first work to explore Multi-Agent Debate with Large Language Models :D

Python 214 22 Updated Nov 9, 2023

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut …

Python 833 72 Updated Apr 29, 2024

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Jupyter Notebook 222 7 Updated Aug 19, 2023

MUBen: Benchmarking the Uncertainty of Molecular Representation Models

Python 7 Updated Apr 17, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 4,415 412 Updated Jun 22, 2024

[SIGIR 2023] This is the code for our short paper `Weakly-Supervised Scientific Document Classification via Retrieval-Augmented Multi-Stage Training'.

Python 10 Updated Aug 30, 2023

Seamlessly integrate LLMs into scikit-learn.

Python 3,010 235 Updated Jul 22, 2024

Code for paper "A Single Vector Is Not Enough: Taxonomy Expansion via Box Embeddings"

Python 12 2 Updated May 28, 2023

PaL: Program-Aided Language Models (ICML 2023)

Python 452 54 Updated Jun 30, 2023

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,649 671 Updated Jan 14, 2024

Let ChatGPT teach your own chatbot in hours with a single GPU!

Python 3,148 277 Updated Mar 17, 2024
Python 3,420 399 Updated May 17, 2024
Next