jmnian

Follow

jmnian

Follow

0 followers · 2 following

Lists (1)

Sort

wechat-crawler

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

jmnian / Evidence-based_Rumor_Detection

Multimodal rumor detection model using an evidence-based dataset. Current version uses CLIP embeddings for both text and image inputs.

Python 2 Updated Aug 28, 2024

ae2015 / defuse

Python 1 Updated Oct 14, 2024

wangrongding / wechat-bot

🤖一个基于 WeChaty 结合 OpenAi ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人，可以用来帮助你自动回复微信消息，或者管理微信群/好友，检测僵尸粉等...

JavaScript 5,314 742 Updated Oct 23, 2024

instructlab / taxonomy

Taxonomy tree that will allow you to create models tuned with your data

Python 186 773 Updated Oct 25, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—foundation models

Python 18,393 1,412 Updated Nov 3, 2024

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 49,796 4,826 Updated Oct 28, 2024

SuperBruceJia / Awesome-LLM-Self-Consistency

Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models

76 2 Updated Aug 10, 2024

okhat / blog

228 4 Updated Sep 29, 2024

madaan / self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Python 611 52 Updated Oct 4, 2024

yanxuzhu / KG-FPQ

The data and code of "KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions".

2 Updated Jul 9, 2024

jmnian / weak_label_for_rag

Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"

Jupyter Notebook 10 1 Updated Aug 19, 2024

zhiyuanpeng / SPTAR

Soft Prompt Tuning for Augmenting Dense Retrieval with Large Language Models

Jupyter Notebook 9 1 Updated Aug 25, 2024

RUC-NLPIR / FlashRAG

⚡FlashRAG: A Python Toolkit for Efficient RAG Research

Python 1,269 102 Updated Nov 3, 2024

castorini / anserini

Anserini is a Lucene toolkit for reproducible information retrieval research

Java 1,028 456 Updated Oct 28, 2024

hila-chefer / Transformer-MM-Explainability

[ICCV 2021- Oral] Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-…

Jupyter Notebook 797 107 Updated Aug 24, 2023

hila-chefer / Transformer-Explainability

[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.

Jupyter Notebook 1,786 241 Updated Jan 24, 2024

beir-cellar / beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,604 191 Updated Jul 28, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 26,948 3,048 Updated Aug 12, 2024

AnswerDotAI / RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,026 206 Updated Sep 3, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,140 172 Updated Aug 11, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,296 1,603 Updated Nov 1, 2024

run-llama / rags

Build ChatGPT over your data, all with natural language

Python 6,286 643 Updated Apr 5, 2024

THU-BPM / CHEF

The source code of paper "CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking"

Python 68 11 Updated Dec 16, 2022

luyug / Condenser

EMNLP 2021 - Pre-training architectures for dense retrieval

Python 244 23 Updated Mar 18, 2022

harvardnlp / annotated-transformer

An annotated implementation of the Transformer paper.

Jupyter Notebook 5,697 1,231 Updated Apr 7, 2024

openai / chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,061 3,687 Updated Jul 4, 2024

google-research / bert

TensorFlow code and pre-trained models for BERT

Python 38,136 9,600 Updated Jul 23, 2024

shibing624 / text2vec

text2vec, text to vector. 文本向量表征工具，把文本转化为向量矩阵，实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型，开箱即用。

Python 4,475 397 Updated Oct 27, 2024

CLUEbenchmark / CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,001 539 Updated May 23, 2024

google-research / deduplicate-text-datasets

Rust 1,116 111 Updated Jul 30, 2024