Skip to content
View Deep1994's full-sized avatar
🎯
Focusing
🎯
Focusing
  • NJU(Nanjing University)
  • Nanjing, China

Highlights

  • Pro

Block or report Deep1994

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Chat Templates for 🤗 HuggingFace Large Language Models

Jinja 481 47 Updated Sep 15, 2024

Official style files for papers submitted to venues of the Association for Computational Linguistics

TeX 703 173 Updated May 20, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 998 89 Updated May 8, 2024

中文大模型能力评测榜单:目前已囊括115个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!

2,498 119 Updated Sep 29, 2024

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

391 10 Updated Oct 3, 2024

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 698 81 Updated Aug 14, 2024

[DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift

Python 31 Updated Jan 25, 2024
Python 5 1 Updated Jul 19, 2024

The code and datasets of our ACM MM 2024 paper "Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs".

7 Updated Sep 27, 2024

📰 Must-read papers and blogs on Speculative Decoding ⚡️

382 15 Updated Sep 26, 2024

3D Visualization of an GPT-style LLM

TypeScript 3,899 424 Updated Aug 24, 2024

A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.

Python 50 Updated Aug 27, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 92,267 7,263 Updated Oct 7, 2024
Jupyter Notebook 29 2 Updated Jun 13, 2024

Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!

HTML 239 15 Updated Apr 8, 2024

Train transformer language models with reinforcement learning.

Python 9,619 1,207 Updated Oct 6, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 37,822 3,975 Updated Jul 28, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,818 1,054 Updated Aug 15, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 1,482 152 Updated Aug 17, 2024

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

797 54 Updated Oct 4, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,559 241 Updated Mar 5, 2024

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 49,104 4,756 Updated Sep 19, 2024

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 8,958 661 Updated Oct 7, 2024
JavaScript 2,401 850 Updated Jun 21, 2024

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 1,918 283 Updated Sep 10, 2024

Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting

Jupyter Notebook 9 Updated Mar 19, 2024

The implement of ACL2024: "MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization"

Python 28 4 Updated Jun 15, 2024

Exploring CoT-Decoding from Google DeepMind's paper, "Chain-of-Thought Reasoning Without Prompting".

Jupyter Notebook 9 Updated Feb 22, 2024

An Attentive Neural Sequence Labeling Model for Adverse Drug Reactions Mentions Extraction

Python 17 11 Updated Jan 17, 2020

[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey

66 6 Updated Aug 7, 2024
Next