Starred repositories
Multimodal sentiment analysis
LLM, Fine Tuning, Llama 2, Gemma, Mixtral, vLLM, LangChain, RAG, ChromaDB, FAISS
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
TriageAI is an operator dispatch assistance that helps streamline the 911 call center processes by extracting key details and triaging calls until an operator becomes available, ensuring efficient …
A consolidated dataset of 911 call for response data for 5 US cities
This Project is part of Data Science Nanodegree Program by Udacity in collaboration with Figure Eight. The dataset contains pre-labelled tweet and messages from real-life disaster events. The proje…
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
CodeUp: A Multilingual Code Generation Llama2 Model with Parameter-Efficient Instruction-Tuning on a Single RTX 3090
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Home of CodeT5: Open Code LLMs for Code Understanding and Generation
The unofficial python package that returns response of Google Bard through cookie value.
ESWA - Expert Systems with Applications latex template
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Can we use explanations to improve hate speech models? Our paper accepted at AAAI 2021 tries to explore that question.
Hate speech detection using Naive Bayes Classifier
Code and documentation to train Stanford's Alpaca models, and generate the data.
Code for "Learning to summarize from human feedback"
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Train transformer language models with reinforcement learning.
This project was developed as part of the AIC Competition sponsored by MTC Egypt. The goal of the project is to create an efficient text summarization system for Arabic language text documents.
This demo repository illustrates how to use Python to scrape news articles from Google based on a given keyword. The scraped articles are then processed by Azure OpenAI Service (AOAI)'s GPT-3 model…
Multiple implementations for abstractive text summurization , using google colab
A Large Scale Text Summarization Dataset
Arabic cleaning, normalization and segmentation library.
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Co…