Skip to content
View NoUnique's full-sized avatar

Highlights

  • Pro

Block or report NoUnique

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A machine learning software for extracting information from scholarly documents

Java 3,396 443 Updated Aug 30, 2024

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Python 226 8 Updated Sep 2, 2024

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 6,615 595 Updated Sep 2, 2024

Bring projects, wikis, and teams together with AI. AppFlowy is an AI collaborative workspace where you achieve more without losing control of your data. The best open source alternative to Notion.

Dart 54,260 3,559 Updated Sep 2, 2024

🕷️ The pipeline for the OSCAR corpus

Rust 159 14 Updated Dec 18, 2023

Official repository for KoMT-Bench built by LG AI Research

Python 44 Updated Aug 8, 2024

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,091 44 Updated Aug 9, 2024

Topic Modelling for Humans

Python 15,539 4,374 Updated Sep 1, 2024

Tool to detect the ISO-15924 code of the script used in a text

Python 6 3 Updated Oct 19, 2016

C source that once compiled can generate a TECkit map from a Unihan_Variants.txt file to map traditional Chinese characters to simplified Chinese characters

C 1 Updated Sep 13, 2022

Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.

Python 43 5 Updated Jul 14, 2022

Data and tools for generating and inspecting OLMo pre-training data.

Python 896 90 Updated Sep 2, 2024

Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

Python 56 2 Updated Jun 19, 2024

Editing Models with Task Arithmetic

Python 398 34 Updated Jan 11, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,328 139 Updated Jun 3, 2024

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

Python 384 23 Updated Aug 26, 2024

Code associated with the Don't Stop Pretraining ACL 2020 paper

Python 525 73 Updated Nov 15, 2021

Tools for merging pretrained large language models.

Python 4,423 389 Updated Aug 31, 2024
Python 69 6 Updated Apr 29, 2024

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Python 231 5 Updated Aug 29, 2024

Zstandard - Fast real-time compression algorithm

C 23,134 2,058 Updated Sep 2, 2024

[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation

Python 5,076 420 Updated Jun 1, 2024

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 971 100 Updated Jul 29, 2024
Python 280 15 Updated Jun 9, 2024

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Jupyter Notebook 423 37 Updated Apr 21, 2024

A modern JavaScript library for handling Hangul characters.

TypeScript 1,232 79 Updated Aug 20, 2024
Python 86 16 Updated Jul 16, 2022

Mediapipe-based library to redact faces from videos and images

C++ 437 16 Updated Sep 29, 2023

Infinity is a high-throughput, low-latency REST API for serving text-embeddings, reranking models and clip

Python 1,214 82 Updated Aug 31, 2024
Next