Skip to content
View yanaiela's full-sized avatar

Highlights

  • Pro

Block or report yanaiela

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A more intuitive version of du in rust

Rust 8,721 186 Updated Sep 16, 2024

This is an extension of the popular 21cmFAST code that interfaces with CLASS to generate initial conditions at recombination that are consistent with the input cosmological model

Jupyter Notebook 2 2 Updated Jul 14, 2024

💤 A utility tool powered by fzf for using git interactively.

Shell 4,396 137 Updated Oct 1, 2024

Tool for interactive embeddings visualization

Python 291 21 Updated Aug 20, 2024

🚢 Data Toolkit for Sailor Language Models

Python 79 7 Updated Jul 11, 2024

Extract structured text from pdfs quickly

Python 294 27 Updated Oct 8, 2024

Data and tools for generating and inspecting OLMo pre-training data.

Python 941 105 Updated Oct 8, 2024

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,597 157 Updated Aug 18, 2024

A latent text-to-image diffusion model

Jupyter Notebook 67,814 10,111 Updated Jun 18, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,026 4,138 Updated Oct 9, 2024

A Survey on Data Selection for Language Models

157 9 Updated Jun 4, 2024

Lightweight clipboard manager for macOS

Swift 12,516 531 Updated Oct 4, 2024

A general representation model across vision, audio, language modalities. Paper: ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities

Python 945 59 Updated Oct 6, 2024

✨ Build AI interfaces that spark joy

Python 5,202 329 Updated Oct 7, 2024

What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets

Python 181 19 Updated Sep 9, 2024

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Python 1,196 50 Updated Oct 7, 2024

A library to manipulate font files from Python.

Python 4,308 453 Updated Oct 8, 2024

Microsoft.Recognizers.Text provides recognition and resolution of numbers, units, date/time, etc. in multiple languages (ZH, EN, FR, ES, PT, DE, IT, TR, HI, NL. Partial support for JA, KO, AR, SV).…

C# 1,664 429 Updated Oct 3, 2024

BookNLP, a natural language processing pipeline for books

Python 785 92 Updated Jul 31, 2024

Creative interactive views of any dataset.

Python 824 43 Updated Feb 25, 2024

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Scala 137 33 Updated Feb 27, 2024

Lexical Generalization Improves with Larger Models and Longer Training (EMNLP 2022)

Python 3 Updated Feb 26, 2023

Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).

Python 1,835 210 Updated Aug 27, 2024

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

Python 730 101 Updated Apr 17, 2024

Python script which prints out a summary of your free slots from your Google calendar(s) so you can paste into a scheduling email.

Python 41 4 Updated Oct 28, 2022

A template repo for Python packages

Python 398 68 Updated Jul 19, 2024

Tools for checking ACL paper submissions

Python 588 47 Updated May 15, 2024

Open-Source Neural Machine Translation in Tensorflow

Python 797 269 Updated Dec 9, 2022
Next