Skip to content
View yotamnahum's full-sized avatar

Block or report yotamnahum

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Language Modelling Tasks as Objects (LaMoTO) treats the pretraining and finetuning of causal and masked language models as classes themselves, not just the models.

Python 1 Updated Aug 18, 2024

Cross-Attention based Unsupervised Contrastive Learning for Sentence Embedding

Python 1 Updated Aug 19, 2024

Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contrastive learning framework.

Python 23 1 Updated Aug 11, 2024

Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, implementing, and testing new search methods. Baguetter support…

Python 112 4 Updated Aug 23, 2024

A compact LLM pretrained in 9 days by using high quality data

Python 136 12 Updated Aug 21, 2024
Python 9 Updated Aug 18, 2024

Code2Prompt is a powerful command-line tool that simplifies the process of providing context to Large Language Models (LLMs) by generating a comprehensive Markdown file containing the content of yo…

Python 465 18 Updated Jul 28, 2024

Token Alignment via Character Matching for Subword Completion (ACL Findings 2024)

Python 6 Updated Aug 5, 2024

Robust and fast topic models with sentence-transformers.

Python 12 4 Updated Aug 19, 2024

πŸ” An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,049 392 Updated Aug 20, 2024
Python 4 Updated Jun 14, 2024

Open-source Python toolkit focused on deep learning with ordinal methodologies

Python 31 2 Updated Jul 25, 2024

DOM to Semantic-Markdown for use in LLMs

TypeScript 503 8 Updated Aug 7, 2024

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 431 49 Updated Aug 21, 2024

Correctly generate plurals, ordinals, indefinite articles; convert numbers to words

Python 949 104 Updated Aug 21, 2024

DataComp for Language Models

HTML 1,073 94 Updated Aug 19, 2024

E5-V: Universal Embeddings with Multimodal Large Language Models

Python 138 4 Updated Jul 17, 2024

A Jupyter widget using sigma.js to render interactive networks.

Jupyter Notebook 189 16 Updated Oct 20, 2023

A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal vector embeddings.

Jupyter Notebook 458 19 Updated Aug 22, 2024

Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"

Python 80 3 Updated Aug 3, 2024

πŸš€ The best source for dashboard icons.

Python 4,599 502 Updated Aug 17, 2024

A curated list for Efficient Large Language Models

Python 1,063 75 Updated Aug 22, 2024

Fast Open-Source Search & Clustering engine Γ— for Vectors & πŸ”œ Strings Γ— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram πŸ”

C++ 2,062 121 Updated Aug 23, 2024

Source Code for FINCH Clustering Algorithm

Jupyter Notebook 322 58 Updated Aug 24, 2023

Fast lexical search library implementing BM25 in Python using Numpy and Scipy

Python 730 25 Updated Aug 23, 2024
Python 32 1 Updated Jun 21, 2024

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 661 39 Updated Aug 7, 2024
Next