Stars
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…
python parser for human readable dates
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM
🦜⛏️ Did you say you like data?
bpcreech / PyMiniRacer
Forked from sqreen/PyMiniRacerPyMiniRacer is a V8 bridge in Python.
A shell power-up for working with the file system and running subprocess commands
AI Prediction api of the MusicLang package
Unimarc to MARC xslts MarcEdit uses as part of it's transformations
Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper
A static site generator for data apps, dashboards, reports, and more. Observable Framework combines JavaScript on the front-end for interactive graphics with any language on the back-end for data a…
PyAirbyte brings the power of Airbyte to every Python developer.
An efficient implementation of a rate limiter for asyncio.
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
re_data - fix data issues before your users & CEO would discover them 😊
Build AI Agents with memory, knowledge, tools and reasoning
A Gradio web UI for Large Language Models.
Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
Rapid fuzzy string matching in Python using various string metrics
O!My Models (omymodels) is a library to generate Pydantic, Dataclasses, GinoORM Models, SqlAlchemy ORM, SqlAlchemy Core Table, Models from SQL DDL. And convert one models to another.
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Run any open-source LLMs, such as Llama 3.1, Gemma, as OpenAI compatible API endpoint in the cloud.
Distribute and run LLMs with a single file.