Starred repositories
A collection of design patterns/idioms in Python
LlamaIndex is a data framework for your LLM applications
Streamlit — A faster way to build and share data apps.
Jupyter metapackage for installation, docs and chat
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
📚 Parameterize, execute, and analyze notebooks
Python module that makes working with XML feel like you are working with JSON
Voilà turns Jupyter notebooks into standalone web applications
CKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data. It powers catalog.data.gov, open.canada.ca/data, data…
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features…
A semantic diff utility and library for tree-like files such as JSON, JSON5, XML, HTML, YAML, and CSV.
Convert xslx to csv, it is fast, and works for huge xlsx files
A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.
A command line tool (and Python library) for archiving Twitter JSON
🦙 Integrating LLMs into structured NLP pipelines
Jupyter handsontable integration
Quickly generate HTML documentation from a JSON schema
Common Workflow Language reference implementation
JSON-LD parser and serializer plugins for RDFLib
Python tools for creating Merkle trees, generating Merkle proofs, and verification of Merkle proofs
Turn a Git repo into a collection of interactive notebooks. This is Binder's user documentation repository.