Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Patient Rule Induction Method implementation on Python
The primary objective of this project is to predict the likelihood of a visitor making a purchase during a subsequent visit to the Google Merchandise Store.
Bump Hunting by Patient Rule Induction Method for Survival, Regression and Classification in a multivariate setting and in high-dimensional data
Holds the code for "SeqScout: Using a Bandit Model to Discover Interesting Subgroups in Labeled Sequences"
Patient Rule Induction Method (PRIM) for Python
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Always know what to expect from your data.
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Demo code exploring Python's memory models and collection algorithms from the Talk Python Training course.
A Python memory profiler for data processing and scientific computing applications
A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal vector embeddings.
Using a combination of Google BigQuery and Looker Studio, I delved into a dataset of 4 csv files from a fictional online game. Each file contains data from a tracking event of users.
A Visual Studio Code plugin for running BigQuery queries.
dbt-bigquery contains all of the code required to make dbt operate on a BigQuery database.
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
🛍 A real-world e-commerce dataset for session-based recommender systems research.
A Python vector database you just need - no more, no less.
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
RAG with knowledge graphs implemented from scratch
This repo is for the Linkedin Learning course: Hands-On Introduction: Data Engineering
A curated list of data engineering tools for software developers
Machine Learning Engineering Open Book