Skip to content
View jeromebanks's full-sized avatar

Block or report jeromebanks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applications with advanced filtering capabilities. It seamlessly inte…

Python 214 24 Updated Oct 4, 2024

The Next Generation Hadoop Scheduler

JavaScript 7 3 Updated May 29, 2015

This repository contains two Python scripts that demonstrate how to create a chatbot using Streamlit, OpenAI GPT-3.5-turbo, and Activeloop's Deep Lake.

Python 1,133 169 Updated May 20, 2024

A highly efficient daemon for streaming data from Kafka into Delta Lake

Rust 360 79 Updated Sep 18, 2024

A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.

Scala 342 52 Updated May 31, 2024

Spark Streaming application with enhanced Kafka Streaming consumer metrics exposed using Spark 3 PrometheusServlet

Scala 3 1 Updated Jan 29, 2023

Amplify your team's potential with customizable and secure AI assistants.

TypeScript 949 107 Updated Oct 15, 2024

Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.

JavaScript 12,497 616 Updated Aug 14, 2024

Go/gRPC service designed to enable generic rate limit scenarios from different types of applications.

Go 2,272 446 Updated Oct 14, 2024

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 794 232 Updated Oct 14, 2024

Distributed database specialized in exporting key/value data from Hadoop

Java 558 53 Updated Jun 27, 2014

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Scala 186 34 Updated Feb 12, 2023

Spark releases with AWS Glue support

Dockerfile 8 4 Updated Aug 1, 2024

Spark releases with AWS Glue support

Dockerfile 7 6 Updated Oct 1, 2020

Apache Hive

Java 5,521 4,673 Updated Oct 15, 2024

The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational m…

Java 203 119 Updated May 10, 2024

☁️Amazon S3-based resolver for sbt

Scala 117 29 Updated Nov 5, 2018

metrics-datadog

Java 187 105 Updated Jan 23, 2024

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Scala 2,238 393 Updated Sep 29, 2023

Base classes to use when writing tests with Spark

Scala 1,516 358 Updated Sep 30, 2024

Datadog API client for Scala.

Scala 31 4 Updated Aug 13, 2024

Docker image for Spark history server on Kubernetes

Shell 15 31 Updated Mar 13, 2020

Mirror of Apache livy (Incubating)

Scala 14 33 Updated Feb 8, 2024

Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.

Scala 881 600 Updated Sep 13, 2024

Livy is an open source REST interface for interacting with Apache Spark from anywhere

Scala 1,009 314 Updated Oct 5, 2022

Redshift Auto Schema is a Python library that takes a delimited flat file or parquet file as input, parses it, and provides a variety of functions that allow for the creation and validation of tabl…

Python 29 5 Updated Jun 21, 2021

Spark on Kubernetes infrastructure Helm charts repo

Mustache 199 75 Updated Oct 20, 2022

Spark on Kubernetes infrastructure Docker images repo

Shell 37 43 Updated Oct 20, 2022

Create and modify Tableau workbook and datasource files

Python 330 178 Updated Jun 19, 2024

A production-grade HBase ORM library that makes accessing HBase clean, fast and fun (Can also be used as Bigtable ORM)

Java 79 41 Updated Jun 14, 2023
Next