Skip to content
View gerashegalov's full-sized avatar

Block or report gerashegalov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
21 results for source starred repositories
Clear filter

New file format for storage of large columnar datasets.

C++ 450 31 Updated Nov 9, 2024

CUDA Core Compute Libraries

C++ 1,252 161 Updated Nov 10, 2024

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 806 234 Updated Nov 8, 2024

cuDF - GPU DataFrame Library

C++ 8,426 903 Updated Nov 10, 2024

Filesystem in Userspace (FUSE) for Rust

Rust 825 113 Updated Nov 8, 2024

Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs

Jupyter Notebook 67 30 Updated Nov 9, 2024

Spark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark

Python 37 27 Updated Oct 23, 2024

User tools for Spark RAPIDS

Scala 54 37 Updated Nov 8, 2024

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …

Cuda 767 194 Updated Nov 8, 2024

Andy Grove Github Profile Page

1 Updated Apr 23, 2024

RAPIDS Accelerator JNI For Apache Spark

Cuda 37 65 Updated Nov 6, 2024

Generate simple index ranges in C++ and CUDA C++

C++ 39 3 Updated Jun 14, 2023

Tools for generating TPC-* datasets

Rust 26 5 Updated Jun 23, 2024

A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.

Jupyter Notebook 126 51 Updated Nov 7, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 39,846 28,306 Updated Nov 10, 2024

Apache Hadoop

Java 14,768 8,863 Updated Nov 9, 2024

Intelligent object mapping

Java 2,286 349 Updated Sep 2, 2024

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Scala 2,243 395 Updated Sep 29, 2023

YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources

Java 729 85 Updated Jul 5, 2022

The official home of the Presto distributed SQL query engine for big data

Java 16,038 5,375 Updated Nov 9, 2024

A Scala API for Cascading

Scala 3,500 706 Updated May 28, 2023