- San Francisco, CA
- https://www.linkedin.com/in/gerashegalov/
- @gerashegalov
Stars
New file format for storage of large columnar datasets.
Spark RAPIDS plugin - accelerate Apache Spark with GPUs
Spark RAPIDS MLlib – accelerate Apache Spark MLlib with GPUs
Spark RAPIDS Benchmarks – benchmark sets and utilities for the RAPIDS Accelerator for Apache Spark
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing …
A repo for all spark examples using Rapids Accelerator including ETL, ML/DL, etc.
Apache Spark - A unified analytics engine for large-scale data processing
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources
The official home of the Presto distributed SQL query engine for big data