anoopj

Anoop Johnson anoopj

Software Engineer at Google

46 followers · 0 following

Achievements

Organizations

Block or Report

Block or report anoopj

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

simd-lite / simd-json

Rust port of simdjson

Rust 1,082 84 Updated Aug 13, 2024

google / highway

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 4,042 308 Updated Aug 15, 2024

unitycatalog / unitycatalog

Open, Multi-modal Catalog for Data & AI

Java 2,100 308 Updated Aug 15, 2024

apache / incubator-xtable

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 815 136 Updated Aug 15, 2024

maxi-k / btrblocks

BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)

C++ 208 16 Updated May 7, 2024

google / fuzztest

C++ 676 66 Updated Aug 15, 2024

delta-incubator / deltaray

Delta reader for the Ray open-source toolkit for building ML applications

Python 40 11 Updated Jan 27, 2024

resource-disaggregation / snowset

Snowflake dataset containing statistics for 70 million queries over 14 day period

Jupyter Notebook 100 21 Updated Sep 27, 2021

coiled / dask-bigquery

Python 40 12 Updated Jul 25, 2024

awslabs / aws-glue-catalog-sync-agent-for-hive

Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog

Java 32 13 Updated Dec 5, 2023

lhbench / lhbench

Lakehouse storage system benchmark

Scala 62 9 Updated Feb 22, 2023

FurcyPin / bigquery-frame

Python 49 2 Updated Jul 4, 2024

weggli-rs / weggli

weggli is a fast and robust semantic search tool for C and C++ codebases. It is designed to help security researchers identify interesting functionality in large codebases.

Rust 2,313 129 Updated Jul 12, 2024

priyankavergadia / GCPSketchnote

If you are looking to become a Google Cloud Engineer , then you are at the right place. GCPSketchnote is series where I share Google Cloud concepts in quick and easy to learn format.

4,692 767 Updated Jun 9, 2023

substrait-io / substrait

A cross platform way to express data transformation, relational algebra, standardized record expression and plans.

Python 1,131 148 Updated Aug 15, 2024

ClickHouse / ClickHouse

ClickHouse® is a real-time analytics DBMS

C++ 36,220 6,719 Updated Aug 15, 2024

google / supersonic

Supersonic is an ultra-fast, column oriented query engine library written in C++

C++ 204 43 Updated Oct 2, 2020

abseil / abseil-cpp

Abseil Common Libraries (C++)

C++ 14,606 2,566 Updated Aug 15, 2024

wey068 / Facebook-Interview-Coding

个人整理的Facebook实习面试题目解法，时间范围2016.8-2017.3

Java 53 188 Updated Oct 13, 2019

kubeflow / spark-operator

Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

Go 2,724 1,357 Updated Aug 15, 2024

apache / incubator-crail

Mirror of Apache crail (Incubating)

Java 147 47 Updated Jul 3, 2022

apache / hudi

Upserts, Deletes And Incremental Processing on Big Data.

Java 5,284 2,400 Updated Aug 15, 2024

awslabs / emr-dynamodb-connector

Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB

Java 215 133 Updated Aug 8, 2024

JerryLead / SparkInternals

Notes talking about the design and implementation of Apache Spark

5,251 1,840 Updated Apr 2, 2024

delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,361 1,658 Updated Aug 15, 2024

firecracker-microvm / firecracker

Secure and fast microVMs for serverless computing.

Rust 24,879 1,744 Updated Aug 15, 2024

microsoft / SPTAG

A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distributed online serving toolkits for large scale vector search sc…

C++ 4,760 579 Updated Aug 10, 2024

trinodb / trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Java 10,044 2,901 Updated Aug 15, 2024

mlflow / mlflow

Open source platform for the machine learning lifecycle

Python 18,152 4,102 Updated Aug 15, 2024

yugabyte / yugabyte-db

YugabyteDB - the cloud native distributed SQL database for mission-critical applications.

C 8,716 1,045 Updated Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anoop Johnson anoopj

Achievements

Achievements

Organizations

Block or report anoopj

Stars

simd-lite / simd-json

google / highway

unitycatalog / unitycatalog

apache / incubator-xtable

maxi-k / btrblocks

google / fuzztest

delta-incubator / deltaray

resource-disaggregation / snowset

coiled / dask-bigquery

awslabs / aws-glue-catalog-sync-agent-for-hive

lhbench / lhbench

FurcyPin / bigquery-frame

weggli-rs / weggli

priyankavergadia / GCPSketchnote

substrait-io / substrait

ClickHouse / ClickHouse

google / supersonic

abseil / abseil-cpp

wey068 / Facebook-Interview-Coding

kubeflow / spark-operator

apache / incubator-crail

apache / hudi

awslabs / emr-dynamodb-connector

JerryLead / SparkInternals

delta-io / delta

firecracker-microvm / firecracker

microsoft / SPTAG

trinodb / trino

mlflow / mlflow

yugabyte / yugabyte-db