- Bangalore
- in/sagar-sumit
Stars
Language
Sort by: Recently starred
A native Rust library for Apache Hudi, with bindings into Python
A repository where sample Hudi code, tips/tricks, etc. will be hosted.
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
All the things about TPC-DS in Apache Spark
The official home of the Presto distributed SQL query engine for big data
Notes on books I read, talks I watch, articles I study, and papers I love
Upserts, Deletes And Incremental Processing on Big Data.
Open source, privacy focused client side library for the creation and monetisation of online audiences.
Papers & presentation materials from Hugging Face's internal science day
A list of NLP(Natural Language Processing) tutorials
Companion webpage to the book "Mathematics For Machine Learning"
Google Cloud Platform Certification resources.
NLP 101: a resource repository for Deep Learning and Natural Language Processing
Thinking in tensors, writing in PyTorch (a hands-on deep learning intro)
Financial Sentiment Analysis with BERT
The "Python Machine Learning (1st edition)" book code repository and info resource
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
comparing stand up comedians using natural language processing
Data science teaching materials
A curated list of engineering blogs
ROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP