Skip to content
View iamwonseokchoi's full-sized avatar
Block or Report

Block or report iamwonseokchoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Ollama Python library

Python 3,414 278 Updated Aug 2, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 6,514 505 Updated Aug 4, 2024

Maestro: Netflix’s Workflow Orchestrator

Java 2,685 164 Updated Aug 4, 2024
Jupyter Notebook 5 2 Updated Jul 16, 2024

aider is AI pair programming in your terminal

Python 15,453 1,450 Updated Aug 5, 2024

Jupyter Notebooks to help you get hands-on with Pinecone vector databases

Jupyter Notebook 2,609 972 Updated Jul 29, 2024

PySpark test helper methods with beautiful error messages

Python 553 64 Updated Jul 31, 2024

Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our c…

Python 12,146 607 Updated Aug 5, 2024

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,084 2,222 Updated Aug 1, 2024

Keyword Extraction and Analysis Pipeline & Application with KeyBERT and Taipy

Python 12 3 Updated Apr 18, 2023

Building smart Big Data pipelines with Dask & Taipy (DEMO)

Jupyter Notebook 3 1 Updated Sep 11, 2023

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

Jupyter Notebook 1,926 341 Updated Jul 31, 2024

Full data pipeline and data engineering project using AWS services (MSK, Kafka, Spark Stream/SQL, Elastic Stack, Iceberg, Glue, Athena, Streamlit, etc.)

Python 1 Updated Oct 23, 2023

Data replication and lineage management mini-project using Azure Databricks

Python 3 Updated Aug 16, 2023

Stock predictor app for NASDAQ stocks served on Streamlit. Data engineering side uses publicly available APIs to curate and form data, data science side offers a myriad of models.

Python 2 4 Updated Oct 15, 2023

Using Spark Vectorized UDFs and AI tools on stock price data

Jupyter Notebook 2 1 Updated Oct 10, 2023

Integrated end-to-end data project using lambda and data lakehouse architecture to compile financial data

Python 3 1 Updated Oct 4, 2023

Data lakehouse and lambda architecture mini-project deployed onto non-managed Kubernetes using Kafka and Pyspark

Python 1 1 Updated Aug 20, 2023

Spark structured streaming mini-project for IoT devices using Databricks

Python 1 Updated Aug 16, 2023