Skip to content
View XinChenDSteam's full-sized avatar
🏠
Working from home
🏠
Working from home
Block or Report

Block or report XinChenDSteam

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is a repo with links to everything you'd ever want to learn about data engineering

9,881 1,330 Updated Jul 10, 2024

Patent analysis using the Google Patents Public Datasets on BigQuery

Jupyter Notebook 528 162 Updated Jun 17, 2024

PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT

Jupyter Notebook 60 14 Updated Jul 25, 2023

Parse files for optimal RAG

Python 1,932 180 Updated Jul 19, 2024

Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA

Jupyter Notebook 1,202 239 Updated Feb 18, 2024

A curated list of ontology things

251 20 Updated Feb 19, 2024

Experimental library integrating LLM capabilities to support causal analyses

Jupyter Notebook 63 9 Updated Jul 8, 2024

Notebooks, code samples, sample apps, and other resources that demonstrate how to use, develop and manage machine learning and generative AI workflows using Google Cloud Vertex AI.

Jupyter Notebook 1,542 780 Updated Jul 20, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 13,634 2,777 Updated Jul 20, 2024

Public runnable examples of using John Snow Labs' NLP for Apache Spark.

Jupyter Notebook 1,017 594 Updated Jul 18, 2024

Bandit is a tool designed to find common security issues in Python code.

Python 6,169 593 Updated Jul 18, 2024
Jupyter Notebook 975 185 Updated Jul 19, 2024

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Python 706 99 Updated Jun 15, 2024

📈 Awesome resources related to GNNs for Time Series Analysis (GNN4TS) 🔥 https://arxiv.org/abs/2307.03759

488 48 Updated Aug 10, 2023

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

Python 1,188 143 Updated Jan 16, 2024

The software used to extract structured data from Wikipedia

Scala 841 271 Updated Jun 20, 2024

:octocat: Curated list of GitHub Issues and Pull Requests templates

2,058 300 Updated Jul 3, 2023

A list of recent papers about Graph Neural Network methods applied in NLP areas.

889 140 Updated May 9, 2023

An extremely fast Python linter and code formatter, written in Rust.

Rust 29,183 950 Updated Jul 20, 2024

B21621 - Polars Cookbook

Jupyter Notebook 146 18 Updated Jul 18, 2024

Recipes for using Python's polars library

Jupyter Notebook 239 12 Updated Jul 1, 2023

Awesome papers about unifying LLMs and KGs

1,742 130 Updated May 16, 2024

Fuzzy String Matching in Python

Python 2,652 132 Updated Feb 27, 2024

Fuzzy String Matching in Python

Python 9,182 879 Updated Feb 24, 2023

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23,241 3,301 Updated Jul 20, 2024

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 27,059 11,081 Updated Jul 20, 2024

Large Language Model Text Generation Inference

Python 8,419 959 Updated Jul 20, 2024

Clustering sentence embeddings to extract message intent

Jupyter Notebook 164 24 Updated Oct 19, 2021

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 15,107 1,447 Updated Jul 19, 2024

Papers & presentation materials from Hugging Face's internal science day

2,020 117 Updated Oct 31, 2020
Next