Skip to content
View caiogomide's full-sized avatar

Sponsoring

@freeCodeCamp

Highlights

  • Pro

Block or report caiogomide

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 13,444 985 Updated Aug 21, 2024

pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.

Python 6,074 379 Updated Jul 6, 2024

Pythonic AI generation of images and videos

Python 7,896 431 Updated Apr 18, 2024

A library of translation-based text similarity measures

Python 25 5 Updated Dec 11, 2023

State of the Art Natural Language Processing

Scala 3,797 706 Updated Aug 28, 2024

🤗 Evaluate: A library for easily evaluating machine learning models and datasets.

Python 1,941 243 Updated Aug 14, 2024

List of Python API Wrappers and Libraries

2,075 406 Updated Aug 29, 2023

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Python 18,920 2,615 Updated Aug 26, 2024

VSCode extension that generates docstrings for python files

TypeScript 655 157 Updated Aug 13, 2024

Py4J enables Python programs to dynamically access arbitrary Java objects

Java 1,175 216 Updated Jun 20, 2024

An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conv…

Jupyter Notebook 1,722 172 Updated Aug 19, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 11,645 957 Updated Jul 5, 2024

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Python 8,008 1,382 Updated Aug 28, 2024

Fine-tuning Mistral LLM for Adaptive Machine Translation

Jupyter Notebook 61 13 Updated Jun 29, 2024

A Hierarchically-Labeled Portuguese Hate Speech Dataset

31 6 Updated Jun 25, 2019

PorSimplesSent - A Portuguese corpus of aligned sentences pairs to investigate sentence readability assessment

Go 8 5 Updated Jan 15, 2020

Portuguese translation of the GLUE benchmark and Scitail dataset

Python 26 5 Updated Jun 27, 2022

Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.

Python 12 2 Updated Feb 10, 2021

List of resources and tools developed with focus on Portuguese.

217 24 Updated Jun 30, 2024

Apache Hive

Java 5,478 4,660 Updated Aug 26, 2024

Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks

Python 236 35 Updated Jul 25, 2023

Apache Hadoop

Java 14,616 8,820 Updated Aug 28, 2024

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 10,619 730 Updated Aug 28, 2024

Python client for Apache Kafka

Python 5,577 1,400 Updated Jul 23, 2024

The Hugging Face course on Transformers

MDX 2,138 692 Updated Aug 23, 2024

Trax — Deep Learning with Clear Code and Speed

Python 8,029 813 Updated Aug 21, 2024

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 15,273 3,464 Updated Jun 2, 2023

Python job scheduling for humans.

Python 11,704 959 Updated May 25, 2024

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 6,699 2,246 Updated Jun 27, 2024

OpenRefine is a free, open source power tool for working with messy data and improving it

Java 10,738 1,939 Updated Aug 26, 2024
Next