Skip to content
View Mimino666's full-sized avatar

Block or report Mimino666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

Python 4,081 270 Updated Oct 10, 2024

Web Scraping Framework

Python 2,387 276 Updated Mar 12, 2024

DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code changes on your scraper. Integrates with any sc…

Go 809 23 Updated Dec 5, 2021

Python books free to read online or download

4,723 655 Updated Mar 18, 2024

A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups

25,743 1,530 Updated Mar 24, 2024

Příručka pro děti a rodiče o výuce programování dětí na prvním stupni (tj. věk 6 až 11 let).

163 22 Updated Jun 17, 2024

API, CLI, and Web App for analyzing and finding a person's profile in 1000 social media \ websites

JavaScript 11,511 909 Updated Mar 16, 2024

A simple annotation component.

TypeScript 64 57 Updated Jan 26, 2023

A computer algebra system written in pure Python

Python 12,865 4,409 Updated Oct 10, 2024

A standalone version of the readability lib

JavaScript 8,839 601 Updated Oct 10, 2024

A Python library for reading and writing PDF, powered by QPDF

Python 2,147 191 Updated Oct 11, 2024

PDF parser and converter to HTML

Java 83 14 Updated Oct 3, 2024

Port of Google's language-detection library to Python.

Python 1,719 197 Updated Jan 24, 2024

Programmatically collect normalized news from (almost) any website.

Python 2,931 283 Updated Oct 30, 2020

Elasticsearch lemmatizer for 15 languages

Java 104 27 Updated May 29, 2024

Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin

13 4 Updated Dec 11, 2018

Use pyppeteer from a Scrapy spider

Python 60 12 Updated Feb 5, 2020

Turn your API made with Django REST Framework(DRF) into a GraphQL like API.

Python 617 43 Updated Aug 9, 2024

Master the command line, in one page

153,184 14,542 Updated Jun 25, 2024

Open-source cloud-environment inspector. Supporting AWS, GCP, Azure, and more! Your cloud resources will have nowhere to hide!

Go 3,949 433 Updated Oct 11, 2024

Exponent Server SDK

Python 145 42 Updated Mar 21, 2024

A lightning fast Finite State machine and REgular expression manipulation library.

C++ 1,824 128 Updated Oct 24, 2023

Headless chrome/chromium automation library (unofficial port of puppeteer)

Python 3,565 370 Updated Aug 5, 2021

Static Type Checker for Python

Python 13,240 1,427 Updated Oct 12, 2024

Český tvarotvorný slovník

13 2 Updated Feb 4, 2019

Make website screenshots and mobile emulations from the command line.

JavaScript 1,673 85 Updated Jul 25, 2021

Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

C++ 13,162 1,163 Updated Jul 29, 2024

Walk through an infinite, procedurally generated city

C# 4,556 520 Updated Mar 28, 2021

A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.

JavaScript 4,271 165 Updated Sep 2, 2024

Tools of The Trade, from Hacker News.

16,545 1,276 Updated Aug 3, 2024
Next