Skip to content

Scrape smart contract data off Etherscan and import into BlockScout

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT
Notifications You must be signed in to change notification settings

alincode/etherscan-scraper

Repository files navigation

Etherscan Contract Scraper

Verified Contract Stats

Verified Failed Total Contracts Checked
56,251 19,030 227,715

Project Goals

The goal of this project is to scrape verified contracts from Etherscan.io and import them onto Blockscout.com. Without the ability to scrape more than 1,000 records from the verified smart contracts page (https://etherscan.io/contractsVerified) on Etherscan, we need to check every contract address from every block.

After the initial catch-up period, the script will check https://etherscan.io/contractsVerified and monitor this page every hour looking for new verified contracts.

The script will also check all new incoming blocks for any verified contracts.

Requirements

  • MYSQL (brew install mysql)
  • NODEJS 10+ (brew install node)

Installation

  • Setup MYSQL Database
    • From the command line, CREATE DATABASE etherscan_contracts;
    • USE etherscan_contracts;
    • Create the addresses and blocks tables
    • Edit the MYSQL connection configuration located here
  • Run the script, node app.js

How it Works

  1. First, the script will check the DB for any pending addresses.
  • Checks Etherscan.io to obtain the source code. If the address is not verified, the checked column in the DB is updated to 1. This address will not be checked again (can be changed in the future).
  • If the contract is verified on Etherscan, the source code, name, compiler version, and optimizations are extracted from the page.
  • The verified contract is then imported into BlockScout (https://github.com/acravenho/etherscan-scraper/issues/2)
  1. The script will do an initial indexing from the last block that was indexed. All smart contract addresses are entered into the DB by each block.
  2. All 40 pages of the Verified Contracts pages are indexed
  3. Every hour the Verified Smart Contracts page is monitored
  4. Every minute new blocks are checked for smart contracts.

About

Scrape smart contract data off Etherscan and import into BlockScout

Resources

License

Apache-2.0, MIT licenses found

Licenses found

Apache-2.0
LICENSE-APACHE
MIT
LICENSE-MIT

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published