Skip to content

ttomasz/tt-osm-changeset-analyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is a simple Streamlit App showing some simple queries run by DuckDB over Parquet files that I prepared by converting OpenStreetMap changeset dump from XML to Parquet and uploaded to AWS S3.

I described data preparation process here: https://ttomasz.github.io/2023-01-30/spark-read-xml

Running locally:

# close repository then run
python -m venv venv
# for linux
source ./venv/bin/activate
# for windows
cmd ./venv/bin/activate.bat

pip install -r requirements.txt
streamlit run main.py

If you downloaded the files from S3 you can override URL template by setting environment variable url_template.