Skip to content

ikreymer/pywb-samples

Repository files navigation

PyWb Samples

A live demo of this pywb sample project can be found at: https://pywb.herokuapp.com/

This repository contains a collection of sample archived content (WARC and CDX) files, for use with pywb wayback replay.

The files are a bit larger (~30M) and are not included in the core pywb distribution.

(The pywb distribution does include a few smaller archive files necessary for unit testing)

Installation

  1. Clone Repo: git clone https://github.com/ikreymer/pywb-samples.git pywb-samples

  2. Create (or using existing) Virtualenv: virtualenv pywb-venv; source pywb-venv/bin/activate

  3. Install reqs (this includes pywb and uWSGI) pip install -r requirements.txt

  4. Run uwsgi uwsgi.ini

For best results, uWSGI is started 10 workers on port 9080. Feel free to adjust the settings as needed in uwsgi.ini If startup is successful, the log will contain:

... [INFO]: *** pywb app inited with config from "create_wb_router"

Samples

The following archived content is available in this collection:

Basic Samples:

Social Media:

Proxy Mode Viewing

Some replay material (eg. Flickr) may work best in proxy mode. To try proxy mode:

  1. Set your browser settings (easiest in Firefox) Automatic Proxy configuration URL to: https://localhost:9080/proxy.pac

  2. Try loading:

Capture Info

All samples were captured manually using browser based crawling, mostly via warcprox

About

Sample Archived Content for pywb

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published