Depicted—dpxdt

Make continuous deployment safe by comparing before and after webpage screenshots for each release. Depicted shows when any visual, perceptual differences are found. This is the ultimate, automated end-to-end test.

View the test instance here

Depicted is:

An API server for capturing webpage screenshots and automatically generating visual, perceptual difference images ("pdiffs").
A workflow for teams to coordinate new releases using pdiffs.
A client library for integrating with existing continuous integration processes.
Built for portability; API server runs on App Engine, behind the firewall, etc.
A wrapper of PhantomJS for screenshots.
Open source, Apache 2.0 licensed.
Not a framework, not a religion.

Depicted is not finished! Please let us know if you have feedback or find bugs.

See this video for a presentation about how perceptual diffs have made continuous deployment safe.

Overview

Here are the steps to making Depicted useful to you:

Establish a baseline release with an initial set of screenshots of your site.
Create a new release with a new set of screenshots of your new version.
Manually approve or reject each difference the tool finds.
Manually mark the new release as good or bad.
Repeat. Your approved release will become the baseline for the next one.

Depicted organizes your releases by a build ID. You can create a build through the API server's UI. A build is usually synonymous with a binary that's pushed to production. But it could also be a unique view into one product, like one build for desktop web and another for mobile web.

Within a build are releases with names. I recommend naming a release as the day it was branched in your source repository, and maybe an attempt number, like "06-16-r01" for June 16th, release 1. If you use codenames for your releases, like "bumblebee", that works too.

Each release may be attempted many times. The full history of each release attempt is saved in the UI. Releases can be manually marked as good or bad in the UI. When results for a new release are uploaded, they will automatically be compared to the last known-good version within that build.

A release consists of many separate test runs. A test run is a single screenshot of a single page. A test run has a name that is used to pair it up with a baseline test run from the known-good, previous release. Usually the test run is named as the path of the URL being tested (like /foo?bar=meep). This lets the baseline release and new release serve on different hostnames.

The life-cycle of a release:

Created: A new release is created with a specific name. The system gives it a release number.
Receiving: The release is waiting for all test runs to be requested or reported.
Processing: All test runs have been reported, but additional processing (like screenshotting or pdiffing) is required.
Reviewing: All test runs have been processed. Now the build admin should review any pdiffs that were found and approve the release.

Final release states:

Bad: The build admin has marked the release and all its test runs as bad. It will never be used as a baseline.
Good: The build admin has marked the release and all of its test runs as passing. The next release created for this build will use this just-approved release as the new baseline.

Getting started

Depicted is written in portable Python. It uses Flask and SQLAlchemy to make it easy to run in your environment. It works with SQLite out of the box. The API server runs on App Engine. The workers run ImageMagick and PhantomJS as subprocesses. I like to run the worker on a cloud VM, but you could run it on your laptop behind a firewall if that's important to you. See deployment below for details.

Running the server locally

Have a version of Python 2.7 installed.
Download PhantomJS for your machine.
Download ImageMagick for your machine.

Clone this git repo in your terminal:

 git clone https://github.com/bslatkin/dpxdt.git

cd to the repo directory:

Update all git submodules in the repo:

 git submodule update --init --recursive

Modify common.sh to match your enviornment:

 # Edit variables such as ...
 export PHANTOMJS_BINARY=/Users/yourname/Downloads/phantomjs-1.9.0-macosx/bin/phantomjs

Write a secrets.py file to the root directory:

 SECRET_KEY = 'insert random string of characters here'

Execute ./run_shell.sh and run these commands to initialize your DB:
```
 server.db.drop_all()
 server.db.create_all()
```
Run the combined server/worker with ./run_combined.sh.
Navigate to https://localhost:5000.
Login and create a new build.

Execute the ./run_url_pair_diff.sh tool to verify everything is working:

 ./run_url_pair_diff.sh \
     --upload_build_id=1 \
     https://www.google.com \
     https://www.yahoo.com

Follow the URL the tool writes to the terminal and verify screenshots are present. Any errors will be printed to the log in the terminal where you are running the server process.

Other scripts

To run the "site diff" script, use:

./run_site_diff.sh \
    --upload_build_id=1 \
    --crawl_depth=1 \
    https://www.example.com

To run the tests to make sure you haven't broken the world:

./run_tests.sh

To run the API server locally, without any worker threads:

./run_server.sh

To run the background workers independently against the local API server:

./run_worker.sh

To run in the App Engine development environment (see the section on deployment for config details):

./appengine_run.sh

API

You can try out the API on the test instance of Depicted located at https://dpxdt-test.appspot.com. This instance's database will be dropped from time to time, so please don't rely on it.

The API is really simple. All requests are POSTs with parameters that are URL encoded. All responses are JSON. All requests should be over HTTPS. The API server uses HTTP Basic Authentication to verify your client has access to your builds. You can provision API keys for a build on its homepage (at the bottom).

Here's an example request to the API server using curl. Pretty easy.

curl -v \
    -u api_key:api_password \
    -F build_id=1 \
    -F 'run_name=/static/dummy/dummy_page1.html' \
    -F 'release_number=1' \
    -F 'log=906d3259c103f6fcba4e8164a4dc3ae0d1a685d9' \
    -F 'release_name=2013-06-16 17:35:03.327710' \
    'https://localhost:5000/api/report_run'

Example tools that use the API

An example client tool that exercises the whole workflow is available in the repo. It's called "Site Diff". It will crawl a webpage, follow all links with the same prefix path, then create a new release that screenshots all the URLs. Running the tool multiple times lets you diff your entire site with little effort. Site Diff is very helpful, for example, when you have a blog with a lot of content and want to make a change to your base template and be sure you haven't broken any pages.

Here's an example invocation of Site Diff:

./dpxdt/tools/site_diff.py \
    --upload_build_id=1234 \
    --release_server_prefix=https://my-dpxdt-apiserver.example.com/api \
    --release_client_id=<your api key> \
    --release_client_secret=<your api secret> \
    --crawl_depth=1 \
    https://www.example.com/my/website/here

Another example tool is available in the repo called Pair Diff. Unlike Site Diff, which establishes a baseline on each subsequent run, Pair Diff takes two live URLs and compares them. This is useful when you have a live version and staging version of your site both available at the same time and can do screenshots of both independently.

Here's an example run of Pair Diff:

./dpxdt/tools/url_pair_diff.py \
    --upload_build_id=1234 \
    --release_server_prefix=https://my-dpxdt-apiserver.example.com/api \
    --release_client_id=<your api key> \
    --release_client_secret=<your api secret> \
    https://www.example.com/my/before/page \
    https://www.example.com/my/after/page

API Reference

All of these requests are POSTs with URL-encoded or multipart/form-data bodies and require HTTP Basic Authentication using your API key as the username and secret as the password. All responses are JSON. The 'success' key will be present in all responses and true if the request was successful. If 'success' isn't present, a human-readable error message may be present in the response under the key 'error'.

/api/create_release

Creates a new release candidate for a build.

Parameters

build_id: ID of the build.
release_name: Name of the new release.
url: URL of the homepage of the new release. Only present for humans who need to understand what a release is for.

Returns

build_id: ID of the build.
release_name: Name of the release that was just created.
release_number: Number assigned to the new release by the system.
url: URL of the release's homepage.

/api/find_run

Finds the last good run of the given name for a release. Returns an error if no run previous good release exists.

Parameters

build_id: ID of the build.
run_name: Name of the run to find the last known-good version of.

Returns

build_id: ID of the build.
release_name: Name of the last known-good release for the run.
release_number: Number of the last known-good release for the run.
run_name: Name of the run that was found. May be null if a run could not be found.
url: URL of the last known-good release for the run. May be null if a run could not be found.
image: Artifact ID (SHA1 hash) of the screenshot image associated with the run. May be null if a run could not be found.
log: Artifact ID (SHA1 hash) of the log file from the screenshot process associated with the run. May be null if a run could not be found.
config: Artifact ID (SHA1 hash) of the config file used for the screenshot process associated with the run. May be null if a run could not be found.

/api/request_run

Requests a new run for a release candidate. Causes the API system to take screenshots and do pdiffs. When ref_url and ref_config are supplied, the system will run two sets of captures (one for the baseline, one for the new release) and then compare them. When rel_url and ref_config are not specified, the last good run for this build is found and used for comparison.

Parameters

build_id: ID of the build.
release_name: Name of the release.
release_number: Number of the release.
url: URL to request as a run.
config: JSON data that is the config for the new run.
ref_url: URL of the baseline to request as a run.
ref_config: JSON data that is the config for the baseline of the new run.

Format of `config`

The config passed to the request_run function may have any or all of these fields. All fields are optional and have reasonably sane defaults.

{
    "viewportSize": {
        "width": 1024,
        "height": 768
    },
    "injectCss": ".my-css-rules-here { display: none; }",
    "injectJs": "document.getElementById('foobar').innerText = 'foo';"
}

Returns

build_id: ID of the build.
release_name: Name of the release.
release_number: Number of the release.
run_name: Name of the run that was created.
url: URL that was requested for the run.
config: Artifact ID (SHA1 hash) of the config file that will be used for the screenshot process associated with the run.
ref_url: URL that was requested for the baseline reference for the run.
ref_config: Artifact ID (SHA1 hash) of the config file used for the baseline screenshot process of the run.

/api/upload

Uploads an artifact referenced by a run.

Parameters

build_id: ID of the build.
(a single file in the multipart/form-data): Data of the file being uploaded. Should have a filename in the mime headers so the system can infer the content type of the uploaded asset.

Returns

build_id: ID of the build.
sha1sum: Artifact ID (SHA1 hash) of the file that was uploaded.
content_type: Content type of the artifact that was uploaded.

/api/report_run

Reports data for a run for a release candidate. May be called multiple times as progress is made for a run. No longer callable once the screenshot image for the run has been assigned.

Parameters

build_id: ID of the build.
release_name: Name of the release.
release_number: Number of the release.
run_name: Name of the run.
url: URL associated with the run.
image: Artifact ID (SHA1 hash) of the screenshot image associated with the run.
log: Artifact ID (SHA1 hash) of the log file from the screenshot process associated with the run.
config: Artifact ID (SHA1 hash) of the config file used for the screenshot process associated with the run.
ref_url: URL associated with the run's baseline release.
ref_image: Artifact ID (SHA1 hash) of the screenshot image associated with the run's baseline release.
ref_log: Artifact ID (SHA1 hash) of the log file from the screenshot process associated with the run's baseline release.
ref_config: Artifact ID (SHA1 hash) of the config file used for the screenshot process associated with the run's baseline release.
diff_image: Artifact ID (SHA1 hash) of the perceptual diff image associated with the run.
diff_log: Artifact ID (SHA1 hash) of the log file from the perceptual diff process associated with the run.
diff_success: Present and non-empty string when the diff process ran successfully. May be missing when diff ran and reported a log but may need to retry for this run.

Returns

Nothing but success on success.

/api/runs_done

Marks a release candidate as having all runs reported.

Parameters

build_id: ID of the build.
release_name: Name of the release.
release_number: Number of the release.

Returns

results_url: URL where a release candidates run status can be viewed in a web browser by a build admin.

Deployment

This is still kinda rough. It primarily explains how to deploy to App Engine / CloudSQL / Google Compute Engine.

Provision a CloudSQL DB for your project and initialize it:

 ./google_sql.sh dpxdt-cloud:test
 sql> create database test;

Go to the Google API console and provision a new project and "API Access". This will give you the OAuth client ID and secret you need to make auth work properly. Update config.py with your values.
Go to the Google Cloud Console and find the Google Cloud Storage bucket you've created for your deployment. In the App Engine admin console, go to "Application Settings" and find your "Service Account Name". Copy that name and in the Cloud Console add it as a team member (this gives your app access to the bucket). Update config.py with your bucket.
Go to the deployment/appengine directory. Update app.yaml with your parameters. Create the secrets.py file as explained for development.
Deploy the app:
```
 ./appengine_deploy.sh
```
Navigate to /admin on your app and run in the interactive console:
```
 from dpxdt import server
 server.db.create_all()
```

Navigate to / on your app and see the homepage. Create a new build. Provision an API key. Then set your user and API key as superusers using the SQL tool:

 select * from user;
 update user set superuser = 1 where user.id = 'foo';
 select * from api_key;
 update api_key set superuser = 1 where id = 'foo';

Now create the background workers package to deploy:
```
 ./worker_deploy.sh
```
Follow the commands it prints out to deploy the worker to a VM.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Depicted—dpxdt

Overview

Getting started

Running the server locally

Other scripts

API

Example tools that use the API

API Reference

/api/create_release

Parameters

Returns

/api/find_run

Parameters

Returns

/api/request_run

Parameters

Format of `config`

Returns

/api/upload

Parameters

Returns

/api/report_run

Parameters

Returns

/api/runs_done

Parameters

Returns

Deployment

About

Releases

Packages

Name		Name	Last commit message	Last commit date
Latest commit History 370 Commits
dependencies		dependencies
deployment		deployment
dpxdt		dpxdt
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
appengine_deploy.sh		appengine_deploy.sh
appengine_run.sh		appengine_run.sh
common.sh		common.sh
config.py		config.py
run_combined.sh		run_combined.sh
run_server.sh		run_server.sh
run_shell.sh		run_shell.sh
run_site_diff.sh		run_site_diff.sh
run_tests.sh		run_tests.sh
run_url_pair_diff.sh		run_url_pair_diff.sh
run_worker.sh		run_worker.sh
worker_deploy.sh		worker_deploy.sh

License

nickdengler/dpxdt

Folders and files

Latest commit

History

Repository files navigation

Depicted—dpxdt

Overview

Getting started

Running the server locally

Other scripts

API

Example tools that use the API

API Reference

/api/create_release

Parameters

Returns

/api/find_run

Parameters

Returns

/api/request_run

Parameters

Format of config

Returns

/api/upload

Parameters

Returns

/api/report_run

Parameters

Returns

/api/runs_done

Parameters

Returns

Deployment

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Format of `config`

Packages