Skip to content

Commit

Permalink
doc: add auto-scaling docs
Browse files Browse the repository at this point in the history
  • Loading branch information
Ricardo Dodds committed Apr 4, 2022
1 parent 00e31ca commit 47b8374
Showing 1 changed file with 10 additions and 0 deletions.
10 changes: 10 additions & 0 deletions docs/cloud/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -60,3 +60,13 @@ for name, img in data['images'].items():

print(json.dumps(data['scoring'], indent=2))
```

## Auto-scaling the service

DeepLIIFs underlying infrastructure is completely defined using [Pulumi](https://www.pulumi.com) stacks.
Behind the scenes, we use containers to deploy both the web application and the API on top of an
ECS cluster with an auto-scaling group that runs on G4dns (GPU) machines.

Under stress, the system will autoscale both the compute capacity and the service availability to accommodate
the incoming requests without affecting the overall performance. The current auto-scaling policy monitors the
number of requests per target on the application load balancer.

0 comments on commit 47b8374

Please sign in to comment.