Make concurrent connections respect limits #581

jpwatts · 2015-10-22T06:16:24Z

When I tried to use aiohttp.ClientSession to make many concurrent requests to the same server, I found that I ended up with more connections than I expected.

Using the current master branch, the script below makes 3 connections for 3 requests, despite setting a limit of 1 on the connector. The problem seems to be that connections aren't counted against the limit until after the underlying trasport has been created. This creates a race condition where connect yields from _create_connection before recording anywhere that one of the available connections has been consumed. Subsequent calls to connect that begin before the earlier _create_connnection call has returned are able to create an unlimited number of connections.

# This script should make only 1 connection, but it actually makes 3.

import asyncio
import logging

import aiohttp


logger = logging.getLogger(__name__)


class LoggingTCPConnector(aiohttp.TCPConnector):
    async def _create_connection(self, req):
        try:
            connection_id = self._connection_id
        except AttributeError:
            connection_id = 0
        self._connection_id = connection_id + 1
        logger.debug("CREATING CONNECTION %s", connection_id)
        transport, protocol = await super()._create_connection(req)
        transport.connection_id = connection_id
        logger.info("CREATED CONNECTION %s", connection_id)
        return transport, protocol

    def _release(self, key, req, transport, protocol, *, should_close=False):
        connection_id = transport.connection_id
        logger.debug("RELEASING CONNECTION %s", connection_id)
        super()._release(key, req, transport, protocol, should_close=False)
        logger.info("RELEASED CONNECTION %s", connection_id)


async def make_many_requests(url, num_connections, num_requests):
    http = aiohttp.ClientSession(
        connector=LoggingTCPConnector(limit=num_connections),
    )

    async def make_one_request(request_id):
        logger.debug("MAKING REQUEST %s", request_id)
        response = await http.request("GET", url)
        await response.release()
        logger.info("MADE REQUEST %s", request_id)

    with http:
        tasks = [
            asyncio.ensure_future(make_one_request(request_id))
            for request_id in range(num_requests)
        ]
        await asyncio.wait(tasks)


logging.basicConfig(level=logging.INFO)
loop = asyncio.get_event_loop()
try:
    loop.run_until_complete(
        make_many_requests("https://example.com/", 1, 3)
    )
except KeyboardInterrupt:
    loop.stop()
finally:
    loop.close()

This pull request includes a new test that fails on the current master branch, but passes with my changes to the connector. I'm also including the first version of the test that I wrote below, because I was surprised when it actually passed on current master. I want to highlight the use of a mock that returns a done future in place of a coroutine, in this case _create_connection. I copied this pattern from test_connect_with_limit and it broke my test because while a function that returns a done future can be yielded from as if it were a coroutine, it never actually yields control back to the event loop. The result in this case was that all tasks ran sequentially rather than concurrently as they would in real use. I replaced the mock with a real coroutine and the test failed as expected. I mention all this because I'm concerned about this pattern potentially masking other problems elsewhere in the tests.

# This test passes, but it shouldn't.

def test_connect_with_limit_concurrent(self):

    @asyncio.coroutine
    def go():
        tr, proto = unittest.mock.Mock(), unittest.mock.Mock()
        proto.is_connected.return_value = True

        class Req:
            host = 'host'
            port = 80
            ssl = False
            response = unittest.mock.Mock(_should_close=False)

        conn = aiohttp.BaseConnector(loop=self.loop, limit=1)
        conn._create_connection = unittest.mock.Mock()
        conn._create_connection.return_value = asyncio.Future(
            loop=self.loop)
        conn._create_connection.return_value.set_result((tr, proto))

        @asyncio.coroutine
        def f():
            connection = yield from conn.connect(Req())
            connection.release()

        tasks = [asyncio.async(f(), loop=self.loop) for i in range(10)]
        yield from asyncio.wait(tasks, loop=self.loop)
        self.assertEqual(1, conn._create_connection.call_count)

    self.loop.run_until_complete(go())

asvetlov · 2015-10-22T18:22:13Z

aiohttp/connector.py

+ # The limit defines the maximum number of concurrent connections
+ # for a key. Waiters must be counted against the limit, even before
+ # the underlying connection is created.
+ available = limit - len(waiters)


I suspect you should respect self._acquired too.

jpwatts · 2015-10-29T17:17:11Z

@asvetlov I updated the check for available connections based on your feedback.

My original test didn't catch the problem because I was only checking the case where all connections were requested up front. I've now updated it to make more than limit connections up front and then also follow up with more connections after the initial ones are released.

Sorry it took me a while to get back to this. I had prepared the changes, but then our new baby decided it was time to arrive. I got a little distracted for a few days. :-)

asvetlov · 2015-11-01T00:04:18Z

I wish happy long life for your baby.

asvetlov · 2015-11-01T00:07:43Z

aiohttp/connector.py

+ available = limit - len(waiters) - len(self._acquired[key])
+
+ # Don't wait if there are connections available.
+ if available > 0:


Can we make short-circuit here?
If no available connections then create future and wait for it, pass the next lines otherwise.

asvetlov · 2015-11-01T12:21:05Z

Thanks a lot!

lock · 2019-10-29T22:04:50Z

This thread has been automatically locked since there has not been
any recent activity after it was closed. Please open a new issue for
related bugs.

If you feel like there's important points made in this discussion,
please include those exceprts into that new issue.

asvetlov reviewed Oct 22, 2015
View reviewed changes

jpwatts added 2 commits October 29, 2015 10:33

Add failing test case for connector limit

a71d936

Make BaseConnector respect limit

2c4b1f0

jpwatts force-pushed the master branch from d0a4393 to 2c4b1f0 Compare October 29, 2015 15:40

asvetlov reviewed Nov 1, 2015
View reviewed changes

asvetlov merged commit 2c4b1f0 into aio-libs:master Nov 1, 2015

lock bot added the outdated label Oct 29, 2019

lock bot locked as resolved and limited conversation to collaborators Oct 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make concurrent connections respect limits #581

Make concurrent connections respect limits #581

jpwatts commented Oct 22, 2015

asvetlov Oct 22, 2015

jpwatts commented Oct 29, 2015

asvetlov commented Nov 1, 2015

asvetlov Nov 1, 2015

asvetlov Nov 1, 2015

asvetlov commented Nov 1, 2015

lock bot commented Oct 29, 2019

Make concurrent connections respect limits #581

Make concurrent connections respect limits #581

Conversation

jpwatts commented Oct 22, 2015

asvetlov Oct 22, 2015

Choose a reason for hiding this comment

jpwatts commented Oct 29, 2015

asvetlov commented Nov 1, 2015

asvetlov Nov 1, 2015

Choose a reason for hiding this comment

asvetlov Nov 1, 2015

Choose a reason for hiding this comment

asvetlov commented Nov 1, 2015

lock bot commented Oct 29, 2019