feat: adding the ability to use Ray Serve async functionality #3769

zoltan-fedor · 2022-12-27T01:29:58Z

As I have explained in issue #2968 , a deployment on Ray Serve can be called async, which can make a lot of sense, as Ray Serve is typically used to serve large ML models - which means it takes a while for each request sent to a cluster of Ray Serve Deployment to return (from a few ms up to about a 100ms depending on the size of the model used).

As such, it makes sense to build a concurrent (eg async) application using an async web framework like FastAPI (or recently even Flask started supporting async`) and then send the requests to Ray Serve in an async fashion (alias without blocking), which means that while the Ray Serve Deployment is working on the request, the application is not blocked, but can work on other requests.

Unfortunately until now Haystack has only supported the calling of Ray Serve deployments in a sync fashion.
This PR is to add the ability to call the same Ray Serve deployments in an async fashion too.

This means that what is deployed into Ray Serve does NOT change, the only thing changes is how we are calling the deployment from the Ray client in Haystack.

Unfortunately making async calls require the methods containing the async call (typically await [call]) code to be marked as async and then that method to be called with await, so unfortunately some duplication of methods was required (see pipelines/base.py::Pipeline.run() and pipelines/base.py::Pipeline.run_async()) , but tried to minimize it as much as possible.

Now you can call a Ray Serve deployment async the following way.

The original (sync) call:

from haystack.pipelines import RayPipeline
raypipeline = RayPipeline.load_from_yaml(...)
prediction = raypipeline.run(query="Why I didn't receive my package?", params={...})

The new, async option was made to be very similar:

from haystack.pipelines import RayPipeline
raypipeline = RayPipeline.load_from_yaml(...)
prediction = await raypipeline.run_async(query="Why I didn't receive my package?", params={...})

Related Issues

fixes Adding the ability to use Ray Serve async functionality #2968

Proposed Changes:

I have added the ability to call a Ray Pipeline Deployment from Haystack in an async fashion in addition to the old sync one.

How did you test it?

I have added a dedicated test to the new async method.
I have also tested it on a live Ray Serve cluster.

Notes for the reviewer

@ZanSara, this is what we have discussed earlier - some code repetition was unfortunately required, but managed to keep it to the minimum, see pipelines/base.py::Pipeline.run() and pipelines/base.py::Pipeline.run_async().
The run_async() method is very much the same as run(), the only difference is the await call in it.

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used the conventional commit convention for my PR title
I documented my code
I ran pre-commit hooks and fixed any issue

… async This is to fix deepset-ai#2968

zoltan-fedor · 2023-01-12T18:43:57Z

Hi @vblagoje,
Have you had a chance to look into this? I would hope to get this merged before 1.13 - if possible.
Thanks

vblagoje · 2023-01-12T19:18:53Z

Hey @zoltan-fedor this is not my area of expertise but I'll ping a colleague and he'll get back to you

zoltan-fedor · 2023-01-12T19:30:52Z

Thanks @vblagoje, much appreciated.
Yes, the original issue was discussed with @ZanSara, so I was a bit surprised that the PR got assigned to you for review.

ZanSara · 2023-01-13T16:36:09Z

Hey @zoltan-fedor! I've been on holiday, that's why the PR went on @vblagoje in the meantime 🙂

I'll review the Python side of this PR shortly, but before merging I hope we can get someone more expert with Ray from the rest of the team to look over these changes 👍

ZanSara

A couple of question marks but otherwise really nice PR 😊 It's way simpler than I imagined!

haystack/pipelines/base.py

zoltan-fedor · 2023-01-15T00:39:11Z

Hi @ZanSara,
Welcome back from your holiday! I thought that you might be an holiday... :-) I hope you had a good time!

Yes, I too thought that adding async will be more complicated and had to spent quite some time on figuring out how to make it the least invasive.
In the end - as you have seen - I think it came together pretty nice, with minimal code duplication required and with a decent user experience. Really the only thing is that from now on both the sync and async version of the run() method (the run() and run_async() methods) will need to be maintained and kept in sync, so their logic stays aligned.

vblagoje · 2023-01-16T08:14:12Z

My colleague left the following comments: There is 99% code duplication between Pipeline.run and Pipeline.run_async , not sure however whether this can be implemented in a cleaner way.
What can be improved is:

There's no _run_node_async method in Pipeline, although it's called by Pipeline.run_async
There's no batch counterpart to _run_async
If it's really just about ray, then I'd suggest to move te run_async method to RayPipeline .
BTW calling ray in async pattern looks good, but I haven't used it so far.

zoltan-fedor · 2023-01-16T13:50:19Z

@vblagoje, @ZanSara,

Responding to the comments from @vblagoje's colleague:

"There is 99% code duplication between Pipeline.run and Pipeline.run_async , not sure however whether this can be implemented in a cleaner way." >> Yes, by definition there is always code duplication with having both sync and async versions of the same methods. The only way to minimize that is by taking "utility methods" out of the main methods - in this case by taking chunks out of Pipeline.run and call those same (synchronous) utility methods both from Pipeline.run and Pipeline.run_async - for which I don't see any good candidates in Pipeline.run. But if you see any sections in Pipeline.run` that could be taken out into another method, let me know.
"There's no _run_node_async method in Pipeline, although it's called by Pipeline.run_async" and "If it's really just about ray, then I'd suggest to move te run_async method to RayPipeline ." >> Both of these have already been addressed by moving the Pipeline.run_async into RayPipeline.run_async based on @ZanSara's recommendation.
"There's no batch counterpart to _run_async" >> That is correct, I was debating whether create one or not. It could have been done, but it would have required LOTS of code duplication (to have both sync and async methods of a chain of methods which are handling the batch calls) and with little benefit. Once you have the async run call, there is little benefit to wanting to make batch calls with async. You typically use async in a quick-fire application scenario - which is the opposite of where the batch would be used. As such, considering the cost of it (lots of code duplication), I think we shouldn't add async batch run.

ZanSara · 2023-01-17T09:24:28Z

Thank you @vblagoje for getting the reviewer's comment! My two cents on code duplication: I agree that in the general case I would not like this amount of duplication, but here I believe we should tolerate it.

That part of Pipeline is rock stable lately, and as we are planning to revisit how Pipeline works from the foundations, no point touching it now for a small optimization. Luckily this is a very simple change that will give us a good example how to run pipelines async and is immediately useful to @zoltan-fedor. So I'm ok for merging it as it is. If you don't agree, let's discuss 🙂

vblagoje · 2023-01-17T09:38:45Z

I am ok since all three of you agree that it is good to go. I don't know this part of the codebase well, but since three engineers who know the subject matter agree - I am approving.

zoltan-fedor · 2023-01-17T13:30:44Z

Thank you both for reviewing and approving.
I hope this can be merged soon!
Thanks!

…sync2

Adding the ability to call the Ray pipeline from concurrent apps with…

33877ee

… async This is to fix deepset-ai#2968

zoltan-fedor requested a review from a team as a code owner December 27, 2022 01:29

zoltan-fedor requested review from vblagoje and removed request for a team December 27, 2022 01:29

zoltan-fedor added 2 commits December 26, 2022 20:43

Fixes: mype + pylint (invalid-overridden-method)

e541645

Simplifying - no real need for an AsyncRayPipeline anymore

9ea3338

zoltan-fedor changed the title ~~Adding the ability to use Ray Serve async functionality~~ feat: adding the ability to use Ray Serve async functionality Dec 27, 2022

Merge branch 'deepset-ai:main' into feature-ray-serve-async2

6f1e1d2

ZanSara reviewed Jan 13, 2023

View reviewed changes

haystack/pipelines/base.py Outdated Show resolved Hide resolved

haystack/pipelines/base.py Outdated Show resolved Hide resolved

zoltan-fedor and others added 4 commits January 14, 2023 17:22

Moving the new run_async method to the RayPipeline

9325c99

Cleanup

977142f

Merge branch 'deepset-ai:main' into feature-ray-serve-async2

bc65088

[EMPTY] Re-trigger CI

20ee7da

vblagoje approved these changes Jan 17, 2023

View reviewed changes

zoltan-fedor and others added 3 commits January 18, 2023 11:32

Merge branch 'deepset-ai:main' into feature-ray-serve-async2

afc3514

Merge remote-tracking branch 'upstream/main' into feature-ray-serve-a…

df8279b

…sync2

Merge branch 'deepset-ai:main' into feature-ray-serve-async2

3a0f2d4

ZanSara added type:feature New feature or request topic:speed topic:pipeline labels Jan 23, 2023

ZanSara merged commit e447bd7 into deepset-ai:main Jan 23, 2023

julian-risch mentioned this pull request Feb 1, 2023

proposal: Add Agents for extended LLM support #3925

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: adding the ability to use Ray Serve async functionality #3769

feat: adding the ability to use Ray Serve async functionality #3769

zoltan-fedor commented Dec 27, 2022 •

edited

Loading

zoltan-fedor commented Jan 12, 2023

vblagoje commented Jan 12, 2023

zoltan-fedor commented Jan 12, 2023

ZanSara commented Jan 13, 2023

ZanSara left a comment

zoltan-fedor commented Jan 15, 2023

vblagoje commented Jan 16, 2023

zoltan-fedor commented Jan 16, 2023 •

edited

Loading

ZanSara commented Jan 17, 2023

vblagoje commented Jan 17, 2023

zoltan-fedor commented Jan 17, 2023

feat: adding the ability to use Ray Serve async functionality #3769

feat: adding the ability to use Ray Serve async functionality #3769

Conversation

zoltan-fedor commented Dec 27, 2022 • edited Loading

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

zoltan-fedor commented Jan 12, 2023

vblagoje commented Jan 12, 2023

zoltan-fedor commented Jan 12, 2023

ZanSara commented Jan 13, 2023

ZanSara left a comment

Choose a reason for hiding this comment

zoltan-fedor commented Jan 15, 2023

vblagoje commented Jan 16, 2023

zoltan-fedor commented Jan 16, 2023 • edited Loading

ZanSara commented Jan 17, 2023

vblagoje commented Jan 17, 2023

zoltan-fedor commented Jan 17, 2023

zoltan-fedor commented Dec 27, 2022 •

edited

Loading

zoltan-fedor commented Jan 16, 2023 •

edited

Loading