first commit #205

izzbizz · 2023-09-13T06:49:35Z

No description provided.

vercel · 2023-09-13T06:49:38Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
haystack-home	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Sep 13, 2023 11:47am

content/blog/rag-deployment/index.md

TuanaCelik · 2023-09-13T08:04:10Z

content/blog/rag-deployment/index.md

+While the nitty-gritty technical details of scaling are handled by Kubernetes, we have the ability to tweak it based on the type of pipeline. To do this, it’s useful to think about the following questions:
+
+
+


And 1 question: What's the role of model deployment services here? Does it make sense to mention them? E.g.: If I go with Sagemaker, will sagemaker handle scaling model requests? That incurs a cost but maybe it's worth it?
@izzbizz @ArzelaAscoIi

Yes. We might want to be a bit careful with not create confusion for the reader, but what do you think about adding a comment about: "Considering an additional model hosting services like sagemaker (or hf inference) might be helpful to spearately scale model inference"

not 100% sure about the wording

How about:
The nitty-gritty technical details of scaling are handled by our orchestration tool. Additionally, model hosting services like SageMaker or Hugging Face Inference can be helpful to scale model inference separately. Aside from these automated solutions, we have the ability to tweak the scaling of our pipelines ourselves.

ArzelaAscoIi · 2023-09-13T09:15:08Z

content/blog/rag-deployment/index.md

+While the nitty-gritty technical details of scaling are handled by Kubernetes, we have the ability to tweak it based on the type of pipeline. To do this, it’s useful to think about the following questions:
+
+
+


Yes. We might want to be a bit careful with not create confusion for the reader, but what do you think about adding a comment about: "Considering an additional model hosting services like sagemaker (or hf inference) might be helpful to spearately scale model inference"

ArzelaAscoIi · 2023-09-13T09:15:18Z

content/blog/rag-deployment/index.md

+While the nitty-gritty technical details of scaling are handled by Kubernetes, we have the ability to tweak it based on the type of pipeline. To do this, it’s useful to think about the following questions:
+
+
+


not 100% sure about the wording

first commit

b519ba9

izzbizz requested a review from TuanaCelik September 13, 2023 06:49

izzbizz self-assigned this Sep 13, 2023

fix headers

91fabcc

vercel bot deployed to Preview September 13, 2023 06:52 View deployment

correct last name Kristof

c33cfc5

vercel bot deployed to Preview September 13, 2023 07:10 View deployment

update author name

f27249e

vercel bot deployed to Preview September 13, 2023 07:21 View deployment

Update index.md

4b7ea74

vercel bot deployed to Preview September 13, 2023 07:25 View deployment

fixing Kristof author name

540ab37

vercel bot deployed to Preview September 13, 2023 07:56 View deployment

TuanaCelik reviewed Sep 13, 2023

View reviewed changes

extend DB list

aa9fa1d

vercel bot deployed to Preview September 13, 2023 09:05 View deployment

ArzelaAscoIi approved these changes Sep 13, 2023

View reviewed changes

add sentence about hosted inference scaling

235b3d4

vercel bot deployed to Preview September 13, 2023 11:47 View deployment

TuanaCelik approved these changes Sep 13, 2023

View reviewed changes

TuanaCelik merged commit 674c149 into main Sep 13, 2023
2 checks passed

TuanaCelik deleted the rag-deployment branch September 13, 2023 11:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

first commit #205

first commit #205

izzbizz commented Sep 13, 2023

vercel bot commented Sep 13, 2023 •

edited

Loading

TuanaCelik Sep 13, 2023

ArzelaAscoIi Sep 13, 2023

ArzelaAscoIi Sep 13, 2023

izzbizz Sep 13, 2023

ArzelaAscoIi Sep 13, 2023

ArzelaAscoIi Sep 13, 2023

		While the nitty-gritty technical details of scaling are handled by Kubernetes, we have the ability to tweak it based on the type of pipeline. To do this, it’s useful to think about the following questions:

first commit #205

first commit #205

Conversation

izzbizz commented Sep 13, 2023

vercel bot commented Sep 13, 2023 • edited Loading

TuanaCelik Sep 13, 2023

Choose a reason for hiding this comment

ArzelaAscoIi Sep 13, 2023

Choose a reason for hiding this comment

ArzelaAscoIi Sep 13, 2023

Choose a reason for hiding this comment

izzbizz Sep 13, 2023

Choose a reason for hiding this comment

ArzelaAscoIi Sep 13, 2023

Choose a reason for hiding this comment

ArzelaAscoIi Sep 13, 2023

Choose a reason for hiding this comment

vercel bot commented Sep 13, 2023 •

edited

Loading