[ML] Missing chunkers for AzureOpenAIService, AzureAIStudioService and HuggingFace embeddings #109875

davidkyle · 2024-06-18T15:17:31Z

Adds chunking for AzureOpenAIService, AzureAIStudioService and the HuggingFace text embedding service. HuggingFace ELSER is not chunked.

elasticsearchmachine · 2024-06-18T16:13:32Z

Pinging @elastic/ml-core (Team:ML)

davidkyle · 2024-06-18T17:02:08Z

@elasticmachine update branch

davidkyle · 2024-06-18T17:02:22Z

@elasticmachine update branch

maxhniebergall

LGTM

maxhniebergall · 2024-06-18T18:09:14Z

...main/java/org/elasticsearch/xpack/inference/services/azureaistudio/AzureAiStudioService.java

- return List.of(new ErrorChunkedInferenceResults(error.getException()));
+ if (model instanceof AzureAiStudioModel baseAzureAiStudioModel) {
+ var actionCreator = new AzureAiStudioActionCreator(getSender(), getServiceComponents());
+ var batchedRequests = new EmbeddingRequestChunker(input, EMBEDDING_MAX_BATCH_SIZE, EmbeddingRequestChunker.EmbeddingType.FLOAT)


how do we know that the AzureAiStudioModel only uses FLOAT types?

Looks like the response format is the same as openai which I believe supports float and base64. Elasticsearch only supports float and byte at the moment though.

maxhniebergall · 2024-06-18T18:13:58Z

...main/java/org/elasticsearch/xpack/inference/services/huggingface/HuggingFaceBaseService.java

+ * For HuggingFace use a conservatively small max batch size as it is
+ * unknown how the model is deployed
+ */
+ static final int EMBEDDING_MAX_BATCH_SIZE = 20;


If the optimal batch size depends on the hardware, then I think we should make this user configurable. I think we can do that in a future PR if you want to get this one out now.

davidkyle added 2 commits June 18, 2024 11:40

Add chunking

8d0f0c7

Adapt for upstream changes

c9e7eed

davidkyle added >non-issue :ml Machine learning v8.15.0 labels Jun 18, 2024

reinstate tests

78373b8

davidkyle marked this pull request as ready for review June 18, 2024 16:13

elasticsearchmachine added the Team:ML Meta label for the ML team label Jun 18, 2024

Merge branch 'main' into missing-chunkers

fc968f6

Merge branch 'main' into missing-chunkers

da75850

maxhniebergall self-requested a review June 18, 2024 18:06

maxhniebergall approved these changes Jun 18, 2024

View reviewed changes

jonathan-buttner approved these changes Jun 18, 2024

View reviewed changes

davidkyle merged commit b60d77e into elastic:main Jun 19, 2024
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Missing chunkers for AzureOpenAIService, AzureAIStudioService and HuggingFace embeddings #109875

[ML] Missing chunkers for AzureOpenAIService, AzureAIStudioService and HuggingFace embeddings #109875

davidkyle commented Jun 18, 2024

elasticsearchmachine commented Jun 18, 2024

davidkyle commented Jun 18, 2024

davidkyle commented Jun 18, 2024

maxhniebergall left a comment

maxhniebergall Jun 18, 2024

jonathan-buttner Jun 18, 2024 •

edited

Loading

maxhniebergall Jun 18, 2024

[ML] Missing chunkers for AzureOpenAIService, AzureAIStudioService and HuggingFace embeddings #109875

[ML] Missing chunkers for AzureOpenAIService, AzureAIStudioService and HuggingFace embeddings #109875

Conversation

davidkyle commented Jun 18, 2024

elasticsearchmachine commented Jun 18, 2024

davidkyle commented Jun 18, 2024

davidkyle commented Jun 18, 2024

maxhniebergall left a comment

Choose a reason for hiding this comment

maxhniebergall Jun 18, 2024

Choose a reason for hiding this comment

jonathan-buttner Jun 18, 2024 • edited Loading

Choose a reason for hiding this comment

maxhniebergall Jun 18, 2024

Choose a reason for hiding this comment

jonathan-buttner Jun 18, 2024 •

edited

Loading