Add support for HF summarization endpoint in the websearch #319

nsarrazin · 2023-06-22T19:38:35Z

If the user has set an HF_ACCESS_TOKEN we use it to call up an inference endpoint trained for summarization. If the user didn't set their token, we use their LLM endpoint (could be self-hosted w/ no HF token) to try and make a summary the old way.

In my local testing this returns more accurate & faster summaries and it would also help reduce the load on the LLM endpoint. (summarization takes a huge context window which is much larger than most conversations)

I'm not sure if the model I chose is optimal as there are multiple models that support summarization. I'm also not sure if we could get rate limited by the API since the calls for all users would be coming from one server using the prod HF token.

gary149

Cool, just 2 things:

Is this only in English? I'd rather do something that supports multiple languages - using a different model, but I'll let you benchmark.
Would it be better to fall back on the actual method if there's an error on inference?

nsarrazin · 2023-07-11T07:11:19Z

So I tested the multi-language stuff, and even if you ask a question in another language, the LLM often generates a search query in English, and the serpapi settings are configured to fetch results from the US so it's unlikely we'll fetch results that are not in English atm. Still, I tried the model and while it works sometimes, most of the time it generates one-sentence summary that are too short to be useful imo

But good catch on the fallback, I added the feature!

gary149

Ok feel free to merge then :)

…ce#319) * Add support for HF endpoint for summary * add fail-safe for summarization

Add support for HF endpoint for summary

2eae93d

nsarrazin added enhancement New feature or request back This issue is related to the Svelte backend or the DB labels Jun 22, 2023

gary149 self-requested a review June 26, 2023 08:10

gary149 reviewed Jun 26, 2023

View reviewed changes

add fail-safe for summarization

0b751c7

gary149 self-requested a review July 11, 2023 10:34

gary149 approved these changes Jul 11, 2023

View reviewed changes

nsarrazin merged commit 10d1ab5 into main Jul 11, 2023

nsarrazin deleted the feature/use_hf_endpoint_for_summary branch July 11, 2023 10:36

ice91 pushed a commit to ice91/chat-ui that referenced this pull request Oct 30, 2024

Add support for HF summarization endpoint in the websearch (huggingfa…

d0c914c

…ce#319) * Add support for HF endpoint for summary * add fail-safe for summarization

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for HF summarization endpoint in the websearch #319

Add support for HF summarization endpoint in the websearch #319

nsarrazin commented Jun 22, 2023

gary149 left a comment

nsarrazin commented Jul 11, 2023

gary149 left a comment

Add support for HF summarization endpoint in the websearch #319

Add support for HF summarization endpoint in the websearch #319

Conversation

nsarrazin commented Jun 22, 2023

gary149 left a comment

Choose a reason for hiding this comment

nsarrazin commented Jul 11, 2023

gary149 left a comment

Choose a reason for hiding this comment