[Feature Request] Display Inference Speed #2129

domfahey · 2024-04-21T14:12:42Z

🥰 Feature Description

Please consider adding the ability to display the inference speed for each interaction with the AI model.

🧐 Proposed Solution

This could be presented in a format similar to "Round trip time: 2.52s" or a more detailed breakdown like the example below:

	Input	Output	Total
Speed (T/s)	868	723	731
Tokens	33	480	513
Inference Time (s)	0.04	0.66	0.70

Displaying the inference speed would allow users to better understand the responsiveness of the AI model and help them gauge the performance of their queries. This information could also be useful for developers and researchers to optimize their models and improve the overall efficiency of LobeChat.

📝 Additional Information

No response

lobehubbot · 2024-04-21T14:12:53Z

👀 @domfahey

Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible.
Please make sure you have given us as much context as possible.
非常感谢您提交 issue。我们会尽快调查此事，并尽快回复您。请确保您已经提供了尽可能多的背景信息。

domfahey added the 🌠 Feature Request New feature or request | 特性与建议 label Apr 21, 2024

lobehubbot added the Inactive No response in 30 days | 超过 30 天未活跃 label Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Display Inference Speed #2129

[Feature Request] Display Inference Speed #2129

domfahey commented Apr 21, 2024 •

edited by arvinxx

Loading

lobehubbot commented Apr 21, 2024

[Feature Request] Display Inference Speed #2129

[Feature Request] Display Inference Speed #2129

Comments

domfahey commented Apr 21, 2024 • edited by arvinxx Loading

🥰 Feature Description

🧐 Proposed Solution

📝 Additional Information

lobehubbot commented Apr 21, 2024

domfahey commented Apr 21, 2024 •

edited by arvinxx

Loading