Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Display Inference Speed #2129

Open
domfahey opened this issue Apr 21, 2024 · 1 comment
Open

[Feature Request] Display Inference Speed #2129

domfahey opened this issue Apr 21, 2024 · 1 comment
Labels
🌠 Feature Request New feature or request | 特性与建议 Inactive No response in 30 days | 超过 30 天未活跃

Comments

@domfahey
Copy link

domfahey commented Apr 21, 2024

🥰 Feature Description

2024-04-21_10-11-01

Please consider adding the ability to display the inference speed for each interaction with the AI model.

🧐 Proposed Solution

This could be presented in a format similar to "Round trip time: 2.52s" or a more detailed breakdown like the example below:

Input Output Total
Speed (T/s) 868 723 731
Tokens 33 480 513
Inference Time (s) 0.04 0.66 0.70

Displaying the inference speed would allow users to better understand the responsiveness of the AI model and help them gauge the performance of their queries. This information could also be useful for developers and researchers to optimize their models and improve the overall efficiency of LobeChat.

📝 Additional Information

No response

@domfahey domfahey added the 🌠 Feature Request New feature or request | 特性与建议 label Apr 21, 2024
@lobehubbot
Copy link
Member

👀 @domfahey

Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible.
Please make sure you have given us as much context as possible.
非常感谢您提交 issue。我们会尽快调查此事,并尽快回复您。 请确保您已经提供了尽可能多的背景信息。

@lobehubbot lobehubbot added the Inactive No response in 30 days | 超过 30 天未活跃 label Jun 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🌠 Feature Request New feature or request | 特性与建议 Inactive No response in 30 days | 超过 30 天未活跃
Projects
None yet
Development

No branches or pull requests

2 participants