-
-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance Issue On Android #12
Comments
Hi there, No, I did not use a specific model to achieve this. The one you suggested should work just as fast, but I remember that one of my commits (where I updated llama.cpp) slowed down the speed a lot to an unusable state. Later one, I did update binaries for llama.cpp again and that I believe fixed it. Are you using this git branch directly (bad instructions, but more up-to-date) or the pub.dev package (easy to install, but outdated)? Side note: |
@mcmah309 Hello, I have the same problem as you, can you please provide information if you managed to solve this problem ? Thank you in advance |
Hello,
First I'll say, really impressed by this library and looking forward to TTS!
I ran the example project on my android pixel 7 (Same one you used) and I am not seeing the same performance that was presented in the video here https://www.youtube.com/watch?v=SBaSpwXRz94 . I am getting about 1 word every 20 seconds.
I used the
tinyllama-1.1b-chat-v1.0.Q2_K.gguf
model found here.https://huggingface.co/TheBloke/TinyLlama-1.1B-Chat-v1.0-GGUF/tree/main
I tried on a few more models and got the same issue.
Was there a specific model needed to achieved this? Or any specific configuration?
Run command:
flutter run -d 28301FDH200MY4 --release
Device Info:
The text was updated successfully, but these errors were encountered: