Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

unable to communicate with mistral models deployed in EC2 by using gradio_webserver API #3409

Open
kambalaashok4 opened this issue Jun 19, 2024 · 0 comments

Comments

@kambalaashok4
Copy link

kambalaashok4 commented Jun 19, 2024

I have deployed mistral7b and 8x&B models in two AWS EC2 server using vllm and fstchat , and now I want to communicate through a web interface(gradio_webserver) and i am using api_endpoints.json file to register models.

api_endpoints.json

{
"Mistral": {
"model_name": "mistral7b",
"api_type": "openai",
"api_base": "https://10.0.0.0/v1",
"api_key": "sk-******",
"anony_only": false,
"recommended_config": {
"temperature": 0.7,
"top_p": 1.0
},
"text-arena": true,
"vision-arena": false,
}
}

error from interface:
image

command used to launch webUI

python3 -m fastchat.serve.gradio_web_server --host 0.0.0.0 --port 8080 --controller-url "" --share --register api_endpoints.json

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant