-
Notifications
You must be signed in to change notification settings - Fork 426
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
echo cancellation when running locally - the output audio is looping back into the input #16
Comments
This is an issue for me also, the mic is picking up the assistants voice, which is then being transcribed and replied to, resulting in awkward conversation with it self. |
Same here. Is there a way we can temporarily pause the input stream when the output stream is being played? |
hey folks!! Indeed there is no echo cancellation enabled in system audio conversation, few things:
|
I think this is a hard problem. Amazon has a few articles on the topic. [1] Two ideas:
|
Headphones helped me with the choppy audio too. Interested in how to handle simultaneous conversation. I want to interrupt the assistant as it's talking. |
This is definitely a challenging implementation detail but I think it might be worth emphasizing this early on in the Quickstart documentation. It's not something that I could've intuitively troubleshot without coming to the Github Issues for the repo. |
good idea @kevcmk, just added this to the docs: https://docs.vocode.dev/python-quickstart#a-note-on-echo-cancellation I'll leave this issue open as we brainstorm more options here. |
Push to talk would be a gain already. |
in the new version, you can have the transcriber stop listening while the agent is speaking. here's a code snippet:
|
another fix for this! If you install Krisp.AI[0] (which is already built into some meeting platforms like Discord), you can select the krisp virtual microphone and speaker on startup and feedback issue is solved[1]! [0] https://krisp.ai/ |
Fix proposed by @ajar98 here: #16 (comment) did not work for me so I also needed to change |
guys ,can we connect vocode with sqlite to handle db data in realtime with fastapi? |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
This issue has been automatically closed due to inactivity. Thank you for your contributions. |
* Update Readme with Preview Info (#1) * Update Readme with Preview Info * We're not quite that far along * Update Structure to be more Pleasing to the Eyes * Add changelog to readme --------- Co-authored-by: srhinos <[email protected]> Co-authored-by: Adnaan Sachidanandan <[email protected]> * The Big Diff (#2) * The Big Diff * remove tests on 3.8 and 3.9 * Update README.md * Update README.md * fix turn based quickstart (#3) * [hotfix] remove unused import (#4) * Update README.md * Update README.md * Remove create_speech() from rime synthesizer (#6) * Fix default factory for elevenlabs WS (#12) * dispatch into elvenlabsws if experimental_websocket is on * fix mypy * Merge In Recent Fixes (#14) * [docs sprint] Updates docs for using transcribers (#9) * [docs sprint] phrase trigger documentation (#16) * [docs sprint] update open source quickstarts (#15) * [docs sprint] Add Documentation on Using Vocode's Loguru Implementation (#19) * [docs sprint] Add Documentation on Using Vocode's Loguru Implementation * Remove Tracing --------- Co-authored-by: srhinos <[email protected]> * [docs sprint] Updates docs for using synthesizers (#8) * [docs sprint] using synthesizers docs update * update docs for elevenlabs ws * Apply suggestions from code review Co-authored-by: Adnaan Sachidanandan <[email protected]> --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> * [docs sprint] Updates docs for react quickstart (#10) * [docs sprint] Updates docs for react quickstart * PR feedback * changes azure to override create_speech_uncached (#21) * [docs sprint] Adds docs for conversation mechanics and moves endpointing docs from transcribers (#11) * [docs sprint] Updates docs for using transcribers * Adds docs for conversation mechanics and moves endpointing docs from transcribers * Update docs/open-source/conversation-mechanics.md Co-authored-by: Adnaan Sachidanandan <[email protected]> * use mdx * PR feedback --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> * updates docs for events manager (#7) * add cartesia synthesizer (#17) * add cartesia synthesizer * make Cartesia dependency optional, add it to the synthesizers extra group * lazy import cartesia * improved lazy loading, and added api_key as a config parameter * improvements to cartesia synth * use create_speech_uncached * use existing abstractions default encoding and sample rates * Remove redundant api_key assignment Co-authored-by: Ajay Raj <[email protected]> * Remove default setting of sampling rate Co-authored-by: Ajay Raj <[email protected]> * Remove default setting of audio_encoding Co-authored-by: Ajay Raj <[email protected]> * remove default setting of sampling rate Co-authored-by: Ajay Raj <[email protected]> * Remove redundant setting of audio enconding the output device handles this Co-authored-by: Ajay Raj <[email protected]> * build failed with poetry.lock file. re-updating it --------- Co-authored-by: Ajay Raj <[email protected]> * Unset docs / README changes * Unset docs changes (cont.) * unset poetry version change * update poetry.lock --------- Co-authored-by: Mac Wilkinson <[email protected]> Co-authored-by: srhinos <[email protected]> Co-authored-by: Adnaan Sachidanandan <[email protected]> Co-authored-by: rjheeta <[email protected]>
* [docs sprint] Updates docs for using transcribers (#9) * [docs sprint] phrase trigger documentation (#16) * [docs sprint] update open source quickstarts (#15) * [docs sprint] Add Documentation on Using Vocode's Loguru Implementation (#19) * [docs sprint] Add Documentation on Using Vocode's Loguru Implementation * Remove Tracing --------- Co-authored-by: srhinos <[email protected]> * [docs sprint] Updates docs for using synthesizers (#8) * [docs sprint] using synthesizers docs update * update docs for elevenlabs ws * Apply suggestions from code review Co-authored-by: Adnaan Sachidanandan <[email protected]> --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> * [docs sprint] Updates docs for react quickstart (#10) * [docs sprint] Updates docs for react quickstart * PR feedback * [docs sprint] Adds docs for conversation mechanics and moves endpointing docs from transcribers (#11) * [docs sprint] Updates docs for using transcribers * Adds docs for conversation mechanics and moves endpointing docs from transcribers * Update docs/open-source/conversation-mechanics.md Co-authored-by: Adnaan Sachidanandan <[email protected]> * use mdx * PR feedback --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> * updates docs for events manager (#7) * [docs sprint] python quickstart + working with phone calls (#27) * deprecate SpeakerOutput * remove play.ht default voice id * rename open source quickstarts page * remove building block reference * update python quickstart * extra steps to deprecate speakeroutput * finish telephony docs * fix some references + language in how-to-use-it * fix test * [docs sprint] Add Sentry Docs to OS (#20) * Add Sentry Docs to OS * Remove Tracing * update docs and fix integration * remove free --------- Co-authored-by: srhinos <[email protected]> Co-authored-by: Ajay Raj <[email protected]> * update README * make mark terminated sync instead of async (#28) * [docs sprint] Add Docs on Creating and Using External Actions (#18) Also updated example for action agents * rename sentry + move around docs order * update README paths to docs * more updates to README * [docs sprint] update agent and action docs and move legacy docs (#29) --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> Co-authored-by: Mac Wilkinson <[email protected]> Co-authored-by: srhinos <[email protected]>
Running on a Mac using the hosted version. It seems to get very confused by its own voice coming out of the speaker and being picked up by the microphone.
Using headphones seems to work much better.
not-working.mp4
The text was updated successfully, but these errors were encountered: