echo cancellation when running locally - the output audio is looping back into the input #16

cgreening · 2023-03-30T14:32:22Z

Running on a Mac using the hosted version. It seems to get very confused by its own voice coming out of the speaker and being picked up by the microphone.

Using headphones seems to work much better.

not-working.mp4

codewithcheese · 2023-03-31T07:23:18Z

This is an issue for me also, the mic is picking up the assistants voice, which is then being transcribed and replied to, resulting in awkward conversation with it self.

remykarem · 2023-04-01T01:33:58Z

Same here.

Is there a way we can temporarily pause the input stream when the output stream is being played?

ajar98 · 2023-04-01T18:21:47Z

hey folks!! Indeed there is no echo cancellation enabled in system audio conversation, few things:

This won't happen for turn-based conversation: https://github.com/vocodedev/vocode-python/blob/main/vocode/turn_based/turn_based_conversation.py
I'd love to hear if folks have any ideas for fixing this (that don't involve pausing the input stream, since that precludes the user from interrupting the bot!). Other surfaces like web / phone do this out-of-the-box

as14g18 · 2023-04-02T19:01:48Z

I think this is a hard problem. Amazon has a few articles on the topic. [1]

Two ideas:

Have a "wake word". For example, Google's "Hey Google".
Have the system learn your voice so it only responds to your voice

[1] https://www.amazon.science/tag/echo-cancellation

rjheeta · 2023-04-02T21:04:10Z

Headphones helped me with the choppy audio too.

Interested in how to handle simultaneous conversation. I want to interrupt the assistant as it's talking.

kevcmk · 2023-04-03T06:33:25Z

This is definitely a challenging implementation detail but I think it might be worth emphasizing this early on in the Quickstart documentation. It's not something that I could've intuitively troubleshot without coming to the Github Issues for the repo.

ajar98 · 2023-04-03T06:57:28Z

good idea @kevcmk, just added this to the docs: https://docs.vocode.dev/python-quickstart#a-note-on-echo-cancellation

I'll leave this issue open as we brainstorm more options here.

boorich · 2023-05-10T21:14:44Z

Push to talk would be a gain already.

ajar98 · 2023-05-19T21:53:17Z

in the new version, you can have the transcriber stop listening while the agent is speaking. here's a code snippet:

async def main():
    (
        microphone_input,
        speaker_output,
    ) = create_streaming_microphone_input_and_speaker_output(
        use_default_devices=False, use_blocking_speaker_output=False
    )

    conversation = StreamingConversation(
        output_device=speaker_output,
        transcriber=DeepgramTranscriber(
            DeepgramTranscriberConfig.from_input_device(
                microphone_input,
                endpointing_config=PunctuationEndpointingConfig(),
                mute_during_speech=True,
            )
        ),
        agent=ChatGPTAgent(
            ChatGPTAgentConfig(
                initial_message=BaseMessage(text="What up"),
                prompt_preamble="""The AI is having a pleasant conversation about life""",
            )
        ),
        synthesizer=AzureSynthesizer(
            AzureSynthesizerConfig.from_output_device(speaker_output)
        ),
        logger=logger,
    )
    await conversation.start()
    print("Conversation started, press Ctrl+C to end")
    signal.signal(signal.SIGINT, lambda _0, _1: conversation.terminate())
    while conversation.is_active():
        chunk = microphone_input.get_audio()
        if chunk:
            conversation.receive_audio(chunk)
        await asyncio.sleep(0)


if __name__ == "__main__":
    asyncio.run(main())

ajar98 · 2023-06-01T19:22:43Z

another fix for this! If you install Krisp.AI[0] (which is already built into some meeting platforms like Discord), you can select the krisp virtual microphone and speaker on startup and feedback issue is solved[1]!

[0] https://krisp.ai/
[1] at reasonable levels of volume - if you blast it, the feedback won't work

jul3x · 2023-09-03T16:46:48Z

Fix proposed by @ajar98 here: #16 (comment) did not work for me so I also needed to change use_blocking_speaker_output flag from False to True and it started working correctly for my application.

mananmani1 · 2023-09-13T07:34:31Z

guys ,can we connect vocode with sqlite to handle db data in realtime with fastapi?

github-actions · 2024-04-20T01:47:19Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

github-actions · 2024-04-27T01:47:25Z

This issue has been automatically closed due to inactivity. Thank you for your contributions.

* Update Readme with Preview Info (#1) * Update Readme with Preview Info * We're not quite that far along * Update Structure to be more Pleasing to the Eyes * Add changelog to readme --------- Co-authored-by: srhinos <[email protected]> Co-authored-by: Adnaan Sachidanandan <[email protected]> * The Big Diff (#2) * The Big Diff * remove tests on 3.8 and 3.9 * Update README.md * Update README.md * fix turn based quickstart (#3) * [hotfix] remove unused import (#4) * Update README.md * Update README.md * Remove create_speech() from rime synthesizer (#6) * Fix default factory for elevenlabs WS (#12) * dispatch into elvenlabsws if experimental_websocket is on * fix mypy * Merge In Recent Fixes (#14) * [docs sprint] Updates docs for using transcribers (#9) * [docs sprint] phrase trigger documentation (#16) * [docs sprint] update open source quickstarts (#15) * [docs sprint] Add Documentation on Using Vocode's Loguru Implementation (#19) * [docs sprint] Add Documentation on Using Vocode's Loguru Implementation * Remove Tracing --------- Co-authored-by: srhinos <[email protected]> * [docs sprint] Updates docs for using synthesizers (#8) * [docs sprint] using synthesizers docs update * update docs for elevenlabs ws * Apply suggestions from code review Co-authored-by: Adnaan Sachidanandan <[email protected]> --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> * [docs sprint] Updates docs for react quickstart (#10) * [docs sprint] Updates docs for react quickstart * PR feedback * changes azure to override create_speech_uncached (#21) * [docs sprint] Adds docs for conversation mechanics and moves endpointing docs from transcribers (#11) * [docs sprint] Updates docs for using transcribers * Adds docs for conversation mechanics and moves endpointing docs from transcribers * Update docs/open-source/conversation-mechanics.md Co-authored-by: Adnaan Sachidanandan <[email protected]> * use mdx * PR feedback --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> * updates docs for events manager (#7) * add cartesia synthesizer (#17) * add cartesia synthesizer * make Cartesia dependency optional, add it to the synthesizers extra group * lazy import cartesia * improved lazy loading, and added api_key as a config parameter * improvements to cartesia synth * use create_speech_uncached * use existing abstractions default encoding and sample rates * Remove redundant api_key assignment Co-authored-by: Ajay Raj <[email protected]> * Remove default setting of sampling rate Co-authored-by: Ajay Raj <[email protected]> * Remove default setting of audio_encoding Co-authored-by: Ajay Raj <[email protected]> * remove default setting of sampling rate Co-authored-by: Ajay Raj <[email protected]> * Remove redundant setting of audio enconding the output device handles this Co-authored-by: Ajay Raj <[email protected]> * build failed with poetry.lock file. re-updating it --------- Co-authored-by: Ajay Raj <[email protected]> * Unset docs / README changes * Unset docs changes (cont.) * unset poetry version change * update poetry.lock --------- Co-authored-by: Mac Wilkinson <[email protected]> Co-authored-by: srhinos <[email protected]> Co-authored-by: Adnaan Sachidanandan <[email protected]> Co-authored-by: rjheeta <[email protected]>

* [docs sprint] Updates docs for using transcribers (#9) * [docs sprint] phrase trigger documentation (#16) * [docs sprint] update open source quickstarts (#15) * [docs sprint] Add Documentation on Using Vocode's Loguru Implementation (#19) * [docs sprint] Add Documentation on Using Vocode's Loguru Implementation * Remove Tracing --------- Co-authored-by: srhinos <[email protected]> * [docs sprint] Updates docs for using synthesizers (#8) * [docs sprint] using synthesizers docs update * update docs for elevenlabs ws * Apply suggestions from code review Co-authored-by: Adnaan Sachidanandan <[email protected]> --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> * [docs sprint] Updates docs for react quickstart (#10) * [docs sprint] Updates docs for react quickstart * PR feedback * [docs sprint] Adds docs for conversation mechanics and moves endpointing docs from transcribers (#11) * [docs sprint] Updates docs for using transcribers * Adds docs for conversation mechanics and moves endpointing docs from transcribers * Update docs/open-source/conversation-mechanics.md Co-authored-by: Adnaan Sachidanandan <[email protected]> * use mdx * PR feedback --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> * updates docs for events manager (#7) * [docs sprint] python quickstart + working with phone calls (#27) * deprecate SpeakerOutput * remove play.ht default voice id * rename open source quickstarts page * remove building block reference * update python quickstart * extra steps to deprecate speakeroutput * finish telephony docs * fix some references + language in how-to-use-it * fix test * [docs sprint] Add Sentry Docs to OS (#20) * Add Sentry Docs to OS * Remove Tracing * update docs and fix integration * remove free --------- Co-authored-by: srhinos <[email protected]> Co-authored-by: Ajay Raj <[email protected]> * update README * make mark terminated sync instead of async (#28) * [docs sprint] Add Docs on Creating and Using External Actions (#18) Also updated example for action agents * rename sentry + move around docs order * update README paths to docs * more updates to README * [docs sprint] update agent and action docs and move legacy docs (#29) --------- Co-authored-by: Adnaan Sachidanandan <[email protected]> Co-authored-by: Mac Wilkinson <[email protected]> Co-authored-by: srhinos <[email protected]>

ajar98 added the bug Something isn't working label Apr 17, 2023

ajar98 changed the title ~~Running locally using the hosted version - choppy and confused audio~~ echo cancellation when running locally - the output audio is looping back into the input Apr 29, 2023

ajar98 closed this as completed May 19, 2023

ajar98 reopened this May 19, 2023

github-actions bot added the stale label Apr 20, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Apr 27, 2024

DanteNoguez pushed a commit to DanteNoguez/vocode-python that referenced this issue Jun 11, 2024

[docs sprint] phrase trigger documentation (vocodedev#16)

dfa66d5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

echo cancellation when running locally - the output audio is looping back into the input #16

echo cancellation when running locally - the output audio is looping back into the input #16

cgreening commented Mar 30, 2023 •

edited

Loading

codewithcheese commented Mar 31, 2023 •

edited

Loading

remykarem commented Apr 1, 2023 •

edited

Loading

ajar98 commented Apr 1, 2023

as14g18 commented Apr 2, 2023

rjheeta commented Apr 2, 2023

kevcmk commented Apr 3, 2023

ajar98 commented Apr 3, 2023

boorich commented May 10, 2023

ajar98 commented May 19, 2023

ajar98 commented Jun 1, 2023

jul3x commented Sep 3, 2023

mananmani1 commented Sep 13, 2023 •

edited

Loading

github-actions bot commented Apr 20, 2024

github-actions bot commented Apr 27, 2024

echo cancellation when running locally - the output audio is looping back into the input #16

echo cancellation when running locally - the output audio is looping back into the input #16

Comments

cgreening commented Mar 30, 2023 • edited Loading

codewithcheese commented Mar 31, 2023 • edited Loading

remykarem commented Apr 1, 2023 • edited Loading

ajar98 commented Apr 1, 2023

as14g18 commented Apr 2, 2023

rjheeta commented Apr 2, 2023

kevcmk commented Apr 3, 2023

ajar98 commented Apr 3, 2023

boorich commented May 10, 2023

ajar98 commented May 19, 2023

ajar98 commented Jun 1, 2023

jul3x commented Sep 3, 2023

mananmani1 commented Sep 13, 2023 • edited Loading

github-actions bot commented Apr 20, 2024

github-actions bot commented Apr 27, 2024

cgreening commented Mar 30, 2023 •

edited

Loading

codewithcheese commented Mar 31, 2023 •

edited

Loading

remykarem commented Apr 1, 2023 •

edited

Loading

mananmani1 commented Sep 13, 2023 •

edited

Loading