Skip to content

This Next.js application provides a user interface for interacting with OpenAI's audio APIs, including text-to-speech and speech-to-text. Features include history tracking, audio playback, and download functionality.

License

Notifications You must be signed in to change notification settings

KelvinCampelo/open-aiudio-client

Repository files navigation

Open Aiudio Client

This is a Next.js application that provides a user interface for interacting with OpenAI's audio APIs, including text-to-speech and speech-to-text.

Screenshot

Screenshot

Features

  • Text-to-speech: Convert text into audio using OpenAI's text-to-speech API.
  • Speech-to-text (WIP): Transcribe audio into text using OpenAI's speech-to-text API.
  • History: Keep track of your previous conversions and easily access them again.
  • Audio playback: Play back the generated audio directly in the browser.
  • Download: Download the generated audio files for offline use.

Getting Started

  1. Clone the repository:
   git clone https://github.com/KelvinCampelo/open-aiudio-client
  1. Install dependencies:
    npm install
  1. Start the development server:
    npm run dev
  1. Open your browser and navigate to https://localhost:3000 to see the application in action.

Usage

  1. Fill in your OpenAI API key: Enter your API key in the designated field. You can obtain an API key from the OpenAI website.
  2. Select an API: Choose either "Text-to-speech" or "Speech-to-text" (WIP) from the navigation menu.
  3. Enter your input:
    • For Text-to-speech, provide the text you want to convert.
    • For Speech-to-text, upload an audio file. (WIP)
  4. Configure settings: Customize the voice, speed, and other settings for your conversion.
  5. Generate audio: Click the "Generate" button to start the conversion process.
  6. Play or download: Listen to the generated audio or download it for offline use.

Contributing

Contributions are welcome! Here's how you can contribute:

  • Report issues: If you encounter any bugs or have suggestions for improvements, please open an issue on GitHub.
  • Submit pull requests: If you have code changes or new features to add, please submit a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for more information.

Acknowledgements

  • OpenAI for providing the powerful audio APIs.
  • Next.js for providing a robust framework for building web applications.
  • Tailwind CSS for providing a utility-first CSS framework.

Contact

For any questions or feedback, please contact [email protected]

About

This Next.js application provides a user interface for interacting with OpenAI's audio APIs, including text-to-speech and speech-to-text. Features include history tracking, audio playback, and download functionality.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published