React / Vanilla JS Text to Speech with highlighting the words and sentences that are being spoken using audio files, text to speech API, and web speech synthesis API
-
Updated
Jul 17, 2025 - JavaScript
React / Vanilla JS Text to Speech with highlighting the words and sentences that are being spoken using audio files, text to speech API, and web speech synthesis API
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.
Implementation of OpenAI's Text-To-Speech in Unity. Synthesize any text and play it via any AudioSource.
Use OpenAI TTS(Text to Speech) API with Gradio
Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.
Implementation of OpenAI's Realtime API in Unity. Easily integrate low-latency, multi-modal conversations via push-to-talk or text.
Try out the OpenAI Text to Speech API in your browser.
MCP Server for Text to Speech
ESP32-based Open AI Voice chat device (similar ChatGPT). Recording questions with a microphone, transcribing via ElevenLabs or Deepgram STT, then sending to Open AI. Response is played with AI voices on speaker. Supporting ongoing dialogues with saved history for follow-up questions. User defined "system prompts" for creating own "personalities".
This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create speech from provided text using different models and voice options.
Stream GPT response to TTS directly using Flask
OpenAI API powered Twitch chat bot that allows users to ask questions and receive answers via TTS.
Use LLMs to generate short ad-hoc English lessons based on any fiction work (won the Clarifai's NextGen GPT AI Hackathon).
This Python script processes a video file, generates a compelling description, creates a voiceover script in the style of David Attenborough, and synthesizes the voiceover using OpenAI's Text-to-Speech API.
An AI Voice Assistant that can read emails, WhatsApp messages, clipboard data, and captures webcam images and screenshots for contextual understanding.
This project allows you to create short videos based on a given topic. Version 2 (improve transform image and real clip) can be found at: Colab notebook. The reason why version 2 is not in this GitHub is that my laptop cannot afford the Nvidia-smi utility, which is required to install xformers for stable diffusion.
Unofficial web client for OpenAI Text-to-Speech API with multilingual interface and voice options
A powerful and easy-to-use Python library for generating natural-sounding speech using OpenAI's text-to-speech capabilities.
Convert text into natural, human-like speech using OpenAI's cutting-edge Text-to-Speech API.
Add a description, image, and links to the openai-tts topic page so that developers can more easily learn about it.
To associate your repository with the openai-tts topic, visit your repo's landing page and select "manage topics."