Seerah Legacy

ChatGPT Text-to-Speech Application

ChatGPT Text-to-Speech Application

Category: Other

License: Other

Model Type: Speech Synthesis

A Node.js server that accepts text prompts, generates conversational responses via the OpenAI API (GPT‑4 or GPT‑3.5‑turbo), and converts those responses to natural-sounding speech using the Eleven Labs API. Designed for local deployment, it provides an HTTP endpoint that returns audio files for further playback or download.

Key Features

Generates chat-style text using GPT‑4 or GPT‑3.5‑turbo
Converts generated text to speech using Eleven Labs voices
Exposes a POST /chat endpoint for prompt-to-audio functionality
Supports adjustable parameters such as temperature and voice selection
Easy-to-use setup with Node.js and Express framework
Outputs audio files (e.g., MP3) directly via API responses

GitHub

Similar Projects

xShop

xShop

SoundCTM-DiT: Unified Score-Based & Consistency Models for Full-Band Text-to-Sound

SoundCTM-DiT: Unified Score-Based & Consistency Models for Full-Band Text-to-Sound

Text-to-Audio with Bark

Text-to-Audio with Bark

pdfGPT – Chat with Your PDFs using LLMs

pdfGPT – Chat with Your PDFs using LLMs

ElevenLabs Clone

ElevenLabs Clone

Paint-by-Example: Exemplar-based Image Editing with Diffusion Models

Paint-by-Example: Exemplar-based Image Editing with Diffusion Models