I will build custom stt tts pipeline with whisper and elevenlabs

S
shhahhussain
S
shhahhussain
Shah

Level 1

4.8
4.8

About this gig

Description:

Ensure accurate, real-time voice processing with a custom STT/TTS pipeline. I will build a streaming speech-to-text and text-to-speech system using Whisper/Deepgram for STT and ElevenLabs/Azure/Google for TTS, with fallback mechanisms for reliability.

What you get:

  • Fully-functional streaming STT/TTS pipeline for voice data
  • Integration of Whisper or Deepgram for transcription
  • Integration of ElevenLabs, Azure, or Google for high-quality TTS
  • Low-latency WebSocket streaming for real-time performance
  • Error handling and retries to ensure reliability

How I work:

  • Discuss requirements (languages, expected load, providers)
  • Design pipeline architecture for streaming audio
  • Implement STT/TTS integration in backend code
  • Add fallback providers for failover and resilience
  • Test end-to-end with sample streams and metrics

What I need from you:

  • Target languages and accents for transcription
  • Preferred primary and backup STT/TTS services
  • Example audio files for testing
  • Expected usage patterns (concurrent streams, burst traffic)
  • Latency/accuracy targets and constraints

Deliverables:

  • Python code for the STT/TTS pipeline with setup instructions
  • Configuration for selected STT and TTS providers

Get to know Shah

Shah

I build production grade Voice AI agents LiveKit Twilio Python deployed on AWS

5.0(9)

Level 1

  • FromPakistan
  • Member sinceJul 2022
  • Avg. response time1 hour
  • Last delivery1 week
  • Languages

    English
I build production-grade Voice AI agents using LiveKit, Twilio, and Python. I’ve implemented real-time inbound/outbound call flows with low-latency streaming, clean turn-taking, and barge-in handling. I improve reliability by tuning VAD, handling jitter/packet loss, and adding retries plus consistent call-state. I containerize and deploy voice agents on AWS so they run stable in production with logging and monitoring.

My Portfolio

Reviews

2 reviews for this Gig
4.8

(2)
(0)
(0)
(0)
(0)
Rating Breakdown
  • Seller communication level
    5
  • Quality of delivery
    4.5
  • Value of delivery
    5
Sort By
Most relevant
  • C

    carsten_lemche

    DK

    Denmark

    4.7

    Just perfect ! Nice guy, this was a proof of concept quickly delivered and we will probably add more work in the future.

    $200-$400

    Price

    1 day

    Duration

    Helpful?
    Yes
    No
  • P

    plaglobal

    Repeat Client

    US

    United States

    5

    Shah is a professional and great to work with. I highly recommend him!

    $100-$200

    Price

    2 days

    Duration

    Helpful?
    Yes
    No
Reviews

2 reviews for this Gig
4.8

(2)
(0)
(0)
(0)
(0)
Rating Breakdown
  • Seller communication level
    5
  • Quality of delivery
    4.5
  • Value of delivery
    5
Sort By
Most relevant
  • C

    carsten_lemche

    DK

    Denmark

    4.7

    Just perfect ! Nice guy, this was a proof of concept quickly delivered and we will probably add more work in the future.

    $200-$400

    Price

    1 day

    Duration

    Helpful?
    Yes
    No
  • P

    plaglobal

    Repeat Client

    US

    United States

    5

    Shah is a professional and great to work with. I highly recommend him!

    $100-$200

    Price

    2 days

    Duration

    Helpful?
    Yes
    No