I will automate speech to text transcription using openai whisper


About this gig
Are you spending hours manually transcribing audio or video files? Let automation handle it.
I will build you a Python-based transcription tool powered by OpenAI Whisper one of the most accurate speech-to-text models available today. No monthly subscription, no API fees per minute. Just run it locally and get results fast.
What you'll receive:
A ready-to-run Python script (or FastAPI endpoint)
Supports MP3, MP4, WAV, M4A, and more
Output in TXT and SRT formats
Clean, timestamped transcripts
Setup instructions included
Perfect for:
YouTubers and content creators
Researchers and journalists
Podcasters
Anyone who needs fast, accurate transcripts
Why choose me:
I build clean, documented code not messy one-offs
Tested on real audio files before delivery
Quick communication and fast revisions
All packages include source code you can reuse and modify freely.
Have questions about your use case? Message me before ordering I'm happy to help.
Get to know Masato K
- FromJapan
- Member sinceNov 2025
- Avg. response time23 hours
Languages
Japanese
My Portfolio
FAQ
Do I need an OpenAI API key to use this?
No. OpenAI Whisper runs locally on your machine. No API key, no subscription, no usage fees required.
What audio and video formats are supported?
MP3, MP4, WAV, M4A, WEBM, and most common formats. If you have a specific format, feel free to ask before ordering.
Do I need a GPU to run this?
No, it works on CPU. A GPU will make it faster, but it is not required. Most files under 30 minutes run fine on a standard laptop.
What languages does it support?
Whisper supports 99 languages including English, Japanese, Spanish, French, Portuguese, and more. Just let me know your target language.
What exactly will I receive?
You will receive a Python script (or FastAPI endpoint depending on the package), output files (TXT and SRT), and clear setup instructions. Everything you need to run it immediately.

