Ready the moment you open it. 8-band EQ, loudness normalization, high-quality export — plus Whisper transcription. All in your browser. No install needed.

Professional audio tools, no download required.
Runs entirely in your browser. No downloads, no accounts. Just open the URL and start.
6 peaking bands plus high & low shelves. High pass filter included. Pick a preset for instant results.
Compressor smooths out volume swings, then loudness is auto-normalized to Spotify, YouTube, or Apple Podcasts targets. Consistent, platform-ready audio.
Auto-detect and cut silences over 2 seconds. Or trim manually on the waveform with fade in/out. 500ms padding keeps speech natural.
Runs OpenAI Whisper in your browser. Japanese & English supported. Export as SRT, VTT, or TXT — and cut unwanted segments with one click from the transcript.
Clone a voice from 10–20 seconds of reference audio using Chatterbox in your browser. Generate speech from text and finish it with EQ in the same tab.
Export as MP3 (up to 320 kbps) or WAV. Batch process multiple files and download at once.
Toggle between original and processed audio in one click. Hear the difference instantly.
No file size limits. No usage caps. No watermarks. Commercial use welcome. Free forever.
Your files never leave your device. All processing happens locally in the browser.
Audio processing runs on Web Audio API; transcription on WebGPU/Whisper; voice generation on Chatterbox — all in-browser. No internet needed after the first model download.
No sign-up or login needed. Just open the URL and go.
Yes, completely free. No account, no credit card required. Commercial use is welcome.
No. All processing happens entirely in your browser. Your files never leave your device, ensuring complete privacy.
Input: MP3, WAV, OGG, FLAC, AAC, MP4. Output: audio as MP3 (up to 320 kbps) or WAV; transcripts as SRT, VTT, or TXT.
OpenAI’s Whisper model runs directly in your browser. The model is downloaded and cached on first use, then works offline afterward. Your audio never leaves your device.
Japanese and English. Accuracy depends on audio quality and speaking speed. Choose between a standard model (Whisper tiny) and a higher-accuracy model (base). WebGPU-capable devices get even faster performance.
Yes. Each transcribed segment has a scissors button — one click marks that span to be removed from the exported audio. Great for cutting fillers and dead air.
Resemble AI’s Chatterbox model runs entirely in your browser. Clone a voice from 10–20 seconds of reference audio, then generate speech from text. The model is downloaded once (~2GB) and cached for offline use. Neither your audio nor text is sent to any server.
English only for now (Beta). Every generated clip includes the Resemble Perth watermark so it can be verified as AI-generated later. Please use only your own voice or audio you have explicit permission to clone.
An equalizer (EQ) lets you adjust the balance of different frequency ranges. It can clear up muddiness, add warmth, or sharpen clarity. Just pick a preset — no technical knowledge needed.
Yes, AudioBuff is optimized for mobile browsers. It works on both iPhone and Android.
Podcasts, music, video narration, voice memos, streaming audio — any audio content that needs polishing.
No download. No account.