awesome-recorder v0.1.3
ποΈ Awesome Recorder
Effortless audio recording with built-in Voice Activity Detection and optimized MP3 outputs in modern browsers.
awesome-recorder
is a lightweight, powerful JavaScript library designed for seamless audio capture directly in the browser. It automatically segments speech using advanced Voice Activity Detection (VAD), encoding spoken audio into compact MP3 filesβperfect for web apps, voice assistants, transcription services, and more.
β¨ Key Features
- π€ Automatic Voice Activity Detection β Precisely detects and segments speech.
- π¦ Compact MP3 Encoding β Small, optimized MP3 audio outputs.
- β‘ Simple Async API β Easy-to-use with async generators and async/await.
- π Event-Driven β Real-time speech state notifications.
- π οΈ Full TypeScript Support β Complete type definitions included.
- π WebAssembly Optimized β Ultra-lightweight custom FFmpeg WASM (~1.2 MB).
π© Installation
npm install awesome-recorder
# or
yarn add awesome-recorder
# or
pnpm add awesome-recorder
π§βπ» Quick Start
import { Recorder } from "awesome-recorder";
const recorder = new Recorder();
// Listen for speech state changes (optional)
recorder.on("speechStateChanged", ({ isSpeaking }) => {
console.log(`User is speaking: ${isSpeaking}`);
});
// Start capturing audio segments
async function startRecording() {
try {
for await (const audioChunk of recorder.start()) {
console.log("Detected speech segment:", audioChunk);
// Play audio directly in browser
const audio = new Audio(URL.createObjectURL(audioChunk));
audio.play();
// Or trigger immediate download
const link = document.createElement("a");
link.href = URL.createObjectURL(audioChunk);
link.download = `speech-${Date.now()}.mp3`;
link.click();
}
} catch (error) {
console.error("Recording Error:", error);
}
}
// Stop recording gracefully
function stopRecording() {
recorder.stop();
}
π API Reference
Recorder
Class
Main class for handling recording and voice detection.
new Recorder(vadOptions?: Partial<RealTimeVADOptions> & { preprocessAudio?: (audio: Float32Array) => Float32Array });
Options
preprocessAudio
Optional callback to process audio data before encoding.
Default behavior trims the last 2000 samples.
You can remove unwanted tail noise, apply custom modifications, or access raw audio data with this callback.
Methods
.preload(): Promise<void>
Preloads the VAD model and FFmpeg WASM module..start(): AsyncGenerator<File, void>
Starts audio recording, yielding MP3 segments upon speech detection..stop(): Promise<void>
Stops audio recording..on(event: string, callback: Function): void
Subscribes to recorder events..off(event: string, callback: Function): void
Unsubscribes from recorder events.
Events
speechStateChanged
Emitted with{ isSpeaking: boolean }
when speech state changes.
π WebAssembly Optimized
By default, awesome-recorder
uses an optimized, custom FFmpeg WASM build from @hinagiku/ffmpeg-core
, specifically tailored for minimal size (~1.2 MB). However, you can easily use your own custom build if preferred:
import { setCoreURL, setWasmURL } from "awesome-recorder";
setCoreURL("https://your-cdn.com/ffmpeg-core.js");
setWasmURL("https://your-cdn.com/ffmpeg-core.wasm");
π Advanced Usage
Custom Voice Activity Detection Options
Fine-tune detection sensitivity and timing:
const recorder = new Recorder({
positiveSpeechThreshold: 0.9,
negativeSpeechThreshold: 0.7,
minSpeechFrames: 5,
preSpeechPadFrames: 15,
redemptionFrames: 10,
});
See the @ricky0123/vad-web
Documentation for detailed configuration options.
βοΈ Uploading Audio Segments
Stream recorded segments directly to your backend:
async function streamSegments(recorder: Recorder) {
let segmentCount = 0;
for await (const audioFile of recorder.start()) {
segmentCount++;
const formData = new FormData();
formData.append("segment", audioFile);
fetch("/api/upload-segment", {
method: "POST",
body: formData,
})
.then((res) => res.json())
.then((data) => console.log(`Uploaded segment ${segmentCount}:`, data))
.catch((err) => console.error("Upload failed:", err));
}
}
π Browser Compatibility
Compatible with modern browsers supporting:
- β WebAssembly (WASM)
- β
Web Audio API (
AudioContext
) - β MediaDevices API
β οΈ Notes for Bundlers
Vite
When using Vite, exclude @ffmpeg/ffmpeg
from dependency optimization:
// vite.config.js
export default {
optimizeDeps: {
exclude: ["@ffmpeg/ffmpeg"],
},
};
π Example Project
Check out the practical demo in the example directory:
π License
Released under the MIT License.
β¨ Enjoy effortless, efficient, and powerful audio recording in your web apps! β¨