0.1.3 β€’ Published 4 months ago

awesome-recorder v0.1.3

Weekly downloads
-
License
MIT
Repository
github
Last release
4 months ago

πŸŽ™οΈ Awesome Recorder

npm

Effortless audio recording with built-in Voice Activity Detection and optimized MP3 outputs in modern browsers.

awesome-recorder is a lightweight, powerful JavaScript library designed for seamless audio capture directly in the browser. It automatically segments speech using advanced Voice Activity Detection (VAD), encoding spoken audio into compact MP3 filesβ€”perfect for web apps, voice assistants, transcription services, and more.

✨ Key Features

  • 🎀 Automatic Voice Activity Detection β€” Precisely detects and segments speech.
  • πŸ“¦ Compact MP3 Encoding β€” Small, optimized MP3 audio outputs.
  • ⚑ Simple Async API β€” Easy-to-use with async generators and async/await.
  • πŸš€ Event-Driven β€” Real-time speech state notifications.
  • πŸ› οΈ Full TypeScript Support β€” Complete type definitions included.
  • 🌐 WebAssembly Optimized β€” Ultra-lightweight custom FFmpeg WASM (~1.2 MB).

🚩 Installation

npm install awesome-recorder
# or
yarn add awesome-recorder
# or
pnpm add awesome-recorder

πŸ§‘β€πŸ’» Quick Start

import { Recorder } from "awesome-recorder";

const recorder = new Recorder();

// Listen for speech state changes (optional)
recorder.on("speechStateChanged", ({ isSpeaking }) => {
  console.log(`User is speaking: ${isSpeaking}`);
});

// Start capturing audio segments
async function startRecording() {
  try {
    for await (const audioChunk of recorder.start()) {
      console.log("Detected speech segment:", audioChunk);

      // Play audio directly in browser
      const audio = new Audio(URL.createObjectURL(audioChunk));
      audio.play();

      // Or trigger immediate download
      const link = document.createElement("a");
      link.href = URL.createObjectURL(audioChunk);
      link.download = `speech-${Date.now()}.mp3`;
      link.click();
    }
  } catch (error) {
    console.error("Recording Error:", error);
  }
}

// Stop recording gracefully
function stopRecording() {
  recorder.stop();
}

πŸ“š API Reference

Recorder Class

Main class for handling recording and voice detection.

new Recorder(vadOptions?: Partial<RealTimeVADOptions> & { preprocessAudio?: (audio: Float32Array) => Float32Array });

Options

  • preprocessAudio
    Optional callback to process audio data before encoding.
    Default behavior trims the last 2000 samples.
    You can remove unwanted tail noise, apply custom modifications, or access raw audio data with this callback.

Methods

  • .preload(): Promise<void>
    Preloads the VAD model and FFmpeg WASM module.

  • .start(): AsyncGenerator<File, void>
    Starts audio recording, yielding MP3 segments upon speech detection.

  • .stop(): Promise<void>
    Stops audio recording.

  • .on(event: string, callback: Function): void
    Subscribes to recorder events.

  • .off(event: string, callback: Function): void
    Unsubscribes from recorder events.

Events

  • speechStateChanged
    Emitted with { isSpeaking: boolean } when speech state changes.

🌐 WebAssembly Optimized

By default, awesome-recorder uses an optimized, custom FFmpeg WASM build from @hinagiku/ffmpeg-core, specifically tailored for minimal size (~1.2 MB). However, you can easily use your own custom build if preferred:

import { setCoreURL, setWasmURL } from "awesome-recorder";

setCoreURL("https://your-cdn.com/ffmpeg-core.js");
setWasmURL("https://your-cdn.com/ffmpeg-core.wasm");

πŸš€ Advanced Usage

Custom Voice Activity Detection Options

Fine-tune detection sensitivity and timing:

const recorder = new Recorder({
  positiveSpeechThreshold: 0.9,
  negativeSpeechThreshold: 0.7,
  minSpeechFrames: 5,
  preSpeechPadFrames: 15,
  redemptionFrames: 10,
});

See the @ricky0123/vad-web Documentation for detailed configuration options.

☁️ Uploading Audio Segments

Stream recorded segments directly to your backend:

async function streamSegments(recorder: Recorder) {
  let segmentCount = 0;

  for await (const audioFile of recorder.start()) {
    segmentCount++;

    const formData = new FormData();
    formData.append("segment", audioFile);

    fetch("/api/upload-segment", {
      method: "POST",
      body: formData,
    })
      .then((res) => res.json())
      .then((data) => console.log(`Uploaded segment ${segmentCount}:`, data))
      .catch((err) => console.error("Upload failed:", err));
  }
}

🌍 Browser Compatibility

Compatible with modern browsers supporting:

  • βœ… WebAssembly (WASM)
  • βœ… Web Audio API (AudioContext)
  • βœ… MediaDevices API

⚠️ Notes for Bundlers

Vite

When using Vite, exclude @ffmpeg/ffmpeg from dependency optimization:

// vite.config.js
export default {
  optimizeDeps: {
    exclude: ["@ffmpeg/ffmpeg"],
  },
};

πŸ” Example Project

Check out the practical demo in the example directory:

πŸ“„ License

Released under the MIT License.

✨ Enjoy effortless, efficient, and powerful audio recording in your web apps! ✨