1.0.3 • Published 4 years ago

@drorgl/youtube-captions-scraper v1.0.3

Weekly downloads
1
License
MIT
Repository
github
Last release
4 years ago

Youtube Captions Scraper

Note: This is forked from syzer/youtube-captions-scraper, typescriptified and added a few other options

  • Fetch Caption Tracks
  • Fetch Subtitles by language
  • Fetch Auto-Generated Captions
  • Fetch Full Text of Captions
  • Optional external HTTP requester

Installation

> npm install @drorgl/youtube-captions-scraper

Usage

import {
  YoutubeCaptions, 
  getSubtitles, 
  getSubtitlesContent 
  } from 'youtube-captions-scraper';

//Get Subtitles for Video
getSubtitles({
  videoID: 'XXXXX', // youtube video id
  lang: 'fr' // default: `en`
}).then(captions => {
  console.log(captions);
});

//Get full transcription as one string
const subtitles = await getSubtitlesContent({ videoID: "P-ygmGxuiEI" });

let youtubeCaptions = new YoutubeCaptions('XXXXX' /*youtube video id*/);

//retrieve caption tracks
let captionTracks = await youtubeCaptions.getCaptionTracks();

//retrieve subtitles by language
let subtitles = await youtubeCaptions.getSubtitles('en' /*optional language*/);

Captions will be an array of object of this format:

{
  "start": Number,
  "dur": Number,
  "text": String, //HTML stripped subtitles
  "htmlText":String
}