0.7.6 • Published 2 years ago

@fablevision/word-timings v0.7.6

Weekly downloads
-
License
MIT
Repository
-
Last release
2 years ago

Word Timings Generator

Uses Vosk to analyze audio files and generate word timings.

Usage

npx word-timings -c ./path/to/my-config.json

Uses a configuration file in the format:

{
    "model": "path/to/model", // Path to the model you've downloaded and unzipped.
    "cache": "path/to/.cachefile", // Optional, name a cache file to use instead of the default. This speeds up later runs.
    "pretty": true, // Optional - if true, pretty prints the output
    "outputs": [
        {
            "file": "path/to/output.json", // path for the output for this group of files
            "globs": ["path/to/*.wav"] // globs of files to batch together into this output
        }
    ]
}

Audio files must be mono PCM .wav files, and are suggested to run in 16khz (although higher sample rates seem to work okay).

Output

Output will be a JSON dictionary of filenames (no path or extension) to arrays of time data.

{
    "myfile": [[0.1, 0.3], 0.4, 0.5, 0.6, [0.8, 1.2]]
}

Time data is an array, where every element is either a tuple representing the start & end time of that word, or a number representing the end time of the word with the start time being the previous word's end time. All times are in seconds.

0.7.6

2 years ago

0.7.5

2 years ago

0.7.4

2 years ago

0.7.2

2 years ago

0.7.1

2 years ago

0.7.3

2 years ago

0.7.0

2 years ago

0.6.1

2 years ago

0.6.0

2 years ago

0.5.1

2 years ago

0.5.0

2 years ago