0.2.4 • Published 2 years ago

readahead v0.2.4

Weekly downloads
-
License
ISC
Repository
github
Last release
2 years ago

Readahead

A fast way to sequentially and asynchronously access buffers of any size from a readable.

readahead will speed it up, especially if your average requested buffer size is less than 4kb

Usage

// const readahead = require('readahead')
import readahead from 'readahead'
import { createReadStream } from 'fs'

async function main () {
  readable = createReadStream('large-file.data')
  const reader = readahead(readable)
  let length = 100
  while (true) {
    const { value, done } = await reader.next(length)
    if (done) break
    console.log(value) // buffer of size <length>
  }
}
main().catch(console.error)

Benchmark

Execution context: MBP 2015

A 1.6GB pcap file was parsed, segmenting packet header and body. No body decoding.

We compare the reading modes "classic" vs "streaming" vs "idle".

In classic mode, the reader reads the individual buffers as needed, which requires many read operations on the disk. Slowing down the process.

In streaming mode, the reader reads a configurable amount of bytes ahead using fs.createReadStream and caches them in memory. Once the parser wants to access these bytes, they are read from memory directly and the rolling window of cached bytes moves along until the end of the file is reached.

In idle mode, nothing happens besides the file being read using fs.createReadStream. This serves as limit for "how fast it gets with NodeJS".

modeduration scompared with idle mode
idle1.1641.0
stream1.5381.3
classic35.40530.4

RAW data

fh.read every buffer:             29.86s user 17.48s system 133% cpu 35.405 total
readStream and readahead parsing:  1.00s user  0.82s system 118% cpu  1.538 total
readStream, no parsing:            0.45s user  0.99s system 124% cpu  1.164 total (idle reference)
0.2.1

2 years ago

0.2.3

2 years ago

0.2.4

2 years ago

0.2.0

3 years ago