3.0.0 • Published 6 years ago

@transit/gtfs v3.0.0

Weekly downloads
6
License
MIT
Repository
github
Last release
6 years ago

Node.js light GTFS loading and manipulation

A Node.js naive library to load and manipulate GTFS datasets.

Installation

npm install --save @transit/gtfs

Usage

The tables of the GTFS will be loaded only when accessed, and not upfront. This allows manipulation of the small tables (like routes.txt or stops.txt) without having to load the big tables (like stop_times.txt).

Example

If you want to remove all the stops called 'Central Station' and the stop_times using this stop:

const { Gtfs } = require('@transit/gtfs');

const gtfs = new Gtfs('pathToTheFolderContainingTheGtfs');

gtfs.forEachStop((stop) => {
  if (stop.stop_name === 'Central Station') {
    gtfs.removeStop(stop);
  }
});

gtfs.forEachStopTime((stopTime) => {
  if (!gtfs.getStopOfStopTime(stopTime)) {
    gtfs.removeStopTime(stopTime);
  }
});

// Let's also clean up the frequencies, to keep a consistence GTFS.
gtfs.forEachFrequency((frequency) => {
  const fromStop = gtfs.getStopWithId(frequency.from_stop_id);
  const toStop = gtfs.getStopWithId(frequency.to_stop_id);
  
  if (!fromStop || !toStop) {
    gtfs.removeFrequency(frequency);
  }
});

gtfs.exportAtPath('somePathWhereYouWantToExportTheGtfs', (error) => {
  if (error) { throw error };
  
  // Done
});

Keep in mind: synchronous loading and indexes

This project is Transit's old GTFS implementation, using callbacks and plain objects. The main advantage is that is it light and simple to use, but it has some drawbacks. The two biggest ones are the indexes and the synchronous loading of tables.

Indexes

The tables are loaded and saved as Maps, to allow o(1) access using the ids. The routes are therefore indexed by the route_id value, which is therefore saved in route.route_id but also as an index.

This indexing is not automatically kept up to date.

If you change the route_id just by changing the internal value of the route the index won't be updated, and therefore the table will be corrupted. To properly update the id of a route, you should replace it:

const route = gtfs.getRouteWithId('oldId');
gtfs.removeRoute(route);
route.route_id = 'newId';
gtfs.addRoute(route);

Synchronous loading

The goal of this implementation was to avoid loading upfront all the tables. Therefore, they are loaded only when required. This makes the code faster to run (if some tables are not required at all).

The drawback, is that any function could trigger the loading of a table. Since we do not want to turn every function into an async one, the loading of the tables is done synchronously.

Naming

The wording used in the official GTFS specification has been followed as much as possible, including the inconsistencies. For example, the table containing the stops is "stops", but the table containing the agencies is "agency". The reason for this being that, in the specification, the files are named stops.txt vs agency.txt.

Most of the time, the name of one item of a table is the singular of the table name (routes -> route, stops -> stop), but for the shapes.txt, since one item of the table is not a "shape" per-se, but just a point, the name used is "shapePoint" (consistent with the name shape_pt_sequence, shape_pt_lat and shape_pt_lon of the spec).

Support and contact

Please post any issues you find on the repo of the project. And do not hesitate to contact Transit App or Leo Frachet directly if you have any questions.

3.0.0

6 years ago

2.0.0

6 years ago

1.0.1

6 years ago

1.0.0

6 years ago