Dataset Constraints
In order to fit a wide range of data and be able to convert this data into nodes and edges, we should define some constraints on the dataset:
File should be in CSV format with headers.
Each time based data row must contain the following fields:
Start node: longitude, latitude, IDEnd node: longitude, latitude, IDEdge info: startTime, stopTime, IDEach time based data row may have additional fields:
bikeIDusertypebirth yeargender
Example dataset
In our example Hubway dataset, each row does meet the requirements and contains the following fields:
"tripduration","starttime","stoptime","start station id","start station name","start station latitude","start station longitude","end station id","end station name","end station latitude","end station longitude","bikeid","usertype","birth year","gender"