Skip to main content
NJ Satellite Image

Introduction

This chapter presents the results of a statewide inference run for extracting sidewalk centerlines across New Jersey. The input data consists of statewide high-resolution aerial imagery available as GeoTIFF tiles, which were preprocessed into model-ready chips through a tiling and normalization pipeline. The run was conducted as the first large-scale test deployment of our geospatial ML pipeline. The primary objectives were:
  • Output Fidelity: Generate statewide sidewalk detections and assess their quality
  • Scalability Check: Validate that our Ray + Triton inference pipeline can handle thousands of tiles
  • Performance Baseline: Measure throughput, bottlenecks, and failure points

Hardware Setup

  • Single GPU (GPU 0)
  • NVIDIA A4000 GPU

Data & Preprocessing

The input consisted of statewide high-resolution aerial imagery of New Jersey, provided as GeoTIFF tiles.

Preprocessing Pipeline

  • Pseudo-color Conversion: To match the model’s training format (NIR-G-B)
  • Normalization: Tile pixel values normalized from UINT16 to UINT8
  • Dynamic Padding: Tile pixel dimensions padded to support overlap between chips
  • Chipping: Large GeoTIFFs divided into fixed-size chips (256×256)

Pipeline Stages

  1. Loader, Reading tiles from storage, applying padding, generating chips
  2. Inference, Queueing, batching, model execution, mask thresholding, reconstruction
  3. Postprocess, Polygonization, filtering, and centerline extraction

Results

Tile and Job Statistics

MetricValue
Total tiles processed9,202
Input tile dimensions5,000 × 5,000 pixels
Chip size256 × 256 pixels
Chips per tile (avg)529
Total chips processed4,867,858
Total pixels processed~319 billion
Estimated statewide coverage~21,350 km²
Total job wall time~17.2 hours
Tiles per minute8.88
Average tile throughput per hour535

Overall Run Performance

MetricValue
Success rate99.66%
Failed tiles31/9,202
Total centerlines extracted889,568
Avg centerlines per tile96.73

Stage Timings (seconds)

Stageavgp50p90p99
Loader6.466.377.028.30
Inference54.4654.0460.8666.06
Postprocess5.193.6010.4622.13
End-to-end66.1164.9975.2988.70

Stage Share of End-to-End Time

Median (p50):
  • Loader: ~9.8%
  • Inference: ~83.2%
  • Postprocess: ~5.5%

Centerline Quality

Compared with OpenStreetMap (OSM) sidewalk centerline annotations: Strengths:
  • Alignment with visible sidewalks
  • High-value annotations where predictions are strong
  • Conservative modeling choice (erring on underprediction)
Limitations:
  • Underprediction & occlusion gaps
  • Some false positives in highways, driveways, parking lots
  • Continuity issues in fragmented segments

Key Takeaways

  • Scalability validated: 9,202 statewide tiles processed in 17.2 hours
  • High reliability: 99.66% success rate
  • Inference-bound performance: ~80–83% of per-tile time
  • Annotation quality: Predictions aligned well with visible sidewalks

Potential Optimizations

  1. Pipeline Throughput, Target 30-50% improvement to achieve ~9-12 hour overnight runs
  2. Larger / Better Training Dataset, Including DVRPC and Boston imagery
  3. Centerline Extraction, Dynamic parameterization and occlusion handling
  4. Dynamic Tile Treatment, Lightweight classifier for routing tiles to specialized processing

Chapter Summary

This statewide case study demonstrates both the scalability of our pipeline and the quality trade-offs of automated sidewalk extraction. The pipeline successfully processed over 9,200 tiles and nearly 5 million chips, validating statewide scale operation. Performance analysis shows that inference is the dominant bottleneck, consuming ~80% of per-tile runtime. The results provide a robust baseline for benchmarking future improvements in throughput and efficiency.