This repository contains the CUDA implementation of the paper "Work-efficient Parallel Non-Maximum Suppression Kernels".