Almost trivial distributed parallelization of stencil-based GPU and CPU applications on a regular staggered grid