taco
taco copied to clipboard
Implement separate lowerers for C and CUDA
I created two classes: LowererImplC and LowererImplCUDA. They implement lowerForall, lowerWhere and helper functions called by them separately for CPU code and CUDA code. This is the starting point of supporting more types of backends.