Move extern "C" around the main ElementWise function to avoid linkage issues...
Move extern "C" around the main ElementWise function to avoid linkage issues with preamble (e.g. when using cuda_fp16.h)
Move extern "C" around the main ElementWise function to avoid linkage issues with preamble (e.g. when using cuda_fp16.h)