Replaced PyArray_SIZE(ary) with PyArray_NBYTES(ary).
pycuda.driver.register_host_memory() was passing the size of the array in elements, rather than the length of the array in bytes when creating a register_host_memory class. This resulted in "Invalid Argument" errors when calling asynchronous memory copies on the entire array.
Loading
Please register or sign in to comment