You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current get_tensor implementation first get data to local buffer and then copy the data buffer to a newly allocated buffer. And then created a pytorch tensor using the new buffer.
The copy can be avoided by directly allocating the buffer with desired size and register the buffer for transfer engine. And get the data directly into the allocated buffer.
Before submitting a new issue...
Make sure you already searched for relevant issues and read the documentation