[Feature Request]: Zero copy for get_tensor Mooncake store Python API

### Describe your feature request

The current get_tensor implementation first get data to local buffer and then copy the data buffer to a newly allocated buffer. And then created a pytorch tensor using the new buffer.

The copy can be avoided by directly allocating the buffer with desired size and register the buffer for transfer engine. And get the data directly into the allocated buffer.

### Before submitting a new issue...

- [ ] Make sure you already searched for relevant issues and read the [documentation](https://kvcache-ai.github.io/Mooncake/)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request]: Zero copy for get_tensor Mooncake store Python API #799

Describe your feature request

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request]: Zero copy for get_tensor Mooncake store Python API #799

Description

Describe your feature request

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions