Provide API to get runtime Tensor buffer, dtype, and shape. #1957

LPanosTT · 2025-01-24T15:47:26Z

For the frontends to properly reconstruct a tensor during intermediate callback, the data type, shape, and data buffer must be provided. Currently, we can only get the tensor data as a std::vector<float> or as a tt::runtime::Tensor, which itself holds a void pointer to a ttnn::Tensor, which can't be used unless the frontend depends on tt-metal as well.

I believe if the tt::runtime::Tensor held a TensorDesc for itself, or had the shape and dtype of itself then calling getOpOutputTensor should suffice.

The text was updated successfully, but these errors were encountered:

tapspatel · 2025-01-27T20:22:27Z

@ctodTT I believe you were investigating something similar about returning non-float types back. Let's discuss this and the other issues on your plate right now.

tapspatel · 2025-01-27T20:23:42Z

We might want to clean up the APIs as more support for the callback grows, other frontends are starting to integrate this into their ecosystem so if you think there's a case for cleaner API support, definitely post your suggestions.

LPanosTT · 2025-01-27T20:27:05Z

I think if the frontends are able to retrieve a buffer, shape, dtype, and strides - that should be enough for any frontend to construct a tensor in whatever framework they want. I guess strides is a little redundant since ttnn tensors don't have strides and are always contiguous (at least the order of the logical data is, tile layout is technically not contiguous I guess. But we convert to row major anyway).

This change implements an API on runtime tensors to reflect tensor metadata and contents reflecting the underlying TTNN tensor. This functionality is pybound as well, allowing for easy casting to torch tensors. Please see `runtime/test/python/ttnn/test_runtime_api.py` for an example. NOTE: This is not the most efficient implementation possible, as there is effectively a double copy to get the data in a pybindable row major format. There is probably some trickery that can be done later to avoid this, but I would like to get this functionality out ASAP and avoid premature optimization Closes #1957

LPanosTT assigned tapspatel Jan 24, 2025

tapspatel assigned ctodTT and unassigned tapspatel Jan 27, 2025

ctodTT linked a pull request Mar 5, 2025 that will close this issue

Implement runtime tensor desc API #2370

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide API to get runtime Tensor buffer, dtype, and shape. #1957

Provide API to get runtime Tensor buffer, dtype, and shape. #1957

LPanosTT commented Jan 24, 2025 •

edited

Loading

tapspatel commented Jan 27, 2025

tapspatel commented Jan 27, 2025

LPanosTT commented Jan 27, 2025 •

edited

Loading

Provide API to get runtime Tensor buffer, dtype, and shape. #1957

Provide API to get runtime Tensor buffer, dtype, and shape. #1957

Comments

LPanosTT commented Jan 24, 2025 • edited Loading

tapspatel commented Jan 27, 2025

tapspatel commented Jan 27, 2025

LPanosTT commented Jan 27, 2025 • edited Loading

LPanosTT commented Jan 24, 2025 •

edited

Loading

LPanosTT commented Jan 27, 2025 •

edited

Loading