ttnn.sharded_to_interleaved_partial
- ttnn.sharded_to_interleaved_partial(input_tensor: ttnn.Tensor, cache_tensor: ttnn.Tensor, num_slices: int, slice_index: int, *, memory_config: ttnn.MemoryConfig | None = None, output_dtype: ttnn.DataType | None = None) ttnn.Tensor
-
Converts a partial tensor from sharded_to_interleaved memory layout
- Parameters:
-
input_tensor (ttnn.Tensor) – input tensor
cache_tensor (ttnn.Tensor) – cache tensor
num_slices (int) – Number of slices.
slice_index (int) – Slice index.
- Keyword Arguments:
-
memory_config (Optional[ttnn.MemoryConfig]) – Memory configuration for the operation. Defaults to None.
output_dtype (Optional[ttnn.DataType]) – Output data type, defaults to same as input. Defaults to None.
- Returns:
-
ttnn.Tensor – the cache tensor with the partial data written.
Example
>>> interleaved_tensor = ttnn.sharded_to_interleaved_partial(tensor, cache_tensor, 4, 2)