Lab Exercises
-
Lab 1: Single Core Matrix Multiplication
- Introduction
- Row-Major Memory Layout
- Linear Transformation and Computational Flexibility
- Loop Tiling
- Introduction to Tenstorrent Architecture
- TT-Metalium Programming Model
- Example TT-Metalium Program
- Kernel Compilation and Execution
- Debug Facilities in TT-Metalium
- Matrix Multiplication in TT-Metalium
- Conclusion
- Troubleshooting and Additional Resources
-
Lab 2: Multi Core Matrix Multiplication
- Introduction
- From Single Core to Multi Core Matrix Multiplication
- Data Reuse in Multi Core Matrix Multiplication
- Potential Additional Optimizations
- Conclusion
-
Lab 3: Multicast for Improved Data Reuse in Multi Core Matrix Multiplication
- Introduction
- Background: Tenstorrent NoC and Multicast
- Example Multicast Program
- Debugging Hangs with Watcher
- Exercise 1: Debugging Multicast Issues Using Watcher
- Exercise 2: Extending the Standalone Multicast Example
- Exercise 3: Batched Multicast for Improved Throughput
- Applying Multicast to Multi Core Matrix Multiplication
- Exercise 4: Multi Core Matrix Multiplication with Multicast
- Conclusion
- Appendix A: Watcher Log File Format