Skip to content

In single stage pipeline, how memory loading from shared ram is overlapped with mma computation? #1503

Answered by hwu36
wzhcz8902 asked this question in Q&A
Discussion options

You must be logged in to vote

There is no overlap in single stage.

just

loop:
  load
  compute

It saves space at the cost of efficiency.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by wzhcz8902
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants