Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about non-pim logic #10

Open
Drewmackintire opened this issue Mar 21, 2024 · 1 comment
Open

Questions about non-pim logic #10

Drewmackintire opened this issue Mar 21, 2024 · 1 comment

Comments

@Drewmackintire
Copy link

I'm writing the code for executing some useful kernels for gpt3 and other transformer model. After completing kernels, I'd like to compare the PIM & NON-PIM execution latency by PIMBENCHTESTCASES file. However, when I observe the code, I can't see any typical kernel or execution for non-pim kernel. Can you give a tip for mocking or simulating NON-PIM execution kernel? Thanks.

@iamshcha
Copy link
Collaborator

The variety of cases is expected to arise based on the tiling method, the sequence of weight data reads by the xPU, and the management of intermediate results. These cases depend on the configuration of the xPU system, for example, the size of the SPM or the impact of on-chip caches can lead to different optimization techniques. For the non-PIM system, we used an oracle approach that assumes sequential reads of the weight data(see genMemTraffic()), which hide the computational overhead, but note that unlike GEMV, this approach cannot be used for operations with high locality data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants