-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Documentation]: Shall we modify the configurations in v2python
for the other kernels?
#12
Comments
Yes, the current steps to add new kernels are
(Note: Keep this Issue open until the |
Theoretically you can, but it's an untested code path right now. The json file we called "turing database" in internal slides (nothing secret, just unsuitable to release as part of source code Repos). The tuning database did not exist until commit 099141a, and before that you need to specify the However, since the introduction of the tuning database, it somehow becomes the central of the build process and guides AOTriton how to dispatch user inputs to actual GPU kernels, and the original approach is not well tested since its performance is abysmal (could be 10x slower)
This totally depends on your needs, and actual kernel you want to compile. The tuning database is an AOT version of Note, Triton kernel compiled with certain configurations can run without segfaults or other runtime errors, but will not give your correct results. See
The JSON version is a little bit verbose. However, we have replaced the JSON with SQLite3 database. You can check the
No, you don't need to. The current database already described all tensors' shapes and dtypes. More specifically, for |
Thanks for your reply! |
Certainly we will, but the migration takes time and extensive regression tests. |
Description of errors
In current
Readme.md
, we need to change the configurations inpython/rules.py
, but actually we need to make our own folders like/v2python/rules/flash/
right?Attach any links, screenshots, or additional evidence you think will be helpful.
No response
The text was updated successfully, but these errors were encountered: