Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mamba #447 #615

Merged
merged 18 commits into from
Sep 5, 2024
Merged

Mamba #447 #615

merged 18 commits into from
Sep 5, 2024

Conversation

johnnynunez
Copy link
Contributor

@johnnynunez johnnynunez commented Sep 4, 2024

@johnnynunez
Copy link
Contributor Author

johnnynunez commented Sep 5, 2024

@dusty-nv feel free to organize the folders inside jetson-containers.
mamba depends from causalconv1d.
all repositories that use mamba depends of mamba(causalconv1d too) like videosuite or mambavision
all is build with cuda(this folders need a benchmark) because the first when I use mamba, it used casualconv1d and mamba in cpu and it is very slow.

Now we can use LLM models that use mamba :) as you can see on my test.py
also works with images or video.
I hope that communit comes with demos :)

@johnnynunez johnnynunez marked this pull request as ready for review September 5, 2024 00:22
@johnnynunez
Copy link
Contributor Author

Benchmark:
image
Captura de pantalla 2024-09-05 a las 11 50 14

2 scenario:
image

Captura de pantalla 2024-09-05 a las 11 51 44

@dusty-nv
Copy link
Owner

dusty-nv commented Sep 5, 2024

Thanks @johnnynunez, looks like another fun one! 🤣 🐍

@dusty-nv dusty-nv merged commit e161d32 into dusty-nv:dev Sep 5, 2024
@johnnynunez
Copy link
Contributor Author

Thanks @johnnynunez, looks like another fun one! 🤣 🐍

Please see the spam of my messages JAJAJ. I think I need your help because this is going too fast.

@dusty-nv
Copy link
Owner

dusty-nv commented Sep 5, 2024

Haha that does seem fast! Is it producing valid/coherent output? That would be great 👍

@johnnynunez
Copy link
Contributor Author

Haha that does seem fast! Is it producing valid/coherent output? That would be great 👍

it is the original benchmark from mamba:
https://github.com/state-spaces/mamba/blob/main/benchmarks/benchmark_generation_mamba_simple.py

@johnnynunez
Copy link
Contributor Author

@dusty-nv if you can upload casual-conv1d wheel and mamba-ssm wheel, I will thank you

@dusty-nv
Copy link
Owner

dusty-nv commented Sep 5, 2024

@johnnynunez I will start trying to build these tonight/tomorrow and push the wheels & containers 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants