-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nemo-v2 <- nemo-vt #3
Closed
Closed
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: smajumdar <[email protected]>
Signed-off-by: Vladimir Bataev <[email protected]>
Signed-off-by: smajumdar <[email protected]>
* [Temp] VP Fixes Signed-off-by: smajumdar <[email protected]> * Revert logging Signed-off-by: smajumdar <[email protected]> --------- Signed-off-by: smajumdar <[email protected]> (cherry picked from commit b6f46a0)
Signed-off-by: hsiehjackson <[email protected]>
* check for first or last stage Signed-off-by: ericharper <[email protected]> * remove redundant check Signed-off-by: ericharper <[email protected]> * fix typo Signed-off-by: ericharper <[email protected]> * add map_location Signed-off-by: ericharper <[email protected]> --------- Signed-off-by: ericharper <[email protected]>
* Bug fix to restore act ckpt Signed-off-by: Markel Sanz Ausin <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Markel Sanz Ausin <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
* Bug fix to reset sequence parallelism Signed-off-by: Markel Sanz Ausin <[email protected]> * Update seq par reset/restore Signed-off-by: Markel Sanz Ausin <[email protected]> * Add nested loop Signed-off-by: Markel Sanz Ausin <[email protected]> --------- Signed-off-by: Markel Sanz Ausin <[email protected]>
…ng (#6744) * fix checkpointed forward and add test for full activation checkpointing Signed-off-by: Abhinav Khattar <[email protected]> * add method Signed-off-by: Abhinav Khattar <[email protected]> * add method Signed-off-by: Abhinav Khattar <[email protected]> --------- Signed-off-by: Abhinav Khattar <[email protected]>
Signed-off-by: smajumdar <[email protected]>
* add call to p2p overlap Signed-off-by: Abhinav Khattar <[email protected]> * update Jenkins for test Signed-off-by: Abhinav Khattar <[email protected]> --------- Signed-off-by: Abhinav Khattar <[email protected]>
* fix get param Signed-off-by: ericharper <[email protected]> * change name Signed-off-by: ericharper <[email protected]> --------- Signed-off-by: ericharper <[email protected]>
* initial POC for LDDL Bert * Finish LDDL POC * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * address comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix merge head * resolving merge * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * add support for val/test loaders * change to new LDDL class + add winding * fix logging level * fix winding * test fix * fixes to winding * add file system * add prepemption optimizations * more logging * more prints * better logging * asfsf * add barrier * removing prints * working with mb lddl loader * final changes * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * update requirements file with LDDL Signed-off-by: wdykas <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * revert adding to requirements --------- Signed-off-by: wdykas <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Eric Harper <[email protected]>
…e (#6740) * Construct FP8 amax reduction group Signed-off-by: Tim Moon <[email protected]> * update core for CI Signed-off-by: Abhinav Khattar <[email protected]> --------- Signed-off-by: Tim Moon <[email protected]> Signed-off-by: Abhinav Khattar <[email protected]> Co-authored-by: Abhinav Khattar <[email protected]>
* add interfaces for tp_communication overlap [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Interface to provide custom userbuffer communicator settings by yaml file [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Construct MPI process group for userbuffers support Signed-off-by: Tim Moon <[email protected]> --------- Signed-off-by: Tim Moon <[email protected]> Co-authored-by: Tim Moon <[email protected]> Co-authored-by: Abhinav Khattar <[email protected]>
* Fix TTS adapter tutorial Signed-off-by: hsiehjackson <[email protected]> * Fix version Signed-off-by: hsiehjackson <[email protected]> --------- Signed-off-by: hsiehjackson <[email protected]>
Signed-off-by: MaximumEntropy <[email protected]>
Signed-off-by: Markel Sanz Ausin <[email protected]>
Signed-off-by: Abhinav Khattar <[email protected]>
* add trainer.validate example Signed-off-by: ericharper <[email protected]> * clean up white space Signed-off-by: ericharper <[email protected]> * add mbs and gbs to the config Signed-off-by: ericharper <[email protected]> --------- Signed-off-by: ericharper <[email protected]>
Signed-off-by: Yi Dong <[email protected]>
Signed-off-by: Yi Dong <[email protected]>
* add model pretraining and customization classes Signed-off-by: ericharper <[email protected]> * fix Signed-off-by: ericharper <[email protected]> * test width Signed-off-by: ericharper <[email protected]> * increase middle pane width Signed-off-by: ericharper <[email protected]> * add modules and datasets Signed-off-by: ericharper <[email protected]> * remove global in t5 dataset s and fix formatting in megatron base model Signed-off-by: ericharper <[email protected]> --------- Signed-off-by: ericharper <[email protected]>
* Apply garbage collection inverval to validation steps Signed-off-by: Sangkug Lym <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Sangkug Lym <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: ericharper <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information