-
Notifications
You must be signed in to change notification settings - Fork 5
Issues: hplt-project/OpusTrainer
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Move argument parsing in trainer.py from the main method.
enhancement
New feature or request
minor
#60
opened Sep 17, 2024 by
varisd
Add a cleaning rule for URL names, such as Amazon.com -> Amazon.it
#58
opened Jul 19, 2024 by
gregtatum
NoiseModifier should use a tokenizer to generate correct alignments
#55
opened Mar 1, 2024 by
gregtatum
Synthesized alignments should be validated before producing a new sentence pair
#54
opened Mar 1, 2024 by
gregtatum
Merge sentences produces incorrect alignments when used with SentencePiece
#53
opened Mar 1, 2024 by
gregtatum
Add a new stage starting in the training log
enhancement
New feature or request
#47
opened Jan 26, 2024 by
onadegibert
Could random unicode noise potentially generate invalid UTF-8
bug
Something isn't working
#42
opened Nov 20, 2023 by
XapaJIaMnu
Replace logging with tracing
enhancement
New feature or request
#36
opened Sep 19, 2023 by
jelmervdl
Repeating training data beyond epoch limit if New feature or request
minor
batch_size > len(dataset)
enhancement
#30
opened Aug 14, 2023 by
jelmervdl
Use New feature or request
SentencePair
struct instead of str
internally
enhancement
#29
opened Jul 27, 2023 by
jelmervdl
Save intermediate models after each stage
enhancement
New feature or request
#16
opened Apr 25, 2023 by
onadegibert
Reading small datasets forces reshuffle and extra epochs despite number of epochs set
enhancement
New feature or request
minor
#10
opened Mar 22, 2023 by
XapaJIaMnu
ProTip!
Follow long discussions with comments:>50.