Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Name FFHQBlindJointDataset is not found, use name: FFHQBlindJointDataset_basicsr! #390

Open
hrituraj-hr opened this issue Jul 18, 2024 · 0 comments

Comments

@hrituraj-hr
Copy link

hrituraj-hr commented Jul 18, 2024

logger:[
print_freq: 100
save_checkpoint_freq: 5000.0
use_tb_logger: True
wandb:[
project: None
resume_id: None
]
]
dist_params:[
backend: nccl
port: 29413
]
find_unused_parameters: True
root_path: /path/to/your/project/root
is_train: True
dist: True
rank: 0
world_size: 1

Name FFHQBlindJointDataset is not found, use name: FFHQBlindJointDataset_basicsr!
[rank0]: Traceback (most recent call last):
[rank0]: File "basicsr/train.py", line 220, in
[rank0]: train_pipeline(root_path)
[rank0]: File "basicsr/train.py", line 140, in train_pipeline
[rank0]: result = create_train_val_dataloader(opt, logger)
[rank0]: File "basicsr/train.py", line 83, in create_train_val_dataloader
[rank0]: train_set = build_dataset(dataset_opt)
[rank0]: File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/basicsr/data/init.py", line 38, in build_dataset
[rank0]: dataset = DATASET_REGISTRY.get(dataset_opt['type'])(dataset_opt)
[rank0]: File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/basicsr/utils/registry.py", line 71, in get
[rank0]: raise KeyError(f"No object named '{name}' found in '{self._name}' registry!")
[rank0]: KeyError: "No object named 'FFHQBlindJointDataset' found in 'dataset' registry!"
E0719 09:56:05.516689 139715360970560 torch/distributed/elastic/multiprocessing/api.py:826] failed (exitcode: 1) local_rank: 0 (pid: 1325241) of binary: /home/cvbl/miniconda3/envs/codeFormer/bin/python
Traceback (most recent call last):
File "/home/cvbl/miniconda3/envs/codeFormer/bin/torchrun", line 8, in
sys.exit(main())
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/elastic/multiprocessing/errors/init.py", line 347, in wrapper
return f(*args, **kwargs)
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/run.py", line 879, in main
run(args)
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/run.py", line 870, in run
elastic_launch(
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 132, in call
return launch_agent(self._config, self._entrypoint, list(args))
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 263, in launch_agent
raise ChildFailedError(
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:

basicsr/train.py FAILED

Can Someone tell how to fix this ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant