LTOS

Layout-controllable Text-Object Synthesis via Adaptive Cross-attention Fusions

training

run CUDA_VISIBLE_DEVICES=0,1,2 python -m torch.distributed.launch --nproc_per_node=3 --master_port=22411 main.py --yaml_file=configs/text_object_control.yaml --DATA_ROOT=./DATA --batch_size=8 --name=cross_fusion_model --official_ckpt_name="checkpoint_generation_text.pth"

--name: saved filename --official_ckpt_name: pretrained model (you should download the GLIGEN checkpoint from (https://huggingface.co/gligen/gligen-generation-text-box/blob/main/diffusion_pytorch_model.bin))

inference

run python inference.py

dataset_contruction

in data_construction/README.md

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
cldm		cldm
configs		configs
data_construction		data_construction
dataset		dataset
fonts		fonts
grounding_input		grounding_input
ldm		ldm
ocr_reg		ocr_reg
README.md		README.md
SD_input_conv_weight_bias.pth		SD_input_conv_weight_bias.pth
convert_ckpt.py		convert_ckpt.py
distributed.py		distributed.py
glyph_instructions.yaml		glyph_instructions.yaml
inference.py		inference.py
main.py		main.py
projection_matrix		projection_matrix
render_images.py		render_images.py
requirements.txt		requirements.txt
trainer.py		trainer.py
tsv_split_merge.py		tsv_split_merge.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LTOS

training

inference

dataset_contruction

About

Releases

Packages

Languages

Layort/TOF

Folders and files

Latest commit

History

Repository files navigation

LTOS

training

inference

dataset_contruction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages