Reporting a minor bug

Hi, thanks for making the codes public!
I found a minor bug [here](https://github.com/VITA-Group/TransGAN/blob/ff5aac42ed244b0a803baecb3af340ffeec53dc0/models_search/Celeba256_gen.py#L345). The variable `self.pos_embed` keeps the CPU version of the positional embedding. This is the root cause of why you need to call `.to()` during [forward pass](https://github.com/VITA-Group/TransGAN/blob/ff5aac42ed244b0a803baecb3af340ffeec53dc0/models_search/Celeba256_gen.py#L465). To fix it, you can instead call `x = x + self.pos_embed_1`, which `self.pos_embed_1` is the correct GPU copy auto-created by PyTorch. 

This bug causes additional CPU-GPU communication time during training, but I am not quite sure how much does this costs in reality.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reporting a minor bug #71

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reporting a minor bug #71

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions