It is pretty slow for copying LoD information between operators. For resnet it will cost roughly 10% time of whole time, including reading data.