According to the publication a ResNet50 can be trained for the Appearance Embedding. How are this parameters trained?