Training process

Hi @dragonfly90!
Many thanks to you for this awesome git. 
Now I'm trying to make my own version of Realtime MultiPerson Pose Estimation, and I have some questions about your code and your approaches:
1. Why do you change LinearRegression in previous version to your own loss, calculated as sqr(prediction-label) * mask ? What is need to use these mask? How is the final loss in `MakeLoss(loss_symbol)` calculated, is it equals mean(sqr(...)) ?
2. What is need to augment data like you do? Why do we need to crop images and not just resize them?
3. Here is a some code from your batch generator:
`for i in range(self._batch_size):`
`image, mask, heatmap, pagmap = getImageandLabel(self.data[self.keys[self.cur_batch]])`
For better understanding, let batch size =4. As I understand, you create 4 same images and maps and put them into a batch, so you have 4 same instances in one batch. Why? Or you make a mistake and here should be something like `image, mask, heatmap, pagmap = getImageandLabel(self.data[self.keys[self.cur_batch * self._bacth_size + i]])`?
4. Do you try to use learning rate scheduler?

I ask these questions, because i can't learn my net, it always predicts a constant (trainable constant, loss decreases to some value and stop changing). The model I use is same as yours. I also tried to change each of the possible network parameters, but nothing helps.

Thankful in advance for the answer, and sorry for my bad English :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training process #11

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Training process #11

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions