Hi @dragonfly90!
Many thanks to you for this awesome git.
Now I'm trying to make my own version of Realtime MultiPerson Pose Estimation, and I have some questions about your code and your approaches:
- Why do you change LinearRegression in previous version to your own loss, calculated as sqr(prediction-label) * mask ? What is need to use these mask? How is the final loss in
MakeLoss(loss_symbol) calculated, is it equals mean(sqr(...)) ?
- What is need to augment data like you do? Why do we need to crop images and not just resize them?
- Here is a some code from your batch generator:
for i in range(self._batch_size):
image, mask, heatmap, pagmap = getImageandLabel(self.data[self.keys[self.cur_batch]])
For better understanding, let batch size =4. As I understand, you create 4 same images and maps and put them into a batch, so you have 4 same instances in one batch. Why? Or you make a mistake and here should be something like image, mask, heatmap, pagmap = getImageandLabel(self.data[self.keys[self.cur_batch * self._bacth_size + i]])?
- Do you try to use learning rate scheduler?
I ask these questions, because i can't learn my net, it always predicts a constant (trainable constant, loss decreases to some value and stop changing). The model I use is same as yours. I also tried to change each of the possible network parameters, but nothing helps.
Thankful in advance for the answer, and sorry for my bad English :)
Hi @dragonfly90!
Many thanks to you for this awesome git.
Now I'm trying to make my own version of Realtime MultiPerson Pose Estimation, and I have some questions about your code and your approaches:
MakeLoss(loss_symbol)calculated, is it equals mean(sqr(...)) ?for i in range(self._batch_size):image, mask, heatmap, pagmap = getImageandLabel(self.data[self.keys[self.cur_batch]])For better understanding, let batch size =4. As I understand, you create 4 same images and maps and put them into a batch, so you have 4 same instances in one batch. Why? Or you make a mistake and here should be something like
image, mask, heatmap, pagmap = getImageandLabel(self.data[self.keys[self.cur_batch * self._bacth_size + i]])?I ask these questions, because i can't learn my net, it always predicts a constant (trainable constant, loss decreases to some value and stop changing). The model I use is same as yours. I also tried to change each of the possible network parameters, but nothing helps.
Thankful in advance for the answer, and sorry for my bad English :)