Skip to content

Training process #11

@abelyaev-vmk

Description

@abelyaev-vmk

Hi @dragonfly90!
Many thanks to you for this awesome git.
Now I'm trying to make my own version of Realtime MultiPerson Pose Estimation, and I have some questions about your code and your approaches:

  1. Why do you change LinearRegression in previous version to your own loss, calculated as sqr(prediction-label) * mask ? What is need to use these mask? How is the final loss in MakeLoss(loss_symbol) calculated, is it equals mean(sqr(...)) ?
  2. What is need to augment data like you do? Why do we need to crop images and not just resize them?
  3. Here is a some code from your batch generator:
    for i in range(self._batch_size):
    image, mask, heatmap, pagmap = getImageandLabel(self.data[self.keys[self.cur_batch]])
    For better understanding, let batch size =4. As I understand, you create 4 same images and maps and put them into a batch, so you have 4 same instances in one batch. Why? Or you make a mistake and here should be something like image, mask, heatmap, pagmap = getImageandLabel(self.data[self.keys[self.cur_batch * self._bacth_size + i]])?
  4. Do you try to use learning rate scheduler?

I ask these questions, because i can't learn my net, it always predicts a constant (trainable constant, loss decreases to some value and stop changing). The model I use is same as yours. I also tried to change each of the possible network parameters, but nothing helps.

Thankful in advance for the answer, and sorry for my bad English :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions