Not sure why n-step target does not work well. DQN + 3-step target  DQN + 1-step target 