This issue is related to #775 and #1891. I'll try to fix it. On the other hand, is it necessary to enable gradient and error clipping in gru-memory and lstm-memory which are implemented in one cpp file (not in the recurrent layer group)? @hedaoyuan @lcy-seso
This issue is related to #775 and #1891. I'll try to fix it. On the other hand, is it necessary to enable gradient and error clipping in gru-memory and lstm-memory which are implemented in one cpp file (not in the recurrent layer group)? @hedaoyuan @lcy-seso