We observe a consistent performance lag when training AdaMixer with mmcv_full==1.3.5, especially with the longer training scheme. This phenomenon may be also widespread with mmcv_full>1.3.3.
For right reproduction, please use mmcv_full==1.3.3. We are actively investigating the reason behind. More information will be updated in this issue.