@pzelasko we are still seeing sawtooth patterns in losses when we use BucketingSampler, even with fewer buckets.
I think it's because the individual buckets are sorted by length. Is it possible to shuffle somehow within the buckets, or, say, always randomly pick a batch from the front or back of the bucket? Or perhaps the individual buckets could either be reversed, or not-reversed, randomly or alternately.