Abstract: We present techniques for scaling Swin Transformer [35] up to 3 billion parameters and making it capable of training with images of up to 1,536x1,536 resolution. By scaling up capacity and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results