Abstract: Vision Transformers (ViTs) have computational costs scaling quadratically with the number of tokens, calling for effective token pruning policies. Most existing policies are handcrafted, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results