EAPT: Efficient Attention Pyramid Transformer for Image Processing

Lin, Xiao; Sun, Shuzhou; Huang, Wei; Sheng, Bin; Li, Ping; Feng, Dagan

doi:10.1109/tmm.2021.3120873

Public

EAPT: Efficient Attention Pyramid Transformer for Image Processing

Published in IEEE Transactions on Multimedia • Oct 20, 2021

Authors:

Xiao Lin

Shuzhou Sun

Wei Huang

Abstract

Recent transformer-based models, especially patch-based methods, have shown huge potentiality in vision tasks. However, the split fixed-size patches divide the input features into the same size patches, which ignores the fact that vision elements are often various and thus may destroy the semantic i...

View

Paywalled

Subject

Computer science

Artificial intelligence

Computer vision

Generate AI Take for this paper

Highlights, strengths & weaknesses, commercial applications, and societal impact — written for this paper on demand.

Subject

Generate AI Take for this paper

Discussions