场景文本检测识别

发布日期: 2023-06-09

2023-06-09 更新

ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes

Authors:Minghao Fu, Xin Man, Yihan Xu, Jie Shao

While scene text image super-resolution (STISR) has yielded remarkable improvements in accurately recognizing scene text, prior methodologies have placed excessive emphasis on optimizing performance, rather than paying due attention to efficiency - a crucial factor in ensuring deployment of the STISR-STR pipeline. In this work, we propose a novel Efficient Scene Text Image Super-resolution (ESTISR) Network for resource-limited deployment platform. ESTISR’s functionality primarily depends on two critical components: a CNN-based feature extractor and an efficient self-attention mechanism used for decoding low-resolution images. We designed a re-parameterized inverted residual block specifically suited for resource-limited circumstances as the feature extractor. Meanwhile, we proposed a novel self-attention mechanism, softmax shrinking, based on a kernel-based approach. This innovative technique offers linear complexity while also naturally incorporating discriminating low-level features into the self-attention structure. Extensive experiments on TextZoom show that ESTISR retains a high image restoration quality and improved STR accuracy of low-resolution images. Furthermore, ESTISR consistently outperforms current methods in terms of actual running time and peak memory consumption, while achieving a better trade-off between performance and efficiency.
PDF

点此查看论文截图

木子已

https://ipaper.today/2023/06/09/2023-06-09-chang-jing-wen-ben-jian-ce-shi-bie/

本博客所有文章除特別声明外，均采用 CC BY 4.0 许可协议。转载请注明来源木子已 !

场景文本检测识别

I2I Translation

2023-06-09 I2I Translation

I2I Translation

Few-Shot

2023-06-09 Few-Shot

Few-Shot

场景文本检测识别

2023-06-09 更新

ESTISR: Adapting Efficient Scene Text Image Super-resolution for Real-Scenes

打赏用于支持本站流量费