Abstract: In scene text detection, the effective fusion of the detail information and the semantic information is crucial for the accurate results. However, the upsampling operation used in existing ...