Swin-MFINet: Swin transformer based multi-feature integration network for detection of pixel-level surface defects
dc.authorid | UZEN, Huseyin/0000-0002-0998-2130 | |
dc.authorid | Hanbay, Davut/0000-0003-2271-7865 | |
dc.authorwosid | UZEN, Huseyin/CZK-0841-2022 | |
dc.authorwosid | Hanbay, Davut/AAG-8511-2019 | |
dc.contributor.author | Uzen, Huseyin | |
dc.contributor.author | Turkoglu, Muammer | |
dc.contributor.author | Yanikoglu, Berrin | |
dc.contributor.author | Hanbay, Davut | |
dc.date.accessioned | 2024-08-04T20:52:13Z | |
dc.date.available | 2024-08-04T20:52:13Z | |
dc.date.issued | 2022 | |
dc.department | İnönü Üniversitesi | en_US |
dc.description.abstract | Automatic surface defect detection is critical for manufacturing industries, such as steel, fabric, and marble industries. This study proposes a Swin transformer-based model called Multi-Feature Integration Network (Swin-MFINet) for pixel-level surface defect detection. The proposed model consists of an encoder, a Swin transformer-based decoder, and Multi-Feature Integration (MFI) modules. In the encoder module of the proposed model, a pre-trained Inception network is used to extract key features from small-size datasets. In the decoder section, global semantic features are obtained from the initial features by using the Swin-transformer block, which is the newest transformer technology of today. In addition, the convolution layer is used in the last step of the decoder, since transformers are limited in acquiring small spatial details such as edges, colors, and textures, which are important in detecting some small defects. In the last module called MFI, feature maps from different decoder stages are combined, and the channel squeeze-spatial excitation block is applied to reveal important features. Finally, a prediction map is obtained by applying a convolution layer and sigmoid activation function to the MFI module output, respectively. The performance of proposed model is analyzed over MT and MVTec datasets containing surface defect images. The proposed model obtained mIoU scores of 81.37%, and 77.07% respectively, for these two datasets These results outperform the state-of-the-art for the surface defect detection problem. | en_US |
dc.identifier.doi | 10.1016/j.eswa.2022.118269 | |
dc.identifier.issn | 0957-4174 | |
dc.identifier.issn | 1873-6793 | |
dc.identifier.scopus | 2-s2.0-85135376977 | en_US |
dc.identifier.scopusquality | Q1 | en_US |
dc.identifier.uri | https://doi.org/10.1016/j.eswa.2022.118269 | |
dc.identifier.uri | https://hdl.handle.net/11616/100818 | |
dc.identifier.volume | 209 | en_US |
dc.identifier.wos | WOS:000888796100009 | en_US |
dc.identifier.wosquality | Q1 | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.language.iso | en | en_US |
dc.publisher | Pergamon-Elsevier Science Ltd | en_US |
dc.relation.ispartof | Expert Systems With Applications | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Pixel-Level Surface Defects Detection | en_US |
dc.subject | Swin Transformers | en_US |
dc.subject | Encoder-Decoder Network | en_US |
dc.subject | Convolutional Neural Network | en_US |
dc.title | Swin-MFINet: Swin transformer based multi-feature integration network for detection of pixel-level surface defects | en_US |
dc.type | Article | en_US |