Research Hub

대학 자원

대학 인프라와 자원을 공유해 공동 연구와 기술 활용을 지원합니다.

Loading...

논문 리스트

2023
Integration of Multi-scale CAM and Attention for Weakly Supervised Defects Localization on Surface Defective Apple
(사)한국스마트미디어학회
김진영 외 2명
논문정보
Publisher
스마트미디어저널
Issue Date
2023-10-31
Keywords
-
Citation
-
Source
-
Journal Title
-
Volume
12
Number
9
Start Page
45
End Page
59
DOI
10.30693/SMJ.2023.12.9.45
ISSN
22871322
Abstract
Weakly supervised object localization (WSOL) is a task of localizing an object in an image using only image-level labels. Previous studies have followed the conventional class activation mapping (CAM) pipeline. However, we reveal the current CAM approach suffers from problems which cause original CAM could not capture the complete defects features. This work utilizes a convolutional neural network (CNN) pretrained on image-level labels to generate class activation maps in a multi-scale manner to highlight discriminative regions. Additionally, a vision transformer (ViT) pretrained was treated to produce multi-head attention maps as an auxiliary detector. By integrating the CNN-based CAMs and attention maps, our approach localizes defective regions without requiring bounding box or pixel-level supervision during training. We evaluate our approach on a dataset of apple images with only image-level labels of defect categories. Experiments demonstrate our proposed method aligns with several Object Detection models performance, hold a promise for improving localization.

저자 정보

이름 소속
김진영 지능전자컴퓨터공학과