https://www.mdu.se/

mdu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automated annotation scheme for extending bounding box representation to detect ship locations.
Mälardalen University, School of Innovation, Design and Engineering.
2023 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesis
Abstract [en]

Bounding boxes often provide limited information about the shape and location of an object on an image. Their limitations lie in their reduced ability to correctly represent objects that have complex shapes or are located at an angle. Related works introduce new object representations that include segmentation masks, keypoints, polylines, and regions and are effective in capturing complex shapes and attributes of an object, but lack computational efficiency for real-time applications and require annotated datasets. The aim of the thesis is to propose an approach to extend bounding box representation to include attributes of interest at a low computational cost. Moreover, the approach aims to automatically transform existing bounding boxes into a new object representation. As a result, the thesis is potentially beneficial for real-time applications that need a complex object representation at a small cost, as well as to create new datasets from existing bounding boxes data or to detect faulty bounding boxes. The approach consists of using a segmentation model to compose a new object representation from a bounding box. The task of object detection is essential in computer vision applications, such as autonomous driving, surveillance, and robotics. The traditional method of representing objects using bounding boxes has limitations in capturing complex shapes and attributes of an object. Therefore, the motivation for this thesis is to propose a low computational cost approach to extend bounding box representation to include attributes of interest. To address this problem, the proposed approach involves using a segmentation model to compose a new object representation from a bounding box. The segmentation model generates a mask for the object, which can be used to extract more detailed features, such as object contours, keypoints, and regions. The approach aims to automatically transform existing bounding boxes into a new object representation, which is potentially beneficial for real-time applications that require a complex object representation at a small cost, as well as to create new datasets from existing bounding box data or to detect faulty bounding boxes. In summary, the approach proposed in this thesis provides an efficient and automated way to improve object representation in computer vision tasks. The experimental results show that the proposed approach achieves better object detection accuracy compared to the traditional bounding box representation, especially for objects with complex shapes and attributes. The approach also has the potential to improve the efficiency of real-time applications that require a complex object representation. Overall, this thesis contributes to the development of more accurate and efficient computer vision systems for various applications. 

Place, publisher, year, edition, pages
2023. , p. 44
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:mdh:diva-64420OAI: oai:DiVA.org:mdh-64420DiVA, id: diva2:1802261
External cooperation
Groke Technologies Oy
Supervisors
Examiners
Available from: 2023-10-09 Created: 2023-10-04 Last updated: 2023-10-09Bibliographically approved

Open Access in DiVA

fulltext(18515 kB)173 downloads
File information
File name FULLTEXT01.pdfFile size 18515 kBChecksum SHA-512
4a15abb60435d0f660857308713a1735ab14b464c2fe3871901f0d979b53f5cc19a97e4fce4417f28b5d8a05c952d5c82d612bc1bced91600fcd5da6d1c48706
Type fulltextMimetype application/pdf

By organisation
School of Innovation, Design and Engineering
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar
Total: 173 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 499 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf