https://www.mdu.se/

mdu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automating the Extraction of Data from Semi-Structured texts Using an NLP-based Approach
Mälardalen University, School of Innovation, Design and Engineering.
2023 (English)Independent thesis Advanced level (degree of Master (Two Years)), 20 credits / 30 HE creditsStudent thesisAlternative title
Automation of Data Extraction from Semi-Structured texts Using NLP (English)
Abstract [en]

In several domains, there is a growing demand for automation software for enhancing process efficiency and reliability. The railway industry is a notable example, given its safety-critical nature and the importance placed on reliability and data integrity. To tackle the complexity of modern trains, engineers must use different notations to specify the intended system and subsystems during development. Managing these different notations and ensuring accurate translations between them poses significant challenges. Manual revisions and translations are time-consuming, costly, and prone to human error, potentially introducing faults into the system. Consequently, automating the extraction of relevant information from these documents can help address these challenges, leading to improved efficiency and accuracy in the development process.In this thesis, we design and developed a NLP-based framework for the semi-automatic translation of semi-structured texts into structured data.The framework focuses on ensuring integrity and reliability of the translated data.We validate the proposed framework on an industrial use case from the railway domain provided by our partner Alstom Rail Sweden AB.

Place, publisher, year, edition, pages
2023. , p. 28
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:mdh:diva-63637OAI: oai:DiVA.org:mdh-63637DiVA, id: diva2:1776190
External cooperation
Alstom Rail Sweden AB
Subject / course
Miscellaneous
Supervisors
Examiners
Available from: 2023-06-28 Created: 2023-06-27 Last updated: 2023-06-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

By organisation
School of Innovation, Design and Engineering
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 113 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf