https://www.mdu.se/

mdu.sePublications
121 of 2
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Machine Learning for Predictive Modeling and Abstraction in Industrial-Scale Systems
Mälardalen University, Faculty of Engineering and Health Sciences, Department of Computer Science & Engineering.ORCID iD: 0009-0006-2745-4282
2026 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Machine learning is increasingly called upon to guide decisionsin critical industrial applications. Its predictive powerpromises gains in efficiency, yet its black-box nature and lackof guarantees pose risks in contexts where behavior must remainanalyzable and safe. This thesis asks how machine learning can bemade trustworthy, explainable, and efficient enough for engineersto deploy in practice. Three gaps hamper broader adoption. Few works provide formal orstatistical guarantees on ML outputs paired with explanationsthat engineers can act on (Gap~A). Data-driven models thatgeneralize across hardware configurations without retrainingremain rare, and existing simulators are prohibitively slow(Gap~B). Many contributions address individual components ofindustrially motivated problems without combining them intovalidated end-to-end pipelines (Gap~C). To address Gap~A, we apply abstraction to neural networks,showing that inputs with negligible effect on the output can beformally identified and removed, producing simpler yet boundedmodels open to verification. We then introduce a conformalprediction framework for CPU load forecasting that providesstatistically guaranteed coverage intervals, combined withShapley value analysis to trace individual task contributions tothe predicted load. To address Gap~B, we develop a data-drivencache memory surrogate using long short-term memory networks,reproducing cache miss distributions across unseen hardwareconfigurations at a fraction of the simulator's computationalcost. To address Gap~C, we present HASCO, a Hybrid AI SimulationCompiler that translates natural language accident reports intoexecutable vehicular simulation scenarios through a structuredcompilation approach with deterministic validation. Together, these contributions establish a path toward machinelearning that is not merely powerful but trustworthy, explainable,and practically deployable in the industrial workflow.

Place, publisher, year, edition, pages
Västerås: Mälardalens universitet, 2026.
Series
Mälardalen University Press Licentiate Theses, ISSN 1651-9256 ; 384
Keywords [en]
Machine learning; Embedded systems; Trustworthy AI; Explainability; Conformal prediction; Neural network verification; Cache simulation; Scenario-based testing; Industrial systems
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:mdh:diva-76650ISBN: 978-91-7485-758-0 (print)OAI: oai:DiVA.org:mdh-76650DiVA, id: diva2:2055954
Presentation
2026-06-15, room R2-141, Mälardalens universitet, Västerås, 13:15 (English)
Opponent
Supervisors
Funder
Knowledge Foundation, 20220033Knowledge Foundation, 20190038Knowledge Foundation, 20230147
Note

Compilation thesis (sammanläggningsavhandling) comprising four papers. The included papers are: Paper A (LNCS 15250, in press), Paper B (COMPSAC 2025), Paper C (STTT 2025), Paper D (AEiC 2026, accepted).

Available from: 2026-05-13 Created: 2026-04-27 Last updated: 2026-05-25Bibliographically approved
List of papers
1. Abstraction-based Reduction of Input Size for Neural Networks
Open this publication in new window or tab >>Abstraction-based Reduction of Input Size for Neural Networks
Show others...
2026 (English)In: Research in Advanced Low-Code/No-Code Application Development / [ed] Tiziana Margaria, Cham, Switzerland, 2026Conference paper, Oral presentation with published abstract (Refereed)
Abstract [en]

Machine learning is an increasingly popular method for modeling complex systems. A common machine learning model is the neural network, which can be trained to represent complicated functions to a high accuracy. However, neural networks often grow large and complex. Recent work is looking at how to abstract networks to yield simpler representations while retaining some property of the original network — for instance, such that for every input the abstracted network's output is at least as large as the original. In this work, we build on previous ideas and extend them to also consider the input layer. Sometimes the input vector has a large size while only a few of the elements are significant in the computation of the output. We propose to use a trained neural network model to identify insignificant input elements, i.e., elements which do not contain important information. We show how the presented abstraction method for the input layer can be utilized to achieve this.

Place, publisher, year, edition, pages
Cham, Switzerland: , 2026
Series
Lecture Notes in Computer Science, ISSN 0302-9743 ; 15250
Keywords
Neural network; Abstraction; Dimensionality reduction; Feature selection; Formal verification; Marabou
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:mdh:diva-76649 (URN)
Conference
AISoLA 2023 (R@ISE track), Crete, Greece, October 23–28, 2023
Funder
Knowledge Foundation, 20220033
Note

Accepted for publication. Volume production delayed; acceptance confirmed by volume editor Prof. Tiziana Margaria (University of Limerick), March 12, 2026.

Available from: 2026-04-27 Created: 2026-04-27 Last updated: 2026-04-30Bibliographically approved
2. A Conformal Prediction-Based Framework for CPU Load Forecasting: A Black-Box Approach
Open this publication in new window or tab >>A Conformal Prediction-Based Framework for CPU Load Forecasting: A Black-Box Approach
Show others...
2025 (English)In: Proceedings - 2025 IEEE 49th Annual Computers, Software, and Applications Conference, COMPSAC 2025, Institute of Electrical and Electronics Engineers (IEEE) , 2025, p. 361-370Conference paper, Published paper (Refereed)
Abstract [en]

To address safety concerns in industrial systems, we propose a framework for forecasting CPU load with respect to a predetermined threshold, allowing customers to add tasks from a predefined library. Existing tools, akin to Windows Task Manager, provide limited insights due to their aggregate nature and high computational overhead. Our approach uses conformal prediction for rapid uncertainty-aware forecasts and Shapley value analysis to quantify individual task contributions to the CPU load. This proof-of-concept framework improves system safety assessment by addressing key research questions in load prediction and validation, paving the way for refined measurement methodologies in industrial applications.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Series
IEEE Annual Computer Software and Applications Conference Workshops, ISSN 2836-3795
Keywords
Conformal Prediction, Cpu, Forecasting, Load, Shapley, Accident Prevention, Artificial Intelligence, Electric Load Forecasting, Industrial Research, Uncertainty Analysis, Black Box Approach, Conformal Predictions, Industrial Systems, Load Forecasting, Prediction-based, Safety Concerns, Task Managers, Loading
National Category
Computer Sciences
Identifiers
urn:nbn:se:mdh:diva-73410 (URN)10.1109/COMPSAC65507.2025.00056 (DOI)001575960000048 ()2-s2.0-105016185844 (Scopus ID)9798331574345 (ISBN)
Conference
49th IEEE Annual Computers, Software, and Applications Conference, COMPSAC 2025, Toronto, Canada, 8-11 July, 2025
Available from: 2025-09-24 Created: 2025-09-24 Last updated: 2026-04-27Bibliographically approved
3. Machine learning-based cache miss prediction
Open this publication in new window or tab >>Machine learning-based cache miss prediction
Show others...
2025 (English)In: International Journal on Software Tools for Technology Transfer, ISSN 1433-2779, E-ISSN 1433-2787, Vol. 27, p. 53-80Article in journal (Refereed) Published
Abstract [en]

Integrating machine learning into computer architecture simulation offers a new approach to performance analysis, moving away from traditional algorithmic methods. While existing simulators accurately replicate hardware, they often suffer from slow execution, complex documentation, and require deep CPU knowledge, limiting their usability for quick insights. This paper presents a deep learning-based approach for simulating a key CPU component, cache memory. Our model "learns" cache characteristics by observing cache miss distributions, without needing detailed manual modeling. This method accelerates simulations and adapts to different program needs, demonstrating accuracy comparable to traditional simulators. Tested on Sysbench and image processing algorithms, it shows promise for faster, scalable, and hardware-independent simulations.

Place, publisher, year, edition, pages
Springer Nature, 2025
Keywords
Machine learning, Cache, Simulation
National Category
Computer Sciences
Identifiers
urn:nbn:se:mdh:diva-71286 (URN)10.1007/s10009-025-00800-6 (DOI)001472171800001 ()2-s2.0-105005271654 (Scopus ID)
Available from: 2025-04-30 Created: 2025-04-30 Last updated: 2026-04-30Bibliographically approved
4. HASCO: A Hybrid AI Simulation Compiler for Semantic Accident Reconstruction
Open this publication in new window or tab >>HASCO: A Hybrid AI Simulation Compiler for Semantic Accident Reconstruction
Show others...
2026 (English)In: Proceedings of the 30th Ada-Europe International Conference on Reliable Software Technologies, Dagstuhl, Germany, 2026Conference paper, Published paper (Refereed)
Abstract [en]

The validation of Automated Driving Systems (ADSs) has shifted from distance-based metrics to Scenario-Based Testing (SBT). Large Language Models (LLMs) have emerged as powerful tools with potential for generating vehicular scenarios at scale. However, generative models used for direct simulation synthesis produce inadequate output, therefore necessitating a more structured compilation approach. We present HASCO (Hybrid AI Simulation COmpiler), a system that translates natural-language driving scene specifications into executable simulation artifacts (XOSC/XODR files) for the esmini/OpenSCENARIO ecosystem. While LLMs excel at narrative parsing, we demonstrate that direct synthesis of simulation artifacts fails in the vast majority of cases due to hallucinated physics or schema violations. To resolve this, HASCO treats scenario creation as a compilation task rather than a generative one. The pipeline supports three compilation paths: direct synthesis, a Python intermediate (via scenariogeneration), and an ontology-guided path that grounds intent into an intermediate representation before compilation. We further evaluate a self-judging mechanism for automated repair. Across six operating modes evaluated on 40 real-world accident reports, the ontology-guided and Python-based compilers achieve 95% and 90% executability rates, respectively, compared to 5% for direct synthesis. We additionally evaluate outputs on semantic fidelity, positioning HASCO as a robust tool for forensic scene reconstruction.

Place, publisher, year, edition, pages
Dagstuhl, Germany: , 2026
Series
Open Access Series in Informatics (OASIcs), ISSN 2190-6807, E-ISSN 2190-6807
Keywords
Scenario-based testing; Large language models; OpenSCENARIO; OpenDRIVE; Automated driving systems; Accident reconstruction; Simulation compiler; HASCO
National Category
Artificial Intelligence
Research subject
Computer Science
Identifiers
urn:nbn:se:mdh:diva-76644 (URN)
Conference
30th Ada-Europe International Conference on Reliable Software Technologies (AEiC 2026), 9-12 June 2026, Västerås, Sweden
Funder
Knowledge Foundation, 20220033
Note

Accepted for publication. Camera-ready version submitted. Volume in production at Schloss Dagstuhl – Leibniz-Zentrum für Informatik.

Available from: 2026-04-27 Created: 2026-04-27 Last updated: 2026-06-09Bibliographically approved

Open Access in DiVA

fulltext(7092 kB)30 downloads
File information
File name FULLTEXT03.pdfFile size 7092 kBChecksum SHA-512
005ed23aabdcd7f974a7734c262ae4b264bdb34ebe408dff3a4a85057c18ca2df6ed94fe6a22ae1326b39c5e61fb2adfd0eba651b407d9757ca35886f0168986
Type fulltextMimetype application/pdf

Authority records

Jelacic, Edin

Search in DiVA

By author/editor
Jelacic, Edin
By organisation
Department of Computer Science & Engineering
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 30 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 139 hits
121 of 2
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf