https://www.mdu.se/

mdu.sePublications
Change search
Link to record
Permanent link

Direct link
Ahmed, Mobyen Uddin, DrORCID iD iconorcid.org/0000-0003-3802-4721
Alternative names
Biography [swe]

Mobyen Uddin Ahmed is Senior Lecturer/Assistant Professor in Computer Science and Artificial Intelligence at Artificial Intelligence and Intelligent Systems and a member of ESS-H - Embedded Sensor Systems for Health Research Profile. Mobyen has 100+ scientific publication and more than 1173 citations.

He is involved in research and development since 2005 after completing his M.Sc. in Computer Engineering (thesis) from Dalarna University, Sweden. He received his PhD (thesis) in computer science in 2011 from Mälardalen University. He has completed one postdoctoral study between the years 2012 and 2014 in Computer Science and Engineering (Center for Applied Autonomous Sensor Systems) at School of Science and Technology, Örebro University, Sweden

To mention some courses those, I am involved in teaching: Applied Artificial Intelligence, Project in intelligent embedded systemsMachine Learning With Big Data (a distance course for industrial professionals), Databases, Deep learning for industrial imaging (comming), Predictive analytics (comming) etc.

Publications (10 of 147) Show all publications
Barua, A. (2025). Advanced Hybrid Reasoning and Transfer Learning on Multimodal Data with Transformers. Springer Nature Computer Science, 6(3)
Open this publication in new window or tab >>Advanced Hybrid Reasoning and Transfer Learning on Multimodal Data with Transformers
2025 (English)In: Springer Nature Computer Science, ISSN 2662995X, Vol. 6, no 3Article in journal (Refereed) Published
Abstract [en]

Reasoning is a vital process in machine learning (ML), involving making inferences and drawing conclusions based on data. This capability is important for developing intelligent systems which can understand and predict complex patterns. The study investigates reasoning through two distinct methodologies: multimodal reasoning and transfer learning based reasoning. In the first approach, multimodal reasoning is used with a semi-supervised method to label unlabelled datasets. In the second approach, transfer learning has been used to transfer knowledge of data from one model to another. Both approaches are demonstrated using unlabelled vehicular telemetry data. During processing, three sets of telemetry data are used to extract features separately through the autoencoder. These features are then clustered and aligned to create labelled and unlabelled datasets. The eXtreme Gradient Boosting (XGBoost) algorithm achieved over 98% test accuracy when applied to the labelled datasets and was then used to predict labels for the unlabelled datasets, which were later added to the labelled dataset to form three datasets for further processing. In transfer learning, a transformer model specifically designed to handle continuous features is developed. Labelled datasets are applied to the transformer model, one after the other, resulting in three final models, with each model achieving over 80% accuracy. The model’s prediction confidence is also validated using conformal learning, where the final models achieved over 80% accuracy. The transformer model is also separately trained on the datasets and compared with traditional ML models, outperforming the others by achieving an accuracy of 98%. By building on the groundwork laid by this study, future research can push the boundaries of what is possible with reasoning approaches, opening up new paths for scientific exploration and practical applications in different fields.

Keywords
Multimodal reasoning, Semi-supervised learning, Supervised alignment, Transfer learning, Transformer
National Category
Computer and Information Sciences Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:mdh:diva-69147 (URN)10.1007/s42979-025-03706-x (DOI)2-s2.0-85218692188 (Scopus ID)
Funder
EU, Horizon 2020, 953432
Available from: 2024-11-15 Created: 2024-11-15 Last updated: 2025-03-12Bibliographically approved
D'Cruze, R. S., Ahmed, M. U., Bengtsson, M., Rehman, A. U., Funk, P. & Sohlberg, R. (2024). A Case Study on Ontology Development for AI Based Decision Systems in Industry. In: Lecture Notes in Mechanical Engineering: . Paper presented at 7th International Congress and Workshop on Industrial AI and eMaintenance, IAI 2023, Luleå, Sweden, 13 June 2023 through 15 June 2023 (pp. 693-706). Springer Science and Business Media Deutschland GmbH
Open this publication in new window or tab >>A Case Study on Ontology Development for AI Based Decision Systems in Industry
Show others...
2024 (English)In: Lecture Notes in Mechanical Engineering, Springer Science and Business Media Deutschland GmbH , 2024, p. 693-706Conference paper, Published paper (Refereed)
Abstract [en]

Ontology development plays a vital role as it provides a structured way to represent and organize knowledge. It has the potential to connect and integrate data from different sources, enabling a new class of AI-based services and systems such as decision support systems and recommender systems. However, in large manufacturing industries, the development of such ontology can be challenging. This paper presents a use case of an application ontology development based on machine breakdown work orders coming from a Computerized Maintenance Management System (CMMS). Here, the ontology is developed using a Knowledge Meta Process: Methodology for Ontology-based Knowledge Management. This ontology development methodology involves steps such as feasibility study, requirement specification, identifying relevant concepts and relationships, selecting appropriate ontology languages and tools, and evaluating the resulting ontology. Additionally, this ontology is developed using an iterative process and in close collaboration with domain experts, which can help to ensure that the resulting ontology is accurate, complete, and useful for the intended application. The developed ontology can be shared and reused across different AI systems within the organization, facilitating interoperability and collaboration between them. Overall, having a well-defined ontology is critical for enabling AI systems to effectively process and understand information.

Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH, 2024
Keywords
Custom NER, Industrial AI, Machine failures prediction, Ontology development, Artificial intelligence, Decision support systems, Interoperability, Iterative methods, Knowledge management, AI systems, Case-studies, Decision systems, Failures prediction, Machine failure, Machine failure prediction, Ontology's, Ontology
National Category
Computer Sciences
Identifiers
urn:nbn:se:mdh:diva-65369 (URN)10.1007/978-3-031-39619-9_51 (DOI)2-s2.0-85181980940 (Scopus ID)9783031396182 (ISBN)
Conference
7th International Congress and Workshop on Industrial AI and eMaintenance, IAI 2023, Luleå, Sweden, 13 June 2023 through 15 June 2023
Available from: 2024-01-17 Created: 2024-01-17 Last updated: 2024-01-17Bibliographically approved
Begum, S. & Ahmed, M. U. (2024). Artificial Intelligence in Predictive Maintenance: A Systematic Literature Review on Review Papers. In: Lecture Notes in Mechanical Engineering: . Paper presented at 7th International Congress and Workshop on Industrial AI and eMaintenance, IAI 2023, Luleå, Sweden, 13 June 2023 through 15 June 2023 (pp. 251-261). Springer Science and Business Media Deutschland GmbH
Open this publication in new window or tab >>Artificial Intelligence in Predictive Maintenance: A Systematic Literature Review on Review Papers
2024 (English)In: Lecture Notes in Mechanical Engineering, Springer Science and Business Media Deutschland GmbH , 2024, p. 251-261Conference paper, Published paper (Refereed)
Abstract [en]

The fourth industrial revolution, colloquially referred to as “industry 4.0”, has garnered substantial global attention in recent years. There, Artificial intelligence (AI) driven industrial intelligence has been increasingly deployed in predictive maintenance (PdM), emerging as a vital enabler of smart manufacturing and industry 4.0. Since in recent years the number of articles focusing on Artificial Intelligence (AI) in PdM is high a review on the available literature reviews in this domain would be useful for the future researchers who would like to advance the research in this area and also for the persons who would like to apply PdM in their application domains. Therefore, this study identifies the AI revolution in PdM and focuses on the next stages available in the literature reviews in this area by quality assessment of secondary study. A well-known structured review approach (Systematic Literature Review, or SLR) was employed to perform this tertiary study. In addition, the Scale for the Assessment of Narrative Review Articles (SANRA) approach for evaluating the quality of review papers has been employed to support a few of the research questions. Here, This tertiary study scrutinizes four crucial aspects of secondary articles: (1) their specific research domains, (2) the annual trends in the quantity, variety, and quality (3) a footsteps of top researchers, and (4) the research constraints that review articles face during the time frame of 2015 to 2022. The results show that the majority of the application areas are applied to the manufacturing industry. It also leads to the identification of the revolution of AI in PdM as well. Our final findings indicate that Dr. Cheng et al.’s (2022) review has emerged as the predominant source of information in this field. As newcomers or industrial practitioners, we can benefit greatly from following his insights. The final outcome is that there is a lack of progress in SLR formulation and in adding explainable or interpretive AI methodologies in secondary studies.

Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH, 2024
Keywords
Artificial Intelligence, Predictive maintenance, Systematic literature review, Industry 4.0, Maintenance, Applications domains, Industrial revolutions, Literature reviews, Quality assessment, Review papers, Smart manufacturing, Structured review, Tertiary study
National Category
Other Engineering and Technologies
Identifiers
urn:nbn:se:mdh:diva-65371 (URN)10.1007/978-3-031-39619-9_18 (DOI)2-s2.0-85181976041 (Scopus ID)9783031396182 (ISBN)
Conference
7th International Congress and Workshop on Industrial AI and eMaintenance, IAI 2023, Luleå, Sweden, 13 June 2023 through 15 June 2023
Available from: 2024-01-17 Created: 2024-01-17 Last updated: 2024-02-26Bibliographically approved
Kabir, M. A., Ahmed, M. U., Begum, S. & Barua, S. (2024). Balancing Fairness: Unveiling the Potential of SMOTE-Driven Oversampling in AI Model Enhancement. In: ACM International Conference Proceeding Series: . Paper presented at 9th International Conference on Machine Learning Technologies, ICMLT 2024, Oslo 24 May 2024 through 26 May 2024 (pp. 21-29). Association for Computing Machinery
Open this publication in new window or tab >>Balancing Fairness: Unveiling the Potential of SMOTE-Driven Oversampling in AI Model Enhancement
2024 (English)In: ACM International Conference Proceeding Series, Association for Computing Machinery , 2024, p. 21-29Conference paper, Published paper (Refereed)
Abstract [en]

In the contemporary landscape of decision support systems, machine learning (ML) algorithms assume a pivotal role in diverse domains, including job screening and loan approvals. Despite their extensive utilization, a persistent challenge arises in the form of biased outcomes, notably influenced by sensitive attributes such as gender and ethnicity. While current research heavily leans on these attributes for fairness, the scarcity of data due to privacy and legal constraints poses a substantial hurdle. Furthermore, imbalances in real-world datasets necessitate the use of class balancing techniques, but conflicting findings on their impact on bias mitigation and overall model performance complicate the pursuit of fairness. This paper conducts a comprehensive investigation, addressing the unique challenge of constructing fair models without explicit reliance on sensitive attributes. It specifically examines the effectiveness of Synthetic Minority Oversampling TEchnique (SMOTE)-driven oversampling methods. The study's findings reveal a significant enhancement in classification performance through SMOTE-driven techniques. These insights advocate for the thoughtful integration of SMOTE-driven oversampling techniques to achieve a balance between model fairness and accuracy. The results provide valuable guidance to researchers and practitioners in the field, contributing to the ongoing dialogue on fairness in machine learning models. 

Place, publisher, year, edition, pages
Association for Computing Machinery, 2024
Keywords
bias mitigation, class balancing, fairness, machine learning, synthetic minority oversampling technique, Adversarial machine learning, Decision supports, Machine learning algorithms, Machine-learning, Over sampling, Sensitive attribute, Support systems, Synthetic minority over-sampling techniques, Contrastive Learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:mdh:diva-68579 (URN)10.1145/3674029.3674034 (DOI)001342512100005 ()2-s2.0-85204676853 (Scopus ID)9798400716379 (ISBN)
Conference
9th International Conference on Machine Learning Technologies, ICMLT 2024, Oslo 24 May 2024 through 26 May 2024
Available from: 2024-10-02 Created: 2024-10-02 Last updated: 2024-12-04Bibliographically approved
Rehman, A. U., Nishat, T., Ahmed, M. U., Begum, S. & Ranjan, A. (2024). Chip Analysis for Tool Wear Monitoring in Machining: A Deep Learning Approach. IEEE Access, 12, 112672-112689
Open this publication in new window or tab >>Chip Analysis for Tool Wear Monitoring in Machining: A Deep Learning Approach
Show others...
2024 (English)In: IEEE Access, E-ISSN 2169-3536, Vol. 12, p. 112672-112689Article in journal (Refereed) Published
Abstract [en]

Recent strides in integrating artificial intelligence (AI) with production systems align with the trend towards highly automated manufacturing, demanding smarter machinery. This dovetails with the overarching vision of Industry 4.0, moving beyond conventional models towards employing AI for real-time modeling of production processes, enabling adaptable and learning-enabled models. This study focuses on leveraging cutting-edge deep learning techniques to monitor and classify tool wear using authentic image data from machining processes. Various deep learning algorithms, including CNN, AlexNet, EfficientNetB0, MobileNetV2, CoAtNet-0, and ResNet18, are explored for monitoring and measuring wear through images of machining chips. The collected images of machining chips are categorized as ‘Accepted’, ‘Unaccepted’, and ‘Optimal’. Due to imbalanced datasets, the study investigates two distinct strategies: upsampling and downsampling. The study also aimes to enhance sensitivity for a specific minority class to meet industrial requirements. The study showed that upsampling enhanced accuracy and almost fulfilled the stated requirements, whereas downsampling did not achieve the desired outcomes. The study evaluates and compares the effectiveness of recently introduced deep learning algorithms with other CNN-based architectures in classifying tool wear states in real-world scenarios. It sheds light on the challenges faced by the machining industry, particularly the prevalent issue of class imbalance in real-world machining data. The observed results indicate that ResNet18 and AlexNet outperform other algorithms, achieving a weighted average accuracy of 96% for both multiclass and binary classification problems when considering upsampled datasets. Consequently, the study concludes that both ResNet18 and AlexNet demonstrate adaptability to class imbalances, generalization to real-world machining scenarios, and competitive accuracy.

Place, publisher, year, edition, pages
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC, 2024
Keywords
Deep learning, industry 4.0, machining, neural networks, predictive maintenance, tool wear, industry 4.0, machining, neural networks, predictive maintenance, tool wear
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:mdh:diva-68327 (URN)10.1109/ACCESS.2024.3443517 (DOI)001297306100001 ()2-s2.0-85201273178 (Scopus ID)
Available from: 2024-09-04 Created: 2024-09-04 Last updated: 2024-09-04Bibliographically approved
Bengtsson, M., D'Cruze, R. S., Ahmed, M. U., Sakao, T., Funk, P. & Sohlberg, R. (2024). Combining Ontology and Large Language Models to Identify Recurring Machine Failures in Free-Text Fields. In: Sustainable Production Through Advanced Manufacturing, Intelligent Automation And Work Integrated Learning, Sps 2024: . Paper presented at 9 April 2024 11th Swedish Production Symposium, SPS2024. Trollhattan. 23 April 2024 through 26 April 2024 (pp. 27-38). IOS Press BV, 52
Open this publication in new window or tab >>Combining Ontology and Large Language Models to Identify Recurring Machine Failures in Free-Text Fields
Show others...
2024 (English)In: Sustainable Production Through Advanced Manufacturing, Intelligent Automation And Work Integrated Learning, Sps 2024, IOS Press BV , 2024, Vol. 52, p. 27-38Conference paper, Published paper (Refereed)
Abstract [en]

Companies must enhance total maintenance effectiveness to stay competitive, focusing on both digitalization and basic maintenance procedures. Digitalization offers technologies for data-driven decision-making, but many maintenance decisions still lack a factual basis. Prioritizing efficiency and effectiveness require analyzing equipment history, facilitated by using Computerized Maintenance Management Systems (CMMS). However, CMMS data often contains unstructured free-text, leading to manual analysis, which is resource-intensive and reactive, focusing on short time periods and specific equipment. Two approaches are available to solve the issue: minimizing free-text entries or using advanced methods for processing them. Free-text allows detailed descriptions but may lack completeness, while structured reporting aids automated analysis but may limit fault description richness. As knowledge and experience are vital assets for companies this research uses a hybrid approach by combining Natural Language Processing with domain specific ontology and Large Language Models to extract information from free-text entries, enabling the possibility of real-time analysis e.g., identifying recurring failure and knowledge sharing across global sites.

Place, publisher, year, edition, pages
IOS Press BV, 2024
Keywords
Artificial Intelligence, Experience Reuse, Industrial Maintenance, Large Language Models, Natural Language Processing, Computational linguistics, Decision making, Failure (mechanical), Natural language processing systems, Ontology, Computerized maintenance management system, Free texts, Language model, Language processing, Large language model, Natural languages, Text entry, Maintenance
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:mdh:diva-66565 (URN)10.3233/ATDE240151 (DOI)001229990300003 ()2-s2.0-85191305248 (Scopus ID)9781643685106 (ISBN)
Conference
9 April 2024 11th Swedish Production Symposium, SPS2024. Trollhattan. 23 April 2024 through 26 April 2024
Available from: 2024-05-14 Created: 2024-05-14 Last updated: 2024-07-03Bibliographically approved
Hurter, C., Degas, A., Guibert, A., Poyer, M., Durand, N., Veyrie, A., . . . Aricó, P. (2024). Examining Decision-Making in Air Traffic Control: Enhancing Transparency and Decision Support Through Machine Learning, Explanation, and Visualization: A Case Study. In: Int. Conf. Agent. Artif. Intell.: . Paper presented at International Conference on Agents and Artificial Intelligence (pp. 622-634). Science and Technology Publications, Lda
Open this publication in new window or tab >>Examining Decision-Making in Air Traffic Control: Enhancing Transparency and Decision Support Through Machine Learning, Explanation, and Visualization: A Case Study
Show others...
2024 (English)In: Int. Conf. Agent. Artif. Intell., Science and Technology Publications, Lda , 2024, p. 622-634Conference paper, Published paper (Refereed)
Abstract [en]

Artificial Intelligence (AI) has recently made significant advancements and is now pervasive across various application domains. This holds true for Air Transportation as well, where AI is increasingly involved in decision-making processes. While these algorithms are designed to assist users in their daily tasks, they still face challenges related to acceptance and trustworthiness. Users often harbor doubts about the decisions proposed by AI, and in some cases, they may even oppose them. This is primarily because AI-generated decisions are often opaque, non-intuitive, and incompatible with human reasoning. Moreover, when AI is deployed in safety-critical contexts like Air Traffic Management (ATM), the individual decisions generated by AI models must be highly reliable for human operators. Understanding the behavior of the model and providing explanations for its results are essential requirements in every life-critical domain. In this scope, this project aimed to enhance transparency and explainability in AI algorithms within the Air Traffic Management domain. This article presents the results of the project’s validation conducted for a Conflict Detection and Resolution task involving 21 air traffic controllers (10 experts and 11 students) in En-Route position (i.e. hight altitude flight management). Through a controlled study incorporating three levels of explanation, we offer initial insights into the impact of providing additional explanations alongside a conflict resolution algorithm to improve decision-making. At a high level, our findings indicate that providing explanations is not always necessary, and our project sheds light on potential research directions for education and training purposes.

Place, publisher, year, edition, pages
Science and Technology Publications, Lda, 2024
Keywords
Air Traffic Management, Artificial Intelligence, Conflict Detection and Resolution, eXplainable Artificial Intelligence, User-Centric XAI
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:mdh:diva-69538 (URN)10.5220/0012471900003636 (DOI)2-s2.0-85190650606 (Scopus ID)
Conference
International Conference on Agents and Artificial Intelligence
Available from: 2024-12-12 Created: 2024-12-12 Last updated: 2024-12-19Bibliographically approved
Islam, M. R., Ahmed, M. U. & Begum, S. (2024). iXGB: improving the interpretability of XGBoost using decision rules and counterfactuals. In: Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART: . Paper presented at 16th International Conference on Agents and Artificial Intelligence (ICAART 2024) (pp. 1345-1353). , 3
Open this publication in new window or tab >>iXGB: improving the interpretability of XGBoost using decision rules and counterfactuals
2024 (English)In: Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART, 2024, Vol. 3, p. 1345-1353Conference paper, Published paper (Other academic)
Abstract [en]

Tree-ensemble models, such as Extreme Gradient Boosting (XGBoost), are renowned Machine Learning models which have higher prediction accuracy compared to traditional tree-based models. This higher accuracy, however, comes at the cost of reduced interpretability. Also, the decision path or prediction rule of XGBoost is not explicit like the tree-based models. This paper proposes the iXGB--interpretable XGBoost, an approach to improve the interpretability of XGBoost. iXGB approximates a set of rules from the internal structure of XGBoost and the characteristics of the data. In addition, iXGB generates a set of counterfactuals from the neighbourhood of the test instances to support the understanding of the end-users on their operational relevance. The performance of iXGB in generating rule sets is evaluated with experiments on real and benchmark datasets which demonstrated reasonable interpretability. The evaluation result also supports that the interpretability of XGBoost can be improved without using surrogate methods.

Keywords
Counterfactuals, Explainability, Explainable Artificial Intelligence, Interpretability, Regression, Rule-based Explanation, XGBoost.
National Category
Computer Sciences
Identifiers
urn:nbn:se:mdh:diva-64916 (URN)10.5220/0012474000003636 (DOI)2-s2.0-85190810293 (Scopus ID)978-989-758-680-4 (ISBN)
Conference
16th International Conference on Agents and Artificial Intelligence (ICAART 2024)
Available from: 2023-11-30 Created: 2023-11-30 Last updated: 2024-05-08Bibliographically approved
Begum, S., Ahmed, M. U., Barua, S., Kabir, M. A. & Masud, A. N. (2024). Research Issues and Challenges in the Computational Development of Trustworthy AI. In: 6th IEEE International Conference on Artificial Intelligence in Engineering and Technology, IICAIET 2024: . Paper presented at 6th IEEE International Conference on Artificial Intelligence in Engineering and Technology, IICAIET 2024, Kota Kinabalu, 26 August 2024 through 28 August 2024. Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Research Issues and Challenges in the Computational Development of Trustworthy AI
Show others...
2024 (English)In: 6th IEEE International Conference on Artificial Intelligence in Engineering and Technology, IICAIET 2024, Institute of Electrical and Electronics Engineers (IEEE), 2024Conference paper, Published paper (Refereed)
Abstract [en]

The development and deployment of AI systems necessitate a steadfast commitment to reliability, safety, security, ethics, and social responsibility. This paper introduces key research issues and challenges for trustworthy AI based on our experience working on several ongoing research projects at Mälardalen University (MDU), Sweden, which considers practical, real-world scenarios from the mobility, transportation, and healthcare domains. Our observations have highlighted several critical technical components that underpin trustworthy AI. These components include fairness, safety, transparency, explainability, accountability, rigorous testing, verification, and a human-centric approach to AI. Notably, these elements align closely with the current state-of-the-art practices in the field.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
National Category
Computer Sciences
Identifiers
urn:nbn:se:mdh:diva-69221 (URN)10.1109/IICAIET62352.2024.10730209 (DOI)2-s2.0-85209658708 (Scopus ID)9798350389692 (ISBN)
Conference
6th IEEE International Conference on Artificial Intelligence in Engineering and Technology, IICAIET 2024, Kota Kinabalu, 26 August 2024 through 28 August 2024
Available from: 2024-11-27 Created: 2024-11-27 Last updated: 2024-12-20Bibliographically approved
Barua, A., Ahmed, M. U., Barua, S., Begum, S. & Giorgi, A. (2024). Second-Order Learning with Grounding Alignment: A Multimodal Reasoning Approach to Handle Unlabelled Data. In: International Conference on Agents and Artificial Intelligence: . Paper presented at 16th International Conference on Agents and Artificial Intelligence, ICAART 2024. Rome 24 February 2024 through 26 February 2024 (pp. 561-572). Science and Technology Publications, Lda, 2
Open this publication in new window or tab >>Second-Order Learning with Grounding Alignment: A Multimodal Reasoning Approach to Handle Unlabelled Data
Show others...
2024 (English)In: International Conference on Agents and Artificial Intelligence, Science and Technology Publications, Lda , 2024, Vol. 2, p. 561-572Conference paper, Published paper (Refereed)
Abstract [en]

Multimodal machine learning is a critical aspect in the development and advancement of AI systems. However, it encounters significant challenges while working with multimodal data, where one of the major issues is dealing with unlabelled multimodal data, which can hinder effective analysis. To address the challenge, this paper proposes a multimodal reasoning approach adopting second-order learning, incorporating grounding alignment and semi-supervised learning methods. The proposed approach illustrates using unlabelled vehicular telemetry data. During the process, features were extracted from unlabelled telemetry data using an autoencoder and then clustered and aligned with true labels of neurophysiological data to create labelled and unlabelled datasets. In the semi-supervised approach, the Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) algorithms are applied to the labelled dataset, achieving a test accuracy of over 97%. These algorithms are then used to predict labels for the unlabelled dataset, which is later added to the labelled dataset to retrain the model. With the additional prior labelled data, both algorithms achieved a 99% test accuracy. Confidence in predictions for unlabelled data was validated using counting samples based on the prediction score and Bayesian probability. RF and XGBoost scored 91.26% and 97.87% in counting samples and 98.67% and 99.77% in Bayesian probability, respectively.

Place, publisher, year, edition, pages
Science and Technology Publications, Lda, 2024
Keywords
Autoencoder, Multimodal Reasoning, Semi-Supervised, Supervised Alignment
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:mdh:diva-66579 (URN)10.5220/0012466500003636 (DOI)2-s2.0-85190658759 (Scopus ID)
Conference
16th International Conference on Agents and Artificial Intelligence, ICAART 2024. Rome 24 February 2024 through 26 February 2024
Available from: 2024-05-08 Created: 2024-05-08 Last updated: 2024-12-19Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-3802-4721

Search in DiVA

Show all publications