https://www.mdu.se/

mdu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Balancing Privacy and Accuracy in Federated Learning for Speech Emotion Recognition
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems.
RISE Research Institutes of Sweden, Västerås, Sweden.
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems.
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems.
Show others and affiliations
2023 (English)In: Proceedings of the 18th Conference on Computer Science and Intelligence Systems, Institute of Electrical and Electronics Engineers (IEEE), 2023, p. 191-200Conference paper, Published paper (Refereed)
Abstract [en]

Speech Emotion Recognition (SER) is a valuable technology that identifies human emotions from spoken language, enabling the development of context-aware and personalized intelligent systems. To protect user privacy, Federated Learning (FL) has been introduced, enabling local training of models on user devices. However, FL raises concerns about the potential exposure of sensitive information from local model parameters, which is especially critical in applications like SER that involve personal voice data. Local Differential Privacy (LDP) has been successful in preventing privacy leaks in image and video data. However, it encounters notable accuracy degradation when applied to speech data, especially in the presence of high noise levels. In this paper, we propose an approach called LDP-FL with CSS, which combines LDP with a novel client selection strategy (CSS). By leveraging CSS, we aim to improve the representatives of updates and mitigate the adverse effects of noise on SER accuracy while ensuring client privacy through LDP. Furthermore, we conducted model inversion attacks to evaluate the robustness of LDP-FL in preserving privacy. These attacks involved an adversary attempting to reconstruct individuals' voice samples using the output labels provided by the SER model. The evaluation results reveal that LDP-FL with CSS achieved an accuracy of 65-70%, which is 4% lower than the initial SER model accuracy. Furthermore, LDP-FL demonstrated exceptional resilience against model inversion attacks, outperforming the non-LDP method by a factor of 10. Overall, our analysis emphasizes the importance of achieving a balance between privacy and accuracy in accordance with the requirements of the SER application

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2023. p. 191-200
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:mdh:diva-64364DOI: 10.15439/2023F444Scopus ID: 2-s2.0-85179177296ISBN: 9788396744784 (print)OAI: oai:DiVA.org:mdh-64364DiVA, id: diva2:1800436
Conference
18th Conference on Computer Science and Intelligence Systems, September 17–20, 2023. Warsaw, Poland
Available from: 2023-09-26 Created: 2023-09-26 Last updated: 2023-12-20Bibliographically approved
In thesis
1. Balancing Privacy and Performance in Emerging Applications of Federated Learning
Open this publication in new window or tab >>Balancing Privacy and Performance in Emerging Applications of Federated Learning
2023 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Federated Learning (FL) has emerged as a novel paradigm within machine learning (ML) that allows multiple devices to collaboratively train a shared ML model without sharing their private data with a central server. FL has gained popularity across various applications by eliminating the necessity for centralized data storage, thereby improving the confidentiality of sensitive information. Among the new FL applications, this thesis focuses on Speech Emotion Recognition (SER), which involves the analysis of audio signals from human speech to identify patterns and classify the conveyed emotions. When SER is implemented within a FL framework, even though speech data remains on local devices, new privacy challenges emerge during the training phase and the exchange of SER model update parameters between servers and clients. These challenges encompass the potential for privacy leakage and adversarial attacks, including model inversion attacks and membership or property inference attacks, which can be conducted by unauthorized or malicious parties to exploit the shared SER model, compromising client data confidentiality and revealing sensitive information.

While several privacy-preserving solutions have been developed to mitigate potential breaches in FL architectures, those are too generic to be easily integrated into specific applications. Furthermore, incorporating existing privacy-preserving mechanisms into the FL framework can increase communication and computational overheads, which may, in turn, compromise data utility and learning performance.

This thesis aims to propose privacy-preserving methods in FL for emerging security-critical applications such as SER while addressing the challenges related to their effect on performance. First, we categorize and analyze recent research on privacy-preserving mechanisms in FL, with a focus on assessing their effects on FL performance and how to balance privacy and performance across various applications. Second, we design an optimized FL setup tailored to SER applications in order to evaluate effects on performance and overhead. Third, we design and develop privacy-preserving mechanisms within FL to safeguard against potential privacy threats while ensuring the confidentiality of clients' data. Finally, we propose and evaluate new methods for FL in SER and integrate them with appropriate privacy-preserving mechanisms to achieve an optimal balance of privacy with efficiency, accuracy, as well as communication and computation overhead.

Place, publisher, year, edition, pages
Västerås: Mälardalens universitet, 2023
Series
Mälardalen University Press Licentiate Theses, ISSN 1651-9256 ; 349
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:mdh:diva-64679 (URN)978-91-7485-621-7 (ISBN)
Presentation
2023-12-14, Paros, Mälardalens universitet, Västerås, 13:00 (English)
Opponent
Supervisors
Available from: 2023-11-07 Created: 2023-11-06 Last updated: 2023-11-23Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopushttps://annals-csis.org/proceedings/2023/pliks/fedcsis.pdf

Authority records

Mohammadi, SamanehSinaei, SimaBalador, AliFlammini, Francesco

Search in DiVA

By author/editor
Mohammadi, SamanehSinaei, SimaBalador, AliFlammini, Francesco
By organisation
Embedded SystemsInnovation and Product Realisation
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 106 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf