https://www.mdu.se/

mdu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
How accuracy of estimated glottal flow waveforms affects spoofed speech detection performance
Mälardalen University, School of Innovation, Design and Engineering.
2020 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

In the domain of automatic speaker verification,  one of the challenges is to keep the malevolent people out of the system.  One way to do this is to create algorithms that are supposed to detect spoofed speech. There are several types of spoofed speech and several ways to detect them, one of which is to look at the glottal flow waveform  (GFW) of a speech signal. This waveform is often estimated using glottal inverse filtering  (GIF),  since, in order to create the ground truth  GFW, special invasive equipment is required.  To the author’s knowledge, no research has been done where the correlation of GFW accuracy and spoofed speech detection (SSD) performance is investigated. This thesis tries to find out if the aforementioned correlation exists or not.  First, the performance of different GIF methods is evaluated, then simple SSD machine learning (ML) models are trained and evaluated based on their macro average precision. The ML models use different datasets composed of parametrized GFWs estimated with the GIF methods from the previous step. Results from the previous tasks are then combined in order to spot any correlations.  The evaluations of the different methods showed that they created GFWs of varying accuracy.  The different machine learning models also showed varying performance depending on what type of dataset that was being used. However, when combining the results, no obvious correlations between GFW accuracy and SSD performance were detected.  This suggests that the overall accuracy of a GFW is not a substantial factor in the performance of machine learning-based SSD algorithms.

Place, publisher, year, edition, pages
2020. , p. 35
Keywords [en]
computer science, machine learning, automatic speech verification, spoofed speech detection, glottal flow waveform, glottal inverse filtering, artificial neural network, logistic regression, support vector machine, classifiers
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:mdh:diva-48414OAI: oai:DiVA.org:mdh-48414DiVA, id: diva2:1437224
Subject / course
Computer Science
Presentation
2020-06-04, 11:10 (English)
Supervisors
Examiners
Available from: 2020-06-17 Created: 2020-06-09 Last updated: 2020-06-17Bibliographically approved

Open Access in DiVA

fulltext(1010 kB)266 downloads
File information
File name FULLTEXT01.pdfFile size 1010 kBChecksum SHA-512
8d40530ff3a9f3a1845b0f7699af3248657f75512cbe07dcafdd58049caad9393f48e72a9ea5faa2a374d54ad782683cf6bee9569874c6664a438a1159b2018d
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Deivard, Johannes
By organisation
School of Innovation, Design and Engineering
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 266 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 210 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf