• English
    • Kurdish
        • English
        • Kurdish
      View Item 
      •   Soran University Repository
      • Articles
      • Article
      • View Item
      •   Soran University Repository
      • Articles
      • Article
      • View Item
      • All fields
      • Title
      • Subject
      • Author
      • Year
      • Publisher
      • Source Title
      • ISSN
      • DOI
      • ISBN
      Advanced Search
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Computer Speech and Language

      Spoofing countermeasure for fake speech detection using brute force features

      Author:
      Arsalan Rahman, Mirza,
      ,
      Abdulbasit K., Al-Talabani
      Abstract: Due to the progress in deep learning technology, techniques that generate spoofed speech have significantly emerged. Such synthetic speech can be exploited for harmful purposes, like imper- sonation or disseminating false information. Researchers in the area investigate the useful fea- tures for spoof detection. This paper extensively investigates three problems in spoof detection in speech, namely, the imbalanced sample per class, which may negatively affect the performance of any detection models, the effect of the feature early and late fusion, and the analysis of unseen attacks on the model. Regarding the imbalanced issue, we have proposed two approaches (a Synthetic Minority Over Sampling Technique (SMOTE)-based and a Bootstrap-based model). We have used the OpenSMILE toolkit, to extract different feature sets, their results and early and late fusion of them have been investigated. The experiments are evaluated using the ASVspoof 2019 datasets which encompass synthetic, voice-conversion, and replayed speech samples. Addition- ally, Support Vector Machine (SVM) and Deep Neural Network (DNN) have been adopted in the classification. The outcomes from various test scenarios indicated that neither the imbalanced nature of the dataset nor a specific feature or their fusions outperformed the brute force version of the model as the best Equal Error Rate (EER) achieved by the Imbalance model is 6.67 % and 1.80 % for both Logical Access (LA) and Physical Access (PA) respectively.
      URI: http://192.64.112.23/xmlui/handle/311/83
      Subject: Spoofing countermeasure , Feature fusion , Data balancing , ASVspoof 2019 , Deep neural network , Support vector machine
      Collections :
      • Article
      • Download: (227.6Kb)
      • Show Full MetaData

      Show full item record

      contributor authorArsalan Rahman, Mirza,
      contributor authorAbdulbasit K., Al-Talabani
      date accessioned2025-02-20T15:32:31Z
      date available2025-02-20T15:32:31Z
      date issued2025
      identifier urihttp://192.64.112.23/xmlui/handle/311/83
      description abstractDue to the progress in deep learning technology, techniques that generate spoofed speech have significantly emerged. Such synthetic speech can be exploited for harmful purposes, like imper- sonation or disseminating false information. Researchers in the area investigate the useful fea- tures for spoof detection. This paper extensively investigates three problems in spoof detection in speech, namely, the imbalanced sample per class, which may negatively affect the performance of any detection models, the effect of the feature early and late fusion, and the analysis of unseen attacks on the model. Regarding the imbalanced issue, we have proposed two approaches (a Synthetic Minority Over Sampling Technique (SMOTE)-based and a Bootstrap-based model). We have used the OpenSMILE toolkit, to extract different feature sets, their results and early and late fusion of them have been investigated. The experiments are evaluated using the ASVspoof 2019 datasets which encompass synthetic, voice-conversion, and replayed speech samples. Addition- ally, Support Vector Machine (SVM) and Deep Neural Network (DNN) have been adopted in the classification. The outcomes from various test scenarios indicated that neither the imbalanced nature of the dataset nor a specific feature or their fusions outperformed the brute force version of the model as the best Equal Error Rate (EER) achieved by the Imbalance model is 6.67 % and 1.80 % for both Logical Access (LA) and Physical Access (PA) respectively.en_US
      language isoen_USen_US
      publisherComputer Speech and Languageen_US
      subjectSpoofing countermeasureen_US
      subjectFeature fusionen_US
      subjectData balancingen_US
      subjectASVspoof 2019en_US
      subjectDeep neural networken_US
      subjectSupport vector machineen_US
      titleSpoofing countermeasure for fake speech detection using brute force featuresen_US
      typeArticleen_US
      Digital Repository Software, Supported by Negasht Company

      71 Title Indexed

       
      Digital Repository Software, Supported by Negasht Company

      71 Title Indexed