Synthetic Speech Detection Using Deep Neural Networks

Reimao, Ricardo Amaral Martins

Synthetic Speech Detection Using Deep Neural Networks

dc.contributor.advisor	Tzerpos, Vassilios
dc.contributor.author	Reimao, Ricardo Amaral Martins
dc.date.accessioned	2019-11-22T18:41:56Z
dc.date.available	2019-11-22T18:41:56Z
dc.date.copyright	2019-05
dc.date.issued	2019-11-22
dc.date.updated	2019-11-22T18:41:56Z
dc.degree.discipline	Computer Science
dc.degree.level	Master's
dc.degree.name	MSc - Master of Science
dc.description.abstract	With the advancements in deep learning and other techniques, synthetic speech is getting closer to a natural sounding voice. Some of the state-of-art technologies achieve such a high level of naturalness that even humans have difficulties distinguishing real speech from computer generated speech. Moreover, these technologies allow a person to train a speech synthesizer with a target voice, creating a model that is able to reproduce someone's voice with high fidelity. With this research, we thoroughly analyze how synthetic speech is generated and propose deep learning methodologies to detect such synthesized utterances. We first collected a significant amount of real and synthetic utterances to create the Fake or Real (FoR) dataset. Then, we analyzed the performance of the latest deep learning models in the classification of such utterances. Our proposed model achieves 99.86% accuracy in synthetic speech detection, which is a significant improvement from a human performance (65.7%).
dc.identifier.uri	http://hdl.handle.net/10315/36698
dc.language	en
dc.rights	Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subject	Artificial intelligence
dc.subject.keywords	machine learning
dc.subject.keywords	synthetic speech
dc.subject.keywords	deep learning
dc.subject.keywords	artificial intelligence
dc.subject.keywords	deep neural networks
dc.subject.keywords	neural networks
dc.subject.keywords	synthetic speech detection
dc.subject.keywords	TTS
dc.subject.keywords	text to speech
dc.subject.keywords	speech generation
dc.subject.keywords	speech synthesis
dc.subject.keywords	audio classification
dc.subject.keywords	speech classification
dc.title	Synthetic Speech Detection Using Deep Neural Networks
dc.type	Electronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Reimao_Ricardo_AM_2019_Masters.pdf
Size:: 3.12 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 2 of 2

Name:: license.txt
Size:: 1.87 KB
Format:: Plain Text
Description:

Download

Name:: YorkU_ETDlicense.txt
Size:: 3.39 KB
Format:: Plain Text
Description:

Download

Collections

Computer Science and Engineering