Investigating fuzzy methods for multilingual speaker identification

Lima, Thales Aguiar de

Investigating fuzzy methods for multilingual speaker identification

Página do item simplificado Estatísticas

dc.contributor.advisor	Abreu, Marjory Cristiany da Costa
dc.contributor.advisorID		pt_BR
dc.contributor.author	Lima, Thales Aguiar de
dc.contributor.authorID		pt_BR
dc.contributor.referees1	Santin, Altair Olivo
dc.contributor.referees1ID		pt_BR
dc.contributor.referees2	Pereira, Mônica Magalhães
dc.contributor.referees2ID		pt_BR
dc.date.accessioned	2020-10-05T17:14:37Z
dc.date.available	2020-10-05T17:14:37Z
dc.date.issued	2020-08-27
dc.description.resumo	Speech is a crucial ability for humans to interact and communicate. Speech-based technologies are becoming more popular with speech interfaces, real-time translation, and budget healthcare diagnosis. Besides, the use of voice for system identification is an important and relevant topic. There are several ways of doing it, but most are dependent on the language the user speaks. However, if the idea is to create an all inclusive and reliable system that uses speech as its input, we must take into account that people can and will speak different languages and accents. This research evaluates closed-set text-independent speaker identification systems on a multilingual setup, including both fuzzy and crisp models. Our experiments are performed using three widely spoken languages which are Portuguese, English, and Chinese. Then, we extracted 13-MFCCs, along with log-Energy and its respective delta and delta-delta from signals to use as our feature vector. We adopted four classifiers: Fuzzy C-Means, Fuzzy k-Nearest Neighbours, k-Nearest Neighbours, and Support Vector Machines. Initial tests indicated the systems have certain robustness on multiple languages. Where results with more languages decreases our accuracy; however our investigation suggests these impacts are from number of classes.	pt_BR
dc.description.sponsorship	Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPES	pt_BR
dc.identifier.citation	LIMA, Thales Aguiar de. Investigating fuzzy methods for multilingual speaker identification. 2020. 66f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2020.	pt_BR
dc.identifier.uri	https://repositorio.ufrn.br/handle/123456789/30245
dc.language	pt_BR	pt_BR
dc.publisher	Universidade Federal do Rio Grande do Norte	pt_BR
dc.publisher.country	Brasil	pt_BR
dc.publisher.initials	UFRN	pt_BR
dc.publisher.program	PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO	pt_BR
dc.rights	Acesso Aberto	pt_BR
dc.subject	Speaker identification	pt_BR
dc.subject	Speaker recognition	pt_BR
dc.subject	Fuzzy	pt_BR
dc.subject	Signal processing	pt_BR
dc.subject	Multilingual speech systems	pt_BR
dc.subject	Portuguese	pt_BR
dc.subject	English	pt_BR
dc.subject	Mandarin	pt_BR
dc.title	Investigating fuzzy methods for multilingual speaker identification	pt_BR
dc.type	masterThesis	pt_BR

Arquivos

Pacote Original

Agora exibindo 1 - 1 de 1

Nome:: Investigatingfuzzymethods_Lima_2020.pdf
Tamanho:: 3.03 MB
Formato:: Adobe Portable Document Format

Baixar

Coleções

PPGSC - Mestrado em Sistemas e Computação

SIGAA

Investigating fuzzy methods for multilingual speaker identification

Arquivos

Pacote Original

Coleções