Investigating fuzzy methods for multilingual speaker identification
dc.contributor.advisor | Abreu, Marjory Cristiany da Costa | |
dc.contributor.advisorID | pt_BR | |
dc.contributor.author | Lima, Thales Aguiar de | |
dc.contributor.authorID | pt_BR | |
dc.contributor.referees1 | Santin, Altair Olivo | |
dc.contributor.referees1ID | pt_BR | |
dc.contributor.referees2 | Pereira, Mônica Magalhães | |
dc.contributor.referees2ID | pt_BR | |
dc.date.accessioned | 2020-10-05T17:14:37Z | |
dc.date.available | 2020-10-05T17:14:37Z | |
dc.date.issued | 2020-08-27 | |
dc.description.resumo | Speech is a crucial ability for humans to interact and communicate. Speech-based technologies are becoming more popular with speech interfaces, real-time translation, and budget healthcare diagnosis. Besides, the use of voice for system identification is an important and relevant topic. There are several ways of doing it, but most are dependent on the language the user speaks. However, if the idea is to create an all inclusive and reliable system that uses speech as its input, we must take into account that people can and will speak different languages and accents. This research evaluates closed-set text-independent speaker identification systems on a multilingual setup, including both fuzzy and crisp models. Our experiments are performed using three widely spoken languages which are Portuguese, English, and Chinese. Then, we extracted 13-MFCCs, along with log-Energy and its respective delta and delta-delta from signals to use as our feature vector. We adopted four classifiers: Fuzzy C-Means, Fuzzy k-Nearest Neighbours, k-Nearest Neighbours, and Support Vector Machines. Initial tests indicated the systems have certain robustness on multiple languages. Where results with more languages decreases our accuracy; however our investigation suggests these impacts are from number of classes. | pt_BR |
dc.description.sponsorship | Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - CAPES | pt_BR |
dc.identifier.citation | LIMA, Thales Aguiar de. Investigating fuzzy methods for multilingual speaker identification. 2020. 66f. Dissertação (Mestrado em Sistemas e Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2020. | pt_BR |
dc.identifier.uri | https://repositorio.ufrn.br/handle/123456789/30245 | |
dc.language | pt_BR | pt_BR |
dc.publisher | Universidade Federal do Rio Grande do Norte | pt_BR |
dc.publisher.country | Brasil | pt_BR |
dc.publisher.initials | UFRN | pt_BR |
dc.publisher.program | PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO | pt_BR |
dc.rights | Acesso Aberto | pt_BR |
dc.subject | Speaker identification | pt_BR |
dc.subject | Speaker recognition | pt_BR |
dc.subject | Fuzzy | pt_BR |
dc.subject | Signal processing | pt_BR |
dc.subject | Multilingual speech systems | pt_BR |
dc.subject | Portuguese | pt_BR |
dc.subject | English | pt_BR |
dc.subject | Mandarin | pt_BR |
dc.title | Investigating fuzzy methods for multilingual speaker identification | pt_BR |
dc.type | masterThesis | pt_BR |
Arquivos
Pacote Original
1 - 1 de 1
Carregando...
- Nome:
- Investigatingfuzzymethods_Lima_2020.pdf
- Tamanho:
- 3.03 MB
- Formato:
- Adobe Portable Document Format
Carregando...