Canuto, Anne Magaly de PaulaBarreto, Cephas Alves da Silveira2023-11-012023-11-012023-07-24BARRETO, Cephas Alves da Silveira. Seleção e rotulagem de instâncias para métodos semissupervisionados indutivos. Orientador: Anne Magaly de Paula Canuto. 2023. 166f. Tese (Doutorado em Ciência da Computação) - Centro de Ciências Exatas e da Terra, Universidade Federal do Rio Grande do Norte, Natal, 2023.https://repositorio.ufrn.br/handle/123456789/55155In recent years, the use of Machine Learning (ML) techniques to solve real problems has become very common and a technological pattern adopted in plenty of domains. However, several of these domains do not have enough labelled data to give ML methods a good performance. This problem led to the development of Semi-supervised methods, a type of method capable of using labelled and unlabelled instances in its model building. Among the semi-supervised learning techniques, the inductive methods stand out. The wrapper methods, a particular category within inductive methods, use a process, often iterative, that involves: training the method with labelled data; selection of the best data from the unlabelled set; and labelling the selected data. Despite showing a simple and efficient process, errors in the selection or labelling processes are common, which deteriorate the final performance of the method. This research aims to reduce selection and labelling errors in wrapper methods to establish selection and labelling approaches that are more robust and less susceptible to errors. To this end, this work proposes a selection and labelling approach based on classification agreement and a selection and agreement approach based on distance metric as an additional factor to an already used selection criterion (e.g. confidence or agreement). The proposed approaches can be applied to any wrapper method and were tested on 42 datasets with Self-training, Co-training and Boosting methods. The results obtained indicate that the proposals bring gains for both methods in terms of accuracy and F-measure.Acesso AbertoComputaçãoAprendizado de máquinaAprendizado semissupervisionadoMétodos wrapperSeleção e rotulagem de instânciasSeleção e rotulagem de instâncias para métodos semissupervisionados indutivosSelection and labelling of instances for indictive semi-supervised methodsdoctoralThesisCNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::SISTEMAS DE COMPUTACAO