University of Rwanda Digital Repository

Classification of answers and questions using natural language processing

Show simple item record

dc.contributor.author Rutayisire, Olivia
dc.date.accessioned 2021-08-18T12:01:11Z
dc.date.available 2021-08-18T12:01:11Z
dc.date.issued 2021
dc.identifier.uri http://hdl.handle.net/123456789/1384
dc.description Master's Dissertation en_US
dc.description.abstract The last decade has marked a rapid and significant growth of digital technology globally to drive our society. One of the key solutions that are being adopted by institutions to get aligned with this trend is the use of question answering systems and chatbots to automate some of the services that their users might need. Some of the automated services are questions asked by the users. The biggest challenge lies in the classification of questions and answers the way a human being would do. This research aims to identify advanced implementations that can be used to optimize the usage of question answering systems. Three different models have been built in python and trained on Kaggle labeled dataset of classified questions and answers from various perspectives, to better understand the questions and their respective answers. The labeled features are 21 questions related features and 9 answer features each ranked in a range of 0 and 1. The algorithms attempted to use in the models are Ridge Regression, Recurrent Neural Network using Long Short Term Memory, and a Neural Network using Keras library. Ridge regression obtained a maximum validation accuracy of 0.37, Recurrent Neural Network with Long Short Term Memory had an accurary 0.40, and Keras with Neural Network performed better on our training dataset with a validation accuracy of 0.58. Better models should be applied to text data for text classification such as BERT models and also consider using more features on the training model to classify better. Additionally, focusing on fewer perceptions but meaningful while choosing labeled features to boost the accuracy of the model. en_US
dc.language.iso en en_US
dc.publisher University of Rwanda en_US
dc.subject Text Classification, Question Answering systems, NLP, Ridge Regression, Deep QA, Long Short Term Memory, Keras, Neural Network en_US
dc.title Classification of answers and questions using natural language processing en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Repository


Browse

My Account