Folksonomy Based Question Answering System

dc.contributor.advisorMoldovan, Dan
dc.creatorRamaswamy, Swetha
dc.date.accessioned2020-01-29T16:20:18Z
dc.date.available2020-01-29T16:20:18Z
dc.date.created2019-08
dc.date.issued2019-08
dc.date.submittedAugust 2019
dc.date.updated2020-01-29T16:20:19Z
dc.description.abstractFinancial data is on the rise. Most of this data is unstructured in nature. A major contributor to this data is in the form of web articles. With an increase in such data, there is a need for techniques that parse unstructured information. This thesis project presents an overview of a folksonomy-based approach for a Question Answering system. The proposed system is divided into two steps, the first step processes the contextual information using techniques such as document-to-vector, tf-idf and topic modelling which forms the level 1 granularity. The variant of word2vec in the form of paragraph2vec is used for achieving a sentence level granularity (level 2). Various combinations of level 1 and level 2 granularity are explored, and the best combination is sought after. The concepts of folksonomy, which is social and contextual tagging, is associated with reduction of search space. The search space is a combination of all possible answers in which the correct answer resides. The idea is to reduce the search space such that different algorithms have the ability of finding the correct answers. The models are then stress tested by varying different parameters. The parameters are obtained after performing a grid search. While finding the best model, more than 12,000 models were generated and tested. The best model was tested on two test cases where it generated an accuracy of 61% and 64%.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10735.1/7195
dc.language.isoen
dc.rights©2019 Swetha Ramaswamy. All Rights Reserved.
dc.subjectNatural language generation (Computer science)
dc.subjectQuestion-answering systems
dc.subjectMetadata
dc.titleFolksonomy Based Question Answering System
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentComputer Science
thesis.degree.grantorThe University of Texas at Dallas
thesis.degree.levelMasters
thesis.degree.nameMSCS

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
ETD-5608-011-RAMASWAMY-260803.46.pdf
Size:
3.45 MB
Format:
Adobe Portable Document Format
Description:
Thesis
No Thumbnail Available
Name:
Final Testcase 1 Questions and Answers.csv
Size:
70.22 KB
Format:
Unknown data format
No Thumbnail Available
Name:
Final Tescase 2 Questions.xlsx
Size:
11.36 KB
Format:
Microsoft Excel XML

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.85 KB
Format:
Plain Text
Description: