Beyond Data: Efficient Knowledge-guided Learning for Sparse and Structured Domains

Kokel, Harsha 1991-

Beyond Data: Efficient Knowledge-guided Learning for Sparse and Structured Domains

dc.contributor.advisor	Natarajan, Sriraam
dc.contributor.advisor	Busso-Recabarren, Carlos A.
dc.contributor.committeeMember	Tadepalli, Prasad
dc.contributor.committeeMember	Iyer, Rishabh
dc.contributor.committeeMember	Gogate, Vibhav
dc.creator	Kokel, Harsha 1991-
dc.creator.orcid	0000-0002-7548-3719
dc.date.accessioned	2023-09-15T15:47:36Z
dc.date.available	2023-09-15T15:47:36Z
dc.date.created	2023-05
dc.date.issued	May 2023
dc.date.submitted	May 2023
dc.date.updated	2023-09-15T15:47:37Z
dc.description.abstract	The field of AI has made great advances in recent years. Most of these advances have focused on leveraging more data and finding new architectures to improve system performance. However, collecting data can lead to exorbitant costs. This is especially the case for structured domains where the data conforms to some standardized format (like tabular data, relational databases, etc.). In structured domains, an expert might be required to collect and organize data; necessitating time and effort. Further, learning explicitly from data is neither sufficient nor favorable. Enormous data can cause concerns for safety, lack of fairness, and a substantial carbon footprint. So looking beyond learning from data, this dissertation focuses on finding principled ways to leverage rich human knowledge for sparse and structured domains to guide the learning procedure. In particular, this dissertation looks at four challenges that arise when models are learned in structured domains and propose to tackle them using explicit human knowledge. First, we consider the challenge of learning from sparse and noisy data in the successful gradient boosting framework and propose to use domain-specific trend information to improve prediction. Second, we consider the challenge of learning to generalize across multiple tasks and objects in sequential decision making. We address this challenge by proposing a framework that takes inspiration from human’s ability to generalize by identifying compositionality and generating abstract representations. Third, we consider the challenging task of human-machine collaborative problem solving and propose a framework that uses natural language communication for effective bi-directional interaction. Finally, the fourth challenge we consider is the problem of a large hypothesis space when dealing with domains with heterogeneous objects. We identify the lack of a language bias—typed object representations—in recent neurosymbolic architectures and devise an approach to incorporate the bias. In this dissertation, we demonstrate various ways to incorporate domain-specific knowledge from humans in training AI systems. We conclude that using domain knowledge not only reduces the sample complexity but also improves the performance and generalization abilities of the model.
dc.format.mimetype	application/pdf
dc.identifier.uri
dc.identifier.uri	https://hdl.handle.net/10735.1/9859
dc.language.iso	English
dc.subject	Computer Science
dc.title	Beyond Data: Efficient Knowledge-guided Learning for Sparse and Structured Domains
dc.type	Thesis
dc.type.material	text
thesis.degree.college	School of Engineering and Computer Science
thesis.degree.department	Computer Science
thesis.degree.grantor	The University of Texas at Dallas
thesis.degree.name	Doctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1

Name:: KOKEL-PRIMARY-2023.pdf
Size:: 13.75 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: proquest_license.txt
Size:: 6.37 KB
Format:: Plain Text
Description:

Download

Name:: license.txt
Size:: 1.98 KB
Format:: Plain Text
Description:

Download

Collections

UTD Theses and Dissertations