Beyond Data: Efficient Knowledge-guided Learning for Sparse and Structured Domains

dc.contributor.advisorNatarajan, Sriraam
dc.contributor.advisorBusso-Recabarren, Carlos A.
dc.contributor.committeeMemberTadepalli, Prasad
dc.contributor.committeeMemberIyer, Rishabh
dc.contributor.committeeMemberGogate, Vibhav
dc.creatorKokel, Harsha 1991-
dc.creator.orcid0000-0002-7548-3719
dc.date.accessioned2023-09-15T15:47:36Z
dc.date.available2023-09-15T15:47:36Z
dc.date.created2023-05
dc.date.issuedMay 2023
dc.date.submittedMay 2023
dc.date.updated2023-09-15T15:47:37Z
dc.description.abstractThe field of AI has made great advances in recent years. Most of these advances have focused on leveraging more data and finding new architectures to improve system performance. However, collecting data can lead to exorbitant costs. This is especially the case for structured domains where the data conforms to some standardized format (like tabular data, relational databases, etc.). In structured domains, an expert might be required to collect and organize data; necessitating time and effort. Further, learning explicitly from data is neither sufficient nor favorable. Enormous data can cause concerns for safety, lack of fairness, and a substantial carbon footprint. So looking beyond learning from data, this dissertation focuses on finding principled ways to leverage rich human knowledge for sparse and structured domains to guide the learning procedure. In particular, this dissertation looks at four challenges that arise when models are learned in structured domains and propose to tackle them using explicit human knowledge. First, we consider the challenge of learning from sparse and noisy data in the successful gradient boosting framework and propose to use domain-specific trend information to improve prediction. Second, we consider the challenge of learning to generalize across multiple tasks and objects in sequential decision making. We address this challenge by proposing a framework that takes inspiration from human’s ability to generalize by identifying compositionality and generating abstract representations. Third, we consider the challenging task of human-machine collaborative problem solving and propose a framework that uses natural language communication for effective bi-directional interaction. Finally, the fourth challenge we consider is the problem of a large hypothesis space when dealing with domains with heterogeneous objects. We identify the lack of a language bias—typed object representations—in recent neurosymbolic architectures and devise an approach to incorporate the bias. In this dissertation, we demonstrate various ways to incorporate domain-specific knowledge from humans in training AI systems. We conclude that using domain knowledge not only reduces the sample complexity but also improves the performance and generalization abilities of the model.
dc.format.mimetypeapplication/pdf
dc.identifier.uri
dc.identifier.urihttps://hdl.handle.net/10735.1/9859
dc.language.isoEnglish
dc.subjectComputer Science
dc.titleBeyond Data: Efficient Knowledge-guided Learning for Sparse and Structured Domains
dc.typeThesis
dc.type.materialtext
thesis.degree.collegeSchool of Engineering and Computer Science
thesis.degree.departmentComputer Science
thesis.degree.grantorThe University of Texas at Dallas
thesis.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
KOKEL-PRIMARY-2023.pdf
Size:
13.75 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
proquest_license.txt
Size:
6.37 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
license.txt
Size:
1.98 KB
Format:
Plain Text
Description: