Ensembles of Oblique Decision Trees

dc.contributor.advisorNatarajan, Sriraam
dc.creatorMajumder, Torsha
dc.date.accessioned2020-08-26T16:35:05Z
dc.date.available2020-08-26T16:35:05Z
dc.date.created2020-05
dc.date.issued2020-05
dc.date.submittedMay 2020
dc.date.updated2020-08-26T16:35:06Z
dc.description.abstractEnsemble methods such as bagging are widely used as they can improve generalization performance and stability compared to individual base estimators. Such methods often use decision trees as base estimators. Standard decision-tree algorithms learn univariate splits in decision nodes resulting in axis-parallel decision boundaries, which can lead to very deep decision trees owing to the limited representation power within each decision node. Oblique decision trees, which have seen recent renewed interest, learn multivariate linear splits in decision nodes. Oblique decision trees are generally shallower than their axis-parallel counterparts owing to the increased representative power of multivariate splits. This thesis analyzes the performance of different oblique decision tree algorithms when used as a base estimator in a bagging ensemble. In particular, the thesis explores the trade-offs between increased node complexity (oblique splits vs. axis-parallel splits) and tree complexity (shallower oblique trees vs. deeper axis-parallel trees), and their effect on ensemble performance. Bagging ensembles of several state-of-the-art oblique tree algorithms are compared with standard bagging approaches on different data sets. This analysis highlights two key results: (1) randomization is a powerful and efficient technique for tree-learning for ensemble learning owing to its ability to promote ensemble diversity by decreasing estimator correlation; (2) for larger problems, with more features or a large number of classes, optimization-based oblique decision tree ensembles are effective but at the expense of a greater computational cost.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10735.1/8818
dc.language.isoen
dc.rights©2020 Torsha Majumder. All rights reserved.
dc.subjectMachine learning
dc.subjectDecision trees
dc.subjectAlgorithms
dc.titleEnsembles of Oblique Decision Trees
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentComputer Science
thesis.degree.grantorThe University of Texas at Dallas
thesis.degree.levelMasters
thesis.degree.nameMSCS
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ETD-5608-011T-261853.63.pdf
Size:
5.27 MB
Format:
Adobe Portable Document Format
Description:
Thesis
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description: