Application of Machine Learning in Drug Discovery

dc.contributor.advisorNourani, Mehrdad
dc.creatorKadiyala, Susmitha Sri
dc.date.accessioned2019-04-24T22:44:14Z
dc.date.available2019-04-24T22:44:14Z
dc.date.created2018-12
dc.date.issued2018-12
dc.date.submittedDecember 2018
dc.date.updated2019-04-24T22:46:24Z
dc.description.abstractDrug Discovery is a highly complicated process. On average, it takes 6 to 12 years to manufacture a drug and have the product released in the market. Even after a huge investment of money, time and hard work, one cannot assure the success of the drug after its release. The recent advancement in the field of machine learning helps us to reduce the risk in this field of science. This thesis aims at analyzing the applications of machine learning in the field of bio-medical science. Usage of a simpler organism for the implementation of the experiments is highly convenient. Therefore, a machine learning model to predict the chemical compounds effect on aging of Caenorhabditis elegans was proposed using the Drug Age database. This database includes the features of Molecular Descriptors and Gene Ontology. In this work, a new feature selection scheme is proposed for an efficient classification task using random forests. We explain the benefits of our feature selection method in comparison with the base-line support vector machine and artificial neural network classifiers. Secondly, another application of machine learning which is presented in the work is the prediction of Drug-Target Interaction using Weisfeiler-Lehman Neural Machine. Prediction of a possible interaction between a drug and a target enables the biochemists to speed up the process of target validation and discovery. A public-domain data set which corresponds to four different target protein types is used for the analysis purpose. The algorithm aims at creating a subgraph from the network formed by the drugs and targets which is then taken through graph labeling, resulting in the formation of an adjacency matrix. This matrix defines the presence of an interaction used for training a model. The results of the proposed method out performed the standard state of art approaches like the similarity based methods in terms of AUC.
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10735.1/6367
dc.language.isoen
dc.subjectAging
dc.subjectCaenorhabditis elegans
dc.subjectDrug development
dc.subjectMachine learning
dc.subjectDrug targeting
dc.subjectBiomedical engineering
dc.titleApplication of Machine Learning in Drug Discovery
dc.typeThesis
dc.type.materialtext
thesis.degree.departmentComputer Engineering
thesis.degree.grantorThe University of Texas at Dallas
thesis.degree.levelMasters
thesis.degree.nameMS

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ETD-5608-010-KADIYALA-9433.61.pdf
Size:
1.23 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.85 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.85 KB
Format:
Plain Text
Description: