Now showing items 1-5 of 5
Determining the Impact of Missing Values on Blocking in Record Linkage
(Springer Verlag, 2019-03-20)
Record linkage is the process of integrating information from the same underlying entity across disparate data sets. This process, which is increasingly utilized to build accurate representations of individuals and ...
Transforming Entity-Relationship Diagrams to Relational Schemas Using a Graph Grammar Formalism
(Institute of Electrical and Electronics Engineers Inc., 2018-12)
As a formal tool extended from string grammars, graph grammars provide an intuitive yet formal way to define and transform various visual languages. This paper proposes an approach to transform Entity-Relationship diagrams ...
When Algorithmic Predictions Use Human-Generated Data: A Bias-Aware Classification Algorithm for Breast Cancer Diagnosis
(INFORMS: Institute for Operations Research and the Management Sciences, 2018-12-20)
When algorithms use data generated by human beings, they inherit the errors stemming from human biases, which likely diminishes their performance. We examine the design and value of a bias-aware linear classification ...
Multistream Classification for Cyber Threat Data with Heterogeneous Feature Space
(Association for Computing Machinery, Inc, 2019-05)
Under a newly introduced setting of multistream classification, two data streams are involved, which are referred to as source and target streams. The source stream continuously generates data instances from a certain ...
Invited Paper: Semantic IoT Data Description and Discovery in the IoT-Edge-Fog-Cloud Infrastructure
(Institute of Electrical and Electronics Engineers Inc., 2019-04-04)
Many IoT systems are data intensive, where a large volume of data steadily get generated from a large number of sensors in the system. These data are continuous, thus, how to store and manage them is an important issue. ...