Classified Enhancement Model for Big Data Storage Reliability Based on Boolean Satisfiability Problem

Huang, H.; Khan, Latifur; Zhou, S.

Classified Enhancement Model for Big Data Storage Reliability Based on Boolean Satisfiability Problem

Files

JECS-6566-260874.06-LINK.pdf (165.11 KB)

Date

2019-05-11

Authors

Huang, H.

Khan, Latifur

Zhou, S.

Publisher

Springer New York LLC

URI

http://dx.doi.org/10.1007/s10586-019-02941-1
https://hdl.handle.net/10735.1/7265

Abstract

Disk reliability is a serious problem in the big data foundation environment. Although the reliability of disk drives has greatly improved over the past few years, they are still the most vulnerable core components in the server. If they fail, the result can be catastrophic: it can take some days to recover data, sometimes data lost forever. These are unacceptable for some important data. XOR parity is a typical method to generate reliability syndrome, thus improving the reliability of the data. In practice, we find that the data is still likely to be lost. In most storage systems reliability improvements are achieved through the allocation of additional disks in Redundant Arrays of Independent Disks (RAID), which will increase the hardware costs, thus it will be very difficult for cost-constrained environments. Therefore, how to improve the data integrity without raising the hardware cost has aroused much interest of big data researchers. This challenge is when creating non-traditional RAID geometries, care must be taken to respect data dependence relationships to ensure that the new RAID strategy improves reliability, which is a NP-hard problem. In this paper, we present an approach for characterizing these challenges using high-dimension variants of the n-queens problem that enables performable solutions via the SAT solver MiniSAT, and use the greedy algorithm to analyze the queen’s attack domain, as a basis for reliability syndrome generation. A large number of experiments show that the approach proposed in this paper is feasible in software-defined data centers and the performance of the algorithm can meet the current requirements of the big data environment. © 2019, Springer Science+Business Media, LLC, part of Springer Nature.

Description

Due to copyright restrictions and/or publisher's policy full text access from Treasures at UT Dallas is restricted to current UTD affiliates (use the provided Link to Article).

Keywords

Big data, Computational complexity, Magnetic disks--Reliability, Computer storage devices (Digital)

Rights

Collections

Khan, Latifur

Full item page

Classified Enhancement Model for Big Data Storage Reliability Based on Boolean Satisfiability Problem

Files

Date

Authors

ORCID

Journal Title

Journal ISSN

Volume Title

Publisher

item.page.doi

URI

Abstract

Description

Keywords

item.page.sponsorship

Rights

Citation

Collections