Marten, Dennis and Meyer, Holger and Dietrich, Daniel and Heuer, Andreas (2019) Sparse and Dense Linear Algebra for Machine Learning on Parallel-RDBMS Using SQL. Open Journal of Big Data (OJBD), 5 (1). pp. 1-34.
|
Text
OJBD_2019v5i1n01_Marten.pdf Download (10MB) | Preview |
Abstract
While computational modelling gets more complex and more accurate, its calculation costs have been increasing alike. However, working on big data environments usually involves several steps of massive unfiltered data transmission. In this paper, we continue our work on the PArADISE framework, which enables privacy aware distributed computation of big data scenarios, and present a study on how linear algebra operations can be calculated over parallel relational database systems using SQL. We investigate the ways to improve the computation performance of algebra operations over relational databases and show how using database techniques impacts the computation performance like the use of indexes, choice of schema, query formulation and others. We study the dense and sparse problems of linear algebra over relational databases and show that especially sparse problems can be efficiently computed using SQL. Furthermore, we present a simple but universal technique to improve intra-operator parallelism for linear algebra operations in order to support the parallel computation of big data.
Item Type: | Article |
---|---|
Subjects: | Forschungsthemen > Datenbanken für Assistenzsysteme Forschungsthemen > Big Data Analytics Autorenart > DBIS-Publikationen Rahmenprojekte > PArADISE |
Depositing User: | Dbis Admin |
Date Deposited: | 07 Dec 2018 13:35 |
Last Modified: | 07 Dec 2018 13:44 |
URI: | https://eprints.dbis.informatik.uni-rostock.de/id/eprint/971 |
Actions (login required)
View Item |