Sparse and Dense Linear Algebra for Machine Learning on Parallel-RDBMS Using SQL

Marten, Dennis and Meyer, Holger and Dietrich, Daniel and Heuer, Andreas (2019) Sparse and Dense Linear Algebra for Machine Learning on Parallel-RDBMS Using SQL. Open Journal of Big Data (OJBD), 5 (1). pp. 1-34.

[img]
Preview
Text
OJBD_2019v5i1n01_Marten.pdf

Download (10MB) | Preview
Official URL: https://www.ronpub.com/ojbd/OJBD_2019v5i1n01_Marte...

Abstract

While computational modelling gets more complex and more accurate, its calculation costs have been increasing alike. However, working on big data environments usually involves several steps of massive unfiltered data transmission. In this paper, we continue our work on the PArADISE framework, which enables privacy aware distributed computation of big data scenarios, and present a study on how linear algebra operations can be calculated over parallel relational database systems using SQL. We investigate the ways to improve the computation performance of algebra operations over relational databases and show how using database techniques impacts the computation performance like the use of indexes, choice of schema, query formulation and others. We study the dense and sparse problems of linear algebra over relational databases and show that especially sparse problems can be efficiently computed using SQL. Furthermore, we present a simple but universal technique to improve intra-operator parallelism for linear algebra operations in order to support the parallel computation of big data.

Item Type: Article
Subjects: Forschungsthemen > Datenbanken für Assistenzsysteme
Forschungsthemen > Big Data Analytics
Autorenart > DBIS-Publikationen
Rahmenprojekte > PArADISE
Depositing User: Dbis Admin
Date Deposited: 07 Dec 2018 13:35
Last Modified: 07 Dec 2018 13:44
URI: http://eprints.dbis.informatik.uni-rostock.de/id/eprint/971

Actions (login required)

View Item View Item