文章基本信息

标题：HyPer: Adapting Columnar Main-Memory Data Management for Transactional AND Query Processing
本地全文：下载
作者：Alfons Kemper Thomas Neumann Florian Funke Viktor Leis Henrik M¡§uhe TU M¡§unchen ; Faculty of Informatics, firstname.lastname@in.tum.de
期刊名称：Bulletin of the Technical Committee on Data Engineering
出版年度：2012
卷号：35
期号：01
出版社：IEEE Computer Society
摘要：Traditionally, business applications have separated their data into an OLTP data store for high through- put transaction processing and a data warehouse for complex query processing. This separation bears severe maintenance and data consistency disadvantages. Two emerging hardware trends allow the con- solidation of the two disparate workloads onto the same database state on one system: the increasing main memory capacities of several terabytes per server and multi-threaded processing based on multi- core parallelism. The prevalent data representation of hybrid OLTP&OLAP main memory database systems is columnar in order to achieve best possible query execution performance for OLAP applica- tions. In order to shield the OLTP transaction processing from long-running queries without costly lock- ing/latching, all queries are executed on an arbitrarily recent snapshot of the data. The paper contrasts several snapshotting techniques for columnar data (twin block, versioning) with the hardware-supported shadow paging we employ in the HyPer system. While OLAP-query processing can rely mostly on colum- nar scans, high OLTP throughput requirements necessitate index techniques for exact match and small range queries. Despite the ever growing capacity, main memory is still a scare resource. Therefore, com- pressing main memory resident databases is beneficial. The paper will devise techniques that achieve good compression ratios without hurting the mission-critical OLTP throughput by adaptively separating cold (i.e. immutable) data for aggressive compression from the hot (i.e. mutable) working set data that remains uncompressed.