Calagator

Support Calagator on Patreon

Get Involved: Blog | Forum | Code
Something not right? File an issue

Change 44123

Time

Attribute with previous and current values

Change #44123

2017-05-18

13:08:54

create Calagator::Event 1250471938 Papers We Love @ PDX - Gorilla: A Fast, Scalable, In-Memory Time Series Database Roll back

description	nil	→	<p>This paper will be presented by Johannes Hoff.</p> <p> This paper describes how Facebook stores 700 million data points (timestamp and value) per minute and keeps 26 hours of it available in memory for fast access. We will mainly look at the compression algorithms making this possible.</p> <p>Here is the abstract:</p> <p> """</p> <p>Large-scale internet services aim to remain highly available and responsive in the presence of unexpected failures. Pro-viding this service often requires monitoring and analyzing tens of millions of measurements per second across a large number of systems, and one particularly effective solution is to store and query such measurements in a time series database (TSDB).</p> <p>A key challenge in the design of TSDBs is how to strike the right balance between efficiency, scalability, and relia-bility. In this paper we introduce Gorilla, Facebook’s in-memory TSDB. Our insight is that users of monitoring sys-tems do not place much emphasis on individual data points but rather on aggregate analysis, and recent data points are of much higher value than older points to quickly detect and diagnose the root cause of an ongoing problem. Gorilla op-timizes for remaining highly available for writes and reads, even in the face of failures, at the expense of possibly drop-ping small amounts of data on the write path. To improve query efficiency, we aggressively leverage compression tech-niques such as delta-of-delta timestamps and XOR’d floating point values to reduce Gorilla’s storage footprint by 10x. This allows us to store Gorilla’s data in memory, reduc-ing query latency by 73x and improving query throughput by 14x when compared to a traditional database (HBase)-backed time series data. This performance improvement has unlocked new monitoring and debugging tools, such as time series correlation search and more dense visualization tools. Gorilla also gracefully handles failures from a single-node to entire regions with little to no operational overhead.</p> <p><br>"""</p> <p> PDF: <a href="http://www.vldb.org/pvldb/vol8/p1816-teller.pdf"></a><a href="http://www.vldb.org/pvldb/vol8/p1816-teller.pdf" class="linkified">http://www.vldb.org/pvldb/vol8/p1816-teller.pdf</a></p> <p> <a href="https://github.com/papers-we-love/portland/blob/master/code-of-conduct.md">Reminder: Papers We Love PDX has a code of conduct.</a></p>
id	nil	→	1250471938
source_id	nil	→	996336012
start_time	nil	→	2017-05-31 18:00:00 -0700
title	nil	→	Papers We Love @ PDX - Gorilla: A Fast, Scalable, In-Memory Time Series Database
url	nil	→	https://www.meetup.com/Papers-We-Love-PDX/events/240093042/
venue_id	nil	→	202395395

calagator.org 1.1.0