InfoQ Homepage Data Storage Content on InfoQ
-
Speed of Apache Pinot at the Cost of Cloud Object Storage with Tiered Storage
Neha Pawar discusses how to query data on the cloud directly with sub-seconds latencies, diving into data fetch and optimization strategies, challenges faced and learnings.
-
EBtree - Design for a Scheduler and Use (Almost) Everywhere
Andjelko Iharos explores the goals, design and the choices behind the implementations of EBtree, and how they produce a very fast and versatile data storage for many of HAProxys advanced features.
-
Algorithms behind Modern Storage Systems
Alex Petrov talks about modern storage system approaches, discussing storage internals, evaluation techniques to choose a database best suitable for a certain data.
-
Beam aboard the Eclipse User Storage Service
Christopher Guindon and Denis Roy introduce Eclipse USS and its SDK, discussing plans for its future and showing how to get started using this service.
-
Elements of Scale
Ben Stopford examines tools, mechanisms and tradeoffs that allow a data architecture to scale, from disk formats to fully blown architectures for real-time storage, streaming and batch processing.
-
Zen: Pinterest's Graph Storage Service
This talk goes over the design motivation for Zen and describe its internals including the API, type system and HBase backend.
-
Solidifying the Cloud: How Google Backs up the Internet
Raymond Blum discusses some of the challenges, solutions and discarded alternatives in creating durable storage systems at Google scale.
-
How Facebook Scales Big Data Systems
Jeff Johnson introduces Apollo, a hierarchical NoSQL data system meant to deal with Facebook's distributed storage needs.
-
The Code that Isn't There
Scott Vokes presents some lesser-known data structures and shows how probability distributions and content-addressable storage can become tools to shape global system behavior.
-
Building a Reliable Data Store
Jeremy Edberg presents the data stores used by Netflix and Reddit, some of the best practices and lessons for surviving outages.
-
Building Google Cloud Storage
Nathan Herring presents the available storage options at Google, the ideal characteristics of a storage service, and the actual implementation of Google Cloud Storage.
-
Facebook News Feed: Social Data at Scale
Serkan Piantino discusses news feeds at Facebook: the basics, infrastructure used, how feed data is stored, and Centrifuge – a storage solution.