InfoQ Homepage Storage Content on InfoQ
-
Understanding Architectures for Multi-Region Data Residency
Alex Strachan discusses challenges to build multi-region data storages, understanding why and when a business needs to do this, who are the real stakeholders, and who owns what.
-
Mind Your State for Your State of Mind
Pat Helland provides a partial taxonomy of diverse storage solutions available over a distributed cluster.
-
Storage Made Easy with Spring Boot, ECS, and PCF
Presenters discuss the journey to create a service broker, make it consumable as a Tile in PCF, using ECS S3 as object storage.
-
Alluxio: The Journey Thus Far and the Road ahead
Gene Pang introduces Alluxio, an open-source memory-speed virtual distributed storage system, integrations with other storage systems and some of the improvements they are working on.
-
Big Ideas: Decentralized Storage
David Vorick talks about the need for distributed/decentralized storage, real life use cases for distributed storage systems, dealing with data loss in a distributed system, overviewing IPFS and Sia.
-
Petabytes Scale Analytics Infrastructure @Netflix
Tom Gianos and Dan Weeks discuss Netflix' overall big data platform architecture, focusing on Storage and Orchestration, and how they use Parquet on AWS S3 as their data warehouse storage layer.
-
Beam aboard the Eclipse User Storage Service
Christopher Guindon and Denis Roy introduce Eclipse USS and its SDK, discussing plans for its future and showing how to get started using this service.
-
Elements of Scale
Ben Stopford examines tools, mechanisms and tradeoffs that allow a data architecture to scale, from disk formats to fully blown architectures for real-time storage, streaming and batch processing.
-
Zen: Pinterest's Graph Storage Service
This talk goes over the design motivation for Zen and describe its internals including the API, type system and HBase backend.
-
Solidifying the Cloud: How Google Backs up the Internet
Raymond Blum discusses some of the challenges, solutions and discarded alternatives in creating durable storage systems at Google scale.
-
The Code that Isn't There
Scott Vokes presents some lesser-known data structures and shows how probability distributions and content-addressable storage can become tools to shape global system behavior.
-
Building a Reliable Data Store
Jeremy Edberg presents the data stores used by Netflix and Reddit, some of the best practices and lessons for surviving outages.