InfoQ Homepage NoSQL Content on InfoQ

Articles

RSS Feed

Newer Older

Development

Picking an Active-Active Geo Distribution Strategy: Comparing Merge Replication and CRDT

Modern distributed applications are fuelling the growing demand for distributed active-active, multi-master databases. While most popular databases support multi-master deployment, different databases employ different techniques. LWW, MVCC, merge replication and CRDTs deliver eventual consistency, offering read and write access with local latency and remaining available during network partitions.

Roshan Kumar
on Jun 12, 2018
AI, ML & Data Engineering

Columnar Databases and Vectorization

In this article, author Siddharth Teotia discusses the Dremio database which is based on Apache Arrow with vectorization capabilities.

Siddharth Teotia
on May 27, 2018
Architecture & Design

Polyglot Persistence Powering Microservices

At Netflix, the cloud database engineering team is responsible for providing several flavors of data persistence as a service to microservice development teams. Roopa Tangirala explained how her team has created self-service tools that help developers easily implement the appropriate data store for each project's needs.

Thomas Betts Roopa Tangirala
on Apr 10, 2018
AI, ML & Data Engineering

Introducing FaunaDB Serverless Cloud

FaunaDB Serverless Cloud is the managed version of FaunaDB, a serverless, object-relational, globally replicated, strongly consistent, temporal database, that can be deployed on multiple clouds, such as AWS, GCP, and Azure, or on premises.

Matt Freels
on Jun 14, 2017
Java

Introducing Reladomo - Enterprise Open Source Java ORM, Batteries Included! (Part 2)

Goldman Sachs is widely known as a leader in investment banking, but they are very much a leading technology firm as well. Continuing our exploration of Reladomo, the primary Java ORM used at GS and now open source, GS Technology Fellow, Mohammad Rezaei looks at advanced features, such as sharding, caching, bitemporal access, performance, and testing.

Mohammad Rezaei
on Jun 13, 2017
AI, ML & Data Engineering

Pascal Desmarets on NoSQL Data Modeling Best Practices

NoSQL databases are specialized to store different types of data like Key Value, Documents, Column Family, Time Series, Graph, and IoT data. Pascal Desmarets talks about how to perform data modeling in NoSQL databases compared to the modeling in Relational databases.

Srini Penchikala
on May 01, 2017
AI, ML & Data Engineering

Big Data Processing Using Apache Spark - Part 6: Graph Data Analytics with Spark GraphX

In this article, author Srini Penchikala discusses Apache Spark GraphX library used for graph data processing and analytics. The article includes sample code for graph algorithms like PageRank, Connected Components and Triangle Counting.

Srini Penchikala
on Mar 14, 2017
AI, ML & Data Engineering

Analysis and Mitigation of NoSQL Injections

NoSQL data storage systems lack the security measures and awareness that are required for data protection. Because code analysis alone is insufficient to prevent attacks in today's typical large-scale deployment, certain mitigations should be done throughout the entire software life cycle.

Anton Puzanov Aviv Ron Alexandra-Shulman-Peleg
on Jan 18, 2017
AI, ML & Data Engineering

Cassandra: The Definitive Guide, 2nd Edition Book Review and Interview

Cassandra: The Definitive Guide, 2nd Edition book authored by Jeff Carpenter and Eben Hewitt covers the Cassandra NoSQL database version 3.0. Authors discuss several different important topics related to this popular database, including data modeling and Cassandra architecture. InfoQ spoke with Jeff Carpenter about the book and Cassandra database current features and future roadmap.

Srini Penchikala
on Jan 05, 2017
AI, ML & Data Engineering

Chris Fregly on the PANCAKE STACK Workshop and Data Pipelines

InfoQ Interviews Chris Fregly, organizer for the 4000+ member Advanced Spark and TensorFlow Meetup about the PANCAKE STACK workshop, Spark and building data pipelines for a machine learning pipeline

Dylan Raithel
on Aug 29, 2016
AI, ML & Data Engineering

Virtual Panel: Current State of NoSQL Databases

NoSQL databases have been around for several years now and have become a choice of data storage for managing semi-structured and unstructured data. These databases offer lot of advantages in terms of linear scalability and better performance for both data writes and reads. InfoQ spoke with four panelists to get different perspectives on the current state of NoSQL databases.

Srini Penchikala
on Aug 02, 2016
AI, ML & Data Engineering

Martin Van Ryswyk on DataStax Enterprise Graph Database

DataStax recently announced a new product called DataStax Graph to store graph data models. It's based on open source Titan graph database and uses Apache Tinkerpop framework's Gremlin query language. InfoQ spoke with Martin Van Ryswyk about the new product.

Srini Penchikala
on May 17, 2016

Newer Articles

Older Articles

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

Articles