InfoQ Homepage Database Content on InfoQ
-
Indestructible Storage in the Cloud with Apache Bookkeeper
At Salesforce, we required a storage system that could work with two kinds of streams, one stream for write-ahead logs and one for data. But we have competing requirements from both of the streams. Being the pioneers in cloud computing, we also required our storage system to be cloud-aware as the requirements of availability and durability are ever more increasing.
-
How Optimizing MLOps Can Revolutionize Enterprise AI
In this article, author Monte Zweben discusses data science architecture, containerization, and how new solutions like Feature Store can help with the full lifecycle of machine learning processes.
-
The Future of Data Engineering
Chris Riccomini examines the current and future states of the art in data pipelines, data streaming, and data warehousing. He presents a six-stage evolution that data ecosystems follow, from a simple monolith to a complex data-microwarehouse architecture as the data engineers who manage them solve problems and clarify their roles as infrastructure engineers, rather than data stewards.
-
Who is on the Team?
Ahmad Fahmy and Cesario Ramos take the changes to the new Scrum Guide as an opportunity to explore what it means to be "on a team." They draw on research to create an ACID test to differentiate who is on the team and who isn't. They discuss different mental models around the idea of a team with the hopes that you take this opportunity to discuss and elevate the roles within your organization.
-
The Evolution of Precomputation Technology and its Role in Data Analytics
In this article, author Yang Li discusses the importance of precomputation techniques in databases, OLAP and data cubes, and some of the trends in using precomputation in big data analytics.
-
Performance Tuning Techniques of Hive Big Data Table
In this article, author Sudhish Koloth discusses how to tackle performance problems when using Hive Big Data tables.
-
The Brain is Neither a Neural Network Nor a Computer: Book Review of The Biological Mind
Underlying much of artificial intelligence research is the idea that the essence of an individual resides in the brain. This is contrary to neuroscience which has discovered that a brain cannot work independently from the body and its environment. Understanding this enables us see what is reasonable to expect from artificial intelligence, as well as technology designed to improve human life.
-
Building an SQL Database Audit System Using Kafka, MongoDB and Maxwell's Daemon
In this article, the author discusses the importance of a database audit logging system outside of traditional built-in data replication, using technologies like Kafka, MongoDB, and Maxwell's Daemon.
-
Overcoming Data Scarcity and Privacy Challenges with Synthetic Data
In this article, the author discusses the importance of using synthetic data in data analytics projects, especially in financial institutions, to solve the problems of data scarcity and more importantly data privacy.
-
Understanding Similarity Scoring in Elasticsearch
In this article, the author discusses the importance of Relevancy Score for developing Search Engine solutions and how to calculate the relevancy score using Elasticsearch's similarity module.
-
Beyond the Database, and beyond the Stream Processor: What's the Next Step for Data Management?
Databases have been around forever with the same shape: you make a request to your data and then you receive an answer. Now, stream processors came along with a different approach: data isn’t locked up, it is in motion. Understand how stream processors and databases relate and why there is an emerging new category of databases that focus on data that stays in place as well as data that moves.
-
The End of the Privacy Shield Agreement Could Lead to Disaster for Hyperscale Cloud Providers
The recent ending of the Privacy Shield agreement by the European Court of Justice (ECJ) might impact cloud adoption. This article looks at the demise of this agreement, and possible solutions.