InfoQ Homepage Data Content on InfoQ
-
The Future of Data Engineering: Chris Riccomini at QCon San Francisco
At QCon San Francisco 2019, Chris Riccomini presented “The Future of Data Engineering”. The key takeaway of his talk is about reaching an end goal with data engineering, which is having a fully automated decentralized data warehouse.
-
Lessons Learned from Innovating at Google: Frame the Problem, Use Data, and Define the MVP
The truly great, innovative, useful ideas come mostly from two sources: your target users, and people working in the organization - not necessarily those with a "product manager" hat. Experimentation can help us to materialize ideas into actual products and technology. Framing the problem, using data, and defining the MVP can help us to increase the chance of success in innovation.
-
Microsoft Announces Public Preview of Azure Data Share
Microsoft has announced the public preview of Azure Data Share, which provides capabilities to share data with users in the own organization, as well as with other organizations. Essentially, Microsoft positions the recently announced service as a big data tool, though it’s also possible to share individual files.
-
DevOps Needs Continuous Improvement to Succeed
Continuous improvement is not a new thing and is often misunderstood. To be successful, we can take guidance from agile principles and apply them to the DevOps world, argued Mirco Hering, managing director at Accenture. At Agile Portugal 2019 he spoke about DevOps leadership in the age of agile.
-
Los Angeles CTO Roundtable about AI and Data
The recent "Leaders in Data CTO Roundtable" in Los Angeles included discussions about an artificial intelligence (AI) framework/platform for business, data in the next five years, data software stacks, and acquiring data talent.
-
Introducing TensorFlow Privacy, a New Machine Learning Library for Protecting Sensitive Data
In a recent blog post, TensorFlow announced TensorFlow Privacy, an open source library that allows researchers and developers to build machine learning models that have strong privacy. Using this library ensures user data are not remembered through the training process based upon strong mathematical guarantees.
-
Microsoft Announces New Azure Analytics Services ADLS, ADX and More
Microsoft has announced the general availability of two new Azure analytics services - Azure Data Lake Storage Gen2 (ADLS) and Azure Data Explorer (ADX). Furthermore, Microsoft also announced the preview of Azure Data Factory Mapping Data Flow.
-
Microsoft Announces the General Availability of Azure Data Box Disk
In a recent blog post, Microsoft has announced the general availability of Azure Data Box Disk, an SSD-based solution for offline data transfer to Azure. Furthermore, Microsoft also announced the public preview of Azure Data Box Blob Storage – a feature allowing customers to copy data to Blob Storage on a Data Box.
-
Google Cloud Announces Transfer Appliance in Beta for Cloud Data Migrations in the EU
Google announced that Transfer Appliance, a high-capacity server that lets customers move large amounts of data to Google Cloud Platform (GCP) quickly and securely, is available in beta in the European Union (EU). Google will handle the data transfer with Transfer Appliance in GCP in the EU, and data will not leave the EU.
-
Bank of America - Blockchain Data Storage Patent Released
On April 12, the United States Patent and Trademark Office (USPTO) released a patent filing from the Bank of America outlining their plans for a permissioned blockchain implementation that enables personal and business data sharing. A user will authorize service providers to securely access their data, but only for the specific records they have access to.
-
Baidu Release Huge Dataset "ApolloScape" for Autonomous Vehicle Research
Baidu, the Chinese internet giant, has released ApolloScape, a massive data-set for autonomous vehicle simulation and research. ApolloScape is an order of magnitude more complex than similar open data-sets. It is part of Apollo, Baidu's vehicle simulation and hardware platform. With this release, Baidu strengthens its position in the automated driving sector.
-
Chile’s Energy Regulator to Adopt Blockchain
PV magazine, a publication focused on reporting photovoltaics (solar power generation), has announced the Chile Energy Regulator is set to adopt blockchain in March 2018. The regulator plans to use blockchain technology to transparently record market prices, marginal costs, fuel prices and compliance documentation.
-
Data-Driven Thinking for Continuous Improvement
Organizations need an objective way to measure performance and tie actions back to business outcomes to improve continuously. Avvo uses a data-driven decision framework with an autonomous team model and a practice of retrospectives to help people make better decisions and proposals for continuous improvement.
-
LinkedIn Ordered to Allow Scraping of Public Profile Data
A United States federal judge has ruled that Microsoft’s LinkedIn cannot block third party web scrapers from scraping data from publicly available profiles.
-
Netflix Introduces Hollow, a Java Library for Processing In-Memory Datasets
Netflix recently introduced Hollow, a Java library and toolset for processing in-memory datasets that aren’t characterized as “big data.” A single producer provides datasets from which many consumers have read-only access. The communication mechanism between producer and consumer includes real-time dataset changes.