InfoQ Homepage S3 Content on InfoQ
-
AWS Announces Physical Data Transfer Terminal for High-Speed Uploads
AWS has recently introduced AWS Data Transfer Terminal, a new option for high-speed data uploads. Currently available only in the US, Data Transfer Terminals provide a physical location where customers can bring their storage devices for fast data transfer to and from the AWS cloud.
-
AWS Introduces S3 Tables Bucket: Is S3 Becoming a Data Lakehouse?
AWS has recently announced S3 Tables Bucket, managed Apache Iceberg tables optimized for analytics workloads. According to the cloud provider, the new option delivers up to 3x faster query performance and up to 10x higher transaction rates for Apache Iceberg tables compared to standard S3 storage.
-
Amazon S3 Introduces Metadata Feature for Improved Data Management and Querying in Preview
Amazon Web Services (AWS) has launched S3 Metadata, enhancing data management for S3 users. This new capability enables near real-time querying and analysis of S3 data via organized metadata updates. By adopting Apache Iceberg, it ensures interoperability and scalability, allowing businesses to efficiently leverage their data for analytics and AI applications.
-
From Aurora DSQL to Amazon Nova: Highlights of re:Invent 2024
The 2024 edition of re:Invent has just ended in Las Vegas. As anticipated, AI was a key focus of the conference, with Amazon Nova and a new version of Sagemaker among the most significant highlights. However, the announcement that generated the most excitement in the community was the preview of Amazon Aurora DSQL, a serverless, distributed SQL database with active-active high availability.
-
AWS Amplify and Amazon S3 Integration Simplifies Static Website Hosting
AWS has launched a seamless integration between Amplify Hosting and S3, revolutionizing static website deployment. This robust solution enables developers to quickly publish sites, configure custom domains, and manage updates effortlessly, all backed by Amazon's global CDN. Enjoy fast, secure hosting without complex setups—optimized for performance and efficiency.
-
Amazon Introduces Storage Browser for S3
Amazon has recently announced the alpha release of Storage Browser for Amazon S3, providing end users with a simple interface for accessing data stored in S3. The project is available in the AWS Amplify JavaScript and React client libraries.
-
Improving Distributed System Data Integrity with Amazon S3 Conditional Writes
AWS recently announced support for conditional writing in Amazon S3, allowing users to check for the existence of an object before creating it. This feature helps prevent overwriting existing objects when uploading data, making it easier for applications to manage data.
-
AWS Introduces Malware Detection for Object Uploads to Amazon S3
At the latest re:Inforce cloud security conference, AWS announced GuardDuty Malware Protection for Amazon S3. This new malware scanning feature for Amazon S3 enables teams to detect malware in new object uploads using Amazon GuardDuty.
-
Amazon S3 Unauthorized Request Billing Issue: an Empty S3 Bucket Can Dramatically Increase the Costs
Maciej Pocwierz, a senior software engineer, recently revealed a significant issue—an empty S3 bucket can unexpectedly result in a substantial AWS bill. In his case, nearly 100,000,000 S3 PUT requests were executed within a single day, leading to a bill that was far from negligible.
-
Cloudflare R2 Storage Introduces Event Notifications and Infrequent Access Storage Tier
During the recent Developer Week, Cloudflare announced that the object storage R2 now supports event notifications, which automatically trigger Workers in response to data changes. Additionally, the migration service Super Slurper now extends its support to Google Cloud Storage and a new infrequent access storage tier is available in private beta.
-
QCon London: How Duolingo Sent 4 Million Push Notifications in 6 Seconds During the Super Bowl Break
As part of the Super Bowl marketing campaign, Duolingo sent out 4 million mobile push notifications when the company’s five-second ad aired during the commercial break. At QCon London, Doulingo’s engineers presented the asynchronous AWS architecture responsible for broadcasting messages to millions of users across seven US cities.
-
Reddit Migrates Media Metadata from S3 and Other Systems into AWS Aurora Postgres
Reddit consolidated its media metadata storage into a new architecture using AWS Aurora Postgres. Previously, the company sourced media metadata from various systems, including directly from AWS S3. The new solution simplifies media metadata retrieval and handles 100k+ requests per second with latency below 5ms (p90).
-
Netflix Uses Metaflow to Manage Hundreds of AI/ML Applications at Scale
Netflix recently published how its Machine Learning Platform (MLP) team provides an ecosystem around Metaflow, an open-source machine learning infrastructure framework. By creating various integrations for Metaflow, Netflix already has hundreds of Metaflow projects maintained by multiple engineering teams.
-
Instacart Creates Real-Time Item Availability Architecture with ML and Event Processing
Instacart combined machine learning with event-based processing to create an architecture that provides customers with an indication of item availability in near real-time. The new solution helped to improve user satisfaction and retention by reducing order cancellations due to out-of-stock items. The team also created a multi-model experimentation framework to help enhance model quality.
-
Amazon OpenSearch Zero ETL with S3 and New OR1 Instances
Amazon has announced the preview of the Amazon OpenSearch Service's zero-extraction, transformation, and loading (ETL) integration with Amazon S3, offering a novel method to analyze operational logs in Amazon S3 and S3-based data lakes without the need to switch between services. Amazon also announced the new OR1 instances for Amazon OpenSearch Service.