Redshift is a petabyte-scale, fully managed data warehouse service in the cloud. Amazon customers will be able to deploy their data warehouse solutions and analyze large amounts of information by using the same set of business intelligence and analytics applications they regularly use. Using a wide variety of innovations, such as its method for storing stores each column sequentially, Amazon will offer data warehouse services in the cloud ranging from hundreds of gigabytes to a petabyte.
In his blog, Werner Vogels, CTO of Amazon.com mentions:
Today, we are excited to announce the limited preview of Amazon Redshift, a fast and powerful, fully managed, petabyte-scale data warehouse service in the cloud. Amazon Redshift enables customers to obtain dramatically increased query performance when analyzing datasets ranging in size from hundreds of gigabytes to a petabyte or more, using the same SQL-based business intelligence tools they use today. Customers have been asking us for a data warehouse service for some time now and we’re excited to be able to deliver this to them.
In terms of price, a user can get started with a single 2TB Amazon Redshift node for $0.85/hour On-Demand, then pay by the hour, with no long-term commitments or upfront costs. With two main node types of offerings, High Extra Large (XL) Node and High Storage Eight Extra Large (8XL) DW, Amazon expects to reach a wide customer base with this offering, from medium to large organizations needing data warehousing services.
High Storage Extra Large (XL) DW Node:
CPU: 2 virtual cores
Memory: 15 GiB
Storage: 3 HDD with 2 TB of local attached storage
Disk I/O: Moderate
Price per node: $0.85 per hour (On-demand pricing)
High Storage Eight Extra Large (8XL) DW Node:
CPU: 16 virtual cores
Memory: 120 GiB
Storage: 24 HDD with 16 TB of local attached storage
Network: 10 Gigabit Ethernet with support for cluster placement groups
Disk I/O: Very High
Price per node: $6.80 per hour (On-demand pricing)
Additionally, Redshift integrates with Amazon Simple Storage Service (S3) and Amazon DynamoDB, with support for Amazon Relational Database Service (RDS) and Amazon Elastic MapReduce coming soon. It is also possible to connect SQL-based clients or business intelligence tools to Amazon Redshift with the use of standard PostgreSQL drivers over JDBC or ODBC connections, and so far, Jaspersoft and MicroStrategy have already certified Amazon Redshift for use with their platforms.
Consumerization and technology transparency are reaching the data warehouse space, and apparently for good.