Amazon’s Cloud Raining Gifts for 2010



By Adam Kraut

January 20, 2010 | Inside the Box | Over the 2009 holiday shopping season, Amazon.com processed an estimated 73 items per second and 6.3 million items on a typical day. That may not come as much of a surprise. But what you may not know is that it’s all powered by cloud computing from Amazon Web Services (AWS). And Amazon isn’t even the biggest AWS customer, according to CEO Werner Vogels. Amazon’s Infrastructure as a Service is so far ahead in the cloud that competitors are struggling to keep pace. AWS is not about to rest on its laurels. Fortunately for the rapidly growing number of AWS users, it is aggressively innovating and improving its services. One might even say that during the 2009 holiday season, it left us some presents under the tree.

One of the major barriers to cloud adoption in enterprise IT departments, as with running data through any hosted service, comes down to security. Most of it boils down to IT staff protecting their jobs or downright paranoia. As my colleague Chris Dagdigian said at Bio-IT World Europe (see “The C Word,” Bio•IT World, Nov 2009), “It’s very funny to see people demanding security practices on the cloud that they’re unable to run in-house.”

To address those customers’ concerns, Amazon recently announced Virtual Private Cloud (VPC) service. VPC offers a secure bridge between existing IT infrastructure and cloud resources through a Virtual Private Network (VPN) connection. With VPC enabled, your company’s security services and firewalls can extend to cover AWS compute resources running on Elastic Cloud Compute (EC2). While this should ease concerns at pharmaceutical companies and other security-conscious IT departments, it does come with the caveat of VPN overhead. If your internal IT bandwidth is poor, it will become the bottleneck when pushing terabytes of data through VPC.

The Data Flow Problem

Another major announcement from AWS is the Import/Export service for Amazon’s Simple Storage Service (S3). Let’s say you have 200 TB of data you want to load into S3 for analysis on EC2. No matter how fat your link to Amazon, it’s going to take a long time to move that data. Dagdigian alluded to this service last April at the Bio-IT World Expo, when he said: “If the ingest problem can be solved… I see petabyte-scale datasets that would flock to utility storage services.” Amazon has answered the call with S3 Import/Export. You put your data on USB or SATA hard drives and send it to Amazon through standard mail. Amazon takes those disks and physically loads them up in their datacenter running S3. After a couple of days, your data are available for processing on EC2 or distributing to customers and colleagues.

When it comes to raw data S3 is a fantastic solution, but what about all the data living in relational databases? Amazon knows that an infinitely scalable relational database is impossible to engineer. As an alternative, AWS has offered SimpleDB, a non-relational distributed database service. In addition there’s Elastic Block Store (EBS) providing elastic disk storage that many customers use underneath of their own managed MySQL, Postgres, and Oracle instances. Amazon realized their customers were spending too much time managing MySQL on top of EC2 and EBS. Relational Database Service (RDS) gives AWS users an API for a self-contained MySQL database instance without having to launch new EC2 servers or deal with EBS volumes and snapshots. Currently in beta, RDS supports up to 20 databases per customer each allowing up to 1TB of storage. There’s nothing to install, configure, or tune. Simply issue a few commands to launch a fully functional database server with the same on-demand pricing we’ve come to appreciate from AWS.

Perhaps the most interesting announcement from AWS is a new pricing model called Spot Instances. While the 10 cents/hour on-demand pricing of EC2 is what made the service initially so popular, James Hamilton, vice president of AWS, calls Spot Instances “a fundamental innovation in how computation is sold.” Spot pricing allows customers to bid on instances effectively balancing the peak and off-peak capacity of EC2. Under this model the spot market drives EC2 pricing. If demand is low you pay less, if demand is high you pay more. Workloads with soft time constraints such as compression, encryption, and exhaustive sampling can be processed at a potentially lower cost than standard EC2 rates.

Amazon is actively engaged in making life easier in the AWS world. According to Vogels, it’s not just about enterprise cost savings but agility in the cloud. “This is not a standard product that is finished. It’s been a continuous improvement process since Day 1.”

Adam Kraut is a scientific consultant at the Bio Team. He can be reached at kraut@bioteam.net

Click here to login and leave a comment.  

0 Comments

Add Comment

Text Only 2000 character limit

Page 1 of 1



White Papers & Special Reports

sgi - whp 1
Turning Genomics Data into Practical Insight
Sponsored by SGI

With worldwide sequencing capacity approaching 13 quadrillion DNA bases annually turning genomics data into knowledge is a true computational challenge. Read this paper and learn how the SGI UV coherent shared memory platform can:  

  • Speed results time while cost competitively tackling the most difficult computational problems across all omics disciplines. 
  • Push performance by scaling to extraordinary levels, up to 256 sockets (2,560 cores, 4,096 threads) per single system (one OS image). 

Provide support for up to 16TB of coherent shared memory in a single system image enabling extreme efficiency across a wide range of compute demands. 



accerlys-logo_2012_wh
New Complimentary Market Survey…
Collaborations and Communications Within Drug Discovery Research
Sponsored by Accelrys
This survey was conducted by the Cambridge Healthtech Media Group in January, 2012. It was sponsored by Accelrys related to their HEOS initiative to gather valid information around externalizing collaborative research while improving communications in the cloud. With 310 qualified industry respondents the survey findings reveal useful usage and trends patterns.  An insightful follow-on discussion and webinar related to this survey, and the HEOS by Scynexis SaaS portal is also available on the Bio-IT World website for complementary viewing.
 


Job Openings

tessella logo 
Scientific Software Engineer
Boston MA
$70,000 to $95,000
 

Tessella delivers software engineering and consulting services to leading pharmaceutical and biotech companies. We are recruiting Software Engineersto work with skilled bioinformaticians and scientists to identify business needs and recommend and develop technical solutions. Applicants require BS, MS or PhD in bioinformatics, biology or chemistry and 2+ years of software development in either: Java, C#, C++, C or VB.NET. 

Apply at http://jobs.tessella.com   

 

oxford nanopore logo 


 Early Access Collaborations Managers
Oxford Nanopore Technologies is developing a novel technology, GridIONTM for the direct, electronic analysis of DNA/RNA and other analytes.  As the system approaches the market, we are building a team of technically knowledgeable, highly motivated candidates with excellent customer service and facilitation skills to join our company as Collaboration Managers.  This is a unique opportunity to work with world-leading genomics customers throughout the early adoption phase of a new generation of DNA sequencing technology.. This is a facilitative, enabling role with responsibility for managing technology development collaborations with key customers at leading genomics institutions.  It will include long term management of the collaboration plan and milestones and associated meetings and documentation. Click here to find out more and apply   

Oxford Nanopore's GridION technology, VP, Sales and Marketing Oxford Nanopore Technologies is a fast-moving technology company that is developing a novel electronic molecular analysis technology. The technology is adaptable for the analysis of DNA/RNA, proteins, chemicals and other molecules.  It is therefore suitable for use in a variety of markets including scientific research and clinical applications.  As the technology approaches the market, Oxford Nanopore is seeking a visionary VP of sales and marketing to join the senior team.  The candidate will embrace the opportunities afforded by entering the market with a truly disruptive technology that has the potential to expand the number of users and the variety of applications in each target market.  This is a rare opportunity to influence the commercial strategy at an early phase of its commercial lifetime, in a well funded company.  Oxford Nanopore welcomes applications from candidates with a track record of high-level strategic commercial  leadership, who wish to apply a fresh approach to existing markets.  Experience in Life Sciences/DNA sequencing is central to this role, however we will consider your application if you have experience of disruptive technologies in other related industries.  We are particularly interested in candidates with strong expertise in the use of digital technologies for sales and marketing of scientific/technical products.  Click to  Apply  


 

For reprints and/or copyright permission, please contact  Tim McLucas, (781) 972-1342, tmclucas@healthtech.com .