Analytics Best Practices
     

Home

Open Data Services

Client Success Stories

News

White Papers

Training Courses

Newsletters

About the Principals

About Open Data

Jobs at Open Data

Contacting Open Data

 


Analytics Best Practices:
A One Day Course

Using Clouds to Provide
Scalable and On-Demand Analytics

Future Courses

We'll be giving courses and tutorials in cloud computing and analytics in the Fall. If you are interested in attending a course before then, it is also possible to arrange an in-house training course or tutorial. If you are interested, please contact us.

General Information

Cloud computing doesn't yet have a standard definition, but a good working definition is to define clouds as racks of commodity computers that provide on-demand resources and services over a network, usually the Internet, with the scale and the reliability of a data center. This one day course gives a quick introduction to cloud computing and analytics. It covers several different types of clouds, describes what is new about cloud computing, and discusses some of the advantages and disadvantages that clouds offer when building and deploying analytic models. It includes three case studies, a survey of vendors, and information about setting up your first cloud.

There are two different, but related, types of clouds: the first category of clouds provide computing instances on demand, while the second category of clouds provide computing capacity on demand. Both use the same underlying hardware, but the first is designed to scale out by providing additional computing instances, while the second is designed to support data- or compute-intensive applications by scaling capacity. Amazon's EC2 and S3 services are an example of the first type of cloud. The Hadoop system is an example of the second type of cloud. In this course we cover both types of clouds and how they are best used in analytic projects.

Course Benefits

  • Learn about the different types of clouds.
  • Learn the benefits of using clouds for analytics.
  • Learn how to structure your first analytic project using clouds.
  • Learn how to speed up the development of analytic models and the deployment of analytics into operational systems using clouds.

Who Should Attend?

  • Managers and executives interested in an introduction to cloud computing and its applications to analytics.
  • Statisticians, modelers and data mining professionals interested in learning how to do analytics using cloud computing.
  • Anyone thinking about using clouds for analytics.

Course Description

This one day course will cover the following topics:

  • Introduction to clouds
  • Strategies for deploying analytics over clouds
  • Benefits of using clouds for analytics
  • What is available: Survey of cloud vendors
  • Standards and the lack of standards for cloud computing
  • Building your first cloud
  • Case Study 1: Scoring data using analytic models and Amazon's EC2 cloud
  • Case Study 2: Identifying compromised web sites using Hadoop
  • Case Study 3: Processing financial times series using cloud computing

Schedule


TimeTopic
8-9 amContinental Breakfast (included)
9-10 amIntroduction to Clouds
10-11 amStrategies for Deploying Analytics Over Clouds
11-12 pmWhat is Available? A Vendor Survey
12-1 pmLunch (included)
1-2 pmCase Study 1: Scoring Data Using Amazon: A Case Study in Data Quality
2-3 pmCase Study 2: Identifying Compromises in Log Files Using Hadoop
3-4 pmYour First Cloud: Building your own, using Amazon, trying other vendors...
4-5 pmCase Study 3: Analyzing Financial Time Series
5 pmAdjourn