Free Trial

Safari Books Online is a digital library providing on-demand subscription access to thousands of learning resources.

Overview

Gain a clear perspective on the future of big data—and all the analytics, architectures, techniques, tools, and technologies you need to use data successfully. With this complete video compilation, you’ll get a front-row seat to the keynotes, workshops, and sessions at O’Reilly’s Strata Conference Santa Clara 2014. You can download these videos or stream them through our HD player.

Subscriber Reviews

Average Rating: 4.066666666666666 out of 5 rating Based on 15 Ratings

"Very little new material covered" - by Anonymous on 22-JUN-2014
Reviewer Rating: 1 star rating2 star rating3 star rating4 star rating5 star rating
Very little new material covered compared to Strata 2013 session "Hadoop Data Warehousing with Hive - Dean Wampler".


Report as Inappropriate

"Government Segment Marketing Manager" - by vonbernetta on 10-APR-2014
Reviewer Rating: 1 star rating2 star rating3 star rating4 star rating5 star rating
The ultimate resource on Big Data.
Report as Inappropriate

"I can't" - by I can't view on 02-APR-2014
Reviewer Rating: 1 star rating2 star rating3 star rating4 star rating5 star rating
I can't view this video and any other too on windows 8

The video does not start, only show Loading

Report as Inappropriate

Table of Contents

Chapter/Selection

Time

Tutorials

Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 1

Preview

00:47:16

Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 2

Preview

00:42:36

Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 3

Preview

00:49:01

Introduction to Machine Learning with IPython and scikit-learn - Olivier Grisel - Part 4

Preview

00:35:02

IPython In Depth - Brian Granger and Fernando Prez - Part 1

Preview

01:03:59

IPython In Depth - Brian Granger and Fernando Prez - Part 2

Preview

00:50:08

IPython In Depth - Brian Granger and Fernando Prez - Part 3

Preview

00:47:57

Building a Data Platform - John Akred, Richard Williamson, and Stephen O'Sullivan - Part 1

Preview

00:43:01

Building a Data Platform - John Akred, Richard Williamson, and Stephen O'Sullivan - Part 2

Preview

00:46:04

Building a Data Platform - John Akred, Richard Williamson, and Stephen O'Sullivan - Part 3

Preview

00:53:31

Building a Data Platform - John Akred, Richard Williamson, and Stephen O'Sullivan - Part 4

Preview

00:33:13

Design Thinking for Dummies (Data Scientists) - Michael Stringer, Dean Malmgren, and Laurie Skelly - Part 1

Preview

00:21:53

Design Thinking for Dummies (Data Scientists) - Michael Stringer, Dean Malmgren, and Laurie Skelly - Part 2

Preview

00:21:54

Design Thinking for Dummies (Data Scientists) - Michael Stringer, Dean Malmgren, and Laurie Skelly - Part 3

Preview

00:39:54

Dissecting Data Science Algorithms using Spreadsheets - John Foreman - Part 1

Preview

00:47:05

Dissecting Data Science Algorithms using Spreadsheets - John Foreman - Part 2

Preview

00:46:05

Dissecting Data Science Algorithms using Spreadsheets - John Foreman - Part 3

Preview

00:44:07

Dissecting Data Science Algorithms using Spreadsheets - John Foreman - Part 4

Preview

00:29:28

Introduction to Hadoop 2.0 - Rich Raposa - Part 1

Preview

00:52:02

Introduction to Hadoop 2.0 - Rich Raposa - Part 2

Preview

00:33:39

Introduction to Hadoop 2.0 - Rich Raposa - Part 3

Preview

00:57:53

Introduction to Hadoop 2.0 - Rich Raposa - Part 4

Preview

00:40:29

Large-scale Machine Learning Cookbook using GraphLab - Carlos Guestrin - Part 1

Preview

00:37:57

Large-scale Machine Learning Cookbook using GraphLab - Carlos Guestrin - Part 2

Preview

00:40:39

Large-scale Machine Learning Cookbook using GraphLab - Carlos Guestrin - Part 3

Preview

00:48:23

Large-scale Machine Learning Cookbook using GraphLab - Carlos Guestrin - Part 4

Preview

00:35:46

From Scattered to Scatterplots: An Introduction to d3.js - Scott Murray - Part 1

Preview

00:33:12

From Scattered to Scatterplots: An Introduction to d3.js - Scott Murray - Part 2

Preview

00:47:45

From Scattered to Scatterplots: An Introduction to d3.js - Scott Murray - Part 3

Preview

00:42:17

From Scattered to Scatterplots: An Introduction to d3.js - Scott Murray - Part 4

Preview

00:43:17

Effective Data Science With Scalding - Vitaly Gordon - Part 1

Preview

00:43:03

Effective Data Science With Scalding - Vitaly Gordon - Part 2

Preview

00:48:15

Big Data Workflows on Mesos Clusters - Florian Leibert, Paco Nathan, and Benjamin Hindman - Part 1

Preview

00:45:33

Big Data Workflows on Mesos Clusters - Florian Leibert, Paco Nathan, and Benjamin Hindman - Part 2

Preview

00:29:23

Big Data Workflows on Mesos Clusters - Florian Leibert, Paco Nathan, and Benjamin Hindman - Part 3

Preview

00:35:19

Big Data Workflows on Mesos Clusters - Florian Leibert, Paco Nathan, and Benjamin Hindman - Part 4

Preview

00:37:06

Adviser: Learning How to get A Second Opinion on Your Analysis when it's Important to get it Right - Leland Wilkinson - Part 1

Preview

00:45:30

Adviser: Learning How to get A Second Opinion on Your Analysis when it's Important to get it Right - Leland Wilkinson - Part 2

Preview

00:39:39

Adviser: Learning How to get A Second Opinion on Your Analysis when it's Important to get it Right - Leland Wilkinson - Part 3

Preview

01:00:00

Building Real-Time Apps with Apache HBase - Ronan Stokes - Part 1

Preview

00:28:29

Building Real-Time Apps with Apache HBase - Ronan Stokes - Part 2

Preview

00:33:34

Building Real-Time Apps with Apache HBase - Ronan Stokes - Part 3

Preview

00:45:00

Building Real-Time Apps with Apache HBase - Ronan Stokes - Part 4

Preview

00:43:31

Data Transformation: Skills of the Agile Data Wrangler - Joe Hellerstein, and Jeffrey Heer - Part 1

Preview

00:44:57

Data Transformation: Skills of the Agile Data Wrangler - Joe Hellerstein, and Jeffrey Heer - Part 2

Preview

00:49:35

Hardcore Data Science

Hardcore Data Science Opening Remarks - Ben Lorica

Preview

00:02:16

Extreme Machine Learning - Alexander Gray

Preview

00:44:57

What the #@)*$ is Big Data? A Holistic View of Data and Algorithms - Alice Zheng

Preview

00:42:34

Overcoming the Barriers to Production-Ready Machine-Learning Workflows - Henrik Brink, and Joshua Bloom

Preview

00:25:22

Anomaly Detection - Ted Dunning

Preview

00:31:19

Neural Networks for Machine Perception - Ilya Sutskever

Preview

00:29:58

The Predictive Business - Kira Radinsky

Preview

00:37:39

Can We Make Big Data Management Easier? - Magda Balazinska

Preview

00:41:28

Design Challenges for Real Predictive Platforms - Max Gasner

Preview

00:31:18

Machine Learning Gremlins - Ben Hamner

Preview

00:30:58

Algebra for Scalable Analytics - Oscar Boykin

Preview

00:32:21

Data-Driven Business Day

Introduction to Data Driven Business Day - Alistair Croll

Preview

00:07:31

Those Numbers Wont Measure Themselves - Farrah Bostic

Preview

00:20:50

Social Data Intelligence: Integrating Social and Enterprise Data for Competitive Advantage - Susan Etlinger

Preview

00:18:22

Open Data: Its Not Just for Governments - Jen van der Meer

Preview

00:19:50

The Insight Economy - Krista Schnell

Preview

00:19:29

9 Levers for Converting Big Data and Analytics into Results - Christy Maver

Preview

00:11:33

Deploying a Data Sciences Team -- The Promise and the Pitfalls - Diane Chang

Preview

00:16:21

Sensing Best Practices - Ben Waber

Preview

00:22:28

Leveraging Value from Open Data Through Collaboration -Peter Pirnejad

Preview

00:17:45

Becoming a Learning Organization: From Data Teams to Corporate Influence - Pamela Peele

Preview

00:15:25

Making Big Data Small - Baron Schwartz

Preview

00:19:29

Big Data Meets Big Infrastructure: Going Underground in One Major European City - Narendra Mulani

Preview

00:11:13

The Era of Data-Powered Government - Beth Blauer

Preview

00:19:19

TripIt Uses Data to Organize Itineraries, No Matter Where You Book - Edith Harbaugh

Preview

00:11:57

Keynotes

Crossing the Chasm: What's New, What's Not - Geoffrey Moore

Preview

00:13:34

Evolution from Apache Hadoop to the Enterprise Data Hub - Amr Awadallah

Preview

00:05:34

Collecting Massive Data via Crowdsourcing - John Schitka

Preview

00:05:12

Empowering Personalized Learning with Big Data - Ramona Pierson

Preview

00:09:40

Hadoop in 5 Minutes or Less - John Schroeder

Preview

00:05:19

People are Data Too - Farrah Bostic

Preview

00:05:56

Bringing Big Data to One Billion People - Quentin Clark

Preview

00:10:01

Small Data in Sports: Little Differences that Mean Big Outcomes - David Epstein

Preview

00:09:18

The Art of Good Practice - Rodney Mullen

Preview

00:09:39

Big Data Moonshots and Ground Control - Joe Hellerstein and Tutti Taygerly

Preview

00:10:40

Data Science and Smart Systems: Creating the Digital Brain - Kaushik Das

Preview

00:10:56

How Companies are Using Spark, and Where the Edge in Big Data Will Be - Matei Zaharia

Preview

00:11:21

In-Hadoop Analytics: Bringing analytics to big data - Anjul Bhambhri

Preview

00:06:58

Record Linkage and Other Statistical Models for Quantifying Conflict Casualties in Syria - Megan Price

Preview

00:10:19

Ben Fry Keynote

Preview

00:09:58

Survivorship Bias and the Psychology of Luck - David McRaney

Preview

00:18:53

Sessions

Apache Hadoop and the Emergence of the Enterprise Data Hub - Eli Collins

Preview

00:39:22

Information Visualization for Large-Scale Data Workflows - Michael Conover

Preview

00:36:03

Adaptive Adversaries: Building Systems to Fight Fraud and Cyber Intruders - Ari Gesher

Preview

00:42:23

Fighting Global Cybercrime and BotNets using Big Data - Bryan Hurd and Herain Oberoi

Preview

00:38:08

Navigating the Big Data Vendor Landscape - Edd Dumbill

Preview

00:43:43

Best Practices for Hadoop In Production - Panel Discussion Facilitated by Forrester Analyst - Mike Gualtieri

Preview

00:38:19

Thorn in the Side of Big Data: Too Few Artists - Chris Re

Preview

00:39:48

10,000: The Most Dangerous Number in Sports - David Epstein

Preview

00:39:28

You're Halfway There: Moving from Insight to Action - Bob Filbin

Preview

00:40:17

Building the Next Generation Data Architecture with Hadoop, Data Warehouse & Data Discovery Platform - Bill Franks

Preview

00:36:17

Minority Report Meets Big Data: Touch and Interactive Big Data is Here - Justin Langseth, and Eva Andreasson

Preview

00:40:59

Machine Learning for Social Change - Fernand Pajot

Preview

00:30:20

Harness Data in Real-Time with Infinite Storage - Yuvaraj Athur Raghuvir

Preview

00:38:02

You Don't Need to Boil the Big Data Ocean with Hadoop - Ben Werther, and Sanjay Mathur

Preview

00:38:52

Predictive Modeling in the Cloud with Scikit-learn and IPython - Olivier Grisel

Preview

00:37:46

Mining Student Notes in Real Time to Provide Study Guides - Perry Samson

Preview

00:52:58

Thinking with Data - Max Shron

Preview

00:35:40

Building a Data-centered Data Center for Agile Development - Justin Makeig

Preview

00:43:30

Evolving Data Governance for the Big Data Enterprise - Scott Lee and Rachel Haines

Preview

00:41:11

Making Big Data Cost Effective in a Bare Metal Cloud - Harold Hannon

Preview

00:41:29

How Evernote Does Conversion Using Hadoop Analytics - Damon Cool

Preview

00:30:40

Crowdsourcing at Locu: How I Learned to Stop Worrying and Love the Crowd - Adam Marcus

Preview

00:24:19

Building a Lightweight Discovery Interface for Chinese Patents - Eric Pugh

Preview

00:40:11

Superconductor: Scaling Charts with Design and GPUs - Leo Meyerovich

Preview

00:22:52

Break Down Data Silos with Apache Accumulo - Adam Fuchs

Preview

00:21:05

Organizing Big Data with the Crowd - Lukas Biewald

Preview

00:14:19

Scalable PostgreSQL as your data platform - Ben Redman

Preview

00:33:10

Unlocking the Secrets of Gertrude Stein - Ian Timourian

Preview

00:41:38

A Different Look at Data and Security - Learning to Live with Fear - Pablos Holman

Preview

00:42:08

Stand Back, I'm Going To Try Science! - Rachel Poulsen and John Akred

Preview

00:20:20

Collaborative Advanced Analytics For Big Data - Bruno Aziza

Preview

00:39:39

Network Science Made Simple: SNA for Pie Chart Makers - Marc Smith

Preview

00:16:21

How Twitter Monitors Millions of Time-series - Yann Ramin

Preview

00:34:50

Harvard's Clean Energy Project: Big Data Maps To Renewable Energy - Kai Trepte

Preview

00:36:37

Working With Time Series Data Using Apache Cassandra - Patrick McFadin

Preview

00:15:46

Friending Graph Analytics: Large-Scale Graph Processing Made Easy - Ted Willke

Preview

00:21:34

Transforming Search Engine Marketing at Ask.com - Mohit Sati

Preview

00:41:14

Music Videos and Gastronomification for Big Data Analysis - Brian Abelson, and Thomas Levine

Preview

00:37:59

Soylent Mean: Data Science is Made of People - Cameran Hetrick and Kimberly Stedman

Preview

00:36:24

Big Data: Beyond Bare-Metal? - Mike Wendt

Preview

00:32:09

Secrets of Apache Hive Queries and UDFs - Shrikanth Shankar

Preview

00:42:14

Twitter and HP HAVEn: The Big Data Big Picture - Sanjay Goil

Play Video

00:39:32

Data Science How to Build and Deploy a Team of Data Scientists - Diane Chang, Steven Hillion, Nick Kolegraff, and Matthew Gee

Preview

00:39:05

The Netflix Data Platform - A Recipe for High Business Impact - Kurt Brown

Preview

00:42:40

Bedtime Stories: Learning from Sleep Data - Monica Rogati

Preview

00:37:58

Tracking a Soccer Game with Big Data - Srinath Perera

Preview

00:36:47

Data Transformation: A User-Centric Approach to Accessing and Analyzing Big Data - Joe Hellerstein

Preview

00:38:50

Apache Hadoop 2.0: Migration from 1.0 to 2.0 - Vinod Kumar Vavilapalli

Preview

00:53:04

Getting a Handle on Hadoop and its Potential to Catalyze a New Information Architecture Model - Milan Vaclavik

Preview

00:42:04

The Sidekick Pattern: Using Small Data to Increase the Value of Big Data - Abe Gong

Preview

00:30:51

Exascale Data Analytics @ Facebook - Sambavi Muthukrishnan

Preview

00:44:54

Sending Millions of Surveys Around the World on Mobile Phones - Max Richman

Preview

00:40:18

Business Data Lake: An Evolution in Data Infrastructure - Jeffrey Kelly, Steven Hirsch, Steve Jones, and Sabrina Dahlgren

Preview

00:42:01

Expressing Yourself in R - Hadley Wickham

Preview

00:34:59

Data Journalism - Organized Crime and Corruption Reporting - Drew Sullivan

Preview

00:38:48

The Inflection Point - Hadoop and Big Data Analytics - Anjul Bhambhri

Preview

00:44:00

Spreadsheets: The Dark Matter of Big Data - Felienne Hermans

Preview

00:44:18

Scale-Invariant Intelligence - Vin Sharma

Preview

00:39:19

Probabilistic Programming: What, Why, How, and When - Beau Cronin

Preview

00:38:55

Beyond Hadoop MapReduce: Interactive Advertising Insights with Shark @ Yahoo! - Nandu Jayakumar and Tim Tully

Preview

00:41:03

Machine Learning for Machine Data - David Andrzejewski - Part 1

Preview

00:44:50

Machine Learning for Machine Data - David Andrzejewski - Part 2

Preview

00:44:45

Lessons from the Trenches: edo Interactive Leverages Hadoop to Build Customer Loyalty - Rob Rosen, and Tim Garnto

Preview

00:36:14

The IPython Notebook: Get Close to Your Data with Python and JavaScript - Brian Granger

Preview

00:45:33

Government Data on Both Sides of the Bridge - Moderated by: Jesse Robbins - Panelists: Shannon Spanhake and Eddie Tejeda

Preview

00:42:01

Enabling Business Transformation with Analytics over Real-time Streaming Data - Anand Venugopal, and Pranay Tonpay

Preview

00:35:52

The Next Wave of SQL-on-Hadoop: Building a Virtual EDW on Native Hadoop Data - Marcel Kornacker

Preview

00:47:04

How Comcast Turns Big Data into Real-Time Operational Insights - Patrick Shumate

Preview

00:42:06

Chicago Bars, Prisoners Dilemma, and Practical Models in Search -Chris Harland

Preview

00:38:01

Big Industrial Internet Data: Connecting and Optimizing at New Scales - Steven Gustafson and Parag Goradia - Part 1

Preview

00:34:24

Big Industrial Internet Data: Connecting and Optimizing at New Scales - Steven Gustafson, and Parag Goradia - Part 2

Preview

00:34:17

FAST and FURIOUS Big Data Analytics Meets Hadoop - Wayne Thompson, and Paul Kent

Preview

00:41:34

The Urgent Need to Appify Big Data - Ryan Cunningham

Preview

00:30:41

Unboxing Data Startups - Michael Abbott

Preview

00:38:50

Apache Hive & Stinger: Petabyte Scale SQL, IN Hadoop - Owen O'Malley, and Alan Gates

Preview

00:41:00

Querying Petabytes of Data in Seconds - Reynold Xin, and Sameer Agarwal

Preview

00:37:19

The Need for Speed & Scale: A Database for Real-Time Analytics - Eric Frenkiel

Preview

00:37:05

Graph All The Things! 11: Graph Database Use Cases That Aren't Social - Emil Eifrem

Preview

00:20:24

Graph Analysis with One Trillion Edges on Apache Giraph - Avery Ching

Preview

00:34:09

Big Data for Big Power: Smart Meters does not mean Smart Grids - Brett Sargent

Preview

00:36:02

The Last Mile: Challenges and Opportunities in Data Tools - Wes McKinney

Preview

00:18:30

Are We Data Scientists or Data Janitors? - Nenshad Bardoliwalla

Preview

00:39:12

Session with Ben Fry

Preview

00:36:02

Data for Good - Moderated by: Jake Porway - Panelists: Drew Conway, Rayid Ghani, and Elena Eneva

Preview

00:46:25

NonStop HBase - Making HBase Continuously Available for Enterprise Deployment - Jagane Sundar

Preview

00:35:21

Apache Mesos as an SDK for Building Distributed Frameworks - Paco Nathan

Preview

00:20:39

Agile Analytics - Neal Ford

Preview

00:19:25

Socializing Search. Professionally. - Sriram Sankar, and Daniel Tunkelang

Preview

00:39:41

Big Data for Better Data Centers - Krishna Raj Raja and Balaji Parimi

Preview

00:40:28

One Size Does Not Fit All: Analyzing Data at Scale with AWS - Rahul Pathak

Preview

00:19:17

Making Choices: What Kind of Relationship are You Seeking with Your Database? - J.R. Arredondo

Preview

00:35:12

StatusWolf: Creating Dashboards That Don't Suck Using Art and Engineering - Mark Troyer

Preview

00:32:52

Real-Time Analytics with NewSQL: Why Hadoop is not enough - Raj Bains

Preview

00:30:23

MLbase: Distributed Machine Learning Made Easy - Ameet Talwalkar and Evan Sparks

Preview

00:39:47

Real-time Analytics with Open Source Technologies - Fangjin Yang, and Gian Merlino

Preview

00:33:35

Extras

The publisher has provided additional content related to this title.


Description
Content

Visit the errata page for Strata Conference Santa Clara 2014: Complete Video Compilation

  • Errata

Visit the catalog page for Strata Conference Santa Clara 2014: Complete Video Compilation

  • Catalog Page