SAS 041714

download SAS 041714

of 35

Transcript of SAS 041714

  • Utilizing Big Data Analytics

    with Hadoop

    Fern Halper @fhalper

    TDWI Research Director for Advanced Analytics

    April 17, 2014

  • Sponsor

  • 3

    Speakers

    Fern Halper Research Director for

    Advanced Analytics,

    TDWI

    Tapan Patel Product Marketing Manager,

    SAS

  • Agenda

    The evolving big data ecosystem

    Status of big data, analytics,and hadoop

    Considerations for getting started

    4

  • New TDWI Checklist

    Free to download

    http://tdwi.org/research/list/tdwi-

    checklist-

    reports.aspx

  • An evolving ecosystem

    6

    Hadoop

    Big data

    Advanced Analytics

    in-memory

  • Examining the pieces: Big Data

    7

    Social

    M2M/IoT

    Text

    Mobile/Location Volume

    Formats

  • 70% of those respondents

    using or currently using predictive

    analytics are utilizing big data

    (source: TDWI Predictive Analytics Best Practices Report, 2014)

    8

  • Examining the pieces: Analytics The Analytics Spectrum

    Excel Dashboards and Reports

    Other BI Visualization Advanced Analytics

    9

  • Advanced Analytics

    10

    Advanced analytics provides algorithms for

    complex analysis of either structured or unstructured

    data. It includes sophisticated statistical models,

    machine learning, text analytics, advanced

    visualization, and other advanced

    data mining techniques.

  • Examining the pieces: Hadoop

    HDFS/MapReduce

    Schema on read

    Ecosystem of tools

    Commercial distributions

    11

  • In-memory analytics

    Performance

    Interactivity

    12

  • Status: Evolving architectures

    13

    Source: (TDWI Evolving Data Warehouse Architectures In the Age of Big Data, 2014) n=1688 responses

    What technical issues or practices are driving change in your DW architecture?

    Select all that apply.

  • Status: Big data pieces

    14

  • Status: Analytics pieces

    15

  • Considerations

    16

    Defining the problem

    Data preparation

    Analyzing the data

    Making it work (i.e., the team)

    Governance

  • Data preparation

    ETL vs. ELT

    Data quality

    Metadata

    17

  • Data exploration

    18

    Query

    Visualization

    Descriptive statistics

  • Analysis

    19

    Data mining

    Supervised

    Unsupervised

    Other analytics

  • Operationalize

    20

    Business process

    In-database scoring

  • Skills

    21

    Computing

    Analytic modeling

    Creative thinker

    Communicator

  • Big Data:

    The Big Data Maturity Model

    22

  • Poll Question

    Are you making use of Hadoop for advanced

    analytics

    Yes

    No, but were thinking about it

    No, and no plans to do so

    Dont know

    23

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    UTILIZING BIG DATA ANALYTICS

    WITH HADOOP

    TAPAN PATEL, PRODUCT MARKETING MANAGER, SAS

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    DATA TO DECISION LIFECYCLE

    TEXT COMPETITIVE

    ADVANTAGE

    PREPARE

    DATA

    EX

    PL

    OR

    E

    DA

    TA

    DEVELOP

    MODELS

    DE

    PL

    OY

    &

    MO

    NIT

    OR

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    ACCESS TO HADOOP

    HADOOP

    Hive QL

    SAS SERVER

    Push some of SAS processing to Hadoop 1

    Key Offerings: SAS/Access to Hadoop

    SAS/Access to Cloudera Impala

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    EMBEDDED PROCESS FRAMEWORK

    HADOOP

    SAS Data Step & DS2

    SAS SERVER

    Push SAS processing to Hadoop with MapReduce 2

    Key Offerings: SAS Scoring Accelerator for Hadoop

    SAS Data Quality Accelerator for Hadoop

    SAS Code Accelerator for Hadoop

    SAS Data Management

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    SAS

    IN-MEMORY ANALYTICS AND HADOOP

    In-memory processing; use Hadoop for storage persistence and commodity computing 3

    SAS LASR ANALYTIC

    SERVER

    SAS IN-MEMORY

    SAS IN-MEMORY

    SAS IN-MEMORY

    SAS IN-MEMORY

    SAS IN-MEMORY

    HADOOP WEB CLIENTS APPLICATIONS ERP

    SCM

    CRM

    Images

    Audio

    and Video

    Machine

    Logs

    Text

    f Web and

    Social

    Data Discovery and Visualization

    Statistics and Predictive Analytics

    Data Management

    Text Analytics

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    SAS

    VISUAL

    STATISTICS INTERACTIVE PREDICTIVE ANALYTICS

    EXPLORE AND

    DISCOVER PREDICT AND

    REFINE

    DEPLOY AND

    MONITOR

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    SAS

    VISUAL

    STATISTICS INTERACTIVE PREDICTIVE ANALYTICS

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    SAS

    IN-MEMORY

    STATISTICS FOR

    HADOOP

    WHAT IS IT

    Provides a single interactive programming environment

    for Hadoop to perform:

    analytical data manipulation

    variable transformations

    exploratory analysis

    statistical modeling and machine learning

    integrated modeling comparison and scoring

    Takes advantage of distributed in-memory computing

    optimized for analytical workloads

    TEXT

    MANIPULATE

    DATA

    EX

    PL

    OR

    E

    DA

    TA

    DEVELOP

    MODELS

    SC

    OR

    E

  • Copyr i g ht 2014 , SAS Ins t i tu t e Inc . A l l r ights reser ve d .

    SAS

    IN-MEMORY STATISTICS FOR HADOOP

    PRODUCT DEMONSTRATION

  • 33

    Questions?

  • 34

    Download a free

    copy of the report

    Download the report as a PDF file at:

    http://tdwi.org/research/2014/03/

    checklist-utilizing-big-data-

    analytics-with-hadoop

    Feel free to distribute the PDF file

    of any TDWI Checklist Report

  • 35

    Contact Information

    If you have further questions or comments:

    Fern Halper, TDWI [email protected]

    Tapan Patel, SAS [email protected]