Introduction to Data Science Methodology

John B. Rollins, Ph.D. IBM Analytics | IBM Corporation Foundational Data Science Methodology © 2015 IBM Corporation

Views 205 Downloads 6 File size 191KB

Report DMCA / Copyright

DOWNLOAD FILE

Recommend stories

Citation preview

John B. Rollins, Ph.D. IBM Analytics | IBM Corporation

Foundational Data Science Methodology

© 2015 IBM Corporation

Introduction § Why we are interested in data science -  Solve problems and answer questions -  Gain useful insights through modeling to predict outcomes or discover underlying patterns

§ Rapidly evolving technologies -  Platform growth -  In-database analytics -  Text analysis -  Automation

2

© 2015 IBM Corporation

Data science methodology § Why? -  To provide a guiding strategy

§ What? -  General strategy that guides the processes and activities within a given domain -  Does not depend on particular technologies or tools -  Not a set of techniques or recipes -  Provides the data scientist with a framework for how to proceed to obtain answers

3

© 2015 IBM Corporation

Methodology diagram Business Understanding

Analytic Approach

Data Requirements

Feedback

Data Collection

Deployment

Data Understanding

Evaluation

Modeling

4

Data Preparation

© 2015 IBM Corporation