John B. Rollins, Ph.D. IBM Analytics | IBM Corporation Foundational Data Science Methodology © 2015 IBM Corporation
Views 205 Downloads 6 File size 191KB
John B. Rollins, Ph.D. IBM Analytics | IBM Corporation
Foundational Data Science Methodology
© 2015 IBM Corporation
Introduction § Why we are interested in data science - Solve problems and answer questions - Gain useful insights through modeling to predict outcomes or discover underlying patterns
§ Rapidly evolving technologies - Platform growth - In-database analytics - Text analysis - Automation
2
© 2015 IBM Corporation
Data science methodology § Why? - To provide a guiding strategy
§ What? - General strategy that guides the processes and activities within a given domain - Does not depend on particular technologies or tools - Not a set of techniques or recipes - Provides the data scientist with a framework for how to proceed to obtain answers
3
© 2015 IBM Corporation
Methodology diagram Business Understanding
Analytic Approach
Data Requirements
Feedback
Data Collection
Deployment
Data Understanding
Evaluation
Modeling
4
Data Preparation
© 2015 IBM Corporation