Title: Data Science for ERP Software Developers

Prev Book Contents » Next Book Contents » Price »
Detailed Comparision with SAP Business One »»
Special Offer, Learner Pack = 8 Books + Software
Free, 6 Books + Software
Pay only for 2
Table of Contents
The objective of this book is to teach the Data Science for ERP Software Developers using python, tensorflow and keras.Chapter 1: Introduction to Data science
- Why this book ?
- What is Data Science? Getting value out of data
- Skills necessary for a Data Scientist? Why Python for Data Science?
- Why Python?
- Case Study: Soccer Data Analysis
- Steps in Data Science process
- Importance of defining the well stated data science problem
- Steps in data science in detail
- Acquiring data
- Exploring Data
- Pre-Processing Data
- Scaling and Transformation
- Dimensionality reduction
- Data manipulation
- Analyze Data
- Classification
- Regression
- Clustering
- Association analysis
- Reporting Insights
- Turning Insights into Action
Chapter 2: Python Programming for Data science
- Install Python on Windows using Anaconda Python Distribution
- Why Jupyter Notebook?
- Python Variables
- Python Loops
- for Loop
- While Loop
- Python conditions
- Relational Operators
- Logical Operators
- Data Structures - lists
- Indexing Lists
- Iterating over a list
- Modifying Items in Lists
- Appending to a list
- Removing from a list
- Merging two lists using Extend method
- Iterate over Multiple Lists Simultaneously
- Applying Functions To List Items
- List slicing
- Constructing a List with List Items
- String functions
- Concatenation, Strip, Split, Slicing
- Find a substring
- String Formatting
- Tuples in Python
- Creating a tuple
- Accessing Values in Tuples
- Dictionaries
- Accessing Dictionary Elements with Keys
- Using Functions to Access Elements
- Modifying Dictionaries
- Remove items from a Dictionary
- Numpy Library
- Introduction to Arrays
- Create a rank one ndarray
- Two-dimensional arrays
- Create a rank two numpy array
- Summing the Values in an Array
- Indexing and Slice indexing
- Using combinations of integer indexing and slice indexing to create different shaped matrices
- Numpy: ndarray boolean indexing
- Numpy: ndarray Datatypes and Operations
- Arithmetic Array Operations
- Numpy Statistical Operations
- Matrix Computations
- ndarrays broadcasting
- Broadcasting rules and Example
Chapter 3: Pandas -Data Analysis Library
- Introduction to Pandas Library
- Pandas Data Structures
- Pandas Series
- Pandas DataFrame
- Differences between ndarrays and Series Objects
- Indexing and Slicing Series
- Integer-location based indexing Example
- Series Initialized with Dictionaries
- DataFrames
- Example 1:- Create a Dataframe from a Dictionary of equal length lists with columns attribute
- Example 2:- Create a dataframe from a dictionary of equal length lists with index attribute
- Example 3 :- Create DataFrame from dictionary of Pandas Series
- Example 4:- Create DataFrame from Pandas Series
- Example 5 :- Sort the DataFrame's rows in descending order
- Example 6 :- Selection and Indexing Methods for Pandas DataFrames
- 1. Selecting pandas data using "iloc"
- 2. Selecting pandas data using "loc"
- 2a. Label-based / Index-based indexing using .loc
- 2b. Boolean / Logical indexing using .loc
- Example 7 :- Setting values in DataFrames using .loc
- Data Cleaning with Pandas
- Handling Data Quality Issues
- Interpolation of the data
- Handling Missing Values
- Drop fields using dropna() function
- Drop rows with all cells having missing values
- Drop rows with any cell having missing values
- Drop column containing all missing Values
- Replace function
- Checks for any row is Null
- Fill missing data gaps forward and backward
- Fill in missing data with zeros
- Fill in missing in preTestScore with the mean value of preTestScore
- Pandas: Data Ingestion
- Loading CSV into Pandas DataFrame
- Loading a JSON file into Pandas DataFrame
- Join And Merge Pandas Dataframe
- The concat function
- Another example for Concatenation
- Merge with outer join
- Merge with inner join
- Merge with right join
- Merge with left join
- Statistical Analysis with DataFrames
- Summarizing, Aggregating, and Grouping data in Python Pandas
- Grouping by multiple variables
- Multiple Statistics per Group
- Data Visualization in Python
- Introduction to Matplotlib
- Line Graph
- Scatter plot
- Making A Matplotlib Scatterplot From A Pandas Dataframe
Chapter 4: ML algorithms and logic behind it
- Introduction to Machine Learning
- What is Machine Learning and main applications of Machine Learning? What does actually Machine(model or statistical model) learn?
- Concepts in Machine Learning
- What is a statistical model?
- What exactly is being learnt in machine learning?
- What are Inputs to Statistical models?
- Model selection
- Common Machine Learning use cases
- Categories of Machine Learning
- Classification
- Regression
- Cluster Analysis
- Association analysis
- Terminology Related to Machine Learning
- Classification in detail
- Binary classification
- Multi-class classification
- Building and Applying a Classification Model
- What is a Machine Learning Model?
- Building Machine Learning Model
- Building Classification Model
- Training phase
- Testing phase
- Algorithms to build a classification model
- K-nearest Neighbors Classification
- Knn classification application
- Understand the knn algorithm using the example
- Similarity distance measures
- Code and explanation from notebook
- Another Example for Classification
- Decision Trees
- How does a decision tree determine the best way to split the set of samples at a node?
- Tree Induction Example
- Decision boundaries
- Decision Trees Example code in the notebook with explanation