Essential Data Science Topics to Real World Applications
Essential Data Science Topics to Real World Applications
What is Data Science
} Goal of Data Science : Turn data
into data products.
Data
Science – Visual Definition
} Data Science is currently a popular topic in industry due to its applications in:
} Quantitative decision making and control.
} Identifying risks, opportunities using ‘what-if’ analysis.
} Predictive Analytics
Data Science – Key Topics
} Data Visualization: Finding patterns in Data.
} Data Analysis (Exploratory Analysis): Understand Empirical Properties. Statistical Methods are used.
} Data Analytics (Predictive Analysis): Forecast Future Data Values.
Data Visualization
} Data visualization is defined as a graphical representation that contains the information and the data.
Data Visualization Diagrams
Benefits of Data Visualization
} Powerful technique to explore the data with presentable and interpretable results.
Data Visualization Tools
} MS-Excel is the simple tool to create charts from 2-dimensional table.
} Tableau is one of the leaders in this field. It is a Business Intelligence (BI) tool.
} Plotly is one of the most popular plug-in to create charts using R or Python
}
Right Choice of Visualization: No visualization is
one-size-fits-all. It’s crucial to choose the right visualization technique for
each type of data on a dashboard to ensure its usability and to avoid confusion
or misinterpretation.
}
Consistency:
Stick to specific
color-coding, fonts, styles and visualization elements when showing the same
metrics across different dashboards.
}
Personalization: Keep the goals of different end-users in mind when deciding what
visualizations and data should be included in a dashboard. Important
information for one user may be unessential or for the others.
Data Analysis
} Exploratory Analysis, which
answers the question, ‘What happened?’
} Exploratory Analysis is the type of
data analysis that helps describe, show or summarize data points in a
constructive way and identify patterns that fulfill every condition of the
data.
} Mainly statistical analysis. It gives
you a conclusion of the distribution of data, helps you detect errors and
outliers. Enables you to identify similarities among variables, thus making you
ready for arriving at statistical conclusion.
Statistical Methods
} Analysis of Variance (ANOVA) tells you if there are any statistical differences between the means of three or more independent groups (such as age, gender, income).
} This type of study gives the researcher the flexibility to use both quantitative and qualitative data in order to discover the properties of the population.
} Helps us to choose the right machine learning model for predictive analytics.
} R Programming
} Applying data patterns towards effective decision-making.
} Machine learning (ML) is a type of Artificial Intelligence (AI) that allows software applications to become more accurate at predicting outcomes without being explicitly programmed to do so.
} Machine learning enables models to train on data sets before being deployed.
Supervised Learning Algorithms
} Supervised learning is a type of machine learning in which machines are trained using well ‘labeled’ training data, and on basis of that data, machines predict the output.
Comments
Post a Comment