Data Science is a combination of multiple disciplines that uses statistics, data analysis, and machine learning to analyze data and to extract knowledge and insights from it.
What is Data Science?
Data Science is about data gathering, analysis and decision-making.
Data Science is about finding patterns in data, through analysis, and make future predictions.
By using Data Science, companies are able to make:
- Better decisions (should we choose A or B)
- Predictive analysis (what will happen next?)
- Pattern discoveries (find pattern, or maybe hidden information in the data)
MICROSOFT EXCEL
- Introduction to Data Analysis
- Perform data cleaning by removing blank spaces as well as incorrect and outdated information
- Format and adjust data using conditional formatting
- Perform data calculations using formulas
- Lookup (HLOOKUP, VLOOKUP and XLOOKUP)
- Conditional Functions (SUMIF(s), COUNTIF(s), AVERAGEIF(s))
- Text Functions (CONCAT, FIND, LEN, RIGHT, MID, CONCAT, PROPER, LOWER, UPPER)
- Logical Functions (AND, IFS, OR, SWITCH, TRUE, FALSE)
- Math Functions (SUM, PRODUCT, SQRT, SUMPRODUCT, LOG)
- Accounting Formulas (IRR, MIRR, XIRR, IPMT, PPMT)
- Organize data using sorting and filtering
- Create visualizations using graphing and charting
- Calculate, summarize, aggregate and analyze data using pivot tables
- Aggregate data for analysis
MICROSOFT POWER BI
- Introduction to Microsoft Power BI
- Data Cleaning and Transformation using PowerQuery
- Building Dashboards and Reports
- Analysis using Data Analysis Expressions (DAX)
- Data Modelling and Relationships
SQL
- Introduction to SQL and Database
- Retrieving Data from Database using SELECT, DISTINCT, WHERE, ORDER BY, TOP N, CASE.
- Aggregate Functions, GROUP BY and HAVING clauses.
- Joining Multiple Tables using JOINS
- Advanced SQL: SUBQUERIES, COMMON TABLE EXPRESSION, WINDOW FUNCTIONS
PYTHON
- Basic Python Concepts: VARIABLES, CONTROL FLOWS, FUNCTIONS, DATA STRUCTURES, ERROR PROCESSING
- Introduction to Python Packages for Analysis: NUMPY, PANDAS and MATPLOTLIB.
- Data Wrangling (Gathering, Accessing and Cleaning)
- Data Visualisation using MATPLOTLIB and Seaborn