Data Analytics Process
Data Analytics Team includes Data Scientist, Data Analysts and Data Engineers who are involved in one or more Steps of the whole Data Analytics process.
The Data Analytics Process is divided into following Steps:-
The Data Analytics Process is divided into following Steps:-
- Collection of Data
- Integration of Data
- Preparation of Data
- Analytical Model Development
- Analytical Model Testing
- Analytical Model Revision
- Data Reporting / Visualization
Data Collection
Data Scientists identify the information they need for a particular analytics application and then work on their own or with data engineers and IT staffers to assemble it for use.
Data Integration
Data from different source systems may need to be combined via data integration routines, transformed into a common format and loaded into an analytics system such as Hadoop cluster, NoSQL database or data warehouse.
Data Preparation
Once the data that is needed is in place, the next step is to find and fix data quality problems that could affect the accuracy of analytics applications. That includes running data profiling and data cleansing jobs to make sure that the information in a data set is consistent and that errors and duplicate entries are eliminated. Additional data preparation work is then done to manipulate and organize data for the planned analytics use and data governance policies are applied to ensure that the data adheres to corporate standards and is being used properly.
Analytical Model Development
After Data Preparation, the data analytics work begins in earnest. A data scientist builds an analytical model using predictive modelling tools or other analytical software and programming languages such as Python, Scala, R and SQL.
Analytical Model Testing
In this phase, the model developed is initially run against a partial data set to test its accuracy.
Analytical Model Revision
Once the model is initially tested, it's revised and tested again, a process known as "training" the model that continues until it functions as intended. Finally, the model is run in production mode against the full data set, something that can be done once to address a specific information need or on an ongoing basis as the data is updated.
Data Reporting / Visualization
The last step in the data analytics process is communicating the results generated by analytical models to business executives and other end users to aid in their decision making. That usually is done with the help of data visualization techniques, which analytics teams used to create charts and other infographics designed to male their findings easier to understand. Data visualizations are often incorporated into BI dashboard applications that display data on a single screen and can be updated in real time as new information becomes available.
Comments
Post a Comment