What do data scientists do? Do you have this question, then here are the answers for you. A data scientist makes predictions using past data to make that prediction and answer the questions by using past data which can not be answered by normal techniques.
|
In every field we have problems to solve like wise a data scientist also have problems to solve, when you have some question to answer you need to define that question, what are the challenges that you are facing, what area you have to solve in that question, it is not a good idea to travelling without knowing our destination so you should define your question before you answer it.
Define the ideal data set
After defining our question its time to define the ideal data set that you should use to answer the question, a professional data scientist will define his data set by his intuition because the data you are going to define to solve your question will play a major role in the outcome of your results, so define the data set which you are going to play with.
Obtain the data
Once we define our data set we have to obtain it, often data became very large day by day it is increasing and become very cheap. We can get data from many sources like data created by human by different action example social media etc, data created by industries and by machines lite ATM's, and many more things, you should be clear from where you have to obtain your ideal data set.
Clean the data
We can not start making jewelry with the raw gold, we need to clean in and extract the pure gold then you to start making beautiful jewelry, same thing should be considered with the data you have obtained. You cannot process further without cleaning the data as it is a raw data you obtain it would have the data that you don't need, you need to extract the data which you need to process.
Exploratory data analysis
An exploratory data analysis builds on a descriptive analysis by searching for discoveries, trends, correlations, or relationships between the measurements of multiple variables to generate ideas or hypotheses.
Statistical prediction/modeling
Statistical modeling is the traditional way to analyse the problem, you need to make statistical modeling or prediction, an intuition about what's going to happen in the next sample you might take.
Interpret results
Interpreting your results, challenging them. Then synthesizing them and writing them up in reproducible ways that can be shared with other people.
Data products
Finally, we're going to talk about distributing results through things like interacting graphics, also through right ups and presentations, and finally through interactive apps built on top of R or Python based on your comfort.





