This is a site for the data science aspirants, who are passionate about data science and for the people who wants to start their career in data science from beginning.

Saturday, 5 November 2016

Prerequisites for a data scientist

By 23:12
Want to became a data scientist..!, Then you must know the prerequisites before you start cranking. As data scientist is the sexiest job of 21st century, many are willing to start their career as a data scientist. You might have heard that to start career in data science one should have an expert skills in various domains, but the truth is if you have the good basic knowledge in maths, statistics and programming and communication skills then everyone can start their career in data science.
It is better to have basics of mathematics concepts like linear algebra, calculus, probability and statistics, these skills are must to learn data science. If you passionate about data science then things are very easy to learn.
If you have the basic knowledge as we discussed above then you should have a basic knowledge on the following tools


Hadoop: It is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. No data scientist can escape from learning this tool as data scientist have to work with a huge data-sets which is highly difficult with normal storage systems.
Hive: It allows sql queries on dataset stored in a hadoop cluster, that means hadoop itself does not support all the things which need the supporting tools too.
Mahout: It is to build an environment for quickly creating salable performance machine learning applications, machine learning is the trending technology where is very helpful in many industries so one such machine learning applications can be create using mahout and you knowledge on linear algebra and calculus plays a major role in machine learning.
Spark: It is a fast and general engine for large-scale data processing. Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python and R shells.
Storm: It is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lots of fun to use!
So we have discussed the all the major prerequisites for a data scientist, If you are passionate about a data science then all this tools and skills are easy to learn and then you can play with a huge data to find predictions.


Here is my youtube channel please follow and subscribe ill get you more video tutorials and articles to help you to learn data science on from this internet world yourself by showing you the right stuff to learn from.

Thanks guys will see you in next article.



Read More...

Friday, 4 November 2016

The Modern data scientist

By 08:55
Data scientists are in very high demand. There is not enough talent and skills to fill the jobs. Do you know Why? Because the sexiest job of 21th century requires a mixture of different domain skills, multidisciplinary skills ranging from an intersection of Linear mathematics, Algebra, statistics, computer science, data visualization and business. Finding a data scientist is very hard and finding a people who understand who a data scientist is, is equally hard. “Being a data scientist is not only about data crunching. It’s about understanding the business challenge, finding out the predictions of future by modeling the data, and communicating their findings to the business, simply you should be good at playing with data and passionate about huge data. The one who learns the skills needed for data science should not call themselves as data scientists as It means a doctorate which needs very good expertise in playing with huge data, but that doesn't mean one cannot be a data scientist if you are passionate about data science then things will automatically come into your hands.
” Jean-Paul Isson, Monster Worldwide, Inc. says "It is very likely that you will not be able to hire a data science soloist, who can solve all your data problems. The skill-set presented the modern data team should be equipped. In the picture above we can see the skills of modern data scientist.
Read More...