Neural Networks, the buzzword of the date were in fact invented in the mid-1900s. The algorithm that trains a neural network, called the backpropagation algorithm was formulated by the famous Geoffrey Hinton in the 1960s. So why do you hear about it now? Because we...
Your First Step in Machine Learning with R
Machine Learning is the study of statistics and algorithms which help computers to arrive at conclusions without any external guidance, solely depending upon recurring trends and patterns in the available data. Machine Learning follows various techniques to...
Methods to Manage Amazon Spark Application Memory on Amazon EMR
It is said that most of the modern-day data have been generated in the past few years. The advancement in technology and the growth of the internet is the source of such voluminous unstructured data which is rapidly changing the dynamics of our daily lives. No...
Is Programming knowledge Required to Pursue Data Science?
The term ‘Data Science’ has become a buzzword in the past couple of years. A lot of people who work in various domains such as IT and Business wants to make a shift to this new career option. Even people with a lot of experience as much as 15 years want to make a...
Is Data Scraping One of the Most Demanded Skill in Data Science?
Imagine every living and non-living entity hooked to the internet, generating bits of information as long as connectivity is maintained. This generation of small bits is a vital sign to deduce if the entity is active or inactive in the wide global network...
Top 5 Statistical Concepts for Data Scientist
Introduction Data science is a comprehensive blend of maths, business and technology. One has to go from data inference to algorithm development and then all the way to use available technology to draw the solutions for complex problems. At its heart, all we have is...
Concept of Cross-Validation in R
Introduction The most beautiful aspect of machine learning is its ability to make predictions on the never seen before data points. In order to estimate the performance of ML models, we need to take our dataset and divide it into two parts. One part powers the model...
Top 10 Machine Learning Algorithms
Introduction Machine learning paradigm is ruled by a simple theorem known as “No Free Lunch” theorem. According to this, there is no algorithm in ML which will work best for all the problems. To state, one can not conclude that SVM is a better algorithm than decision...
How Data Science is Revolutionizing the Film Industry?
Movies and Data Science! The first thing many of us must have thought of must be the movie Moneyball. The film is about how a statistician uses his technical knowledge to create a baseball team full of underrated players to have a successful tournament....
Univariate Analysis – A Key to the Mystery Behind Data!
Exploratory Data Analysis or EDA is that stage of Data Handling where the Data is intensely studied and the myriad limits are explored. EDA literally helps to unfold the mystery behind such data which might not make sense at first glance. However, with detailed...