Learning at par Industry-standards with Industry experts made my Career Transition possible
Background
Education: BE in Electrical
Previous Profile Company: CGI Profile: Software Engineer (Mainframe) Project: John Hancock (Manulife) Insurance Services Domain: Insurance Location: India
Current Profile Company: CGI Profile: Lead Business Analyst Project: ERP and BI Techno-functional SME Domain: Finance and HRM Location: India
My journey into Data Science
Why Data Science?
Early on in my career, I learnt that the IT industry is constantly upgrading. One of my projects was shut down because the client wanted to upgrade to newer tech, in another company the entire project-team was asked to shift or upgrade with the latest tech. Since then, upgrading with the industry has been my core mantra to stay ahead with the times. After 3 years in IT as a Mainframe developer, I switched to Data Warehousing. After that, progression to Data Science was the natural course.
Why Dimensionless?
I checked a lot of courses on other portals like Coursera and Udemy etc. But I knew that what I needed was a more personalized and interactive course that would give me practical knowledge. My colleague did a course through Dimensionless Techademy. He described the course and frankly, I found it hard to trust him since other institutes with similar features were priced comparatively high. After a lot of research and contemplation, I decided to take a Demo class. Looking at the teaching methodology and course content, by the end of the demo I was convinced to join them.
Experience with Dimensionless?
Teachers are highly qualified and industry-experienced
These were all met at Dimensionless, so I am completely satisfied. All these features are hard to find in any other online courses. The teachers then became my mentors and still help me even after the completion of my course.
I am planning to do the AI specialization course with them next.
Career Transition to Data Science
From a Mainframe Developer in Insurance domain to a Lead Business Analyst in ERP and BI domain, and now entering into the Data Science and Advanced Analytics field, my career has taken a complete 360-degree turn. I am applying whatever I learnt to my work in real-time. Apart from theory and practical hands-on, industry case-studies and domain-related use-cases helped me a lot. I’ve finally cleared an interview for an internal-shift (At the time of this interview.)
Now there are a multitude of data science resources out there, all of whom claim to be the “best possible introductory to advanced material and courseware on the subject of data science”. Now I’ve made mistakes in choosing my data science references to buy and keep (and use) but I’ll be sharing what I’ve learned through experience to be the most effective for these particular topics. This list is both effective and born out of experience by going through them one at a time. The list of resources contains the following items:
eBooks (or Books – your choice):
Developing Analytic Talent: Becoming a Data Scientist by Vincent Granville
Introduction to Machine Learning with Python by Muller & Guido.
R for Data Science by Wickham & Grolemund.
Hands-On Machine Learning with Scikit-Learn & TensorFlow by Aurelion Geron
Statistics by Freedman, Pisani, & Purves
Websites:
www.stackoverflow.com (for doubts and errors during coding)
www.kaggle.com (for Data Science Competitions and worldwide rankings)
1. Developing Analytic Talent: Becoming a Data Scientist by Vincent Granville
Now there are not many books that I would recommend for a professional data scientist, but this book is written by an authority with 15 years of experience in the data science field working on some seriously large-scale projects for the best companies in the world. And it shows. This single book contains some of the latest and the best methods to achieve what you need to be a professional data scientist. And it’s not just teaching theory. Every chapter has multiple case studies taken from the experiences in the industry. Vincent Granville is recognized worldwide as one of the best-known resource talents in data science. The level is a little advanced, and it is not recommended for beginners. But this is the perfect book for advanced-intermediate to professional data scientists. If you want to know how to work professionally as a data scientist, this book is for you. But this is only for intermediate, advanced, and professional data scientists since you need to know the basics before starting on this book.
Now, this is a book for beginners, with just a basic knowledge of numpy, pandas and matplotlib required. This is perhaps the most effective way to learn the Scikit-Learn data science library since the authors are two of the core contributors to the scikit-learn package as an open source project. They literally know the library inside out, since they both contributed heavily to creating it! The explanations are simple and the time spent working on the exercises and source code in this book will be highly beneficial if you want to master scikit-learn and its associated libraries.
3. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data by Wickham & GrolemundThis is another beginner-friendly book, teaching all the basics of R clearly and concisely for those with basic programming skills. R is a language intended for manipulation of raw data, and it is an excellent complement to your toolset if you already know Python and are preparing for a career as a data scientist. The IDE used is RStudio, which is bundled with the Anaconda distribution of Python and ML libraries. Both authors are chief scientists involved in the RStudio software development team and are also members of the R Foundation. This book gets you up and running in R effectively and quickly.
This book has received massive acclaim from the data science community for the breadth of knowledge which it provides and is one of the best books on this topic till date. TensorFlow coverage is excellent, and there are methodologies that this book teaches to get your data science project perfectly executed immediately. The TensorFlow (with some Keras) coverage is the most simple and easy to understand among all the various TensorFlow tutorials I personally have found both on the Web as well as in the few available ebooks. If you want to work in Deep Learning but don’t know how to get started, this book is for you (it covers Deep Learning as well)!
Once upon a time, if a developer was stuck on a programming problem, he would have to go through several textbooks each over 500 pages long to find the answer to his problem. Not any more! StackOverflow is a site that is a platform for questions and doubts on nearly every type of programming language available, including Python, R, scikit-learn, TensorFlow, Keras, pandas, numpy, scipy, Theano, PyTorch, matplotlib and dozens more (both languages and libraries). It shows the power of crowd-sourcing problems since it is much easier to find the answer to a problem from 50,000 people than just four or five which would be the case if you were studying from a few teachers. You can simply copy-paste your error message in your data science compiler tool into StackOverflow and the site will return fully worked out and clearly explained solutions to your problem.
The way I see it – StackOverflow was a defining moment in programming. Once upon a time, debugging was a challenge. Now for nearly 90% or more of all bugs and errors, StackOverflow has your answer, explained in clear English, with the corrected source code. What more could you want? These days, anyone can become a developer in any language, thanks to this single site. And the concept has become so popular that there are now a multitude of crowd-sourcing answers to questions platform websites such as www.math-exchange.com, www.stack-exchange.com, and around 10 to 20 more sites that provide this functionality for that particular field, be it Mathematics, English, and even Christianity!
These days, just having an impressive profile on Kaggle will be enough to land you a job interview at the very least. Kaggle is a site that has been hosting data science competitions for many years. The competition is immense and intense, but so are the tutorials and the articles are also equally powerful and instructive. If you want to be a data scientist, not having a decent Kaggle profile is inexcusable. Kaggle will be like a showcase of your data science skills to the entire world. Even if you don’t rank very high, consistency and practice can get you there more often than not.
And there is another side to it, a course designed purely for the purpose of winning data science competitions, available on Coursera (How to Win Data Science Competitions: Learn from Top Kagglers), available on this link: https://www.coursera.org/learn/competitive-data-science. However, this course is not for beginners, it is only for those who already have a strong background in machine learning and machine learning libraries with practical programming knowledge experience.
Dimensionless.in is an elite data science training company that imparts industry level experience and knowledge to those with a real thirst to learn. Training is given from the basics, leading to a strong foundation. Now, anyone with discipline and persistence can learn data science and become a data scientist. All the faculty in the courses are IIT alumni. The training received is personalized to cater to the needs of each student. There are only 20-30 students in a single batch.
Going by popular wisdom, this level of detail and attention is not available in any of the major data science training centers and course curators. Most organizations focus on marketing and volume (quantity). But Dimensionless Technologieshas their focus not on quantity but in quality. The following three major courses are offered:
An in-depth explanation for each course is given at each of the links. Do check them out and have an in-depth view of the potential that is here to be tapped, for your benefit.
And Finally…
AI & ML (Artificial Intelligence & Machine Learning) is the future for nearly every single industry. The question on every CEO’s and CIO’s mind will be this: Why should we set up a staffed division in our company for any role when an automated machine with just a high one-time investment (for which operating costs are literally non-existent (compared to paying a salary to 100 staff members – you can do the math) can do the same job for us more reliably, more efficiently, more consistently, and more accurately than people when they do it as staff or employees? That burning question is racing through nearly every industry in the world right now. Thousands of jobs will be automated and the biggest demand in all sectors will be for machine learning experts who are also highly skilled in domain knowledge of that company (say hospitals, for e.g.).
Don’t be scared of the incoming changes. Change is humanity’s best friend. Without change, life would be boring. Changes are not just challenges, they are also opportunities for much higher-paying and much less laborious jobs than the jobs you hold currently. And, if by chance, you happen to be a student reading this article, you now know which industry you should focus on – completely. All the best, and rememberto enjoy the process of learning. Regardless of your age, this is the best time to be alive – ever. Because domain knowledge is available more widely today than at any time in the past. Be enthusiastic. Be positive. Be disciplined. Be focused.And make the right choices at the right times – and no, its never too late when you have quality trainers ready to mentor you. May the thrill of learning a completely new concept with truly enlightened insight never leave you. Once again, all the best.