Email the Author
You can use this page to email Irfan Elahi about Scala Programming for Big Data Analytics.
About the Book
It's an open secret that we are living in the world of Big Data. Organisations are currently experiencing disruptive paradigm shift wherein they are increasingly adopting Big Data technologies to process large volumes of data to derive insights with the goal to orchestrate innovation and to stay competitive. As a result of that, demand of candidates with strong skill-set in these areas is experiencing exponential growth and people with skills in Big Data are among the highest paid ones as well.
In Big Data landscape, Hadoop is the de-facto framework that powers big data platforms with its suite of services and Apache Spark is the leading distributed and in-memory computing engine in Hadoop ecosystem. Apache Spark is being used for a diverse variety of Big Data use-cases like machine learning, ETL, graph analytics to name a few and is experiencing phenomenal growth and adoption in businesses all around the world. And Scala is the lingua-franca of Apache Spark i.e. Not only Apache Spark (and many other frameworks like Apache Kafka) is developed in Scala but it is also the recommended language for Apache Spark development as it provides the best performance and access to all the latest features in Apache Spark API releases. Thus, to develop skill-set in Apache Spark and build your career in this promising domain, there is a critical prerequisite i.e. you need to learn Scala!
Learning Scala has manifold benefits on its own as Scala is one of the hottest JVM based programming languages out there and candidates skilled in Scala are among the highest paid ones.
The challenge with Scala is that there is a steep learning curve. Scala combines advanced constructs from functional programming and object oriented principles and candidates willing to learn Scala become overwhelmed with the complexity and depth of language. On the other hand, specifically for getting started with Apache Spark development, one generally needs to master a subset of key concepts in Scala language. This itself is another issue because there is not a single book/resource out there that covers Scala programming language with the focus of Big Data development. Though there is no shortage of books/tutorials on Scala but they cover concepts with unnecessary depth and breadth which may not be relevant for Big Data development.
And this is exactly the problem that my book "Scala Programming for Big Data Analytics" addresses which has been written with one crisp goal: teach you just enough Scala only relevant for Big Data i.e. Apache Spark development with no fluff! Instead of bogging you down with needless details of irrelevant and complex concepts of Scala programming language features, the book covers only the most important concepts with laser-focus and necessary depth along with highlighting the best practices based on my versatile experience of using this language.
The book is crafted to be fully hands-on. If you'll follow this book, you'll find the impression that I am holding your hand and teaching you the concepts starting from the very basics. Each section of the book is complimented with series of hands-on code examples. The book will start by introducing Scala and will then will naturally progress to the topics including variables (mutable/immutable), data types, functions, collections, flow control, libraries usage and exception handling along with a gentle emphasis on object oriented programming and functional programming concepts wherever necessary coupled with best practices based on my versatile experience of using this language.
This book doesn't cover Apache Spark. Rather, it covers the key Scala programming language concepts necessary to develop mastery in Apache Spark. After this book, you will be able to learn Apache Spark with no hassle or even use Scala alone as its a general purpose language.
You don't need to have any prior programming language experience to use this book and you'll be able to do hands-on practice on your system (Windows/Mac/Linux) without any software cost.
Now you are one step away from learning one of the most in-demand languages i.e. Scala to excel your career in the promising and lucrative domain of Big Data. Get this book now and let's learn Scala!
About the Author
Irfan Elahi is a Senior Consultant in Deloitte Australia specialising in Big Data and Machine Learning.
His primary focus lies in using Big Data and Machine Learning to support business growth with multifaceted and strong ties to the Telecommunications, Energy, Retail and Media industry. He has worked on a number of projects in Australia in end-to-end life cycle to design, prototype, develop and deploy production-grade Big Data solutions in Amazon Web Services (AWS) and Azure to support a number of use-cases ranging from enterprise data warehousing, ETL offloading, analytics, batch processing and stream processing while employing leading commercial Hadoop solutions like Cloudera and Hortonworks. He has worked closely with clients’ system and software engineering team in DevOps space to enhance the continuous integration and continuous deployment (CICD) processes and manage Hadoop cluster’s operations and security.
Additionally, Irfan is leading the Data-stream of Deloitte’s ClearLight platform to setup a multi-tiered and multi-tenant Big Data platform in Amazon Web Services based on best practices to facilitate firm’s strategic initiatives like trainings, managed services and prototyping for potential clients.
In addition to his technology competencies, Irfan has recently presented in DataWorks Summit in Sydney in 2017 about in-memory big data technologies and in a number of meetups all around the world. He also remained involved delivering knowledge transfer sessions, training and workshops about Big Data and Machine Learning, both within firm and at clients. He also has launched Udemy courses on Apache Spark for Big Data Analytics and R Programming for Data Science with more than 18,000 students from 145 countries enrolled in them.