The other day I saw a question on where to start to learn Big Data. Well, it dawned on me that I have created a few resources that might be useful and so I share them here. It feels good to have a few resources that can help people get started.
If you want to set up Hadoop clusters using Cloudera you could watch these online trainings:
Creating Your First Big Data Hadoop Cluster Using Cloudera CDH
Preparing a Production Hadoop Cluster with Cloudera: Databases
Deploying Hadoop with Cloudera CDH to AWS
Deploying and Scaling Cloudera Enterprise on Microsoft Azure (this one is FREE)
They get you started with a development cluster, then a production grade cluster, then a deployment in the AWS cloud and then on Azure, including a module on managed Big Data with Cloudera Altus
Once you have a cluster, you can watch this course to use HUE to work with Hive, Pig, Impala and more.
Take Control of Your Big Data with HUE in Cloudera CDH
If you want to learn about search engines, you can check these on Solr
Getting Started with Enterprise Search Using Apache Solr
Implementing Search in .NET Applications
And regarding Spark, which IMHO is one of the best platforms that you can learn now then you can take either of these courses, which help you get started with either Python or Scala
Developing Spark Applications with Python & Cloudera
Developing Spark Applications Using Scala & Cloudera
I hope this helps. IMHO, learning Big Data is one of the best moves that you can make at the moment.