A Few Resources to Get Started with Search and Big Data

by Xavier Comments: 0

The other day I saw a question on where to start to learn Big Data. Well, it dawned on me that I have created a few resources that might be useful and so I share them here. It feels good to have a few resources that can help people get started.

If you want to set up Hadoop clusters using Cloudera you could watch these online trainings:

Creating Your First Big Data Hadoop Cluster Using Cloudera CDH

Preparing a Production Hadoop Cluster with Cloudera: Databases

Deploying Hadoop with Cloudera CDH to AWS

Deploying and Scaling Cloudera Enterprise on Microsoft Azure (this one is FREE)

They get you started with a development cluster, then a production grade cluster, then a deployment in the AWS cloud and then on Azure, including a module on managed Big Data with Cloudera Altus

Once you have a cluster, you can watch this course to use HUE to work with Hive, Pig, Impala and more.

Take Control of Your Big Data with HUE in Cloudera CDH

If you want to learn about search engines, you can check these on Solr

Getting Started with Enterprise Search Using Apache Solr

Implementing Search in .NET Applications

And regarding Spark, which IMHO is one of the best platforms that you can learn now then you can take either of these courses, which help you get started with either Python or Scala

Developing Spark Applications with Python & Cloudera

Developing Spark Applications Using Scala & Cloudera

I hope this helps. IMHO, learning Big Data is one of the best moves that you can make at the moment.