Hooked on Hadoop

Thursday, February 18, 2016

Provisioning a Cloudera Hadoop cluster on Azure using an ARM template

https://blogs.msdn.microsoft.com/cloud_solution_architect/2016/03/06/provisioning-a-cloudera-hadoop-cluster-on-azure/

Posted by Anagha Khanolkar at 5:41 PM 376 comments:
Email ThisBlogThis!Share to TwitterShare to FacebookShare to Pinterest
Older Posts Home
Subscribe to: Posts (Atom)

About Me

My photo
Anagha Khanolkar
I am a Solution Architect, for Big Data and Analytics at Microsoft, in the Azure Cloud space
View my complete profile

Search

Blog archive

  • ▼  2016 (1)
    • ▼  February (1)
      • Provisioning a Cloudera Hadoop cluster on Azure us...
  • ►  2014 (1)
    • ►  February (1)
  • ►  2013 (41)
    • ►  December (1)
    • ►  November (2)
    • ►  October (2)
    • ►  September (11)
    • ►  August (1)
    • ►  July (12)
    • ►  June (10)
    • ►  May (2)

Labels

.hiveRC (1) 1..1 (1) 1..many (1) Capture Output (1) CombineFileInputFormat (1) CompositeInputFormat (1) Distributed Cache (3) Format conversion (2) genericUDF (1) Hive (1) Hive UDF (2) Java MapReduce Code Sample (9) KeyValueTextInputFormat (1) LazyOutputFormat (1) Lookup (2) Map File (3) Map-side join (3) MultipleOutputs (1) NLineInputFormat (1) Oozie (2) Pass ouput between actions (1) Pig UDF (1) Reduce-side join (1) Secondary Sort (2) Sequence File (1) Shell Action (1) simple UDF (1) Subworkflow action (1) UDF (2)

Popular Posts

  • Apache Sqoop - Part 1: Import data from mysql into HDFS
    Apache Sqoop Apache Sqoop is a tool designed for efficiently transferring bulk data in a distributed manner between Apache Hadoop and...
  • Apache Sqoop - Part 3: Export from HDFS/Hive into mysql
    What's in the blog? My notes on exporting data out of HDFS and Hive into mySQL with examples that one can try out.  My first blog on A...
  • Apache Oozie - Part 1: Workflow with hdfs and email actions
    What's covered in this blog? Apache Oozie documentation (version 3.3.0) on - workflow, hdfs action, email action, and a sample appli...
  • Reduce-side joins in Java map-reduce
    1.0. About reduce side joins Joins of datasets done in the reduce phase are called reduce side joins.  Reduce side joins are easier to imp...
  • Apache Sqoop - Part 5: Scheduling Sqoop jobs in Oozie
    What's covered in the blog? 1. Documentation on the Oozie sqoop action 2. A sample workflow (against syslog generated logs) that incl...

Total Pageviews

Simple theme. Theme images by Ollustrator. Powered by Blogger.