Hooked on Hadoop

Thursday, February 18, 2016

Provisioning a Cloudera Hadoop cluster on Azure using an ARM template

https://blogs.msdn.microsoft.com/cloud_solution_architect/2016/03/06/provisioning-a-cloudera-hadoop-cluster-on-azure/

Posted by Anagha Khanolkar at 5:41 PM 178 comments:
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Newer Posts Older Posts Home
Subscribe to: Posts (Atom)

About Me

My photo
Anagha Khanolkar
I am a Solution Architect, for Big Data and Analytics at Microsoft, in the Azure Cloud space
View my complete profile

Search

Blog archive

  • ▼  2016 (1)
    • ▼  February (1)
      • Provisioning a Cloudera Hadoop cluster on Azure us...
  • ►  2014 (1)
    • ►  February (1)
  • ►  2013 (41)
    • ►  December (1)
    • ►  November (2)
    • ►  October (2)
    • ►  September (11)
    • ►  August (1)
    • ►  July (12)
    • ►  June (10)
    • ►  May (2)

Labels

.hiveRC (1) 1..1 (1) 1..many (1) Capture Output (1) CombineFileInputFormat (1) CompositeInputFormat (1) Distributed Cache (3) Format conversion (2) genericUDF (1) Hive (1) Hive UDF (2) Java MapReduce Code Sample (9) KeyValueTextInputFormat (1) LazyOutputFormat (1) Lookup (2) Map File (3) Map-side join (3) MultipleOutputs (1) NLineInputFormat (1) Oozie (2) Pass ouput between actions (1) Pig UDF (1) Reduce-side join (1) Secondary Sort (2) Sequence File (1) Shell Action (1) simple UDF (1) Subworkflow action (1) UDF (2)

Popular Posts

  • Apache Sqoop - Part 1: Import data from mysql into HDFS
    Apache Sqoop Apache Sqoop is a tool designed for efficiently transferring bulk data in a distributed manner between Apache Hadoop and...
  • Apache Sqoop - Part 3: Export from HDFS/Hive into mysql
    What's in the blog? My notes on exporting data out of HDFS and Hive into mySQL with examples that one can try out.  My first blog on A...
  • Apache Oozie - Part 1: Workflow with hdfs and email actions
    What's covered in this blog? Apache Oozie documentation (version 3.3.0) on - workflow, hdfs action, email action, and a sample appli...
  • Reduce-side joins in Java map-reduce
    1.0. About reduce side joins Joins of datasets done in the reduce phase are called reduce side joins.  Reduce side joins are easier to imp...
  • Apache Hive: The .hiverc file
    What is .hiverc file? It is a file that is executed when you launch the hive shell - making it an ideal place for adding any hive configur...

Total Pageviews

Simple theme. Theme images by Ollustrator. Powered by Blogger.