Tuesday, February 25, 2014

Cascading extensions for Accumulo

I recently had the opportunity to work on extending Cascading to read/write to Accumulo.
Versions - Cascading 2.5.2 and Accumulo 1.5.0.

The source code is at -
https://github.com/airawat/cascading.accumulo

Examples of using the AccumuloTap are at -
https://github.com/airawat/cascading.accumulo.examples

The examples cover the following functionality-
1.  Querying Accumulo from Cascading.
2.  Performing Accumulo table operations like - create table, create table with splits, check if table exists, delete table & flush, from Cascading 
3.  Dump data in Accumulo to HDFS from Cascading.
4.  Export data in Accumulo to HDFS, after transposing to a flat, delimited format with column headers.
5.  Import data in HDFS, in a flat delimited format into Accumulo.
6.  Read data in Accumulo and write (back) to Accumulo 
7.  Export data in Accumulo into Mysql


20 comments:

  1. It is only after attending the hadoop hadoop online training, I was selected for job in an MNC in India. Thanks for support provided by the informative blogs like this.

    ReplyDelete
  2. List of Do Follow Social Bookmarking Sites in United States of America.
    www.topleadsmedia.com

    ReplyDelete
  3. Some topics covered,may be it helps someone,HDFS is a Java-based file system that provides scalable and reliable data storage,and it was designed to span large clusters of commodity servers. HDFS has demonstrated production scalability of up to 200 PB of storage and a single cluster of 4500 servers, supporting close to a billion files and blocks.
    http://www.computaholics.in/2015/12/hdfs.html
    http://www.computaholics.in/2015/12/mapreduce.html
    http://www.computaholics.in/2015/11/hadoop-fs-commands.html

    ReplyDelete
  4. Thank u for giving this valuable information..it's really very nice

    ReplyDelete
  5. very nice article on Big Data. With the explosion of big data, companies are faced with data challenges in three different areas. hadoop online training

    ReplyDelete
  6. thakyou it vry nice blog for beginners
    https://www.emexotechnologies.com/courses/big-data-analytics-training/big-data-hadoop-training/

    ReplyDelete
  7. This comment has been removed by the author.

    ReplyDelete
  8. I would like to thank you for the efforts you have made in writing this article. I am hoping the same best work from you in the future as well. In fact your creative writing abilities has inspired me to start my own BlogEngine blog now. Really the blogging is spreading its wings rapidly. Your write up is a fine example of it.
    python training in rajajinagar | Python training in btm | Python training in usa

    ReplyDelete
  9. Wow it is really wonderful and awesome thus it is very much useful for me to understand many concepts and helped me a lot. it is really explainable very well and i got more information from your blog.

    blueprism training in chennai | blueprism training in bangalore | blueprism training in pune | blueprism online training

    ReplyDelete
  10. Thanks for such a great article here. I was searching for something like this for quite a long time and at last I’ve found it on your blog. It was definitely interesting for me to read  about their market situation nowadays.
    Data Science Course in Indira nagar | Data Science Course in Electronic city

    Python course in Kalyan nagar | Data Science course in Indira nagar

    Data Science Course in Marathahalli | Data Science Course in BTM Layout

    ReplyDelete
  11. I was recommended this web site by means of my cousin. I am now not certain whether this post is written through him as nobody else recognise such precise about my difficulty. You're amazing! Thank you!
    Microsoft Azure online training
    Selenium online training
    Java online training
    Python online training
    uipath online training

    ReplyDelete
  12. This comment has been removed by the author.

    ReplyDelete
  13. Excellent post. I learned a lot of information from this blog and Its useful for gain my knowledge. Keep blogging
    Apache hive Training in Electronic City

    ReplyDelete
  14. Wow, this article is good, a friend recently asked me about this, I will refer her to your post. Read more about azure training in chennai from our website.

    ReplyDelete
  15. Great article,thank you for sharing this awesome blog with us.

    thank you so much,keep updating...

    big data hadoop training

    hadoop administration training

    ReplyDelete
  16. It has been simply incredibly generous with you to provide openly
    what exactly many individuals would’ve marketed for an eBook to end
    up making some cash for their end, primarily given that you could
    have tried it in the event you wanted.
    dba course in chennai
    java training institute in chennai
    node js course in chennai

    ReplyDelete
  17. Your good knowledge and kindness in playing with all the pieces were
    very useful. I don’t know what I would have done if I had not
    encountered such a step like this.
    oracle certification in Chennai
    asp net training in Chennai

    ReplyDelete
  18. These are genuinely fantastic ideas about blogging really. You have touched some very nice points here. Please keep up this good writing.

    VBSPU BA 1st Year Result
    VBSPU BA 2nd Year Result
    VPSPU BA 3rd Year Result

    ReplyDelete