top of page
Sep 19, 20213 min read
Sqoop Import from MySQL to Hive & HBase
In last post we discussed Sqoop concepts and demos on how to move data from RDBMS to HDFS and from HDFS back to RDBMS. Today we will...
Sep 12, 20214 min read
Apache Sqoop – SQL to Hadoop
Today we are going to discuss a very efficient tool – Apache Sqoop. The heading itself tells you the meaning and purpose of tool which is...
Aug 20, 20213 min read
Twitter Sentiments Analysis
Few weeks back we had used flume to download tweets from twitter on the topic that we wanted (I had used BigData world keywords). Today,...
Aug 18, 20213 min read
Flume & Spark Streaming Integration
In this post, we will integrate Flume and Spark Streaming. We will use Spark Streaming to process data that we will get from Flume. Flume...
Aug 16, 20212 min read
Spark Streaming: Streaming Word Count
In last post, we used flume from Hadoop toolset to stream data from Twitter to HDFS location for analysis. In this blog, we are going to...
Aug 1, 20213 min read
Extract Streaming Data with Apache Flume from Twitter to HDFS
Today we are going to do something exciting. We will use another tool from Hadoop ecosystem and play with Real-Time Streaming data. We...
Jul 25, 20215 min read
Apache Oozie - Workflow Scheduler for Hadoop
We will today understand the workflow scheduler component of Hadoop – Oozie. We can use it to schedule our jobs. For comparison this tool...
Jul 3, 20211 min read
Running Hive Queries in Spark
Hive quries can be executed from Spark Shell. Hive comes with Spark Library as HiveContext, which is inherited from SQLContext....
bottom of page