Sep 19, 20213 minSqoop Import from MySQL to Hive & HBaseIn last post we discussed Sqoop concepts and demos on how to move data from RDBMS to HDFS and from HDFS back to RDBMS. Today we will...
Sep 12, 20214 minApache Sqoop – SQL to HadoopToday we are going to discuss a very efficient tool – Apache Sqoop. The heading itself tells you the meaning and purpose of tool which is...
Aug 20, 20213 minTwitter Sentiments AnalysisFew weeks back we had used flume to download tweets from twitter on the topic that we wanted (I had used BigData world keywords). Today,...
Aug 18, 20213 minFlume & Spark Streaming IntegrationIn this post, we will integrate Flume and Spark Streaming. We will use Spark Streaming to process data that we will get from Flume. Flume...
Aug 16, 20212 minSpark Streaming: Streaming Word CountIn last post, we used flume from Hadoop toolset to stream data from Twitter to HDFS location for analysis. In this blog, we are going to...
Aug 1, 20213 minExtract Streaming Data with Apache Flume from Twitter to HDFSToday we are going to do something exciting. We will use another tool from Hadoop ecosystem and play with Real-Time Streaming data. We...
Jul 25, 20215 minApache Oozie - Workflow Scheduler for HadoopWe will today understand the workflow scheduler component of Hadoop – Oozie. We can use it to schedule our jobs. For comparison this tool...
Jul 3, 20211 minRunning Hive Queries in SparkHive quries can be executed from Spark Shell. Hive comes with Spark Library as HiveContext, which is inherited from SQLContext....