Spark for Beginners: Tutorials – Spark Twitter analysis with spark SQL – example

Français Français

In this tutorial, we’ll do a simple analysis of sentimental Tweets Spark with SQL on a json file. This exercise is designed in Java to retrieve a stream of Tweets and Scala for spark SQL scripts. You will find the Repo Github link in the tutorial.

Architecture

above illustrates the architecture of our application.

 

Continue reading Spark for Beginners: Tutorials – Spark Twitter analysis with spark SQL – example

Spark for Beginners: Tutorials – Apache Spark Streaming Twitter java example

Français Français

In this chapter, we will walk you through using Spark Streaming to process live tweet streams. Remember, Spark Streaming is a component of Spark that provides highly scalable, fault-tolerant streaming processing. These exercises are designed as standalone Java programs which will receive and process Twitter’s real sample tweet streams. You will find it the Gist Github links in the tutorial.

Create a Twitter developer account

This video Demonstrate how to create a twitter application. First go to https://apps.twitter.com/.



Continue reading Spark for Beginners: Tutorials – Apache Spark Streaming Twitter java example

Spark for Beginners: Tutorials – Connecting To Cassandra

Français Français

Welcome, we will discover in this tutorial how to connecting Spark with Cassandra database using the Java language. The code will be done in Java you will find it the Gist Github links in the tutorial.

Video Demo

Continue reading Spark for Beginners: Tutorials – Connecting To Cassandra

Spark for beginners: Introduction

Français Français

What is Spark ?

Apache Spark is a framework of open source for Big Data processing built to perform sophisticated analysis and designed for speed and ease of use. This was originally developed by AMPLab, UC Berkeley University in 2009 and spent as open source Apache project in 2010.

Continue reading Spark for beginners: Introduction