
Apache Zeppelin is a web-based notebook for interactive data analytics. We already described how to install Apache Spark. If you have followed our any guide on big data tools installation, it is few minutes work to install and use Zeppelin. Zeppelin can be pre-built package or can be build from source. Here is how to install apache zeppelin on ubuntu 16.04 building from source.

What Are Needed To Install Apache Zeppelin On Ubuntu 16.04

For our guide Java, Scala, Apache Spark, Maven, npm, Node.js needed. We will describe to install all of them as steps. As reader, you can skip the things already installed.

Steps To Install Apache Zeppelin On Ubuntu 16.04

As we said above, we have to install Java, Scala and Spark. First install Java :

Advertisement ---

apt update apt upgrade sudo apt-add-repository ppa:webupd8team/java apt update apt install oracle-java8-installer 1 2 3 4 5 apt update apt upgrade sudo apt - add - repository ppa : webupd8team / java apt update apt install oracle - java8 - installer

Check whether Java is correctly installed :

java -version 1 java - version

Next, we will install Scala :

wget http://downloads.lightbend.com/scala/2.12.0/scala-2.12.0.tgz sudo mkdir /usr/local/src/scala tar -xvf scala-2.12.0.tgz -C /usr/local/src/scala/ 1 2 3 wget http : / / downloads . lightbend . com / scala / 2.12.0 / scala - 2.12.0.tgz sudo mkdir / usr / local / src / scala tar - xvf scala - 2.12.0.tgz - C / usr / local / src / scala /

Next we need to open the profile file :

nano .bashrc 1 nano . bashrc

Add these :

export SCALA_HOME=/usr/local/src/scala/scala-2.12.0 export PATH=$SCALA_HOME/bin:$PATH 1 2 export SCALA_HOME = / usr / local / src / scala / scala - 2.12.0 export PATH = $ SCALA_HOME / bin : $ PATH

And reload by sourcing it :

source .bashrc 1 source . bashrc

Next we will check whether Scala is correctly installed and install Git, sbt :

scala -version apt install git wget https://bintray.com/artifact/download/sbt/debian/sbt-0.13.6.deb sudo dpkg -i sbt-0.13.6.deb ## OR apt update apt install sbt 1 2 3 4 5 6 7 scala - version apt install git wget https : / / bintray . com / artifact / download / sbt / debian / sbt - 0.13.6.deb sudo dpkg - i sbt - 0.13.6.deb # # OR apt update apt install sbt

Next we will install Apache Spark :

wget http://d3kbcqa49mib13.cloudfront.net/spark-2.0.2-bin-hadoop2.7.tgz tar -xvf spark-2.0.2-bin-hadoop2.7.tgz mv spark-2.0.2-bin-hadoop2.7/ spark cd conf/ cp spark-env.sh.template spark-env.sh nano spark-env.sh 1 2 3 4 5 6 wget http : / / d3kbcqa49mib13 . cloudfront . net / spark - 2.0.2 - bin - hadoop2 . 7.tgz tar - xvf spark - 2.0.2 - bin - hadoop2 . 7.tgz mv spark - 2.0.2 - bin - hadoop2 . 7 / spark cd conf / cp spark - env . sh . template spark - env . sh nano spark - env . sh

Add the following lines to spark-env.sh :

JAVA_HOME=/usr/lib/jvm/java-8-oracle SPARK_WORKER_MEMORY=4g PYSPARK_PYTHON=/home/<username>/anaconda3/bin/python source spark-env.sh 1 2 3 4 JAVA_HOME = / usr / lib / jvm / java - 8 - oracle SPARK_WORKER_MEMORY = 4g PYSPARK_PYTHON = / home / < username > / anaconda3 / bin / python source spark - env . sh

The run pyspark :

pyspark 1 pyspark

Next we will install npm, Node.js, maven, Zeppelin notebook :

apt update apt install npm apt install nodejs-legacy apt install libfontconfig wget http://www-eu.apache.org/dist/maven/maven-3/3.3.9/binaries/apache-maven-3.3.9-bin.tar.gz sudo tar -zxf apache-maven-3.3.9-bin.tar.gz -C /usr/local/ sudo ln -s /usr/local/apache-maven-3.3.9/bin/mvn /usr/local/bin/mvn node --version mvn --version git clone https://github.com/apache/zeppelin.git 1 2 3 4 5 6 7 8 9 10 apt update apt install npm apt install nodejs - legacy apt install libfontconfig wget http : / / www - eu . apache . org / dist / maven / maven - 3 / 3.3.9 / binaries / apache - maven - 3.3.9 - bin . tar . gz sudo tar - zxf apache - maven - 3.3.9 - bin . tar . gz - C / usr / local / sudo ln - s / usr / local / apache - maven - 3.3.9 / bin / mvn / usr / local / bin / mvn node -- version mvn -- version git clone https : / / github . com / apache / zeppelin . git

Next we need to open the profile file :

nano .bashrc 1 nano . bashrc

Add these :

export M2_HOME=/usr/local/apache-maven-3.3.9 export PATH=${M2_HOME}/bin:${PATH} 1 2 export M2_HOME = / usr / local / apache - maven - 3.3.9 export PATH = $ { M2_HOME } / bin : $ { PATH }

Then :

source .bashrc sudo npm install -g bower mvn clean install -DskipTests cd zeppelin/zeppelin-web # debug bower install # Run mvn clean install -DskipTests 1 2 3 4 5 6 7 8 source . bashrc sudo npm install - g bower mvn clean install - DskipTests cd zeppelin / zeppelin - web # debug bower install # Run mvn clean install - DskipTests

Start the daemon.sh :

bin/zeppelin-daemon.sh start 1 bin / zeppelin - daemon . sh start

Next steps you can follow official website :

https://zeppelin.apache.org/docs/0.6.0/install/install.html#building-from-source 1 https : / / zeppelin . apache . org / docs / 0.6.0 / install / install . html # building - from - source

Tagged With

This Article Has Been Shared 982 Times! Pinterest

About Abhishek Ghosh Abhishek Ghosh is a Businessman, Orthopaedic Surgeon, Author and Blogger. You can keep touch with him on Twitter - @AbhishekCTRL.