apache-zeppelin

Pass and process an array from JS to Spark in Zeppelin

余生颓废 提交于 2019-12-11 10:19:45
问题 I have to visualiase some map thingy in Zeppelin via spark and JS (angular interpreter in this case). Zeppelin has this binding feature but it gives a little knowledge about usage. I would like to click on the button, evaluate some stuff, than send the id_selected_arr to the spark via binding. In %angular window <input type="button" value="Click me" ng-click="eval()" ng-model="id_selected_arr"/> In %spark2 window var result = z.angular("id_selected_arr") I get: result: Object = id_selected

Why does helium give 'Unexpected token' error in Apache Zeppelin?

微笑、不失礼 提交于 2019-12-11 05:49:33
问题 I am trying to add volume-leaflet via helium to Apache zeppelin (0.7) as per these instructions. I get this error when trying to enable it: ./~/leaflet/dist/leaflet.css Module parse failed: /usr/hdp/2.6.0.3-8/zeppelin/local-repo/vis/node_modules/leaflet/dist/leaflet.css Unexpected token (3:0) You may need an appropriate loader to handle this file type. SyntaxError: Unexpected token (3:0) at Parser.pp$4.raise (/usr/hdp/2.6.0.3-8/zeppelin/local-repo/vis/node_modules/acorn/dist/acorn.js:2221:15)

apache zeppelin additional repository import

蓝咒 提交于 2019-12-10 20:15:51
问题 How to import an additional dependency in zeppelin? I am trying to import a library (saddle) in apache zeppelin, however ... import org.saddle._ <console>:21: error: object saddle is not a member of package org import org.saddle._ ^ I've tried using the %dep interpreter (dependency loader) like this: %dep z.reset() z.addRepo("Saddle Repo").url("http://mvnrepository.com/artifact/org.scala-saddle/saddle_2.10/1.0.1") z.load("org.scala-saddle:saddle_2.10:1.3.0") I've also tried in the GUI by 1.

Container killed by YARN for exceeding memory limits

好久不见. 提交于 2019-12-10 18:22:17
问题 I am creating a cluster in google dataproc with the following characteristics: Master Standard (1 master, N workers) Machine n1-highmem-2 (2 vCPU, 13.0 GB memory) Primary disk 250 GB Worker nodes 2 Machine type n1-highmem-2 (2 vCPU, 13.0 GB memory) Primary disk size 250 GB I am also adding in Initialization actions the .sh file from this repository in order to use zeppelin. The code that I use works fine with some data but if I use bigger amount of, I got the following error: Container killed

zeppelin hive interpreter throws ClassNotFoundException

亡梦爱人 提交于 2019-12-10 15:18:26
问题 I have deployed zeppelin 0.6 and configured hive under Jdbc interpreter. Tried executing %hive show databases Throws: org.apache.hive.jdbc.HiveDriver class java.lang.ClassNotFoundException java.net.URLClassLoader.findClass(URLClassLoader.java:381) java.lang.ClassLoader.loadClass(ClassLoader.java:424) sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:331) java.lang.ClassLoader.loadClass(ClassLoader.java:357) java.lang.Class.forName0(Native Method) java.lang.Class.forName(Class.java:264)

How to use Zeppelin to access aws spark-ec2 cluster and s3 buckets

試著忘記壹切 提交于 2019-12-10 13:48:35
问题 I have an aws ec2 cluster setup by the spark-ec2 script. I would like to configure Zeppelin so that I can write scala code locally on Zeppelin and run it on the cluster (via master). Furthermore I would like to be able to access my s3 buckets. I followed this guide and this other one however I can not seem to run scala code from zeppelin to my cluster. I installed Zeppelin locally with mvn install -DskipTests -Dspark.version=1.4.1 -Dhadoop.version=2.7.1 My security groups were set to both

Apache Zeppelin 0.8.0-Snapshot “Failed to exec build:dist” ( Building Zeppelin: web Application)

怎甘沉沦 提交于 2019-12-10 12:06:09
问题 I am building Apache Zeppelin 0.8.0 from maven and I have to use advanced features provided by Zeppelin e.g. Apache Zeppelin Notebook Authorization allow "Runners" But I am trying while different versions of node and npm but still getting the following error during mvn clean package -DskipTests for building Building Zeppelin: web Application. Following is the error log from debug log: /root/.npm/_logs/2018-03-22T10_38_10_265Z-debug.log Also when this new version( 0.8.0) will release? 1

How can I pretty print a wrappedarray in Zeppelin/Spark/Scala?

拈花ヽ惹草 提交于 2019-12-10 11:36:26
问题 In this question I was told how to print a dataframe using zeppelin's z.show command. This works well except for 'WrappedArray' appearing in the lemma column: I have tried this: z.show(dfLemma.select(concat_ws(",", $"lemma"))) but it just gave me a list of words, not nicely formatted and I also want the racist column in my output. Any help is much appreciated. 回答1: Here's a suggestion for formatting your array column: import org.apache.spark.sql.Column import org.apache.spark.sql.functions._

How is an imported name resolved in Scala? (Spark / Zeppelin)

和自甴很熟 提交于 2019-12-10 08:50:44
问题 I have a script running in a paragraph with the Spark interpreter in Zeppelin. It has an import and the name imported can be resolved from the global namespace and also from a function, but not from a method inside a class. This runs well on my computer's installation of Scala (2.12) but it doesn't work in Zeppelin (Scala 2.11). import java.util.Calendar def myFun: String = { // this works return Calendar.getInstance.toString } class MyClass { def myFun(): String = { // this doesn't return

Hello world in zeppelin failed

一曲冷凌霜 提交于 2019-12-10 02:02:38
问题 I just installed apache zeppelin (built from latest source from git repo) and successfully saw it is up and running in the port 10008. I created a new note book with a single line of code val a = "Hello World!" And run this paragraph and saw the below error java.net.ConnectException: Connection refused at java.net.PlainSocketImpl.socketConnect(Native Method) at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:350) at java.net.AbstractPlainSocketImpl.connectToAddress