Workspaces
Note
This tab is visible only for Databricks clusters.
In this section, you are provided with instructions to add and set up your workspace.
On the Unravel UI, click Workspaces. The Workspaces manager page is displayed.
Click Add Workspaces. The Add Workspace dialog is displayed.
Enter the following details and click Add.
Field
Description
Workspace Id
Databricks workspace ID.
Workspace Name
Databricks workspace name.
Instance (Region) URL
Regional URL where the Databricks workspace is deployed.
Tier
Select a subscription option:
Standard
Premium
Enterprise (AWS)
Dedicated (AWS)
Token
Personal access token to authenticate to and access Databricks REST APIs. Refer to Authentication using Databricks personal access tokens to create personal access tokens.
From the Workspace manager, click Configure Cluster . The Setup Databricks with Unravel page is displayed.
You must update the following settings under Advanced options for every cluster (Automated /Interactive) in your workspace. Use configurations in Spark 2.4.x and below tab or Spark 3.0.x and above tab, whichever is applicable.
Spark/SparkConfig
Copy the following snippet to Spark > Spark Conf. Replace
<Unravel DNS or IP Address>
.Note
For spark-submit jobs, click Configure spark-submit and copy the following snippet in the Set Parameters > Parameters text box as spark-submit parameters. Replace
<Unravel DNS or IP Address>
."--conf", "spark.eventLog.enabled=true", "--conf", "spark.eventLog.dir=dbfs:/databricks/unravel/eventLogs/", "--conf", "spark.unravel.shutdown.delay.ms=300", "--conf", "spark.unravel.server.hostport=
<Unravel DNS or IP Address>
:4043", "--conf", "spark.executor.extraJavaOptions= -Dcom.unraveldata.client.rest.request.timeout.ms=1000 -Dcom.unraveldata.client.rest.conn.timeout.ms=1000 -javaagent:/dbfs/databricks/unravel/unravel-agent-pack-bin/btrace-agent.jar=config=executor,libs=spark-version (2.4/3.0)
", "--conf", "spark.driver.extraJavaOptions= -Dcom.unraveldata.client.rest.request.timeout.ms=1000 -Dcom.unraveldata.client.rest.conn.timeout.ms=1000 -javaagent:/dbfs/databricks/unravel/unravel-agent-pack-bin/btrace-agent.jar=config=driver,script=StreamingProbe.btclass,libs=spark-version (2.4/3.0)
"spark.eventLog.enabled true spark.eventLog.dir dbfs:/databricks/unravel/eventLogs/ spark.unravel.server.hostport
<Unravel DNS or IP Address>
:4043 spark.unravel.shutdown.delay.ms 300 spark.executor.extraJavaOptions -Dcom.unraveldata.client.rest.request.timeout.ms=1000 -Dcom.unraveldata.client.rest.conn.timeout.ms=1000 -javaagent:/dbfs/databricks/unravel/unravel-agent-pack-bin/btrace-agent.jar=config=executor,libs=spark-version (2.4/3.0)
spark.driver.extraJavaOptions -Dcom.unraveldata.client.rest.request.timeout.ms=1000 -Dcom.unraveldata.client.rest.conn.timeout.ms=1000 -javaagent:/dbfs/databricks/unravel/unravel-agent-pack-bin/btrace-agent.jar=config=driver,script=StreamingProbe.btclass,libs=spark-version (2.4/3.0)
Logging
Select DBFS as Destination, and copy the following as Cluster Log Path.
dbfs:/cluster-logs/
Init Script
In the Init Scripts tab, set Destination to DBFS. Copy the following as the Init script path and click Add.
dbfs:/databricks/unravel/unravel-db-sensor-archive/dbin/install-unravel.sh