Table of Contents

Driven Administrator Guide

version 1.2

Installing and Configuring the Driven Plugin

Overview

To enable Driven, you must install the Driven Plugin in your Hadoop environment so that it can collect telemetry data and send it to the Driven Server. The plugin for the licensed, self-hosted version of Driven is different from the Driven Beta plugin in that it does not automatically send data to cascading.io, so it must be configured to know where to send the data.

For information about Driven Plugin setup in environments that use the cascading.io service as part of the beta program, see Getting Started with Driven for Driven Beta.

Prerequisites

  • The Driven Server must be installed and configured before you can set up the Driven Plugin.

  • Your data applications must have network access to your installation of the Driven Server.

Installing the Plugin

The following steps document how to obtain and configure the Driven Plugin. See the documentation for Amazon Elastic MapReduce or Maven and Gradle if you use those environments. If you plan to package the Driven Plugin directly in your application, you can add the Maven spec to your project file as a dependency. Adding the plugin to your Maven repositories is particularly useful if you plan to use Driven from your IDE as you develop a Cascading application.

  • On UNIX, Linux, and Mac: Run the following command:

$ wget -i http://files.concurrentinc.com/driven/1.2/driven-plugin/latest-jar.txt

The JAR file includes all dependencies.

Configuring the Plugin

Step 1: Make the plugin accessible to your application

Add the path of the plugin to the cascading-service.properties file in your Hadoop configuration directory (${HADOOP_CONF}). If you do not have the properties file, create it in ${HADOOP_CONF}. For Hadoop 1.x, the configuration directory may be ${HADOOP_HOME}/conf. For Hadoop 2.x, the configuration directory is ${HADOOP_HOME}/etc/hadoop.

$ echo cascading.management.service.jar=${PATH}driven-plugin-1.2.jar >> ${HADOOP_CONF}/cascading-service.properties

Alternatively, set the path to the plugin by adding it to the HADOOP_CLASSPATH environment variable:

$ export HADOOP_CLASSPATH=$HADOOP_CLASSPATH:${PATH}/driven-plugin-1.2.jar

Step 2: Set the Driven Server host URL

Specify the URL for the Driven Server in the Cascading service properties file in the following command format:

$ echo cascading.management.document.service.hosts=${DRIVEN_SERVER_URL} >> ${HADOOP_CONF}/cascading-service.properties

Alternatively, set the host in an environment variable. Example:

$ export DRIVEN_SERVER_HOSTS=${DRIVEN_SERVER_URL}

Step 3 (Optional): Using a Driven API Key

Each Driven Team has a unique API key. By associating an application with an API key, you make the application searchable by other members of your team.

  1. Log in to the Driven Server to open the Driven web interface.

  2. Hover over your user name in the upper-right corner of the window.

  3. Click My Teams.

  4. Record an API key for the team of your choice.

  5. Configure the Driven Plugin with the Driven API key by one of the following methods:

Store the API key parameter in the cascading-service properties file. Example:

$ echo cascading.management.document.service.apikey=${API_KEY} >> ${HADOOP_CONF}/cascading-service.properties

Alternatively, you can set the API key in an environment variable. Example:

$ export DRIVEN_API_KEY=${API_KEY}

Additional Configuration Settings

The Driven Plugin has additional configuration options that primarily are for administrative tasks. For more information, see Driven Plugin Configuration Parameters.

Next