Explorer 2

Quick tour of features and functionality

Getting Started

This page can be accessed by clicked the Explorer option, within the compass icon

This is being released as a tech preview and supports JDBC sources

Select Your Data Source

Navigate the tree explorer and browser your pre defined connections

Create a new OwlCheck by clicking +Create OwlCheck

The edit icon indicates this OwlCheck can be edited and resumed.

View Data is an interactive option to run queries and explore the data

The bar chart icon will take you to a profile page of the dataset created prior to Explorer 2

Select The Scope and Define a Query

The orange SQL Editor icon allows you to test a query

Pick Date Column if your dataset contains an appropriate time filter

Click Build Model -> to Save and Continue

Transform Tab (optional)

The Transform (gear icons) allow you to perform common transformation functions

Click Build Model -> to Save and Continue

Profile

Various options can be applied to customize analysis

Click Save to and Click Records to Continue

Records

Click a column checkbox to apply record deltas

This should be applied to low cardinality columns

Click Save to and Click Pattern to Continue

Pattern (optional)

Toggle on Pattern to enable this layer

Click +Add to define a group and series of columns

A key indicates a sub grouping or bucketing

Click Save to and Click Outlier to Continue

Outlier (optional)

Click Save to and Click Dupe to Continue

Dupe (optional)

The Dupe Score slider determines the amount of fuzzy match

Click Save to and Click Source to Continue

Source (optional)

Navigate to the source dataset

Click Preview to interlace the columns

Manually map the columns by dragging left to right or deselect columns

Click Save to and Click Save/Run to Continue

Run

Select an agent

Click Estimate Job

Click Run to start the job

Advanced deployment optimizations are available in Parallel JDBC and Deployment Mode

AutoProfile

AutoProfile allows you to select a set of databases and tables to quickly be cataloged. Each selected table will be profiled and added to the Owl Catalog via the selected agent. Alerts, Job Schedules, and limit values can also be set.

When you expand a datasource in the Explorer page, you're given a list of possible databases and their associated tables. AutoProfile is triggered when you select the ones you want and hit scan. This will take you to a separate page that allows you to configure the various AutoProfile parameters.

A SparkSubmit will be launched for each table, so make sure the agent configuration is reasonable and the box has enough resources to handle each JVM.

AutoProfile wizard