Datameer provides users with data discovery & transformation capabilities to visually explore and transform their data to identify the key insights for improving data driven decisions. Users can transform their data using a variety of no-code, low-code and SQL transformation recipes and then quickly publish this data back into Snowflake to support the development of BI dashboards, reports and data science driven insights in your organization. All this can be delivered without the need for IT assistance or formal training and without leaving a single user interface.
In this training lab you will learn how to model and transform your data, create highly valuable datasets, and publish the data into Snowflake. No coding skills are necessary using our no-code tools or writing code using our SQL toolset. Or even use both! We will provide you with a test Datameer & Snowflake instance for this hands-on lab.
This entry-level lab introduces you to the user interface and basic capabilities of Datameer. When done with the lab you should be ready to create your own data transformations against your own Snowflake data.
Anyone who will be exploring & transforming data in Snowflake. Includes technical roles like members of an Analytics & BI team, data engineers to non-technical business users in the various lines of business. No SQL or technical skills are needed to do this lab.
In this workshop lab, you will experience the following capabilities in Datameer:
For the purposes of this workshop, you play the role of a loans analyst. Throughout this lab we will be working with some current and historical loans data from our fictitious loans company. We will begin with looking at our current loans data only, but throughout the course of the lab will incorporate some geographical information to transform & understand our loan portfolio by state/region being able to spot business opportunities in these regions.
Click the link below to get to the trial setup:
The first time you login to Datameer you can choose to be taken directly to a Project, supported by an onboarding checklist which guides you though essential Datameer features. We will begin exploring and transforming data in the next module.
But before, let's have a quick look at the Datameer Workbench UI:
Let's add our first data source by clicking the + button next to the LOANS
table from the Data Browser. This will add the table to our Workbench for discovery and transformation.
We can explore this dataset in three different ways:
GRADE
column, and you can see the number of values, empty values, uniques, nulls, and the distribution of values.Let's now add another dataset to this Project - this time by adding the ZIPCODE
dataset by selecting the + button next to the table. You should see both added datasets in your Flow Area.
Click on the LOANS node and select the + button to bring up the list of no-code transformations.
Let's apply a filter by doing the following:
Here we can see a Filter condition in the UI with a Recipe step on the right hand side. You may see another purple hint appear on the screen informing you that you have created your first Receipe.
Let's apply another transformation. The ´Extract´ operation allows us to extract a certain number of characters from one field to a new column name. In our example we want to extract the first 3 digits from our ZIP_CODE
column and populate this in a new column name called ZIP
. (Please note we use an INDEX value starting at 0 and extract 3 characters from this position to get the first 3 characters from this column.)
Here are the steps for an Extract:
You should now see the following flow:
The Formula Builder can be used to enter low-code formulas on a columnar basis. All functions can be found on the right hand side, grouped together by different categories (STRING, NUMERIC, DATE & TIME, CONVERSION, CONTEXT & CONDITIONAL) - selecting a category allows you to browse the Snowflake functions and obtain help on how to use these against your data. Inside the Formula Builder we can select a field to transform using a variety of functions or we can select the + button to add a new column.
Let's apply a formula to concatenate the first 4 characters from the LOAD_ID
column with the MEMBER_ID
column and put a -
separator between the two values:
CONCAT(LEFT(Source.LOAN_ID,4),'-',MEMBER_ID)
You will see the following screen after entering the formula described in Step 3 above.
LOANS
data with our ZIPCODE
(POSTCODE) dataset. Let's apply a join:
Join configuration:
The resultant flow:
Datameer provides functionality to perform exploration of datasets through dimensions and measures without having to export to a BI tool. For example, to calculate the average interest rate by loan grade and purpose, perform the following steps:
You will then see the aggregated results, which are always calculated based on the full data set. Additionally users can filter and sort the explorations. The explorations are tied to the nodes, and can be retrieved by clicking on the saved Explore tab. Currently the explorations are saved in the browser's cache, with future plans to persist them as nodes, as well as to provide visual exploration. You can also visualize the results by clicking on the bar chart icon.
Users can create persistent aggregation and pivot nodes. For example, we can create an aggregation of the interest rate by the state:
Aggregate configuration:
The updated pipeline flow:
We now want to deploy our newly transformed joined and aggregated data back into Snowflake:
Your transformations are now deployed to Snowflake as a table!
Users can upload files through Datameer which are then materialized as Snowflake tables. You can download a sample CSV file here.
To upload a file, perform the following:
The file is now materialized as a table.
Users can also add SQL nodes as part of the overall transformation process. SQL nodes can be combined with no- or low-code nodes. We can try the following to insert a SQL node into our pipeline:
SELECT
LOAN_ID, MEMBER_ID, LOAN_AMNT, LEFT(TERM, 2)
FROM LOANS
WHERE
LOAN_AMNT < 10000 AND LOAN_STATUS = 'Current'
You can now see the SQL node in your Flow Area:
This training was designed to be an introduction in how to use Datameer for data transformation geared towards a business user or analyst. Through this lab we have walked through how the Datameer Project Workbench enables us to quickly explore & transform Snowflake data in a self-service manner using a no-code, low-code and SQL based approach. Because Datameer is SaaS and pushes all queries to Snowflake for execution, Datameer can combine its user interface and the unlimited power of Snowflake to give users a quick & easy way to transform your data, but with no limit on the amount of data you wish to analyze. Unlock the value of all your data and users!
We're happy to help with whatever questions you have! Reach out to us using our support address at support@datameer.com
Documentation: https://documentation.datameer.com/datameer/
Knowledge Base: https://help.datameer.com/hc/en-us
Datameer Blog: https://www.datameer.com/blog/
Resources/Case Studies: https://www.datameer.com/resources/