Jump to content

Column Correlation Python Data Function for Spotfire® 1.2.0


2 Screenshots

Summary

This Python data function calculates the correlation coefficients between columns of data. Multiple correlation methods are available which are: Pearson, Spearman and Kendall

Overview

This Python data function calculates the correlation coefficients between columns of data. Correlation analysis is an important step in comparing data to determine whether it is highly correlated or not, and if so is that negatively or positively correlated. This can help determine relationships in data, as well as aid in data reduction by removing highly correlated data in use cases such as data science model building.

In this data function, three methods are available. These are: Pearson, Spearman and Kendall. All methods return a correlation score between -1 and 1 indicating the correlation score between two columns of data. A score of -1 is the maximum negative correlation, a score of 1 is the maximum positive correlation and a score of 0 means there is no correlation.

More info can be found here.

Also available as a Statistica data function.

 

Installing the data function

Follow the online guide available here to register a data function in Spotfire.

 

Configuring the data function

Each data function may require inputs from the Spotfire analysis and will return outputs to the Spotfire analysis. For each data function, these need to be configured once the data function is registered. To learn about how to configure data functions in Spotfire please view this video:

For more information on Spotfire visit the Spotfire training page.

 

Data function library

There exists a large number of data functions covering various features. Feel free to review what is available on the Data Function Library.

Release 1.2.0

Published: October 2021

Changes to previous release:

  • Added new encode strings option to allow correlation of strings as well as numeric data

Release includes:

  • Data function
  • Dxp with example usage
  • Documentation
  • License information

 

Release 1.1.0

Published: May 2021

Changes to previous release:

  • Updated SFD file to correct version
  • Improved Python code for error handling
  • Added compatibility for Spotfire 11.3 and above
  • Added 2nd example to Spotfire DXP showing optional feature selection parameter

Release includes:

  • Data function
  • Dxp with example usage
  • Documentation
  • License information

×
×
  • Create New...