Jump to content

Integration Mods for Spotfire® Data Science and TIBCO® Data Virtualization Release 1.2.0


1 Screenshot

Summary

21 new Spotfire® Data Science - Team Studio operators that enable machine learning with Spark 2.4 for a wide range of data sources provided by TIBCO® Data Virtualization.

Overview

Spotfire® Data Science - Team Studio version 6.6 can exploit the power of TIBCO® Data Virtualization (TIBCO® DV) in order to connect to a diverse set of data sources. In order to combine the power of TIBCO DV to seamlessly handle many data sources ? including cloud data sources such as AWS S3 ? with the machine learning capabilities of Spark 2.4, we provided 21 new operators designed to work with TIBCO DV data sources. This allows users to build a complete data science workflow whilst still being able to use standard database operators provided by Spotfire® Data Science - Team Studio. The SQL queries are pushed down to TIBCO DV, and the machine learning executes in Spark. 

 

The new operators are provided in this community Exchange offering, and each operator is individually documented. The result is a set of scalable operators capable of processing large volumes of data.

 

This Exchange download includes the .jar files for the 21 Mods, along with an integration pack to set up the connectivity between TIBCO DV and Spotfire® Data Science - Team Studio. Follow this Knowledge Base article for Mods installation guidelines. 

 

Follow the provided installation instructions to configure Spotfire® Data Science - Team Studio and TIBCO DV. 

 

More details can be found in this overview of Integration Mods for Spotfire® Data Science and TIBCO® Data Virtualization.

 

 

Release 1.2.0

Published: July 2022

Release includes:

  • JAR file with 21 Team Studio operators extending the Team Studio functionality
  • Documentation for this integration pack
  • Documentation for all new operators
  • License information

What is new compared to previous version:

  • New Import Excel operator.
  • Bug fix for TDV ModelStore detection.



Release 1.1.0

Published: May 2022

Release includes:

  • JAR file with 20 Team Studio operators extending the Team Studio functionality
  • Documentation for this integration pack
  • Documentation for all new operators
  • License information

What is new compared to previous version:

  • Added support for the following Spark configurations:
    • Kerberos for Spark with Yarn cluster manager
    • Spark Standalone cluster manager
    • Spark running locally on the Team Studio server.
  • Added support for TIBCO DV 8.5.
  • ModelStore is now managed by TIBCO DV on the local file system or designated database.
  • Bug fixes for normalization in Elastic-Net Logistic Regression, Elastic-Net Linear Regression and K-Means Clustering.
  • Simplified ModelStore setup in the installation scripts.
  • New option in Export to File Storage to choose an external file system, for example a S3 bucket not associated with the Spark cluster.

 

Release 1.0.0

Published: June 2021

Initial release includes:

  • JAR file with 19 Team Studio operators extending the Team Studio functionality
  • Documentation for this integration pack
  • Documentation for all new operators
  • License information

×
×
  • Create New...