Mohd Abid Dhamiri Bin Atan Posted July 12, 2021 Share Posted July 12, 2021 Just an arbitraryexample here... let's say I would like to splitthe Red Wine Quality dataset (https://www.kaggle.com/uciml/red-wine-quality-cortez-et-al-2009) into 3 datasets of 75%, 20% and 5% ratios for the purposes of Training, Validation and Testing of my model. What are the different methods of achieving this in Spotfire I would like to purely rely on Spotfire without any R codes involved. p.s I am not exactly using the Red Wine Quality dataset as mentioned above but my current dataset has similar number of variables and observations. Link to comment Share on other sites More sharing options...
Gaia Paolini Posted July 13, 2021 Share Posted July 13, 2021 Create an intermediate random column between zero and one: [RANDOM01] : Rand(123) Then since this will be (pseudo)-uniformly distributed, you can assign rows to a sample by: [sAMPLE_ID]: case when [RANDOM01] Link to comment Share on other sites More sharing options...
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now