Code for distributing highly comparative time-series analysis computations, using hctsa, on a computing cluster using pbs or slurm using Matlab (without linking to a mySQL database).
A basic pipeline:
- Set up a large
HCTSA.matfile for your computation on your local machine usingTS_Init. - Ensure that the hctsa version on your computing cluster is identical to the local version used to run
TS_Init(otherwise results could be inconsistent). - Transfer the (uncomputed)
HCTSA.matfile onto the cluster - Set the parameters
tsMin,tsMax, andnumPerJobinHCTSA_run.sh. These parameters determine howHCTSA.matwill be distributed into segments, each of which will be submitted as a cluster job. - Run
HCTSA_run.shin the parent directory which should contain theHCTSA.matfile. This will generate a set of directories containing subsets of time series. (NB: you may need to grant yourself permission to execute:chmod u+x HCTSA_run.sh) - When all computations are complete, stitch all the subsections of the main
HCTSAfile back together again usingcombineBatchFiles. This yields a fully computedHCTSA.matfile. 😄