Changelog¶

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog and this project adheres to Semantic Versioning

3.2.1 - 2020-07-02¶

Added¶

Major documentation update.

Changed¶

PyRadiomics setup not dependent on pre-install of numpy anymore, so altered travis, yml, setup, and documentation.
PREDICT updated, change dependencies.
Defaults are now a bit different: see argumentation in the documentation. Main thing is that PyRadiomics and PREDICT both compute certain features, so it’s redundant to use both. Additionally, the wavelet features from PyRadiomics add +1000 features, which did not seem to help in many experiments while majorly slowing down the computation.

Fixed¶

Catch error when PCA does not converge in fit and score function.

3.2.0 - 2020-06-26¶

Added¶

Labelprocessing can now also handle having patient ID in the feature files. Was required for ComBat.

Changed¶

Output of plot_SVM function is better ordered.
Several defaults, as we now have PyRadiomics fully embedded, resulting in a large increase in features.
No more overrides for the full config, as the default now is the full config.

Fixed¶

Cardinality of decomposition tool was incorrect.
ComBat integration in WORC network now works properly.
Documentation didn’t build due to C-extension dependencies of PyRadiomics, and thus therefore also PREDICT. Fixed in setup.py and readthedocs files.
Documentation building.
Bugfix in segmentix test output url.
Bugfix in SimpleWORC facade when using features.
Warning when using Evaluation pipeline without images.

3.1.4 - 2020-05-26¶

Added¶

Catch error if number of segmentations supplied does not match number of images.
Add support in SimpleWORC and BasicWORC for multiple segmentations per patient.
Chi2 test in statistical testing.
fastr tool to make boxplots of all features, overall and per class.
Added this boxplot tool to the evaluate workflow.
Option in evaluation to overfit feature scaling to test set: should only be used to assess differences between the training and test sets, not in an actual model.
Option to delete small objects in segmentation.
Option to within the preprocessing, use a dilated ROI.
Otsu thresholding as mask for preprocessing.
Memory for each fastr node is now in a dictionary of the WORC object and can be easily changed.
PyRadiomics now fully embedded and configurable.
ComBat harmonization: currently as separate tool on full dataset, not in cross-validation.
Computation of ICC, and thresholding object to use ICC for feature selection.
Added groupwise feature selection per feature extraction toolbox.
Feature converter tool, to convert features from a toolbox to WORC compatible format.
RobustScaler for feature scaling.
Decomposition to evaluate network.
Combat: in WORK workflow.

Changed¶

Resampling of objects is now after feature selection.
Made plot_SVM function more memory efficient.
For PCA, Relief, VarianceThreshold, and SelectFromModel feature selection, you can now simply supply a float to determine the percentage of times this method is used in the created workflows.
Moved load_features from trainclassifier to file_io.
Matching of PID from labels from label file to other objects is now all converted to lower case.
Refactoring of WORC network building.
Segmentix tool is cleaned up. Segmentix script is moved to processing.

Fixed¶

Order of methods in preprocessing function of SearchCV did not correspond with that in fitandscore.
Replace spaces in uri conversion of sources in SimpleWORC.
Check whether all fitandscore jobs succeeded, otherwise throw error.
Bug in PCA when n_components > min(n_samples, n_features)
Random seed is now set and passed to PCA, Relief and all classifiers for reproducability of the results.
Evaluate can now also accept multiple feature toolboxes.

3.1.3 - 2020-01-24¶

Added¶

Some options for the plot_images slicer function.
Validators to check your inputs before executing experiment.
Timer in classification log.
Backwards compatibility in fastr config.
Option for fixed seed in cross-validation.
BasicWORC Facade.
Support for computing confidence intervals on differences between models.
Error when config file cannot be found.

Changed¶

Preprocessor slows down progress and is not always neccesary. Made it optional.
Moved the preprocessor to the SearchCV script to do once on the training set, not in within the algorithm optimization.
Ensembling during training is turned-of, as it takes to much time. Only used when plotting the performance.
Debug flag includes fixed seed in cross-validation.
Joblib now uses by default only 1 core and threading backend.

Fixed¶

Bug in patient naming of plotting function: if ensembling was done in training, do not re-ensemble.

3.1.2 - 2019-12-09¶

Added¶

Support for Oncoradiomics RadiomiX tool
Groupwise Search includes GLDZM, Fractal, location, NGTDM, NGLDM, wavelet, and rgrd features
SimpleWORC now also accepts features instead of images and segs as input
Preprocessor object to preprocess the features before any fitting algorithms are ran. Mostly to detect possible faults/errors in the data.

Changed¶

On runtime, copy config.d file if it does not exist yet.

Fixed¶

KNN imputation gave an error if >80% of the feature values were missing. Added preprocessing function to remove these features.

3.1.1 - 2019-11-28¶

Added¶

Travis continuous integration for Windows.
Removed RT Struct Reader, as it was buggy and not used.

Changed¶

SimpleWORC now properly uses the fastr FileSystem plugin, instead of supplying the absolute filepaths.

Fixed¶

Under Windows, creation of the fastr home folder if it did not exist did not work when using pip for installing. We now use a new feature of fastr to simply add the WORC folder to the fastr config at WORC import time.
When debugging, override manual tempdir setting and always use default. On Windows, Travis gives errors if the tempdir is not in a vfs mount.
Use shutil.move in the datadownloader, as os.rename could not overwrite files.

3.1.0 - 2019-10-16¶

Added¶

Thresholding option in plot_SVM.
NPV (Negative Preditive Value) to classification metrics.
Facade for easier interaction with WORC.
Thresholding option in plot_SVM.
Function to create fixed splits for cross validation.
n_splits parameter for train-test cross validation.
Added generalization score.
Parameter to choose how many of the optimal settings to save (maxlen).
Option to combine multiple onevsrest models in plot_SVM.
StatsticalTestThreshold feature selection for multilabel problems.
Support for test sets in which only one class is present in various plotting functions and the metrics.
Installation: create fastr home if it does not exist yet.
Boostrapping as performance evaluation in plot_SVM.
Confidence intervals for boostrapping based on percentile.
Catch for if patient is in the test set, but not the overall label set.
Downloader for downloading example datasets.
ReadTheDocs.yml for configuration of documentation.
Unit test included in Travis.
Various detectors.

Changed¶

Plot_SVR is removed: it’s now embedded in plot_SVM.
Moved statistical feature selection to last step in fit and score.
Also the minimum train and validation score are now saved.
Put scaler at top of fitandscore function.
Make link in file conversion if output is same format as input.
Sort keys in performance output JSON.
VarianceThreshold features selection on by default.
Removed grid_scores from SearchCV as support is dropped in sklearn > 0.20
Renamed IntermediateFacade to SimpleWORC
Use inspect to find packagedir

Fixed¶

Metric computation can now handle it when both the truth and the predicted labels are from a single class.
Plotting module now correctly initialized.
Plot_SVM now also works properly for regression.
Masks for ROI normalization now properly added.
Preprocessing: mask needed to be cast to binary.
Failed workflows now return nan instead of zero for performance.
Several bugs in multilabel performance evaluation
Ring in segmentix was in sagital instead of axial direction.
Added replacenan in features before applying SMOTE.
Metadata test was not passed to calcfeatures: bugfix.
Bugfix: overide labels in facade when predict_labels is called.
Several bugfixes in the overrides in the facade configbuilder.
Various print commands converted to Python3: .format prints were still left and sometimes buggy.
StatisticalTestFeatures and PlotRankedScores tools only accepted cardinality of 1.
Bugfixes in many plotting functions: opening files with ‘w’ instead of ‘wb’ due to python3 conversion, Compatibility issues with plot_SVM due to conversion.
Except error when Grahpviz is not installed.
Symlinking in worccastcovert not supported by Windows, reverted to copying.
Bugfix in create_ensemble in SearchCV when using ensemble = 1.

3.0.0 - 2019-05-08¶

Added¶

Now ported to Python3.6+ (Python 2 is no longer supported!). Thereby also to fastr3.
Compatibility for Windows. Some small changes in functions, as some packages behaviour differently under Windows. Also, adjusted sink and source paths to use OS file separator.
Config is now also a sink.

Changed¶

PCE and DTI node removed, as they were not open source.
Pinfo file can now also be a csv. Txt is still supported.
Use fastr as default for hyperparameter search parallelization instead of Joblib, as this is much more flexible.
When the conffidence interval cannot be computed, just use the mean.

Fixed¶

WORC_config.py was not correctly copied in Windows due to incorrect path separation.
Source creation for the config was only for Linux.
In numpy 1.15>, booleans cannot be subtracted. Fixed an error due to this in segmentix by using bitwise_xor instead.
Bug when using masks, but not for all images, and segmentix.
Cardinality of classify node was incorrect.

2.1.3 - 2019-04-08¶

Changed¶

PREDICT was updated, so had to update the requirements. Changed it to a minimum of PREDICT to prevent these issues in the future.

2.1.2 - 2019-04-02¶

Added¶

Dummy workflow in segmentix and calcfeatures PREDICT tools.
Added several new PREDICT parameters.
Slicer tool.

Changed¶

Memory for elastix tool is now larger.

Fixed¶

-Evaluate framework now correctly adopts the name you give it.

2.1.1 - 2019-02-15¶

Added¶

Several new PREDICT variables to the config.
Multilabel classification workflow.
New oversampling strategy.
RankedSVM multilabel classification and Relief feature selection.

Changed¶

Major reduction in memory usage, especially due to PREDICT updates.
Only use first configuration in the classify config.
Outputs are now in multiple subfolders instead of one big folder.

Fixed¶

Minor bug in test workflow: needed str of label in appending to classify.
There was a bug in using a .ini file as a config.

2.1.0 - 2018-08-09¶

Added¶

Feature imputation settings in WORC config.
PCA settings in WORC config.
Dummy file, which can generally be accepted by WORC.
Preprocessing is now a separate node before the calcfeatures node.
Started working on a RTStructReader tool.
Added EditElastixTransformFile node to set FinalBSplineInterpolationOrder to 0 in Elastix. Neccesary for transforming segmentations.
Registred image is also saved as a sink.
Tex, Zip and PNG Datatypes
Plot ROC tool for PREDICT
Plot SVM tool for PREDICT
Plot Barchart tool for PREDICT
Plot Ranked Scores tool for PREDICT
Plot statistical test tool for PREDICT
Tools: Evaluation network. Can currently be run only serparately: future work includes the optional addition of the Evaluate network to the WORC network.
Settings for PREDICT General, which contains the joblib Parallel settings and whether a temporary save will be made after each cross validation.

Changed¶

Separate sinks for the output segmentations of the elastix and segmentix nodes.
Switched from using PXCastConvert to WORCCastConvert, hence ITK is not anymore required as well as ITK tools.

Fixed¶

Patientclass ID was used for both test and training. Now given separate names.
When elastix is used but segmentix isn’t, there was a bug.
DataFile dataype is now a TypeGroup instead of an URLType.
Last transformation output from elastix is passed further to the network.
Set FinalBSplineInterpolationOrder to 0 before transforming segmentation with transformix.
Bug: when giving multiple feature sources, only the first was used.

2.0.0 - 2018-02-13¶

Added¶

Elastix and transformix as separate workflow in the tools folder. Can be used through the WORC.Tools attribute.
Example data for elastix and transformix tool.
Workflow for separate training and testing set
FASTR tool for applying ttest to all features. Works similar to the trainclassifier tool in terms of inputs and outputs.

Changed¶

Option for multiple modalities. Supports infinitely many inputs per object.
Moved many PREDICT parameters to the configuration file.
When using a multimodal workflow with only a single segmentation, Elastix will automatically be used for registration. Note that you have to put the reference segmentation on the first modality!

Fixed¶

Proper combining of features from multiple modalities to classify tool.
Minor bugs in segmentix tool.
For multiple modalities, add only optional sources like metadata when present.

1.0.0rc1 - 2017-05-08¶

First release