Model Tuner Logo

Changelog

Important

Complete version release history available here

Version 0.0.15a

Contains all previous fixes relating to:

  • CatBoost support (early stopping, and support involving resetting estimators).

  • Pipeline steps now support hyperparameter tuning of the resamplers (SMOTE, ADASYN, etc.).

  • Removed older implementations of impute and scaling and moved onto supporting only custom pipeline_steps.

  • Fixed bugs in stratification with regards to length mismatch of dependent variable when using column names to stratify.

  • Cleaned a removed multiple lines of unused code and unused initialisation parameters.

Version 0.0.014a

In previous versions, the train_val_test_split method allowed for stratification either by y (stratify_y) or by specified columns (stratify_cols), but not both at the same time. There are use cases where stratification by both the target variable (y) and specific columns is necessary to ensure a balanced and representative split across different data segments.

Enhancement

Modified the train_val_test_split method to support simultaneous stratification by both stratify_y and stratify_cols. This was inside the method achieved by implementing the following logic that ensures both y and the specified columns are considered during the stratification process.

stratify_key = pd.concat([X[stratify_cols], y], axis=1)

strat_key_val_test = pd.concat(
   [X_valid_test[stratify_cols], y_valid_test], axis=1
)

Version 0.0.013a

  • Updated bootstrapper

  • evaluate_bootstrap_metrics

  • Added notebooks/xgb_early_bootstrap_test.py to test it

  • Updated requirements.txt file for dev testing

  • Fixed sampling error on low number of samples inside bootstrapper

Version 0.0.012a

  • Xgboost bug fixes

  • Zenodo updates

  • Pickle model fixes with np import

  • ADASYN and SMOTE fix with no fit happening when calibrating

Version 0.0.011a

  • updated readme for PyPI

  • previous version not saved on setup; re-release to 0.0.11a

Version 0.0.010a

  • updated readme for PyPI

Version 0.0.09a

  • number of estimators now extracted from XGBoost model object

  • early stopping fixed

Version 0.0.08a

AutoKerasClassifier

  • Changed layers key to store count instead of list to avoid exceeding MLflow’s 500-char limit.

  • Simplified function by removing key filtering loop.

Version 0.0.07a

  • Kfold threshold tuning fix

Version 0.0.06a

  • Updating best_params: ref before assignment bug

Version 0.0.05a

  • Bootstrapper: - Fixed import bugs - Fixed Assertion bug to do with metrics not being assigned

  • Early stopping: - Leon: fixed bug with SelectKBest and ADASYN where the wrong code chunk was being utilized - Arthur: Verbosity fix

Version 0.0.02a

  • temporarily commented out updated apache software license string in setup.py

  • updated logo resolution