Feature
Feature¶
-
class
best.modules.feature.FeatureAugmentorModule¶ Feature augmentation using an ‘augment_features’ function from the ‘PiesUtils’ package. See the code for additional details.
-
fit(X=None, Y=None)¶
-
fit_transform(X, Y=None)¶
-
transform(X)¶
-
-
class
best.modules.feature.Log10Module¶ -
fit(X, Y=None)¶
-
fit_transform(X, Y=None)¶
-
transform(X, Y=None)¶
-
-
class
best.modules.feature.LogModule¶ -
fit(X, Y=None)¶
-
fit_transform(X, Y=None)¶
-
transform(X, Y=None)¶
-
-
class
best.modules.feature.PCAModule(var_threshold=0.98)¶ -
fit(X, y=None)¶ Fit the model with X.
- Parameters
X (array-like of shape (n_samples, n_features)) – Training data, where n_samples is the number of samples and n_features is the number of features.
y (Ignored) –
- Returns
self – Returns the instance itself.
- Return type
object
-
fit_transform(X, y=None)¶ Fit the model with X and apply the dimensionality reduction on X.
- Parameters
X (array-like of shape (n_samples, n_features)) – Training data, where n_samples is the number of samples and n_features is the number of features.
y (Ignored) –
- Returns
X_new – Transformed values.
- Return type
ndarray of shape (n_samples, n_components)
Notes
This method returns a Fortran-ordered array. To convert it to a C-ordered array, use ‘np.ascontiguousarray’.
-
-
class
best.modules.feature.PCAModuleSVD(var_threshold=0.98)¶ -
fit(X, Y=None)¶
-
fit_transform(X, Y=None)¶
-
transform(X, Y=None)¶
-
-
class
best.modules.feature.ZScoreModule(trainable=False, continuous_learning=False, multi_class=False)¶ Z-score normalization compatible with scikit.pipeline.Pipeline Enables continuous learning - enabling continuous adaptation.
- Modes
Zscore normalization
- Zscore normalization with fixed mean and std values based on the initial training dataset
Possible category-wise normalization with mean and std values estimated from the training dataset - number of features is multiplied by number of categories
- Zscore normalization with an initial mean and std values trained on the training dataset - adaptation during inference
https://stats.stackexchange.com/questions/211837/variance-of-subsample
-
continuous_learning¶ If true - An instance updates mean and variance values during each prediction step. Initial outlier filtering is recommended
- Type
bool
-
trainable¶ If false - An instance normalizes inference data based on their current mean value and std If true - An instance remembers mean and variance values of training data
- Type
bool
-
multi_class¶ If true - An instance performs normalization for each training class separately Number of output features is multiplied by a number of training categories
- Type
bool
-
mean¶ Trained mean values for each feature. In case multi_class == True -> list of numpy ndarrays for each category
- Type
numpy ndarray / list
-
std¶ - Type
numpy ndarray
-
N¶ - Type
int
-
fit(X=None, Y=None)¶ - Parameters
X (numpy ndarray) – shape[n_samples, n_features]
Y (list or numpy array, optional) – category reference for each sample - required only for option with multi_class normalization
- Returns
- Return type
None
-
fit_transform(X=None, Y=None)¶ - Parameters
X (numpy ndarray) – shape[n_samples, n_features]
Y (list or numpy array, optional) – category reference for each sample - required only for option with multi_class normalization
- Returns
transformed_data – shape[n_samples, n_features]
- Return type
numpy ndarray
-
transform(X=None)¶ - Parameters
X (numpy ndarray) – shape[n_samples, n_features]
- Returns
transformed_data – shape[n_samples, n_features]
- Return type
numpy ndarray