Grey Wolf Optimization

Grey Wolf

The Grey Wolf Optimizer (GWO) mimics the leadership hierarchy and hunting mechanism of grey wolves in nature. Four types of grey wolves such as alpha, beta, delta, and omega are employed for simulating the leadership hierarchy. In addition, three main steps of hunting, searching for prey, encircling prey, and attacking prey, are implemented to perform optimization.

Import

from zoofs import GreyWolfOptimization

Example

from sklearn.metrics import log_loss
"""
define your own objective function,
make sure the function receives four parameters,
fit your model and return the objective value !
"""
def objective_function_topass(model,X_train, y_train, X_valid, y_valid):      
    model.fit(X_train,y_train)  
    P=log_loss(y_valid,model.predict_proba(X_valid))
    return P

# import an algorithm !  
from zoofs import GreyWolfOptimization

# create object of algorithm
algo_object=GreyWolfOptimization(objective_function_topass,
                                 n_iteration=20,
                                 population_size=20,
                                 minimize=True)
import lightgbm as lgb
lgb_model = lgb.LGBMClassifier()       

# fit the algorithm
algo_object.fit(lgb_model,X_train, y_train, X_valid, y_valid,verbose=True)

#plot your results
algo_object.plot_history()

# extract the best  feature set
algo_object.best_feature_list

Methods

`init(self, objective_function, n_iteration=1000, timeout=None, population_size=50, method=1, minimize=True, logger=None, **kwargs)` `special`

Parameters:

Name	Type	Description	Default
`objective_function`	`user made function of the signature 'func(model,X_train,y_train,X_test,y_test)'`	The function must return a value, that needs to be minimized/maximized.	required
`n_iteration`	`int`	Number of time the Optimization algorithm will run	`1000`
`timeout`	`int`	Stop operation after the given number of second(s). If this argument is set to None, the operation is executed without time limitation and n_iteration is followed	`None`
`population_size`	`int, default=50`	Total size of the population	`50`
`method`	`{1, 2}, default=1`	Choose the between the two methods of grey wolf optimization	`1`
`minimize`	`bool, default=True`	Defines if the objective value is to be maximized or minimized	`True`
`logger`	`Logger or None, optional (default=None)`	accepts `logging.Logger` instance.	`None`
`**kwargs`	`None`	Any extra keyword argument for objective_function	`{}`

Attributes:

Name	Type	Description
`best_feature_list`	`ndarray of shape (n_features)`	list of features with the best result of the entire run

Source code in zoofs\greywolfoptimization.py

def __init__(self,
             objective_function,
             n_iteration: int = 1000,
             timeout: int = None,
             population_size=50,
             method=1,
             minimize=True,
             logger=None,
             **kwargs):
    """
    Parameters
    ----------
    objective_function : user made function of the signature 'func(model,X_train,y_train,X_test,y_test)'
        The function must return a value, that needs to be minimized/maximized.

    n_iteration : int, default=1000
        Number of time the Optimization algorithm will run

    timeout: int = None
        Stop operation after the given number of second(s).
        If this argument is set to None, the operation is executed without time limitation and n_iteration is followed

    population_size : int, default=50
        Total size of the population

    method : {1, 2}, default=1
        Choose the between the two methods of grey wolf optimization

    minimize : bool, default=True
        Defines if the objective value is to be maximized or minimized

    logger: Logger or None, optional (default=None)
        - accepts `logging.Logger` instance.

    **kwargs
        Any extra keyword argument for objective_function

    Attributes
    ----------
    best_feature_list : ndarray of shape (n_features)
        list of features with the best result of the entire run
    """
    super().__init__(objective_function, n_iteration, timeout, population_size, minimize, logger, **kwargs)
    self.method=method

`fit(self, model, X_train, y_train, X_valid, y_valid, verbose=True)`

Parameters:

Name	Type	Description	Default
`model`	`machine learning model's object`	machine learning model's object	required
`X_train`	`pandas.core.frame.DataFrame of shape (n_samples, n_features)`	Training input samples to be used for machine learning model	required
`y_train`	`pandas.core.frame.DataFrame or pandas.core.series.Series of shape (n_samples)`	The target values (class labels in classification, real numbers in regression).	required
`X_valid`	`pandas.core.frame.DataFrame of shape (n_samples, n_features)`	Validation input samples	required
`y_valid`	`pandas.core.frame.DataFrame or pandas.core.series.Series of shape (n_samples)`	The target values (class labels in classification, real numbers in regression).	required
`verbose`	`bool,default=True`	Print results for iterations	`True`

Source code in zoofs\greywolfoptimization.py

def fit(self, model, X_train, y_train, X_valid, y_valid, verbose=True):
    """
    Parameters
    ----------      
    model : machine learning model's object
       machine learning model's object

    X_train : pandas.core.frame.DataFrame of shape (n_samples, n_features)
       Training input samples to be used for machine learning model

    y_train : pandas.core.frame.DataFrame or pandas.core.series.Series of shape (n_samples)
       The target values (class labels in classification, real numbers in regression).

    X_valid : pandas.core.frame.DataFrame of shape (n_samples, n_features)
       Validation input samples

    y_valid : pandas.core.frame.DataFrame or pandas.core.series.Series of shape (n_samples)
        The target values (class labels in classification, real numbers in regression).

    verbose : bool,default=True
         Print results for iterations
    """
    self._check_params(model, X_train, y_train, X_valid, y_valid, self.method)

    self.feature_score_hash = {}
    self.feature_list = np.array(list(X_train.columns))
    self.best_results_per_iteration = {}
    self.best_score = np.inf
    self.best_dim = np.ones(X_train.shape[1])

    self.initialize_population(X_train)

    self.best_score_dimension = np.ones(X_train.shape[1])

    self.alpha_wolf_dimension, self.alpha_wolf_fitness = np.ones(
        X_train.shape[1]), np.inf
    self.beta_wolf_dimension, self.beta_wolf_fitness = np.ones(
        X_train.shape[1]), np.inf
    self.delta_wolf_dimension, self.delta_wolf_fitness = np.ones(
        X_train.shape[1]), np.inf

    if (self.timeout is not None):
        timeout_upper_limit = time.time() + self.timeout
    else:
        timeout_upper_limit = time.time()
    for i in range(self.n_iteration):
        if (self.timeout is not None) & (time.time() > timeout_upper_limit):
            warnings.warn("Timeout occured")
            break
        a = 2-2*((i+1)/self.n_iteration)

        self.fitness_scores = self._evaluate_fitness(
            model, X_train, y_train, X_valid, y_valid)

        self.iteration_objective_score_monitor(i)

        top_three_fitness_indexes = np.argsort(self.fitness_scores)[:3]

        for fit, dim in zip(np.array(self.fitness_scores)[top_three_fitness_indexes], self.individuals[top_three_fitness_indexes]):
            if fit < self.alpha_wolf_fitness:
                self.delta_wolf_fitness = self.beta_wolf_fitness
                self.beta_wolf_fitness = self.alpha_wolf_fitness
                self.alpha_wolf_fitness = fit

                self.delta_wolf_dimension = self.beta_wolf_dimension
                self.beta_wolf_dimension = self.alpha_wolf_dimension
                self.alpha_wolf_dimension = dim
                continue

            if ((fit > self.alpha_wolf_fitness) & (fit < self.beta_wolf_fitness)):
                self.delta_wolf_fitness = self.beta_wolf_fitness
                self.beta_wolf_fitness = fit

                self.delta_wolf_dimension = self.beta_wolf_dimension
                self.beta_wolf_dimension = dim
                continue

            if ((fit > self.beta_wolf_fitness) & (fit < self.delta_wolf_fitness)):
                self.delta_wolf_fitness = fit
                self.delta_wolf_dimension = dim

        if (self.method == 1) | (self.method == 2):
            C1 = 2 * \
                np.random.random((self.population_size, X_train.shape[1]))
            A1 = 2*a * \
                np.random.random(
                    (self.population_size, X_train.shape[1]))-a
            d_alpha = abs(C1*self.alpha_wolf_dimension - self.individuals)

            C2 = 2 * \
                np.random.random((self.population_size, X_train.shape[1]))
            A2 = 2*a * \
                np.random.random(
                    (self.population_size, X_train.shape[1]))-a
            d_beta = abs(C2*self.beta_wolf_dimension - self.individuals)

            C3 = 2 * \
                np.random.random((self.population_size, X_train.shape[1]))
            A3 = 2*a * \
                np.random.random(
                    (self.population_size, X_train.shape[1]))-a
            d_delta = abs(C3*self.delta_wolf_dimension - self.individuals)

        if self.method == 2:
            X1 = abs(self.alpha_wolf_dimension - A1*d_alpha)
            X2 = abs(self.beta_wolf_dimension - A2*d_beta)
            X3 = abs(self.delta_wolf_dimension - A3*d_delta)
            self.individuals = np.where(np.random.uniform(size=(
                self.population_size, X_train.shape[1])) <= self.sigmoid((X1+X2+X3)/3), 1, 0)

        if self.method == 1:
            Y1 = np.where((self.alpha_wolf_dimension + np.where(self.sigmoid(A1*d_alpha) >
                          np.random.uniform(size=(self.population_size, X_train.shape[1])), 1, 0)) >= 1, 1, 0)
            Y2 = np.where((self.beta_wolf_dimension + np.where(self.sigmoid(A1*d_beta) >
                          np.random.uniform(size=(self.population_size, X_train.shape[1])), 1, 0)) >= 1, 1, 0)
            Y3 = np.where((self.delta_wolf_dimension + np.where(self.sigmoid(A1*d_delta) >
                          np.random.uniform(size=(self.population_size, X_train.shape[1])), 1, 0)) >= 1, 1, 0)
            r = np.random.uniform(
                size=(self.population_size, X_train.shape[1]))
            self.individuals[r < (1/3)] = Y1[r < (1/3)]
            self.individuals[(r >= (1/3)) & (r < (2/3))
                             ] = Y2[(r >= (1/3)) & (r < (2/3))]
            self.individuals[r >= (2/3)] = Y3[r >= (2/3)]

        self.verbose_results(verbose, i)
        self.best_feature_list = list(
            self.feature_list[np.where(self.best_dim)[0]])
    return self.best_feature_list

`plot_history(self)` `inherited`

Plot objective score history

Source code in zoofs\greywolfoptimization.py

def plot_history(self):
    """
    Plot objective score history
    """
    res = pd.DataFrame.from_dict(self.best_results_per_iteration).T
    res.reset_index(inplace=True)
    res.columns = ['iteration', 'best_score',
                   'objective_score', 'selected_features']
    fig = go.Figure()
    fig.add_trace(go.Scatter(x=res['iteration'], y=res['objective_score'],
                             mode='markers', name='objective_score'))
    fig.add_trace(go.Scatter(x=res['iteration'], y=res['best_score'],
                             mode='lines+markers',
                             name='best_score'))
    fig.update_xaxes(title_text='Iteration')
    fig.update_yaxes(title_text='objective_score')
    fig.update_layout(
        title="Optimization History Plot")
    fig.show()

Grey Wolf Optimization

Import

Example

Methods

__init__(self, objective_function, n_iteration=1000, timeout=None, population_size=50, method=1, minimize=True, logger=None, **kwargs) special

fit(self, model, X_train, y_train, X_valid, y_valid, verbose=True)

plot_history(self) inherited

`init(self, objective_function, n_iteration=1000, timeout=None, population_size=50, method=1, minimize=True, logger=None, **kwargs)` `special`

`fit(self, model, X_train, y_train, X_valid, y_valid, verbose=True)`

`plot_history(self)` `inherited`