An intelligent technique for initial distribution in stochastic optimization methods

Preprint

Article

An intelligent technique for initial distribution in stochastic optimization methods

Altmetrics

Downloads

Views

Comments

A peer-reviewed article of this preprint also exists.

Vasileios Charilogis,

Ioannis G. Tsoulos^*,

VASILIOS STAVROU

Vasileios Charilogis,

Ioannis G. Tsoulos^*,

VASILIOS STAVROU

This version is not peer-reviewed

Submitted:

18 September 2023

Posted:

20 September 2023

You are already at the latest version

Alerts

Abstract

The need to find the global minimum in multi-dimensional functions is a critical problem in many fields of science and technology. Effectively solving this problem requires the creation of initial solution estimates, which are subsequently used by the optimization algorithm to search for the best solution in the solution space. In the context of this article, a novel approach to generating the initial solution distribution is presented which is applied to a genetic optimization algorithm. Using the k-means clustering algorithm, a distribution based on data similarity is created. This helps in generating initial estimates that may be more tailored to the problem. Additionally, the proposed method employs a rejection sampling algorithm to discard samples that do not yield better solution estimates in the optimization process. This allows the algorithm to focus on potentially optimal solutions, thus improving its performance. Finally, the article presents experimental results from the application of this approach to various optimization problems, providing the scientific community with a new method for addressing this significant problem.

Keywords:

Subject: Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

The task of locating the global minimum of a continuous and differentiable function f can be defined as:

x^{*} = \arg min_{x \in S} f (x)

(1)

with S:

S = [a_{1}, b_{1}] \otimes [a_{2}, b_{2}] \otimes \dots [a_{n}, b_{n}]

This task finds application in a variety of real world problems, such as problems from physics [1,2,3], chemistry [4,5,6], economics [7,8], medicine [9,10] etc. The methods aimed at finding the global minimum are divided into two major categories: deterministic methods and stochastic methods. The most frequently encountered techniques of the first category are interval techniques [11,12], which partition the initial domain of the objective function until a promising subset is found to find the global minimum. The second category includes the vast majority of methods and in its ranks one can find methods such as Controlled Random Search methods [13,14,15], Simulated Annealing methods [16,17,18], Differential Evolution methods [19,20], Particle Swarm Optimization (PSO) methods [21,22,23], Ant Colony optimization methods [24,25], etc. Furthermore, a variety of hybrid techniques have been proposed, such as hybrid Multistart methods [26,27], hybrid PSO techniques [28,29,30] etc. Also, many parallel optimization methods [31,32] have appeared during the past years or methods that take advantage of the modern GPU processing units [33,34].

One of the basic techniques included in the area of stochastic techniques is Genetic Algorithms, initially proposed by John Holland [35]. The operation of genetic algorithms is inspired by biology, and for this reason, it utilizes the idea of evolution through genetic mutation, natural selection, and crossover [36,37,38].

Genetic algorithms can be combined with machine learning to solve complex problems and optimize models. More specifically, the genetic algorithm has been applied in many machine learning applications, such as in the article by Ansari et al, which deals with the recognition of digital modulation signals. In this article, the genetic algorithm is used to optimize machine learning models by adjusting their features and parameters to achieve better signal recognition accuracy [39]. Additionally, in the study by Ji et al, a methodology is proposed that uses machine learning models to predict amplitude deviation in hot rolling, while genetic algorithms are employed to optimize the machine learning models and select features to improve prediction accuracy [40]. Furthermore, in the article by Santana, Alonso, and Nieto, which focuses on the design and optimization of 5G networks in indoor environments, the use of genetic algorithms and machine learning models is identified for estimating path loss, which is critical for determining signal strength and coverage indoors [41].

Another interesting article is by Liu et al, which discusses the use of genetic algorithms in robotics [42]. The authors propose a methodology that utilizes genetic algorithms to optimize the trajectory and motion of digital twin robots. A similar study was presented by Nonoyama et al [43], where the research focused on optimizing energy consumption during the motion planning of a dual-arm industrial robot. The goal of the research is to minimize energy consumption during the process of object retrieval and placement. To achieve this, both genetic algorithms and particle swarm optimization algorithms are used to adjust the robot’s motion trajectory, thereby increasing its energy efficiency.

The use of genetic algorithms is still prevalent even in the business world. In the article by Liu et al [44], the application of genetic algorithms in an effort to optimize energy conservation in a high-speed Methanol Spark Ignition engine fueled with Methanol and gasoline blends is discussed. In this study, genetic algorithms were used as an optimization technique to find the best operating conditions for the engine, such as the air-fuel ratio, ignition timing, and other engine control variables, aiming to save energy and reduce energy consumption and emissions. In another research, the optimization of the placement of electric vehicle charging stations is carried out [45]. Furthermore, in the study by Chen and Hu [46], the design of an intelligent system for agricultural greenhouses using genetic algorithms is presented to provide multiple energy sources. Similarly, in the research by Min, Song, Chen, Wang, and Zhang [47], an optimized energy management strategy for hybrid electric vehicles is introduced using a genetic algorithm based on fuel cells in a neural network under startup conditions.

Moreover, genetic algorithms are extremely useful in the field of medicine, as they are employed in therapy optimization, medical personnel training, genetic diagnosis, and genomic research. More specifically, in the study by Doewes, Nair & Sharma [48], data from blood analyses and other biological samples were used to extract characteristics related to the presence of the SARS-CoV-2 virus that causes COVID-19. In this article, genetic algorithms are used for data analysis and processing to extract significant characteristics that can aid in the effective diagnosis of COVID-19. Additionally, there are studies that present the design of dental implants for patients using artificial neural networks and genetic algorithms [49,50]. Lastly, the contribution of genetic algorithms is significant in both implant techniques [51,52] and surgeries [53,54].

The current work aims to improve the efficiency of the genetic algorithm in global optimization problems, by introducing a new way of initializing the population’s chromosomes. In the new initialization technique, the k-means [55] method is used to find initial values of the chromosomes that will lead to finding the global minimum faster and more efficient than chromosomes generated by some random distribution. Also, the proposed technique discards chromosomes which, after applying the k-means technique, are close to each other.

The rest of this article is organized as follows: in Section 2 the proposed method is discussed in detail, in Section 3 the used test functions as well the experimental results are fully outlined and finally in Section 4 some conclusions and future guidelines are listed.

2. The proposed method

The fundamental operation of a genetic algorithm mimics the process of natural evolution. The algorithm begins by creating an initial population of solutions, called chromosomes that represents a potential solution to the objective problem. The genetic algorithm operates by reproducing and evolving populations of solutions through iterative steps. Following the analogy to natural evolution, the genetic algorithm allows optimal solutions to evolve through successive generations. The main steps of the used genetic algorithm are described below:

Initialization Step

(a)

Set $N_{c}$ as the number of chromosomes.

(b)

Set $N_{g}$ the maximum number of allowed generations.

(c)

Initialize randomly the $N_{C}$ chromosomes in in S.

(d)

Set as $p_{s}$ the selection rate of the algorithm, with $p_{s} \leq 1$ .

(e)

Set as $p_{m}$ the mutation rate, with $p_{m} \leq 1$ .

(f)

Set iter=0.
Fitness calculation Step

(a)

For $i = 1, \dots, N_{g}$ do

i

Calculate the fitness $f_{i} = f (g_{i})$ of chromosome $g_{i}$ .

(b)

EndFor
Genetic operations step

(a)

Selection procedure. The chromosomes are sorted according to their fitness values. The $(1 - p_{s}) \times N_{c}$ chromosomes with the lowest fitness values are transferred intact to the next generation. The remain chromosomes are substituted by offspings created in the crossover procedure. During the selection process for each offspring two parents are selected from the population using the tournament selection.

(b)

Crossover procedure: For every pair $(z, w)$ of selected parents two additional chromosomes $\tilde{z}$ and $\tilde{w}$ are produced using the following equations:

$\begin{matrix} \tilde{z_{i}} & = & a_{i} z_{i} + (1 - a_{i}) w_{i} \\ \tilde{w_{i}} & = & a_{i} w_{i} + (1 - a_{i}) z_{i} \end{matrix}$

(2)

The value $a_{i}$ is a randomly selected number with $a_{i} \in [- 0.5, 1.5]$ [56].

(c)

Mutation procedure: For each element of every chromosome, a random number $r \in [0, 1]$ is drawn. The corresponding element is altered randomly if $r \leq p_{m}$ .
Termination Check Step

(a)

Set $i t e r = i t e r + 1$

(b)

If $i t e r \geq N_{g}$ or the proposed stopping rule of Tsoulos [57] is hold, then goto Local Search Step, else goto b.
Local Search Step. Apply a local search procedure to chromosome of the population with the lowest fitness value and report the obtained minimum. In the current work the BFGS variant of Powell [58] was used as a local search procedure.

The current work proposes a novel method to initiate the chromosomes, that utilizes the well - known technique of k-means. The significance of the initial distribution in solution finding within optimization is essential across various domains and techniques. Apart from genetic algorithms, the initial distribution impacts other optimization methods like Particle Swarm Optimization (PSO)[59], Evolution Strategies[60], and neural networks[61]. The initial distribution defines the starting solutions that will evolve and improve throughout the algorithm. If the initial population contains solutions close to the optimum, it increases the likelihood of evolved solutions being in proximity to the optimal solution. Conversely, if the initial population is distant from the optimum, the algorithm might need more iterations to reach the optimal solution or even get stuck in a suboptimal solution. In conclusion, the initial distribution influences the stability, convergence speed, and quality of optimization algorithm outcomes. Thus, selecting a suitable initial distribution is crucial for the algorithm’s efficiency and the discovery of the optimal solution in a reasonable time [63,64].

2.1. Proposed initialization Distribution

The present work replaces the randomness of the initialization of the chromosomes by using the k-means technique. More specifically, the method takes a series of samples from the objective function and then the k-means method is used to locate the centers of these points. These centers can then be used as chromosomes in the genetic algorithm.

The k-means algorithm emerged in 1957 by Stuart Lloyd in the form of Lloyd’s algorithm[65], although the concept of clustering based on distance had been introduced earlier. The name ’k-means’ was introduced around 1967 by James MacQueen[66]. The k-means algorithm is a clustering algorithm widely used in data analysis and machine learning. Its primary objective is to partition a dataset into k clusters, where data points within the same cluster are similar to each other and differ from data points in other clusters. Specifically, k-means seeks cluster centers and assigns samples to each cluster, aiming to minimize the distance within clusters and maximize the distance between cluster centers[67]. The algorithm steps are presented in algorithm 1

Algorithm 1: The k-Means algorithm.

1.

Set the number of clusters k

2.

Repeat

(a)

Set

S_{j} =, j = 1 . . k

(b)

For every point

x_{i}, i = 1, . . ., N_{m}

i: Set $j^{*} = {min}_{i = 1}^{k} \{D (x_{i}, c_{j})\}$ .
ii.: Set $S_{j^{*}} = S_{j^{*}} \cup \{x_{i}\}$ .

(c)

EndFor

(d)

For every center

c_{j}, j = 1 . . k

i.: Set as $M_{j}$ the number of points in $S_{j}$
ii.: Compute $c_{j}$ as

$c_{j} = \frac{1}{M_{j}} \sum_{i = 1}^{M_{j}} x_{i}$

(e)

EndFor

3.

Calculate the quantities

s_{j}

σ_{j}^{2} = \frac{\sum_{i = 1}^{M_{j}} {(x_{i} - c_{j})}^{2}}{M_{j}}

4.

Stopthe algorithm, if there is no change in centers

c_{j}

The algorithm terminates when cluster centers don’t change significantly between consecutive iterations, implying that the clusters have stabilized in their final form[68,69].

2.2. Chromosome rejection rule

An additional technique for discarding chromosomes where they are similar or close to each other is listed and applied below. Specifically, each chromosome is extensively compared to all the other chromosomes, and those that have very small or negligible Euclidean distance between them are sought, implying their similarity. Subsequently, the algorithm incorporates these chromosomes into the final initial distribution table, while chromosomes that are not similar are discarded.

Algorithm 2: Chromosome rejection rule

1.

Set C the set of centers,

C = \{c_{i}, i = 1, \dots, k\}

2.

Set

ϵ > 0

a small positive number

3.

For every center

c_{i}

(a)

For every center

c_{j}, i \neq j

i.: If $∥c_{i} - c_{j}∥ \leq ϵ$ thenremove $c_{i}$ from C.

(b)

EndFor

(c)

If then

4.

EndFor

5.

Return the final set of centers C

2.3. The proposed sampling procedure

The proposed sampling procedure has the following major steps:

Take $N_{m}$ random samples from the objective function using uniform distribution
Calculate the k centers of the $N_{m}$ points using the k-means algorithm provided in algorithm 1.
Remove from the set of centers C, points that are closed to each other.
Return the set of centers C as the set of chromosomes.

3. Experiments

In the following, the benchmark functions used in the experiments as well as the experimental results are presented. The test functions used here was proposed in a variety of research papers [70,71].

3.1. Test functions

The definition of the test functions used are given below

Bf1 (Bohachevsky 1) function:

f (x) = x_{1}^{2} + 2 x_{2}^{2} - \frac{3}{10} cos (3 π x_{1}) - \frac{4}{10} cos (4 π x_{2}) + \frac{7}{10}

with

x \in {[- 100, 100]}^{2}

Bf2 (Bohachevsky 2) function:

$f (x) = x_{1}^{2} + 2 x_{2}^{2} - \frac{3}{10} cos (3 π x_{1}) cos (4 π x_{2}) + \frac{3}{10}$

with $x \in {[- 50, 50]}^{2}$ .
Branin function: $f (x) = {(x_{2} - \frac{5.1}{4 π^{2}} x_{1}^{2} + \frac{5}{π} x_{1} - 6)}^{2} + 10 (1 - \frac{1}{8 π}) cos (x_{1}) + 10$ with $- 5 \leq x_{1} \leq 10, 0 \leq x_{2} \leq 15$ .
CM function:

$f (x) = \sum_{i = 1}^{n} x_{i}^{2} - \frac{1}{10} \sum_{i = 1}^{n} cos (5 π x_{i})$

where $x \in {[- 1, 1]}^{n}$ . In the conducted experiments the value $n = 4$ was used.
Camel function:

$f (x) = 4 x_{1}^{2} - 2.1 x_{1}^{4} + \frac{1}{3} x_{1}^{6} + x_{1} x_{2} - 4 x_{2}^{2} + 4 x_{2}^{4}, x \in {[- 5, 5]}^{2}$
Easom function:

$f (x) = - cos (x_{1}) cos (x_{2}) exp ({(x_{2} - π)}^{2} - {(x_{1} - π)}^{2})$

with $x \in {[- 100, 100]}^{2} .$
Exponential function, defined as:

$f (x) = - exp (- 0.5 \sum_{i = 1}^{n} x_{i}^{2}), - 1 \leq x_{i} \leq 1$

The values $n = 4, 8, 16, 32$ were used in the executed experiments.
Griewank2 function:

$f (x) = 1 + \frac{1}{200} \sum_{i = 1}^{2} x_{i}^{2} - \prod_{i = 1}^{2} \frac{cos (x_{i})}{\sqrt{(i)}}, x \in {[- 100, 100]}^{2}$
Griewank10 function. The function is given by the equation

$f (x) = \sum_{i = 1}^{n} \frac{x_{i}^{2}}{4000} - \prod_{i = 1}^{n} cos (\frac{x_{i}}{\sqrt{i}}) + 1$

with $n = 10$ .
Gkls function. $f (x) = G k l s (x, n, w)$ , is a function with w local minima, described in [72] with $x \in {[- 1, 1]}^{n}$ and n a positive integer between 2 and 100. The values $n = 2, 3$ and $w = 50$ were used in the conducted experiments.
Goldstein and Price function

$\begin{matrix} f (x) & = & [1 + {(x_{1} + x_{2} + 1)}^{2} \\ (19 - 14 x_{1} + 3 x_{1}^{2} - 14 x_{2} + 6 x_{1} x_{2} + 3 x_{2}^{2})] \times \\ [30 + {(2 x_{1} - 3 x_{2})}^{2} \\ (18 - 32 x_{1} + 12 x_{1}^{2} + 48 x_{2} - 36 x_{1} x_{2} + 27 x_{2}^{2})] \end{matrix}$

With $x \in {[- 2, 2]}^{2}$ .
Hansen function: $f (x) = \sum_{i = 1}^{5} i cos [(i - 1) x_{1} + i] \sum_{j = 1}^{5} j cos [(j + 1) x_{2} + j]$ , $x \in {[- 10, 10]}^{2}$ .
Hartman 3 function:

$f (x) = - \sum_{i = 1}^{4} c_{i} exp (- \sum_{j = 1}^{3} a_{i j} {(x_{j} - p_{i j})}^{2})$

with $x \in {[0, 1]}^{3}$ and $a = (\begin{matrix} 3 & 10 & 30 \\ 0.1 & 10 & 35 \\ 3 & 10 & 30 \\ 0.1 & 10 & 35 \end{matrix}), c = (\begin{matrix} 1 \\ 1.2 \\ 3 \\ 3.2 \end{matrix})$ and

$p = (\begin{matrix} 0.3689 & 0.117 & 0.2673 \\ 0.4699 & 0.4387 & 0.747 \\ 0.1091 & 0.8732 & 0.5547 \\ 0.03815 & 0.5743 & 0.8828 \end{matrix})$
Hartman 6 function:

$f (x) = - \sum_{i = 1}^{4} c_{i} exp (- \sum_{j = 1}^{6} a_{i j} {(x_{j} - p_{i j})}^{2})$

with $x \in {[0, 1]}^{6}$ and $a = (\begin{matrix} 10 & 3 & 17 & 3.5 & 1.7 & 8 \\ 0.05 & 10 & 17 & 0.1 & 8 & 14 \\ 3 & 3.5 & 1.7 & 10 & 17 & 8 \\ 17 & 8 & 0.05 & 10 & 0.1 & 14 \end{matrix}), c = (\begin{matrix} 1 \\ 1.2 \\ 3 \\ 3.2 \end{matrix})$ and

$p = (\begin{matrix} 0.1312 & 0.1696 & 0.5569 & 0.0124 & 0.8283 & 0.5886 \\ 0.2329 & 0.4135 & 0.8307 & 0.3736 & 0.1004 & 0.9991 \\ 0.2348 & 0.1451 & 0.3522 & 0.2883 & 0.3047 & 0.6650 \\ 0.4047 & 0.8828 & 0.8732 & 0.5743 & 0.1091 & 0.0381 \end{matrix})$
Potential function. The molecular conformation corresponding to the global minimum of the energy of N atoms interacting via the Lennard-Jones potential[73] is used a test function here and it is defined by:

$V_{L J} (r) = 4 ϵ [{(\frac{σ}{r})}^{12} - {(\frac{σ}{r})}^{6}]$

The values $N = 3, 5$ were used in the conducted experiments.
Rastrigin function.

$f (x) = x_{1}^{2} + x_{2}^{2} - cos (18 x_{1}) - cos (18 x_{2}), x \in {[- 1, 1]}^{2}$
Rosenbrockfunction.

$f (x) = \sum_{i = 1}^{n - 1} (100 {(x_{i + 1} - x_{i}^{2})}^{2} + {(x_{i} - 1)}^{2}), - 30 \leq x_{i} \leq 30 .$

The values $n = 4, 8, 16$ were used in the conducted experiments.
Shekel 5function.

f (x) = - \sum_{i = 1}^{5} \frac{1}{(x - a_{i}) {(x - a_{i})}^{T} + c_{i}}

with

x \in {[0, 10]}^{4}

and

a = (\begin{matrix} 4 & 4 & 4 & 4 \\ 1 & 1 & 1 & 1 \\ 8 & 8 & 8 & 8 \\ 6 & 6 & 6 & 6 \\ 3 & 7 & 3 & 7 \end{matrix}), c = (\begin{matrix} 0.1 \\ 0.2 \\ 0.2 \\ 0.4 \\ 0.4 \end{matrix})

Shekel 7 function.

f (x) = - \sum_{i = 1}^{7} \frac{1}{(x - a_{i}) {(x - a_{i})}^{T} + c_{i}}

with

x \in {[0, 10]}^{4}

and

a = (\begin{matrix} 4 & 4 & 4 & 4 \\ 1 & 1 & 1 & 1 \\ 8 & 8 & 8 & 8 \\ 6 & 6 & 6 & 6 \\ 3 & 7 & 3 & 7 \\ 2 & 9 & 2 & 9 \\ 5 & 3 & 5 & 3 \end{matrix}), c = (\begin{matrix} 0.1 \\ 0.2 \\ 0.2 \\ 0.4 \\ 0.4 \\ 0.6 \\ 0.3 \end{matrix})

Shekel 10 function.

f (x) = - \sum_{i = 1}^{10} \frac{1}{(x - a_{i}) {(x - a_{i})}^{T} + c_{i}}

with

x \in {[0, 10]}^{4}

and

a = (\begin{matrix} 4 & 4 & 4 & 4 \\ 1 & 1 & 1 & 1 \\ 8 & 8 & 8 & 8 \\ 6 & 6 & 6 & 6 \\ 3 & 7 & 3 & 7 \\ 2 & 9 & 2 & 9 \\ 5 & 5 & 3 & 3 \\ 8 & 1 & 8 & 1 \\ 6 & 2 & 6 & 2 \\ 7 & 3.6 & 7 & 3.6 \end{matrix}), c = (\begin{matrix} 0.1 \\ 0.2 \\ 0.2 \\ 0.4 \\ 0.4 \\ 0.6 \\ 0.3 \\ 0.7 \\ 0.5 \\ 0.6 \end{matrix})

Sinusoidal function:

$f (x) = - (2.5 \prod_{i = 1}^{n} sin (x_{i} - z) + \prod_{i = 1}^{n} sin (5 (x_{i} - z))), 0 \leq x_{i} \leq π .$

The values of $n = 4, 8, 16$ and $z = \frac{π}{6}$ was used in the conducted experiments.
Test2N function:

$f (x) = \frac{1}{2} \sum_{i = 1}^{n} x_{i}^{4} - 16 x_{i}^{2} + 5 x_{i}, x_{i} \in [- 5, 5] .$

The function has $2^{n}$ in the specified range and in our experiments we used $n = 4, 5, 6, 7$ .
Test30N function:

$f (x) = \frac{1}{10} {sin}^{2} (3 π x_{1}) \sum_{i = 2}^{n - 1} ({(x_{i} - 1)}^{2} (1 + {sin}^{2} (3 π x_{i + 1}))) + {(x_{n} - 1)}^{2} (1 + {sin}^{2} (2 π x_{n}))$

with $x \in [- 10, 10]$ , with $30^{n}$ local minima in the search space. For our experiments we used $n = 3, 4$ .

3.2. Experimental results

The freely available software OPTIMUS was utilized for the experiments, available at the following address: https://github.com/itsoulos/OPTIMUS(accessed on 9 September 2023). The machine used in the experiments was an AMD Ryzen 5950X with 128GB of RAM, running the Debian Linux operating system. To ensure research reliability, the experiments were executed 30 times for each objective function, employing different seeds for the random generator, and reporting the mean values. The values used for the parameters in the experiments are listed in Table 1. For the experimental tables, the following notation is used:

The column UNIFORM indicates the incorporation of uniform sampling in the genetic algorithm. In this case, $N_{c}$ randomly selected chromosomes using uniform sampling are used in the genetic algorithm.
The column TRIANGULAR defines the usage of triangular distribution for the initial samples of the genetic algorithm. For this case, $N_{c}$ randomly selected chromosomes with triangular distribution are used in the genetic algorithm.
The column KMEANS denotes the application of k - means sampling as proposed here in the genetic algorithm. In this case, $N_{m}$ randomly selected points were sampled from the objective function and k centers were produced using the k - means algorithm. In order to have a reliable comparison with the above distributions, the number of centers equals the number of randomly generated chromosomes $N_{g}$ .
The numbers in cells represent the average number of function calls required to obtain the global minimum. The fraction in parentheses denotes the percentage where the global minimum was successfully discovered. If this fraction is absent, then the global minimum was successfully discovered in all runs.
In every table, an additional line was added under the name TOTAL, representing the total number of function calls and, in parentheses, the average success rate in finding the global minimum.

Table 2 presents the three different distributions for the initialization of chromosomes, along with the objective function evaluations. It is evident that with the proposed initialization, the evaluations are fewer compared to the other two initialization methods. Specifically, compared to the uniform initialization, there is a reduction of 47.88%, while in comparison to the triangular initialization, the reduction is 50.25%. As for the success rates, no significant differences are observed and graphically outlined in Figure 1.

An additional set of experiments was performed to verify the reliability of the proposed technique with high-dimensional objective functions. The functions

High Conditioned Elliptic function, defined as

$f (x) = \sum_{i = 1}^{n} {(10^{6})}^{\frac{i - 1}{n - 1}} x_{i}^{2}$
Cm function, defined as

$f (x) = \sum_{i = 1}^{n} x_{i}^{2} - \frac{1}{10} \sum_{i = 1}^{n} cos (5 π x_{i})$

were used as test cases in this series of experiments. For the first function the dimension values

n = 1, \dots, 20

were used and the comparative results are outlined in Table 3 and graphically in Figure 2. It is evident that with the proposed initialization, the results are improved compared to those of the uniform distribution. Additionally, as expected, the required function evaluations increase in parallel with the dimension of the problem.

Likewise, a series of experiments were conducted for the CM function with the dimension n increased from 2 to 30, and the results are shown in Table 4 and graphically in Figure 3. The proposed initialization method requires fewer function calls to obtain the global minimum of the function and also the average success rate with the proposed initialization method reaches 100%, whereas with the uniform distribution it is smaller by 15%.

4. Conclusions

In this work, an innovative chromosome initialization method for genetic algorithms was proposed that utilizes the well-known k-means technique. These genetic algorithms are used to find the global minimum of multidimensional functions. This method replaces the initialization of chromosomes in genetic algorithms which is traditionally performed by some random distribution with centers produced by the k-means technique. In addition, in this technique, centers that are close enough are rejected from being genetic algorithm chromosomes. The above procedure significantly reduced the required number of function calls compared to random distributions and furthermore, in difficult high-dimensional functions, it appears to be a more efficient technique at finding the global minimum than random distributions. Future research may include incorporation of parallel techniques such as MPI [74] or OpenMP [75] to speed up the method or application of the initialization process to other stochastic techniques such as Particle Swarm Optimization or Differential Evolution.

Author Contributions

V.C., I.G.T. and V.S. conceived the idea and methodology and supervised the technical part regarding the software. V.C. conducted the experiments, employing several datasets, and provided the comparative experiments. I.G.T. performed the statistical analysis. V.S. and all other authors prepared the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

The experiments of this research work were performed at the high performance computing system established at Knowledge and Intelligent Computing Laboratory, Department of Informatics and Telecommunications, University of Ioannina, acquired with the project “Educational Laboratory equipment of TEI of Epirus” with MIS 5007094 funded by the Operational Programme “Epirus” 2014–2020, by ERDF and national funds.

Conflicts of Interest

The authors declare no conflict of interest.

Sample Availability

Not applicable.

References

L. Yang, D. Robin, F. Sannibale, C. Steier, W. Wan, Global optimization of an accelerator lattice using multiobjective genetic algorithms, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment 609, pp. 50-57, 2009. [CrossRef]
E. Iuliano, Global optimization of benchmark aerodynamic cases using physics-based surrogate models, Aerospace Science and Technology 67, pp.273-286, 2017. [CrossRef]
Q. Duan, S. Sorooshian, V. Gupta, Effective and efficient global optimization for conceptual rainfall-runoff models, Water Resources Research 28, pp. 1015-1031 , 1992. [CrossRef]
S. Heiles, R. L. Johnston, Global optimization of clusters using electronic structure methods, Int. J. Quantum Chem. 113, pp. 2091– 2109, 2013. [CrossRef]
W.H. Shin, J.K. Kim, D.S. Kim, C. Seok, GalaxyDock2: Protein–ligand docking using beta-complex and global optimization, J. Comput. Chem. 34, pp. 2647– 2656, 2013.
A. Liwo, J. Lee, D.R. Ripoll, J. Pillardy, H. A. Scheraga, Protein structure prediction by global optimization of a potential energy function, Biophysics 96, pp. 5482-5485, 1999. [CrossRef]
Zwe-Lee Gaing, Particle swarm optimization to solving the economic dispatch considering the generator constraints, IEEE Transactions on 18 Power Systems, pp. 1187-1195, 2003.
C. D. Maranas, I. P. Androulakis, C. A. Floudas, A. J. Berger, J. M. Mulvey, Solving long-term financial planning problems via global optimization, Journal of Economic Dynamics and Control 21, pp. 1405-1425, 1997. [CrossRef]
Eva K. Lee, Large-Scale Optimization-Based Classification Models in Medicine and Biology, Annals of Biomedical Engineering 35, pp 1095-1109, 2007. [CrossRef]
Y. Cherruault, Global optimization in biology and medicine, Mathematical and Computer Modelling 20, pp. 119-132, 1994. [CrossRef]
M.A. Wolfe, Interval methods for global optimization, Applied Mathematics and Computation 75, pp. 179-206, 1996.
T. Csendes and D. Ratz, Subdivision Direction Selection in Interval Methods for Global Optimization, SIAM J. Numer. Anal. 34, pp. 922–938, 1997. [CrossRef]
W. L. Price, Global optimization by controlled random search, Journal of Optimization Theory and Applications 40, pp. 333-348, 1983.
Ivan Krivy, Josef Tvrdik, The controlled random search algorithm in optimizing regression models, Computational Statistics & Data Analysis 20, pp. 229-234, 1995. [CrossRef]
M.M. Ali, A. Torn, and S. Viitanen, A Numerical Comparison of Some Modified Controlled Random Search Algorithms, Journal of Global Optimization 11,pp. 377–385,1997. [CrossRef]
S. Kirkpatrick, CD Gelatt, , MP Vecchi, Optimization by simulated annealing, Science 220, pp. 671-680, 1983.
L. Ingber, Very fast simulated re-annealing, Mathematical and Computer Modelling 12, pp. 967-973, 1989.
R.W. Eglese, Simulated annealing: A tool for operational research, Simulated annealing: A tool for operational research 46, pp. 271-281, 1990. [CrossRef]
R. Storn, K. Price, Differential Evolution - A Simple and Efficient Heuristic for Global Optimization over Continuous Spaces, Journal of Global Optimization 11, pp. 341-359, 1997. [CrossRef]
J. Liu, J. Lampinen, A Fuzzy Adaptive Differential Evolution Algorithm. Soft Comput 9, pp.448–462, 2005.
J. Kennedy and R. Eberhart, Particle swarm optimization, Proceedings of ICNN’95 - International Conference on Neural Networks, 1995, pp. 1942-1948 vol.4. [CrossRef]
Riccardo Poli, James Kennedy kennedy, Tim Blackwell, Particle swarm optimization An Overview, Swarm Intelligence 1, pp 33-57, 2007.
Ioan Cristian Trelea, The particle swarm optimization algorithm: convergence analysis and parameter selection, Information Processing Letters 85, pp. 317-325, 2003. [CrossRef]
M. Dorigo, M. Birattari and T. Stutzle, Ant colony optimization, IEEE Computational Intelligence Magazine 1, pp. 28-39, 2006.
K. Socha, M. Dorigo, Ant colony optimization for continuous domains, European Journal of Operational Research 185, pp. 1155-1173, 2008. [CrossRef]
M. Perez, F. Almeida and J. M. Moreno-Vega, Genetic algorithm with multistart search for the p-Hub median problem, Proceedings. 24th EUROMICRO Conference (Cat. No.98EX204), Vasteras, Sweden, 1998, pp. 702-707 vol.2.
H. C. B. d. Oliveira, G. C. Vasconcelos and G. B. Alvarenga, A Multi-Start Simulated Annealing Algorithm for the Vehicle Routing Problem with Time Windows, 2006 Ninth Brazilian Symposium on Neural Networks (SBRN’06), Ribeirao Preto, Brazil, 2006, pp. 137-142.
B. Liu, L. Wang, Y.H. Jin, F. Tang, D.X. Huang, Improved particle swarm optimization combined with chaos, Chaos Solitons and Fractals 25, pp. 1261-1271, 2005. [CrossRef]
X.H. Shi, Y.C. Liang, H.P. Lee, C. Lu, L.M. Wang, An improved GA and a novel PSO-GA based hybrid algorithm, Information Processing Letters 93, pp. 255-261, 2005. [CrossRef]
Harish Garg, A hybrid PSO-GA algorithm for constrained optimization problems, Applied Mathematics and Computation 274, pp. 292-305, 2016. [CrossRef]
J. Larson and S.M. Wild, Asynchronously parallel optimization solver for finding multiple minima, Mathematical Programming Computation 10, pp. 303-332, 2018. [CrossRef]
H.P.J. Bolton, J.F. Schutte, A.A. Groenwold, Multiple Parallel Local Searches in Global Optimization. In: Dongarra J., Kacsuk P., Podhorszki N. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2000. Lecture Notes in Computer Science, vol 1908. Springer, Berlin, Heidelberg, 2000.
R. Kamil, S. Reiji, An Efficient GPU Implementation of a Multi-Start TSP Solver for Large Problem Instances, Proceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Computation, pp. 1441-1442, 2012.
Van Luong T., Melab N., Talbi EG. (2011) GPU-Based Multi-start Local Search Algorithms. In: Coello C.A.C. (eds) Learning and Intelligent Optimization. LION 2011. Lecture Notes in Computer Science, vol 6683. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25566-3_24. [CrossRef]
J.H. Holland, Genetic algorithms. Scientific american 267, pp. 66-73, 1992.
J. Stender, Parallel Genetic Algorithms:Theory & Applications. Edition: IOS Press, 1993.
D. Goldberg, Genetic Algorithms in Search, Optimization and Machine Learning, Addison-Wesley Publishing Company, Reading, Massachussets, 1989.
Z. Michaelewicz, Genetic Algorithms + Data Structures = Evolution Programs. Springer - Verlag, Berlin, 1996. [CrossRef]
S. Ansari, K. Alnajjar, M. Saad, S. Abdallah, A. Moursy, Automatic Digital Modulation Recognition Based on Genetic-Algorithm-Optimized Machine Learning Models, IEEE Access 10, pp. 50265-50277, 2022. [CrossRef]
Y. Ji, S. Liu, M. Zhou, Z. Zhao, X. Guo, L. Qi, L., A machine learning and genetic algorithm-based method for predicting width deviation of hot-rolled strip in steel production systems. Information Sciences 589, pp. 360-375, 2022.
Y. Hervis Santana, R. Martinez Alonso, G. Guillen Nieto, L. Martens, W. Joseph, D. Plets, Indoor genetic algorithm-based 5G network planning using a machine learning model for path loss estimation. Applied Sciences 12, 3923, 2022. [CrossRef]
X. Liu, D. Jiang, B. Tao, G. Jiang, Y. Sun, J. Kong, B. Chen, Genetic algorithm-based trajectory optimization for digital twin robots. Frontiers in Bioengineering and Biotechnology 9, 793782, 2022. [CrossRef]
K. Nonoyama, Z. Liu, T. Fujiwara, M.M. Alam, T. Nishi, Energy-efficient robot configuration and motion planning using genetic algorithm and particle swarm optimization. Energies 15, 2074, 2022. [CrossRef]
K. Liu, B. Deng, Q. Shen, J. Yang, Y. Li, Optimization based on genetic algorithms on energy conservation potential of a high speed SI engine fueled with butanol–gasoline blends, Energy Reports 8, pp. 69-80, 2022. [CrossRef]
G. Zhou, Z. Zhu, S. Luo, Location optimization of electric vehicle charging stations: Based on cost model and genetic algorithm, Energy 247, 123437, 2022. [CrossRef]
Q. Chen, X. Hu, Design of intelligent control system for agricultural greenhouses based on adaptive improved genetic algorithm for multi-energy supply system, Energy Reports 8, pp. 12126-12138, 2022. [CrossRef]
D. Min, Z. Song, H. Chen, T. Wang, T. Zhang, Genetic algorithm optimized neural network based fuel cell hybrid electric vehicle energy management strategy under start-stop condition, Applied Energy 306, 118036, 2022. [CrossRef]
R.I. Doewes, R. Nair, T. Sharma, Diagnosis of COVID-19 through blood sample using ensemble genetic algorithms and machine learning classifier, World Journal of Engineering 19, pp. 175-182, 2022. [CrossRef]
S. Choudhury, M. Rana, A. Chakraborty, S. Majumder, S. Roy, A. RoyChowdhury, S. Datta, Design of patient specific basal dental implant using Finite Element method and Artificial Neural Network technique. Journal of Engineering in Medicine 236, pp. 1375-1387, 2022. [CrossRef]
M.I. El-Anwar, M.M. El-Zawahry, A three dimensional finite element study on dental implant design, Journal of Genetic Engineering and Biotechnology 9, pp. 77-82, 2011. [CrossRef]
Zheng, Q. & Zhong, J. (2022). Design of Automatic Pronunciation Error Correction System for Cochlear Implant Based on Genetic Algorithm. ICMMIA: Application of Intelligent Systems in Multi-modal Information Analytics pp 1041–1047.
O. Brahim, B. Hamid, N. Mohammed, Optimal design of inductive coupled coils for biomedical implants using metaheuristic techniques. E3S Web Conf.: Volume 351, 2022. [CrossRef]
E. Tokgoz, M.A. Carro, Applications of Artificial Intelligence, Machine Learning, and Deep Learning on Facial Plastic Surgeries. Springer: Cosmetic and Reconstructive Facial Plastic Surgery pp 281–306, 2023.
B. Wang, J.F. Gomez-Aguilar, Z. Sabir, M.A.Z. Raja, W. Xia, H. Jahanshahi, M.O. Alassafi, F. Alsaadi, Surgery Using The Capability Of Morlet Wavelet Artificial Neural Networks, Fractals30, 2240147, 2023.
M. Ahmed, R. Seraj, S.M.S. Islam, The k-means algorithm: A comprehensive survey and performance evaluation, Electronics 9, 1295, 2020. [CrossRef]
P. Kaelo, M.M. Ali, Integrated crossover rules in real coded genetic algorithms, European Journal of Operational Research 176, pp. 60-76, 2007. [CrossRef]
I.G. Tsoulos, Modifications of real code genetic algorithm for global optimization, Applied Mathematics and Computation 203, pp. 598-607, 2008. [CrossRef]
M.J.D Powell, A Tolerant Algorithm for Linearly Constrained Optimization Calculations, Mathematical Programming 45, pp. 547-566, 1989. [CrossRef]
J. Kennedy, R. Eberhart, Particle swarm optimization, In: Proceedings of IEEE International Conference on Neural Networks pp. 1942-1948, 1995.
H.G. Beyer, H.P. Schwefel, Evolution strategies–A comprehensive introduction, Natural Computing 1, pp. 3-52, 2002.
Y. LeCun, Y. Bengio, G. Hinton, Deep learning, Nature 521, pp. 436-444, 2015.
J.H. Holland, Adaptation in Natural and Artificial Systems, University of Michigan Press, 1975.
D. Whitley, The GENITOR algorithm and selection pressure: Why rank-based allocation of reproductive trials is best, In Proceedings of the Third International Conference on Genetic Algorithms, pp. 116-121, 1994.
A.E. Eiben, J.E. Smith, Introduction to Evolutionary Computing, Springer, 2015.
S. Lloyd, Least squares quantization in PCM, IEEE Transactions on Information Theory 28, pp. 129–137, 1982.
J.B. MacQueen, Some Methods for classification and Analysis of Multivariate Observations. Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability. Vol. 1. University of California Press. pp. 281–297. MR 0214227. Zbl 0214.46201, 1967.
A.K. Jain, M.N. Murty, P.J. Flynn, Data clustering: A review, ACM Computing Surveys 31, pp. 264-323, 1999.
C.M. Bishop, Pattern recognition and machine learning, Springer, 2006.
T. Hastie, R. Tibshirani, J. Friedman, The elements of statistical learning: Data mining, inference, and prediction, Springer, 2009.
M. Montaz Ali, Charoenchai Khompatraporn, Zelda B. Zabinsky, A Numerical Evaluation of Several Stochastic Algorithms on Selected Continuous Global Optimization Test Problems, Journal of Global Optimization 31, pp 635-672, 2005. [CrossRef]
C.A. Floudas, P.M. Pardalos, C. Adjiman, W. Esposoto, Z. Gümüs, S. Harding, J. Klepeis, C. Meyer, C. Schweiger, Handbook of Test Problems in Local and Global Optimization, Kluwer Academic Publishers, Dordrecht, 1999. [CrossRef]
M. Gaviano, D.E. Ksasov, D. Lera, Y.D. Sergeyev, Software for generation of classes of test functions with known local and global minima for global optimization, ACM Trans. Math. Softw. 29, pp. 469-480, 2003.
J.E. Lennard-Jones, On the Determination of Molecular Fields, Proc. R. Soc. Lond. A 106, pp. 463–477, 1924.
W. Gropp, E. Lusk, N. Doss, A. Skjellum, A high-performance, portable implementation of the MPI message passing interface standard, Parallel Computing 22, pp. 789-828, 1996.
R. Chandra, L. Dagum, D. Kohr, D. Maydan,J. McDonald and R. Menon, Parallel Programming in OpenMP, Morgan Kaufmann Publishers Inc., 2001.

Figure 1. Statistical comparison of function calls with different distribution

Figure 2. Comparison of function calls of ELP function with different distributions and dimensions

Figure 3. Comparison of function calls of CM function with different distributions and dimensions

Table 1. The values for the parameters used in the experiments.

PARAMETER	MEANING	VALUE
$N_{c}$	Number of chromosomes	200
$N_{m}$	Initial samples for k-means	2000
k	Number of centers in k-means	200
$N_{g}$	Maximum number of allowed generations	200
$p_{s}$	Selection rate	0.9
$p_{m}$	Mutation rate	0.05
$ϵ$	Small value used in comparisons	$10^{- 6}$

Table 2. Comparison of function calls and success rates with different distributions

PROBLEM	UNIFORM	TRIANGULAR	KMEANS
BF1	5731	5934	4478
BF2	5648(0.97)	5893	4512
BRANIN	4680	4835	4627
CM4	5801	5985	4431
CAMEL	4965	5099	4824
EASOM	5657	7089	4303
EXP4	4934	4958	4539
EXP8	5021	5187	4689
EXP16	5063	5246	4874
EXP32	5044	5244	5016
GKLS250	4518	4710	4525
GKLS350	4650	4833	4637
GOLDSTEIN	8099	8537	7906
GRIEWANK2	5500(0.97)	5699(0.97)	4324
GRIEWANK10	6388(0.70)	7482(0.63)	4559
HANSEN	5681(0.93)	6329	6357
HARTMAN3	4950	5157	4998
HARTMAN6	5288	5486	5258
POTENTIAL3	5587	5806	5604
POTENTIAL5	7335	7824	7450
RASTRIGIN	5703	5848	4481
ROSENBROCK4	4241	4441	4241
ROSENBROCK8	41802	41965	4523
ROSENBROCK16	42196	42431	4962
SHEKEL5	5488(0.97)	5193(0.97)	5232(0.97)
SHEKEL7	5384	5711(0.97)	5695(0.97)
SHEKEL10	6360	5989	6396
TEST2N4	5000	5179	5047
TEST2N5	5306	5309	5039
TEST2N6	5245	5492	5107
TEST2N7	5282(0.93)	5583	5216
SINU4	4844	5046	4899
SINU8	5368	5503	5509
SINU16	6919	5583	5977
TEST30N3	7215	8115	5270
TEST30N4	7073	7455	6712
Total	273966(0.98)	282176(0.985)	186217(0.998)

Table 3. Objective function ELP. Comparison of function calls with different distributions and dimensions

dimension	Calls (uniform 200 samples)	Calls (kmeans 200 centers)
5	15637	4332
10	24690	4486
15	39791	4743
20	42976	5194
25	43617	7152
30	44502	6914
35	45252	15065
40	46567	13952
45	47640	15193
50	49393	22535
55	50062	23692
60	52293	25570
65	52546	25678
70	53346	28153
75	54110	28328
80	57209	29320
85	60970	29371
90	65319	32121
95	68097	35721
100	66803	35396
TOTAL	980820	392916

Table 4. Objective function CM. Comparison of function calls and success rates with using different distributions.

dimension	Calls (uniform 200 samples)	Calls (kmeans 200 centers)
2	5665	4718
4	6212	4431
6	7980	4390
8	9917	4449
10	12076(0.97)	4481
12	14672	4565
14	18708(0.87)	4685
16	23251(0.77)	4687
18	24624(0.77)	4766
20	30153(0.80)	4848
22	35851(0.77)	15246(0.97)
24	43677(0.93)	7865(0.93)
26	41492(0.77)	5627
28	38017(0.73)	10566(0.97)
30	47538(0.83)	24803(0.90)
TOTAL	359833(0.84)	110127(0.98)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

MDPI Initiatives

Important Links

Choose an area of interest and we will send you notifications of new preprints at your preferred frequency.

Disclaimer

An intelligent technique for initial distribution in stochastic optimization methods

Abstract

1. Introduction

2. The proposed method

2.1. Proposed initialization Distribution

2.2. Chromosome rejection rule

2.3. The proposed sampling procedure

3. Experiments

3.1. Test functions

3.2. Experimental results

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Institutional Review Board Statement

Informed Consent Statement

Acknowledgments

Conflicts of Interest

Sample Availability

References

MDPI Initiatives

Important Links

Subscribe