Genetic Algorithms

When searching a large space with many dimensions, greedy search algorithms find locally good results that are often much worse than the best possible solutions. Genetic Algorithms (GAs) are an efficient means of finding near optimum solutions by more uniformly exploring a large many-dimensional search space. Genetic algorithms are a type of heuristic search where the heuristic takes the form of combining a fitness function with efficient search techniques inspired by biology.

Genetic algorithms can be said to be inspired by biology because they deal with mutation, crossover, and selection. Each possible solution to a problem is encoded in a chromosome that represents a point in a many-dimensional search space.

GA computer simulations evolve a population of chromosomes that may contain at least some fit individuals. Fitness is specified by a fitness function that is used to rate each individual in the population (of chromosomes) and makes it possible to use selection to choose the best candidate chromosomes to mutate and/or do crossover operations, or save as-is for the next generation. We make copies of the selected chromosomes and slightly perturb the copies with random mutations. Furthermore, pairs of selected chromosomes are cut in the same random gene index and cut pieces of the pair of chromosomes are swapped (a process called crossover).

Setting up a GA simulation is fairly easy: we need to represent (or encode) the state of a system in a chromosome that is usually implemented as a set of bits. GA is basically a search operation: searching for a good solution to a problem where the solution is a very fit chromosome. The programming technique of using GA is useful for AI systems that must adapt to changing conditions because “re-programming” can be as simple as defining a new fitness function and re-running the simulation. An advantage of GA is that the search process will not often “get stuck” in local minimum because the genetic crossover process produces radically different chromosomes in new generations while occasional mutations (flipping a random bit in a chromosome) cause small changes.

As you can imagine, performing a crossover operation moves to a very distant point in the search space. Alternatively mutating a single bit only moves a point (i.e., a chromosome) in one dimension in the search space.

Another aspect of GA is supporting the evolutionary concept of “survival of the fittest”: by using the fitness function we will preferentially “breed” chromosomes with higher fitness values.

It is interesting to compare how GAs are trained with how we train neural networks (see the next chapter on Neural Networks). We need to manually “supervise” the training process: for GAs we need to supply a fitness function. For of the neural network models used in the chapter Neural Networks we supply training data with desired sample outputs for sample inputs.

Theory

GAs are typically used to search very large and usually very high dimensional search spaces. If we want to find a solution as a single point in an N dimensional space where a fitness function has a near maximum value, then we have N parameters to encode in each chromosome. In this chapter we will be solving a simple problem in which we only need to encode a single number (a floating point number for this example) in each chromosome. We are effectively downsampling a floating point number to an integer and the bit representation of the integer is a chromosome. Using a GA toolkit like the one developed in a later section, requires two problem-specific customizations:

Characterize the search space by a set of parameters that can be encoded in a chromosome (more on this later). GAs work with the coding of a parameter set, not the parameters themselves (Genetic Algorithms in Search, Optimization, and Machine Learning, David Goldberg, 1989).
Provide a numeric fitness function that allows us to rate the fitness of each chromosome in a population. We will use these fitness values in the selection process to determine which chromosomes in the population are most likely to survive and reproduce using genetic crossover and mutation operations.

The GA toolkit developed in this chapter treats genes as a single bit; while you can consider a gene to be an arbitrary data structure, the approach of using single bit genes and specifying the number of genes (or bits) in a chromosome is very flexible. A population is a set of chromosomes. A generation is defined as one reproductive cycle of replacing some elements of the chromosome population with new chromosomes produced by using a genetic crossover operation followed by optionally mutating a few chromosomes in the population.

We will describe a simple example problem (that can be better solved using Newton’s method) in this section, write a general purpose library in the section Java Library for Genetic Algorithms, and finish the chapter in the section Java Genetic Algorithm Example by solving this problem.

For a sample problem, let’s suppose that we want to find the maximum value of the function F with one independent variable x in the following equation and as seen in last figure:

F(x) = sin(x) * sin(0.4 * x) * sin(3 * x)

The problem that we want to solve is finding a good value of x to find a near to possible maximum value of F(x). To be clear: we encode a floating point number as a chromosome made up of a specific number of bits so any chromosome with randomly set bits will represent some random number in the interval [0, 10]. The fitness function is simply the function in the last equation.

This figure shows an example of a crossover operation that we will implement later in the program example. A random chromosome bit index is chosen, and two chromosomes are “cut” at this index and swap cut parts. The two original chromosomes in generation_n are shown on the left of the figure and after the crossover operation they produce two new chromosomes in generation n + 1 where n is the current generation number. The two new chromosomes are shown on the right of the figure.

In addition to using crossover operations to create new chromosomes from existing chromosomes, we will also perform genetic mutation by randomly flipping bits in chromosomes. A fitness function that rates the fitness value of each chromosome allows us to decide which chromosomes to discard and which to use for the next generation. We will use the most fit chromosomes in the population for producing the next generation using crossover and mutation.

We will implement a general purpose Java GA library in the next section and then solve the example problem posed at the end of this chapter in the GA Example Section.

Java Library for Genetic Algorithms

The full implementation of the GA library is in the Java source file Genetic.java. The following code snippets show the method signatures defining the public API for the library. Note that there are two constructors, the first using default values for the fraction of chromosomes on which to perform crossover and mutation operations and the second constructor allows setting explicit values for these parameters:

1     abstract public class Genetic {
2       public Genetic(int num_genes_per_chromosome,
3                      int num_chromosomes)
4       public Genetic(int num_genes_per_chromosome,
5                      int num_chromosomes,
6                      float crossover_fraction,
7                      float mutation_fraction)

The method sort is used to sort the population of chromosomes in most fit first order. The methods getGene and setGene are used to fetch and change the value of any gene (bit) in any chromosome. These methods are protected because you may need to override them in derived classes.

1       protected void sort()
2       protected boolean getGene(int chromosome,
3                                 int gene)
4       protected void setGene(int chromosome,
5                              int gene, int value)
6       protected void setGene(int chromosome, 
7                              int gene,
8                              boolean value)

The methods evolve, doCrossovers, doMutations, and doRemoveDuplicates are utilities for running GA simulations. These methods are protected but you will probably not need to override them in derived classes.

1       protected void evolve()
2       protected void doCrossovers()
3       protected void doMutations()
4       protected void doRemoveDuplicates()

When you subclass class Genetic you must implement the following abstract method calcFitness that will determine the evolution of chromosomes during the GA simulation.

1       // Implement the following method in sub-classes:
2       abstract public void calcFitness();
3     }

The class Chromosome represents an ordered bit sequence with a specified number of bits and a floating point fitness value.

1     class Chromosome {
2       private Chromosome()
3       public Chromosome(int num_genes)
4       public boolean getBit(int index) 
5       public void setBit(int index, boolean value)
6       public float getFitness()
7       public void setFitness(float value)
8       public boolean equals(Chromosome c)
9     }

The class ChromosomeComparator implements a Comparator interface and is application specific. It is used to sort a population in “best first” order:

1     class ChromosomeComparator
2           implements Comparator<Chromosome> {
3       public int compare(Chromosome o1,
4                          Chromosome o2)
5     }

The last class ChromosomeComparator is used when using the Java Collection class static sort method.

The class Genetic is an abstract class: you must subclass it and implement the method calcFitness that uses an application specific fitness function (that you must supply) to set a fitness value for each chromosome.

The following UML class diagram provides an overview of the Java classes and their public APIs as well as the class MyGenetic that will implement a fitness function for our example of finding a maximum value in an equation and the test class TestGenetic:

UML Class Diagram for library and test program

This GA library provides the following behavior:

Generates an initial random population with a specified number of bits (or genes) per chromosome and a specified number of chromosomes in the population
Ability to evaluate each chromosome based on a numeric fitness function
Ability to create new chromosomes from the most fit chromosomes in the population using the genetic crossover and mutation operations

There are two class constructors for Genetic set up a new GA experiment. Both constructors require the number of genes (or bits) per chromosome, and the number of chromosomes in the population. The second constructor allows you to optionally set the fractions for mutation and crossover operations.

The Genetic class constructors build an array of integers rouletteWheel which is used to weight the most fit chromosomes in the population for choosing the parents of crossover and mutation operations. When a chromosome is being chosen, a random integer is selected to be used as an index into the rouletteWheel array; the values in the array are all integer indices into the chromosome array. More fit chromosomes are heavily weighted in favor of being chosen as parents for the crossover operations. The algorithm for the crossover operation is fairly simple; here is the implementation:

 1      public void doCrossovers() {
 2        int num = (int)(numChromosomes * crossoverFraction);
 3        for (int i = num - 1; i >= 0; i--) {
 4          // Don't overwrite the "best" chromosome
 5          // from current generation:
 6          int c1 = 1 + (int) ((rouletteWheelSize - 1) *
 7                              Math.random() * 0.9999f);
 8          int c2 = 1 + (int) ((rouletteWheelSize - 1) *
 9                              Math.random() * 0.9999f);
10          c1 = rouletteWheel[c1];
11          c2 = rouletteWheel[c2];
12          if (c1 != c2) {
13            int locus = 1+(int)((numGenesPerChromosome-2) *
14                                Math.random());
15            for (int g = 0; g<numGenesPerChromosome; g++) {
16              if (g < locus) {
17                setGene(i, g, getGene(c1, g));
18              } else {
19                setGene(i, g, getGene(c2, g));
20              }
21            }
22          }
23        }
24      }

The method doMutations is similar to doCrossovers: we randomly choose chromosomes from the population and for these selected chromosomes we randomly “flip” the value of one gene (a gene is a bit in our implementation):

 1       public void doMutations() {
 2         int num = (int)(numChromosomes * mutationFraction);
 3         for (int i = 0; i < num; i++) {
 4           // Don't overwrite the "best" chromosome
 5           // from current generation:
 6           int c = 1 + (int) ((numChromosomes - 1) *
 7                              Math.random() * 0.99);
 8           int g = (int) (numGenesPerChromosome *
 9                          Math.random() * 0.99);
10           setGene(c, g, !getGene(c, g));
11         }
12       }

We developed a general purpose library in this section for simulating populations of chromosomes that can evolve to a more “fit” population given a fitness function that ranks individual chromosomes in order of fitness. In the next section we will develop an example GA application by defining the size of a population and the fitness function that we saw earlier.

Finding the Maximum Value of a Function

We will use the Java library in the last section to develop an example application to find the maximum of the function seen in the figure showing the sample function which shows a plot of our test function we are using a GA to fit, plotted in the interval [0, 10].

While we could find the maximum value of this function by using Newton’s method (or even a simple brute force search over the range of the independent variable x), the GA method scales very well to similar problems of higher dimensionality. The GA also helps us to find better than locally optimum solutions. In this example we are working in one dimension so we only need to encode a single variable in a chromosome. As an example of a 20-dimensional space, we might have a financial model with 20 independent variables x1, x2, ..x20 and a single chromosome would still represent a point in this 20-dimensional space. To continue this example, if we used 10 bits to represent the value range in each of the 20 dimensions, then the chromosome would be represented as 200 bits.

To generalize, our first task is to characterize the search space as one or more parameters. In general when we write GA applications we might need to encode several parameters in a single chromosome. As another example, if a fitness function has three arguments we would encode three numbers in a single chromosome.

Let’s get back to our 1-dimensional example seen in the figure showing the sample function. This is a simple example showing you how to set up a GA simulation. In this example problem we have only one parameter, the independent variable x. We will encode the parameter x using ten bits (so we have ten 1-bit genes per chromosome). A good starting place is writing a utility method for converting the 10-bit representation to a floating-point number in the range [0.0, 10.0]:

1     float geneToFloat(int chromosomeIndex) {
2       int base = 1;
3       float x = 0;
4       for (int j=0; j<numGenesPerChromosome; j++)  {
5          if (getGene(chromosomeIndex, j)) {
6             x += base;
7          }
8          base *= 2;
9       }

For each bit at index j with a value of 1, add

We need to normalize this sum x that is an integer in the range of [0,1023] to a floating point number in the approximate range of [0, 10]:

1       x /= 102.4f;
2       return x;
3     }

Note that we do not need the reverse method! We use our GA library from the last section to create a population of 10-bit chromosomes. In order to evaluate the fitness of each chromosome in a population, we only have to convert the 10-bit representation to a floating-point number for evaluation using the fitness function we showed earlier (figure showing the sample function):

1     private float fitness(float x) {
2       return (float)(Math.sin(x) *
3                      Math.sin(0.4f * x) *
4                      Math.sin(3.0f * x));
5     }

The following table shows some sample random chromosomes and the floating point numbers that they encode. The first column shows the gene indices where the bit is “on,” the second column shows the chromosomes as an integer number represented in binary notation, and the third column shows the floating point number that the chromosome encodes. Note that the center column in the following table shows the bits in order where index 0 is the left-most bit, and index 9 is the right-most bit; this is the reverse of the normal order for encoding integers but the GA does not care, it works with any encoding we use as long as it is consistent.

1   “On bits” in chromosome   As binary   Number encoded
2   -----------------------   ---------   --------------
3   2,  5,  7,  8,  9         0010010111  9.1015625
4   0,  1,  3,  5,  6         1101011000  1.0449219
5   0,  3,  5,  6,  7, 8      1001011110  4.7753906

Using methods geneToFloat and fitness we now implement the abstract method calcFitness from our GA library class Genetic so the derived class TestGenetic is not abstract. This method has the responsibility for calculating and setting the fitness value for every chromosome stored in an instance of class Genetic:

1     public void calcFitness() {
2       for (int i=0; i<numChromosomes; i++) {
3         float x = geneToFloat(i);
4         chromosomes.get(i).setFitness(fitness(x));
5       }
6     }

While it was useful to make this example more clear with a separate geneToFloat method, it would have also been reasonable to simply place the formula in the method fitness in the implementation of the abstract (in the base class) method calcFitness.

In any case we are done with coding this example. You can compile the two example Java files Genetic.java and TestGenetic.java, and run the TestGenetic class to verify that the example program quickly finds a near maximum value for this function. The project Makefile has a single target that builds the library and runs the example test program:

1 test:
2  mvn install
3  mvn exec:java -Dexec.mainClass="com.markwatson.geneticalgorithm.TestGenetic"

You can try setting different numbers of chromosomes in the population and try setting non-default crossover rates of 0.85 and a mutation rates of 0.3. We will look at a run with a small number of chromosomes in the population created with:

 1       genetic_experiment =
 2                      new MyGenetic(10, 20, 0.85f, 0.3f);
 3       int NUM_CYCLES = 500;
 4       for (int i=0; i<NUM_CYCLES; i++) {
 5         genetic_experiment.evolve();
 6         if ((i%(NUM_CYCLES/5))==0 || i==(NUM_CYCLES-1)) {
 7           System.out.println("Generation " + i);
 8           genetic_experiment.print();
 9         }
10       }

In this experiment 85% of chromosomes will be “sliced and diced” with a crossover operation and 30% will have one of their genes changed. We specified 10 bits per chromosome and a population size of 20 chromosomes. In this example, I have run 500 evolutionary cycles. After you determine a fitness function to use, you will probably need to experiment with the size of the population and the crossover and mutation rates. Since the simulation uses random numbers (and is thus non-deterministic), you can get different results by simply rerunning the simulation. Here is example program output (with much of the output removed for brevity):

 1     count of slots in roulette wheel=55
 2     Generation 0
 3     Fitness for chromosome 0 is 0.505, occurs at x=7.960
 4     Fitness for chromosome 1 is 0.461, occurs at x=3.945
 5     Fitness for chromosome 2 is 0.374, occurs at x=7.211
 6     Fitness for chromosome 3 is 0.304, occurs at x=3.929
 7     Fitness for chromosome 4 is 0.231, occurs at x=5.375
 8     ...
 9     Fitness for chromosome 18 is -0.282 occurs at x=1.265
10     Fitness for chromosome 19 is -0.495, occurs at x=5.281
11     Average fitness=0.090 and best fitness for this
12     generation:0.505
13     ...
14     Generation 499
15     Fitness for chromosome 0 is 0.561, occurs at x=3.812
16     Fitness for chromosome 1 is 0.559, occurs at x=3.703
17     ...

This example is simple but is intended to show you how to encode parameters for a problem where you want to search for values to maximize a fitness function that you specify. Using the library developed in this chapter you should be able to set up and run a GA simulation for your own applications.

The important takeaway is that if you can encode a problem space as a chromosome and you have a fitness function to rate the numerical effectiveness of a chromosome, then Genetic Algorithms are an effective alternative to greedy search algorithms.

Up next

Neural Networks