Kernel machine to doing logic programming in Hopfield network for solve non horn problem-3sat

doi:10.15406/mojabb.2017.01.00001

MOJ

eISSN: 2576-4519

Applied Bionics and Biomechanics

Review Article Volume 1 Issue 1

Kernel machine to doing logic programming in Hopfield network for solve non horn problem-3sat

Shehab Abdulhabib Saeed Alzaeemi,

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Saratha Sathasivam, Salaudeen Abdulwaheed Adebayo, Mohd Shareduwan M Kasihmuddin, Mohd Asyraf Mansor

School of Mathematical Sciences, Universiti Sains, Malaysia

Correspondence: Shehab Abdulhabib Saeed Alzaeemi, School of Mathematical Sciences, Universiti Sains, Malaysia

Received: July 03, 2017 | Published: April 28, 2017

Citation: Alzaeemi SAS, Sathasivam S, Adebayo SA, et al. Kernel machine to doing logic programming in Hopfield network for solve non horn problem-3sat. MOJ App Bio Biomech. 2017;1(1):1–6. DOI: 10.15406/mojabb.2017.01.00001

Download PDF

Abstract

Kernel machine is computationally efficient and has the capability to function on high dimensional analysis data with random complex structure. The aim of this research is to provide a new insight on the kernel machines integrated with Hopfield Network in doing logic programming (KHNN). The newly proposed method emphasized on non horn clauses logic. Kernel machine reduced the computational burden by intelligently define the embedded memory pattern in high dimensional feature space. Since KHNN is able to formulate the estimation of neuron states efficiently, computation in high dimensional feature space of the network can be reducing dramatically. The simulation of KHNN will be executed by using Dev C++ software. The robustness of KHNN in doing non horn clause 3 sat will be evaluated based on global minima ratio, root mean square error (RMSE), and sum of squared error (SSE), mean absolute error (MAE), mean percentage error (MAPE) and computation time. The result obtained from the computer simulation demonstrates the effectiveness of KHNN in doing non horn clause problem-3 sat.

Keywords: linear kernel machine, logic programming, non-horn clause

Introduction

A neural network which is recognized as artificial neural network is a mathematical model or computational model that tries to simulate the structure and functional aspect of biological neural networks. It can solve complicated recognition solve optimization problems and analysis problems. It is because it composed of huge amount of interconnected neurons to solve specific problems.¹ Hopfield Network is a recurrent neural network investigated by John Hopfield in the early 1980s. Hopfield network serves as a content-addressable memory system with binary threshold units.² Logic is deals with false and true while in the logic programming, a set of Non Horn clauses 3 sat that formed by atoms are represented to find the truth values of the atoms in the set. It is using neurons to store the truth value of atoms to write a cost function for minimization when all the clauses are satisfied.³ Moreover, a bi-directional mapping between propositional logic formulas and energy functions of symmetric neural networks had defined by Gadi Pinkas⁴ and Wan Abdullah.⁵ Further detail can refer to the references. The advantages by using Wan Abdullah’s method are it can revolves around propositional non Horn clauses 3 sat and learning ability of the Hopfield network and hunts for the best solutions, given the clauses in the logic program, and the corresponding solutions may change as new clauses added. This research focus in kernel Hopfield neuron network, Kernel machine are powerful, computational effective analytical tools that are qualified for working on high dimensional data with complex structure.⁶ In kernel methods, the data are mapped from their original space to a higher dimensional feature space.⁷ Any operation, that can be represented through dot products has a kernel evaluation and called kernelization.^8,9

The rest of the paper is organized as follows

Section II Linear Kernel Machine, In section III Kernel Hopfield Neural Network, IV Logic Programming in Kernel Hopfield Neural Network, V Simulation and discussion and section VI to conclude the study.

Linear kernel machine

In their study Kernel Methods in Machine Learning,¹⁰ Suppose we are given empirical data:

$(x_{1}, y_{1}), (x_{2}, y_{2}), ........., (x_{n}, y_{n}) \in X \times Y$ (1)

Here, the domain $X$ is some nonempty set that the inputs (predictor variables) $x_{i}$ are taken from; the $y_{i} \in Y$ are called targets (the response variable). Here and below, $i, j \in [n]$ where we use the notation $[n] : = {1, . . ., n}$ . Note that we have not made any assumptions on the domain $X$ other than it being a set. In order to study the problem of learning, we need additional structure. In learning, we want to be able to generalize to unseen data points. In the case of binary pattern recognition, given some new input $x \in X$ , we want to predict the corresponding $y \in {\pm 1}$ (more complex output domains $Y$ will be treated below). Loosely speaking, we want to choose $y$ such that $(x, y)$ is in some sense similar to the training examples. To this end, we need similarity measures in $X$ and in {±1}. The latter is easier, as two target values can only be identical or different. For the former, were quires a function?

$k : X \times X \to R, (x, x') \to k (x, x')$ (2)

(Figure 1) A simple geometric classification algorithm: given two classes of points (depicted by “o” and “+”), compute their means $c_{+}, c_{_}$ and assign a test input $x$ to the one whose mean is closer. This can be done by looking at the dot product between $x - c$ [where $c = (c_{+} + c_{-}) / 2$ ] and $(w : = c_{+} - c_{-}),$ which changes sign as the enclosed angle passes through $π / 2$ . Note that the corresponding decision boundary is a hyper plane (the dotted line) orthogonal to w.¹⁰ Satisfying, for all:

$X, X' \in X,$ $K(X,X')= 〈 Φ (x), Φ (x') 〉$ (3)

Figure 1 Kernel Methods in Machine Learning.

where $Φ$ maps into some dot product space $H$ , sometimes called the feature space. The similarity measure $k$ is usually called a kernel, and $Φ$ is called its feature map. The advantage of using such a kernel as a similarity measure is that it allows us to construct algorithms in dot product spaces. It is interesting to note that the above optimization problem can be entirely formulated in terms of dot products hence it can be solved in an arbitrary feature space induced by a kernel function. That is the kernel represents a dot product on a different space $H$ called feature space into which the original vectors are mapped. With the introduction of a suitable kernel function the above learning procedure of the Adatron can be carried out in an arbitrary feature space where on the one hand the linear separability of the problem be guaranteed for every processing unit and on the other, the high dimensionality of the feature space produces an increment on the capacity of the network. In this way a kernel function defines an embedding of memory patterns into (high or infinite dimensional) feature vectors and allows the algorithm to be carried out in feature space without the need of representing it explicitly. With the introduction of a suitable kernel function the working of the network is mapped into a high dimensional feature space with the consequence that the capacity of the network is increased where $K (Y^{v}, S)$ is the kernel function and the $α_{v}$ are the Lagrange multipliers equal to 1.
In this study we will use kernels are the linear, shown in relation (4). The linear kernel is the simple dot product in input space whereas the other kernel functions represent dot products in arbitrary feature space.¹¹
The linear kernel is

$K_{l} (Y, S) = 〈 Y \cdot S 〉$ (4)

Kernel Machine technique is used as it can solve the combinatorial optimization problems that always occur in Hopfield network, also can reduce parameter processing time and sensors for robots.¹²

Kernel hopfield neural network

In kernel methods, the data are mapped from their original space to a higher dimensional feature space. The basic idea behind the kernel machine is that under certain conditions the kernel function can be interpreted as an inner product in a high dimensional feature space.^13,14

A linear kernel helps us to do calculation faster which would involve computations in higher dimensional space, also to induce a configuration of synaptic weights that maximize the stability of the processing unit.
Expression the updating rule in Hopfield neural network can be written:

$S_{i} (t + 1) = sgn [h_{i} (t)]$ (5)

There local field is given by the equation below

$h_{i} (t) = \sum_{j} \sum_{k} J_{i j k}^{(3)} S_{j} S_{k} + \sum_{j} J_{i j}^{(2)} S_{j} + J_{i}^{(1)}$ (6)

then

$S_{i} (t + 1) = s i g n (\sum_{j} \sum_{k} J_{i j k}^{(3)} S_{j} S_{k} + \sum_{j} J_{i j}^{(2)} S_{j} + J_{i}^{(1)})$ (7)

to obtain the dual representation of the local field of the HNN in feature space which can be written as:

$h_{i} = (\sum_{v = 1}^{p} (\sum_{j} \sum_{k} J_{i j k}^{(3)} S_{j} S_{k} + \sum_{j} J_{i j}^{(2)} S_{j} + J_{i}^{(1)}) K (Y^{v}, S (t)))$ (8)

Nevertheless rewriting the definitions (4) of the kernel functions in terms of the products of components of the involved vectors in input space an expression equivalent to (3) for the weight vectors can be given as a generalized product of functions of the memory components. It is shown in (8) that for the linear kernel this is readily done. As an example let us consider a simple the linear kernel when $S_{i} \in {- 1, 1} .$

Equation (7) can be written

$S_{i} (t + 1) = s i g n (\sum_{v = 1}^{p} (\sum_{j} \sum_{k} J_{i j k}^{(3)} S_{j} S_{k} + \sum_{j} J_{i j}^{(2)} S_{j} + J_{i}^{(1)}) (Y^{v} \cdot S (t)))$ (9)
$a n d S_{i} = {\begin{matrix} 1 & , h_{i} \geq 0 \\ - 1 & , O t h e r w i s e \end{matrix}$ (10)

Also
$K_{l} (Y, S) = 〈 Y \cdot S 〉 = \sum_{K}^{N} y_{k}^{v} S_{k}$ (11)

Inserting (9) to (11)

$S_{i} (t + 1) = s i g n (\sum_{j} \sum_{k} J_{i j k}^{(3)} S_{j} S_{k} (\sum_{v = 1}^{p} (Y^{v} \cdot S (t))) + \sum_{j} J_{i j}^{(2)} S_{j} (\sum_{v = 1}^{p} (Y^{v} \cdot S (t))) + J_{i}^{(1)} (\sum_{v = 1}^{p} (Y^{v} \cdot S (t))))$ (12)

Or can write that:

$S_{i} (t + 1) = s i g n (K_{l} \sum_{j} \sum_{k} J_{i j k}^{(3)} S_{j} S_{k} + K_{l} \sum_{j} J_{i j}^{(2)} S_{j} + K_{l} J_{i}^{(1)})$ (13)

From the above derivation it can be seen that the right hand side expression of (13) is a generalized inner product of the functions of the components of vectors S all the neuron’s final state for the local minima that stored and Y is a learned interpretation stored in CAM. In conclusion the kernel machine allows a straight reroute generalization of the Hopfield network to a higher dimensional feature space. The important advantage of this procedure is that in principle all processing units can be trained to optimal stability.

Logic programming in kernel hopfield neural network

Kernel Hopfield neural network is a numerical procedure to minimize energy function to find membership grade. We will lead the way in using the neural network to solve optimization problems. It is a well-known technique used based on Lyapunov energy function and it is useful for solving combinatorial optimization problems as a content addressable memory or an analog computer. Combinatorial optimization consists of looking for the combination of choices from a discrete set which produces an optimum value for some related cost function. Use Kernel Hopfield network that can handle non-monotonicity of logic to model and solve combinatorial optimization problems.¹⁵ Kernel Hopfield neural network algorithm revolves around propositional non-Horn clauses and learning the ability of the Hopfield network.
Khnn-Non-Horn Clauses 3sat algorithm is as followed

Given a logic program, translate all Non- Horn clauses in the logic program into basic Boolean algebraic form:

$P = (A \lor B \lor \neg C) \land (\neg D \lor \neg B) \land \neg C$ (18)

2. Identify a neuron to each ground neuron.
3. Initialize all synaptic weight to zero.
4. Derive a cost function that is associated with the negation of all the clauses, such that $\frac{1}{2} (1 + S_{x})$ represents the logical value of a neuron x, where $S_{x}$ is the state of neuron corresponding to x, $S_{x} \in {1, - 1}$ . Negation (neuron x does not occur) is represented by $\frac{1}{2} (1 - S_{x})$ ; A conjunction logical connective is represented by multiplication whereas a disjunction connective is represented by addition.
5. Compute the synaptic weights by comparing cost function $C_{p}$ with energy function $E_{p}$ (wan Abdullah's method).
6. Training by checking clauses satisfaction of non-horn clauses by applying artificial bee colony and exhaustive search method. Satisfied assignment will be fed to the Hopfield network as CAM as $Y_{i}$ . If the network met the inconsistencies, the network will reset the search space (by using Wan Abdullah method).
7. We implemented Sathasivam's relaxation technique to ensure the network relaxed to achieve final state (Find the final state of the neurons, if the state of the neurons remains unchanged after five loops, we consider it as stable state). Hence, the information exchange between neurons will be updated based on the following equation: $\frac{d h_{i}^{n e w}}{d t} = R \frac{_{{d h}_{i}}}{d t}$ (19)
Where R denotes the relaxation rate and $h_{i}$ refers to the local field of the network.(Neuron relaxation).
8. Randomize the states of the neurons.
9. Find the corresponding local field by using the following equation

$h_{i} = .... + \sum_{j} J_{i j k}^{(3)} S_{j} S_{k} + \sum_{j} J_{i j}^{(2)} S_{j} + J_{i}^{(1)}$ (20)

10. Classify the final state of the neurons by using Hyperbolic Activation Function

$g (h_{i}) = \frac{e^{h_{i}} - e^{- h_{i}}}{e^{h_{i}} + e^{- h_{i}}}$ (21)

Where

$S_{i} = {\begin{matrix} 1 & , g {(h)}_{i} \geq 0 \\ - 1 & , O t h e r w i s e \end{matrix}$ (22)

11. Calculate the final energy by using the following equation

$E_{P} = ... + - \frac{1}{3} \sum_{i} \sum_{j} \sum_{k} J_{[i j k]}^{(3)} S_{i} S_{j} S_{k} - \frac{1}{2} \sum_{i} \sum_{j} J_{[i j]}^{(2)}_{} S_{i} S_{j} - \sum_{i} J_{i}^{(1)} S_{i}$ (23)

If $| E_{P} - E_{s} | < T O L$
$E_{P} is global minima solution$
else
$E_{P} is l o c al minima solution t h e n e t w o r k w i l l u s e kernel m a c h i n e$
12. Extract all the neuron’s final state for the local minima. Store it at $S_{i}^{*}$ .
13. Calculate the corresponding kernel function by using the following equation

$K_{f} (Y, S_{i}^{*}) = 〈 Y . S_{i}^{*} 〉$ (24)

Where Y is a learned interpretation stored in CAM.

14. Find the corresponding kernelized local field by using the following equation

$h_{i}^{k} = .... + \sum_{j} K_{f} J_{i j k}^{(3)} S_{j} S_{k} + \sum_{j} K_{f} J_{i j}^{(2)} S_{j} + K_{f} J_{i}^{(1)}$ (25)

15. Classify the final state of the neurons by using Hyperbolic Activation Function

$g (h_{i}^{k}) = \frac{e^{h_{i}^{k}} - e^{- h_{i}^{k}}}{e^{h_{i}^{k}} + e^{- h_{i}^{k}}}$ (26)

Where

$S_{i} = {\begin{matrix} 1 & , g {(h_{i}^{k})}_{i} \geq 0 \\ - 1 & , O t h e r w i s e \end{matrix}$ (27)

16. Calculate the kernelized final energy by using the following equation

$E_{P}^{k} = - \frac{1}{3} \sum_{i} \sum_{j} \sum_{k} J_{[i j k]}^{(3)} S_{i} S_{j} S_{k} - \frac{1}{2} \sum_{i} \sum_{j} J_{[i j]}^{(2)} S_{i} S_{j} - \sum_{i} J_{i}^{(1)} S_{i}$ (28)

If $| E_{P}^{k} - E_{s} | < T O L$
$E_{P}^{k} is global minima solution$
else
$E_{P}^{k} is l o c al minima solution$
17. Find the corresponding MSE, RMSE, SSE, MBE, MAPE, SMAPE, Global minima ratio, CPU time.

Simulation and discussion in linear kernel hopfield neural network

In order to obtain the results, computer simulations have been tested using Microsoft Visual Dev C++ 2015 Express for Windows 7 to demonstrate the ability of the kernel machine in doing the logic programs for a Hopfield network based on Wan Abdullah’s method to solve non-horn clauses problems. The number and the order of the clauses are chosen by the user using try and error technique.¹⁶ The numbers of neurons involved are increased in each training. The maximum number of neurons is 120 to find the corresponding RMSE, MAE, SSE, MBE, MAPE, SMAPE, Global minima ratio, CPU time. The relaxation was run for 100 trials and 100 combinations of neurons so as to reduce statistical error. The selected tolerance value is 0.001. All these values are obtained by try and error technique, where several values are tried as tolerance values, and the value which gives better performance than other values are selected. The comparison between Hopfield neural network and linear kernel Hopfield neural network. From (Figures 2-7) as presented above represent the performance comparison of the errors for both Hopfield neural network and Linear Kernel Hopfield neural network. noted when the NN are increasing, the errors to be increasing due to the model become more complicated because there are several local minima are trying to stuck. During the kernel machine prevents that from happening we see the errors equals to zero because the kernel machine does harmony between the data and the model of the Hopfield. The results from the (Figure 8), showed that the ratio of global solutions with different number of neurons (NN=6 to NN=120) for KHNN-Non-Horn Clauses SAT and HNN-Non-Horn Clauses 3SAT. It is clear from figure 8 that the global minimum ratio of KHNN-Non-Horn Clauses 3SAT closer to one compared HNN-Non-Horn Clause 3SAT. The kernel machine shows that the global solutions obtained are nearly or 1 for all values of NN, even though the network become more complex by increasing the NN, this does not affect the results significantly. However, the results of global minima ratio obtained, for kernel machine is more stable and consistent even as NN increases. From the (Figure 9) we compare computation time of Hopfield neural network with Kernel machine and computation time of Hopfield neural network without Kernel machine, it was observed that as NN increase, the running time of KHNN's method better from HNN's method. In addition, the difference in the computational time between KHNN's method and HNN’s method turn out to be more visible as the network complexity increase. These results justify consistency and stability of Hopfield neural network with Kernel machine since they are less susceptible to get stuck at the local minima as compared to Hopfield neural network without Kernel machine.^17-19

Figure 2 Rmse Error.

Figure 3 Mae Error.

Figure 4 SSE Error.

Figure 5 Mbe Error.

Figure 6 Mape Error.

Figure 7 Smape Error.

Figure 8 ZM.

Figure 9 CT.

Conclusion

The research utilized the linear kernel (KHNN) in finding optimal neuron states, which portrayed an improved method of doing logic programming in Hopfield network. The performances of these two methods KHNN and HNN were compared based on RMSE, MAE, SSE, MBE, MAPE, SMAPE, Global minima ratio, CPU time. Results obtained indicate that the KHNN improved the efficiency in finding global solutions. Besides, the computational time using KHNN is better than the HNN. Furthermore, this poses an indication that KHNN is well adapted to a complex network compared to HNN, whose effect is obviously seen as the complexity of the network increases simply increases the number of neuron (NN). The results obtained also certified that KHNN always converges to the optimal solution or nearly to the optimal solutions and maintains the population of the candidate solutions for the problem to be solved. In addition to this, KHNN is less prone to get trapped in the local optima or in any sub-optimal solutions. In contrast, HNN exhibits slow convergence to the desired solutions (global solutions) and takes longer computational time as the network gets larger and complex. Thus, from the simulation results obtained, it can be concluded that KHNN is a promising machine for solving optimization problems and is a useful technique when dealing with a large and complex search space, while the results of the errors of RMSE, MAE, SSE, MBE, MAPE, SMAPE to zero in KHNN strengthening the assertion of global solutions in the process.