Current state of quantum theory

doi:10.15406/paij.2017.01.00021

eISSN: 2576-4543

Physics & Astronomy International Journal

Review Article Volume 1 Issue 4

Current state of quantum theory

Vatsya SR

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

648 Inverness Ave, Canada

Correspondence: Anthony D.M. Curtis, Institute of Science and Technology in Medicine, Keele University, Keele, Staffordshire, UK, Tel 519 474 1183

Received: September 14, 2017 | Published: October 17, 2017

Citation: Vatsya SR. Current state of quantum theory. Phys Astron Int J. 2017;1(4):115-121. DOI: 10.15406/paij.2017.01.00021

Download PDF

Abstract

Standard and path integral formulations of quantum mechanics are reviewed briefly followed by the formulations based on the gauge transformations, which are Wheeler’s formulation and another one founded upon an extension of Hamilton’s action principle. Various interpretations of quantum mechanics are discussed and compared. Recent developments in quantum field theory are commented on.

Keywords: quantum theory, Hamilton’s action principle, Nelson’s formulation, wave function

Introduction

The original formulation of quantum mechanics is based on some empirically deduced postulates available in the texts. This founding premise is discomforting for a scientific discipline should be founded upon a conceptual ground and a systematic, logically consistent structure. This situation is improved by Feynman’s path integral quantum theory, Hamilton"s action principle, Nelson"s formulation, wave function but only partially. Further disconcerting situations developed consequent of the original formulation, mainly with the quantum mechanical understanding of measurement, i.e., observations. The observed impact of the method of observation on its outcome presents an enigmatic situation, which is not well understood. This problem is best illustrated by the quintessential double slit experiment. With two slits open, individual microscopic entities such as the electrons and photons, arrive on the observation screen as classical particles but not at the classically prescribed locations. As the number of arrivals increases, they cluster about certain locations and after infinitely many of them have arrived on the screen, the density distribution resembles the intensity pattern produced by two interfering wavelets emanating from the slits. Such observations underlie the doctrine of the wave-particle duality, which fuses two contradictory concepts together. If an observation behind one of the slits is made that enables an extraction of “which path” information, i.e., which path some particles took, the density distribution becomes as produced by the particles passing through both slits. If the “which path” information is erased, the distribution again becomes interference-like. Both behaviors can be observed in different parts of the data in a single experiment.⁵

Numbers of interpretations have been developed to address the above issue that are described in the paper and commented upon in appropriate sections. None of the interpretations developed to explain such phenomena is universally accepted; their popularity has varied with times. Despite a weak foundation, the theory has survived due to its remarkable successes in describing the observed physical phenomena. Nevertheless, the resulting issues of fundamental nature are not yet fully resolved. This review describes the formulations and the weaknesses in the foundational structure of quantum mechanics.

Parallel efforts have continued to formulate quantum mechanics in terms of the concept of gauge transformations underlying Weyl"s geometry⁶ hoping to resolve the foundational issues. First such attempt was made by London⁷ but little progress was made. However, some recent developments based on this concept, have been successful in making major advances.⁸^,⁹ The formulation including such developments is referred to as the geometrical formulation. This brief review describes this formulation and compares its advantages over the standard ones

Wheeler’s Wiener integral formulation in Weyl"s geometry¹⁰ supplemented with Nelson"s formulation¹¹ also relates the gauge transformations with quantum mechanics but in a limited way. This development, although noteworthy, is founded upon rather untenable assumptions and it has not been found to be very useful. This formulation has been included and commented upon for its relevance and historical value. Included also is a comparative discussion of the Aharonov-Bohm effect¹² that is geometrical in nature as understood in terms of the standard formulation but more so in terms of the geometrical formulation.

There are troubling issues with quantum field theory also, which are not discussed here in detail but briefly commented on.

Gauge transformations

In the Riemann spaces, the length of a vector remains constant under parallel transport. The Weyl geometry⁶ was developed by assuming that the length $l_{x}$ of a vector undergoes a change of $δ l_{x} = a {ϕ^{'}}_{μ} d x^{μ} l_{x}$ under parallel transport from an arbitrary point $x$ to a neighboring point $x + d x$ , where Weyl’s weight $a$ is a nonzero constant and ${ϕ^{'}}_{μ}$ , termed the Weyl gauge potentials, are the components of a vector, Weyl’s vector potential. It follows that the length $l_{y}$ at a point $y$ of a vector transported from $x$ to $y$ along a trajectory $ρ_{x y}$ is given by

$l_{y} = \exp [a \int_{ρ_{x y}} {ϕ^{'}}_{μ} d x^{μ}] l_{x}$ . (1)

Also, the gauge can be assigned essentially arbitrarily at every point. This is equivalent to recalibrating the lengths by redefining the metric, $\tilde{g} (x) \to κ^{2} (x) \tilde{g} (x)$ , which transforms the length $l_{x} \to κ (x) l_{x} = {l^{'}}_{x}$ . It follows from (1) that as a result of this recaliberation,

$l_{y} \to κ (y) l_{y} = {l^{'}}_{y} = κ (y) \exp [a \int_{ρ_{x y}} {ϕ^{'}}_{μ} d x^{μ}] κ^{- 1} (x) {l^{'}}_{x} = \exp [\int_{ρ_{x y}} (a {ϕ^{'}}_{μ} + κ^{- 1} κ_{μ}) d x^{μ}] {l^{'}}_{x}$ , (2)

where $μ$ denotes the derivative with respect to $x^{μ}$ . A gauge transformation defined by (2), contains a path-dependent part, a functional, defined by (1) termed the essential gauge, and a point function $κ (x)$ referred to as the assigned gauge.

The gauge transformations are at times introduced differently in literature. Weyl himself had introduced them as the length recaliberations and the parallel transport rule arose out of a resulting necessity to define a torsion free derivative. However, the metric and the affine connections that define the parallel transport, are independent concepts and both are taken into account in all formulations. This and the other rether presentations are equivalent to the present one; the ptresent one is better suited for the present purpose.

Quantum mechanics

Standard formulation

Standard formulation of quantum mechanics is available in the text books in detail. Here we discuss a few of the postulates for their relevance to the other formulations discussed here.

The state of a physical system is described completely by its wave function, or the amplitude $ψ = ψ (r)$ . If there are two possible states $ψ_{1}$ and $ψ_{2}$ for a system, then their linear combinations $c_{1} ψ_{1} + c_{2} ψ_{2}$ are also possible states; this is known as the Superposition Principle. Probability of a particle being in the space region between $r$ and $r + d r$ is proportional to $| ψ (r) |^{2} d r$ , which is known as the Born Rule. Physical observables are represented by mathematical operators. If a state $ψ$ of a system is a linear combination $ψ = c_{1} ψ_{1} + c_{2} ψ_{2}$ of two eigenstates $ψ_{1}$ and $ψ_{2}$ of an observable $A$ , then the outcome of a single measurement is one of the two corresponding eigenvalues of $A$ , and the probability of it being the eigenvalue corresponding to $ψ_{n}$ is equal to $| c_{n} |^{2}, n = 1, 2$ . This constitutes the basis of the Copenhagen Interpretation, i.e., upon measurement the wave function collapses in a single state. Representation of most of the observables can be determined from that of the momentum $p$ , which has a concrete representation by a differential operator, which is, $p = - i \partial = - i \partial / \partial x$ , where $x$ is the position. Two canonically conjugate observables, e.g., the position and momentum, are assumed to satisfy a commutation relation. These yields the Uncertainty Principle: There is a limit to the accuracy of two conjugate observables in their simultaneous measurement. These postulates differ fundamentally from the classical formulation where the physical observables completely describe the state of a system that is precisely determined. Evolution of a system is described by the Schrödinger equation.

$i \frac{\partial ψ}{\partial t} = H ψ$ , (3)

Where $H$ is the Hamiltonian operator obtained from its classical form by replacing its argument observables by their operator representatives. Here $t$ is the time treated as a parameter and (3) refers to the one-dimensional non-relativistic case. Generalization to the Riemannian case, which includes the relativistic case, entails in letting $p^{μ} \to i \partial^{_{; μ}}$ , where $p^{μ}$ and $\partial^{_{; μ}}$ is the contra variant components of the momentum and the covariant derivative. In the relativistic formulation, time loses its independent parameter character and in general Riemannian spaces; it is replaced by arc length as will be discussed.

Feynman’s path integral formulation

In Feynman’s path integral formulation of quantum mechanics (Section 4.3), a particle is assumed to take all alternative paths available to it and the wave function is defined as an aggregate of the phase factors associated with infinity of trajectories. Quantum mechanical amplitude $K (x, x_{0})$ for a particle to go from $x_{0}$ to $x$ is defined to be the equiweighted sum of the phase factors $\exp {i S [ρ (t)]}$ along all paths from $[x_{0}, t (x_{0})]$ to $[x, t (x)]$ , where $S [ρ (t)]$ is the classical action along a trajectory $ρ (t)$ joining these two points, i.e.,

$K [x, t (x); x_{0}, t (x_{0})] = \sum_{\begin{array}{l} a l l p a t h s \\ f r o m x_{0} t o x \end{array}} w (ρ) \exp {i S [ρ (t)]}$ . (4)

The weight $w (ρ)$ is equal for all trajectories. The amplitude admits the following integral representation:

$K [x, t + ε; x_{0}, t (x_{0})] = \int d y v (y) \exp (i S [x, t + ε; y, t]) K [y, t; x_{0}, t (x_{0})]$ (5)

Where $v (y)$ is a suitable measure and $S [x, t + ε; y, t]$ is the action along the classical trajectory from $(y, t)$ to $(x, t + ε)$ . Letting $(x_{0}, t_{0})$ vary over the entire extended manifold yields the following representation of the wave function:

$ψ (x, t + ε) = \int d y v (y) \exp (i S [x, t + ε; y, t]) ψ (y, t)$ . (6)

The differential equation for $ψ$ is obtained by expanding both sides of (6) in powers of $ε$ and by term by term comparison; only a few terms in the expansion are needed.

Feynman had initially obtained the Schrödinger equation. Since then, the Klein-Gordon equation in the electromagnetic field (Section 4.4.2) and generalized to the Riemann spaces has also been obtained (Section 4.4.3). The Dirac equation created some complications. However, spinors have been formulated to be the solutions of the generalized Klein-Gordon equation with some improvements over the Dirac equation.¹³

Feynman’s formulation has a clearer conceptual basis, which is basically a pure particle formulation but not entirely as a wavelike coherence on the trajectories is introduced in the definition of the wave function. As a result, the wave function appears somewhat of a superposition of infinitely many waves. The formulation yields useful methods to deduce various quantum mechanical results and it modifies the dynamical equations in curved spaces by a curvature term, which is known to be physically significant.

Extension to the curved Riemannian spaces involves some mathematical complexities but the resulting calculus is still quite straightforward (Sections 4.3 and 4.4.3). Technically, the path integrals on such spaces are constructed by patching infinitesimal trajectories in the tangent spaces. This construction results in replacing $d y$ with the invariant measure $\sqrt{| \tilde{g} (y) |} d y$ with $| \tilde{g} (y) |$ being the determinant of the metric $\tilde{g} (y)$ .

Wheeler’s wiener integral in weyl’s geometry

In the gauge transformations (Section 3.1), Weyl identified ${ϕ^{'}}_{μ}$ with $ϕ_{μ}$ , where $ϕ_{μ}$ are the electromagnetic potentials, and took $a$ to be a real constant. Later London⁷ showed that with $a = - i e$ and $y$ varying along the classical trajectory of a particle of charge $e, l_{y}$ is directly proportional to an extended de Broglie wave function associated with the particle. The Weyl-London gauge transformation is widely used in quantum mechanics in its original as well as generalized to multidimensional cases. Further significant progress was made by formulating and extending Hamilton’s action principle in the framework of gauge transformations with $a {ϕ^{'}}_{μ} = - i e S_{μ}$ , where $S$ is the classical action, for a charged particle in an electromagnetic field.⁸ This formulation was later extended to include the multi-dimensional gauge fields.⁹ Original formulations have undergone through some further adjustments [Sec. 2.3.3]. In the meantime, Wheeler investigated the correspondence between the Weyl geometry and quantum mechanics in some detail.¹⁰

For lack of a unique trajectory, Wheeler assigned infinity of trajectories and defined the Wiener integral representations of the amplitude, $ψ^{+}$ and its conjugate $ψ^{-}$ with the associated measurable probability density $ψ^{+} ψ^{-}$ . Specializing the formulation for a charged particle in an electromagnetic field in its non relativistic approximation, the probability density was shown to satisfy the Fokker-Planck, diffusion equation, which had been used before to interpret the Schrödinger equation as a diffusion phenomenon.¹¹ This associates Wheeler’s formulation with quantum mechanics albeit in a limited way.

While Feynman’s formulation has been quite successful in deducing the quantum mechanical equations, Wheeler’s, has not been. Also, this formulation suffers from an encumbrance of somewhat arbitrary, intricate and not all that tenable reasoning underlying its assumptions. Therefore, Wheeler’s formulation will not be discussed futher here. It is mentioned above mainly for its historical significance and for its novel approach.

Formulation based on action principle

In this section, we describe a recently developed path integral formulation based on an extention of Hamilton’s action principle in the framework of gauge transformations (Section 1.1), referred to as the “Geometrical Formulation” (Section 4.3.3) .

Extended action principle: Hamilton’s action principle assumes the action $S$ to be stationary about the particle path $ρ_{x y}$ , i.e., $ρ_{x y}$ is an extremal defined by this variational characterization. The principle is stated as

$δ S = δ \int_{ρ_{x y}} d S = \int_{ρ_{x y}^{c, \inf}} d S = S (ρ_{x y}^{c, \inf}) ≅ 0$ ,

where $≅$ indicates that the equality holds up to the first order in the area enclosed by each closed curve $ρ_{x y}^{c, \inf}$ obtained as the union of an arbitrary curve $ρ_{x z y}$ in a small neighborhood of $ρ_{x y}$ and another arbitrary curve $ρ_{y z^{'} x}$ in a small neighborhood of $ρ_{y x}$ with $x$ and $y$ fixed. Now, the classical action principle can be expressed as

$1 + a S (ρ_{x y}^{c, \inf}) ≅ \exp [a S (ρ_{x y}^{c, \inf})] ≅ 1$ (7)

for all closed curves $ρ_{x y}^{c, \inf}$ enclosing infinitesimally small areas with an arbitrary nonzero constant $a$ . From (2), the choice ${ϕ^{'}}_{μ} = a S_{μ}$ defines the gauge group element associated with $ρ_{x y}^{c, \inf}$ to be $\exp [a S (ρ_{x y}^{c, \inf})]$ .

Thus, (7) formulates the classical action principle in terms of the gauge transformations. Classical characterization of the particle motion stated in (7) admits a natural extension:

$κ (y) \exp [a S (ρ_{x y})] κ^{- 1} (x) = 1$ , (8)

which is a founding assumption of this formulation. The left side of (8) is the gauge group element with ${ϕ^{'}}_{μ} = S_{μ}$ , associated with an arbitrary curve $ρ_{x y}$ , closed or unclosed, and the equality is assumed to hold exactly. For the curves $ρ_{x y}^{c, \inf}$ , (8) reduces to (7) up to the first order as an extension should. Infinitely many solutions of (8) facilitating the passage of a particle will be called the physical paths.

The assigned gauges in the Weyl geometry are arbitrary. We require that $κ (x) = κ (x^{'})$ if and only if the points $x$ and $x^{'}$ in the underlying manifold are physically equivalent, constituting another founding assumption. For illustration, consider a free particle collimated by a small aperture and detected at some distance. The particle experiences the same physical conditions at all points in the interior of any path it may take and thus at all points except about the aperture and the detection point. However, it is subjected to different interactions at the aperture and at the detector, which are also geometrically localized and separated from the other points. Thus, the aperture and detection point are both physically different from the other points and from each other. Therefore, the assigned gauges are different at the aperture, at the detector and at the other points of the relevant region where they are equal to a constant. Present assumptions about the assigned gauges thus, take the experimental configuration and the involved interactions into account.

Disctinction between the assigned and the essential gauges is not so clear in literature, where they are considered essentially inconsequential by requiring meaningful physical quantities to be gauge invariant. In the present formulation, they assume a significant role as they are defined in terms of the physical conditions relevant to the physical system including the method of observation.

From (8), a constant value of the assigned gauges cancels out for a physical system confined to a region with all of its points being equivalent to each other, e.g., a free particle. In case of the closed curves as the action principle stated in (7) and the Bohr orbits,, the assigned gauges are ineffective, again due to cancelations. In all such cases, (8) reduces to $\exp [a S (ρ_{x y})] = 1$ . Since nonzero real values of the action for such particle trajectories cannot be excluded, $a$ should be purely imaginary, which can be set equal to $i$ . Thus, London’s assumption of $a$ being purely imaginary [7], is deduced here as a result. Since $a$ is the same constant for all cases, (8) reduces to

$κ (y) \exp [i S (ρ_{x y})] κ^{- 1} (x) = 1$ . (9)

Left side of (8) is the length acquired by a unit vector transported along $ρ_{x y}$ . Thus, for a trajectory to be physical, i.e., an allowed particle path, (8) requires a vector attached to the particle to regain its length at some point along the path. Then it follows from (9) that the vector regains its length along such a path periodically.

It is clear from (8) and (9) that the assigned gauges participate in describing the observable effects.

Physical trajectories: Since there are no preferred physical paths, all are equally likely and a particle trajectory would be determined by random selection. Thus, while a particle follows a definite trajectory, it is not determinable due to the randomness.

If $ρ_{x y}$ and $ρ_{y z}$ are two physical paths, then the union $ρ_{x z}$ of $ρ_{x y}$ and $ρ_{y z}$ is also physical, termed a continuing union. The physical trajectories can be classified under two categories: Monotonic and nonmonotonic. Along a monotonic trajectory, the action increases (decreases) monotonically as the path is traversed. If a trajectory is not monotonic, it is nonmonotonic. Nontrivial monotonic physical paths with smallest allowed value of the action in magnitude will be called the elemental physical paths. For a constant $κ$ , the elemental physical paths $ρ_{x y}$ are defined by $S (ρ_{x y}) = \pm 2 π$ . Thus, for a monotonic trajectory, a particle travels along a randomly selected elemental from one point to the other and then along another elemental from its arrival point to the next. It follows that a general monotonic particle trajectory is constituted of a sequence of randomly selected elementals. Nonmonotonic physical trajectories can be considered the continuing unions of their monotonic segments.

A particle originating at a source and detected some distance away, would travel along a continuing union of randomly selected elementals, which may or may not coincide with or be close to the extremal of the classical action principle. Thus, the particle would be detected as a localized entity, i.e., as a particle, but not necessarily at the location determined by the action principle. If such an experiment is conducted with a large collection of identical particles or many times, each with a single particle with particles in all repeats being identical, then each particle would travel along an independent continuing union of the elementals. The collection of the detected particles may or may not cluster about the classically determined location. This agrees with the observations.

For a macroscopic system, that of a classical scale, the trajectories available to a particle have been shown to be concentrated about an extremal. The basic argument is that there are many equiaction trajectories in a second order neighborhood of a classical trajectory because of a second order difference in the action along the trajectories in its neighborhood. Such trajecories in pairs can be shown to constitute non-monotonic physical trajectories. Thus, each monotonic segment of each such trajectory can facilitate passage of a particle. Consequently, a particle would travel along an almost classical trajectory. This implies also that the major part of an expected observation can be gleaned by studying the classical trajectories.

The argument applies and the conclusion holds equally well for the monotonic segments of a classical magnitude of the nonmonotonic physical trajectories that are the continuing unions of their piecewise monotonic segments. The equiaction trajectories can be shown to exist also in a small neghborhood of a nonclassical trajectory but spread over a first order neighborhood with consequent reduced density.

Role of gauge in observations: Consider the double slit experiment where identically prepared particles, e.g., the electrons, encounter two slits, at $x_{i}$ and $x_{f}$ , in a screen, and observed at an arbitrary location $\hat{x}$ on an observation screen. A pair of trajectories meeting at $\hat{x}$ can be considered a continuing union of two monotonic trajectories $ρ_{x_{i} \hat{x}}$ and $ρ_{\hat{x} x_{f}}$ each of a classical extent constituting a nonmonotonic trajectory $ρ_{x_{i} \hat{x} x_{f}}$ from $x_{i}$ to $\hat{x}$ to $x_{f}$ . In this arrangement, the assigned gauge at $\hat{x}$ cancels out reducing (9) to

$κ (x_{f}) \exp {i [S (ρ_{x_{i} \hat{x}}) - S (ρ_{x_{f} \hat{x}})]} κ^{- 1} (x_{i}) = 1$ . (10)

For identiacal slits, the points $x_{i}$ and $x_{f}$ are physically equivalent implying that $κ (x_{i}) = κ (x_{f})$ and hence, the solutions of (10) are given by

$S = S (ρ_{x_{i} \hat{x} x_{f}}) = S (ρ_{x_{i} \hat{x}}) - S (ρ_{x_{f} \hat{x}}) = 2 n π$

with $n$ being an arbitrary integer. An extremal joining any two points in this case is a straight line with the momentum $p$ of the particle(s) being almost constant. Thus, if the monotonic segments $ρ_{x_{i} \hat{x}}$ , $ρ_{x_{i} \hat{x}}$ of the corresponding nonmonotonic trajectory $ρ_{x_{i} \hat{x} x_{f}}$ are extremals of the action principle, then $ρ_{x_{i} \hat{x} x_{f}}$ , where $Δ r^{'}$ is equal to the difference between the path lengths of the straight lines from $x_{i}$ to $\hat{x}$ and $x_{f}$ to $\hat{x}$ . It follows that for a union of two straight lines transmitting a particle of momentum $p$ to be physical, it is required that $Δ r^{'} = 2 n π / p = Δ r_{n}$ . Since the physical trajectories are concentrated about the extremals $ρ_{x_{i} \hat{x}}$ and $ρ_{\hat{x} x_{f}}$ , as explained in Sec. 4.4.2, a high concentration of the physical trajectories and hence of the particles would exist in small neighborhoods of such points on the observation screen, denoted by ${\hat{x}}_{n}$ , decreasing away from them. In between the maxima, particles can only reach along nonextremals with small associated density due to a smaller concentration of the physical trajectories there. The particle density distribution deduced above resembles the intensity distribution resulting from the interference of two identical wavelets emanating from $x_{i}$ and $x_{i}$ as observed.

The above observation underlies the doctrene of the wave-particle duality, which does not make a distinction between the particle density distribution and the intensity pattern produced by the interfering waves, rendering the argument underlying this doctrene rather weak. In the geometrical formulation, the particle density distrbution is obtained above from the defintion of the physical trajectories.

If an intrusive observation is made behind a slit, e.g., by scattering the photons from the particle beam, path of the beam is altered which would shift the location of the corresponding beam on the observation screen. This shift can be calculated precisely for the classical trajectories, which is indicated above to yield a close approximation to the expected observation. Estimates (Section 4.4.1) show that for small values of the shift $s$ , the particle beams can still be considered almost correlated since there would still be many pairs of almost extremals with a common point on the observation screen in the small neighborhoods of ${\hat{x}}_{n}$ . This implies the existence of a shifted and smeared density pattern. As the scattering effects increase, the shift increases shifting and smearing the density distribution further. When the shift increases still further, the correlation between the beams is almost completely lost. Consequently, $κ (\hat{x})$ cancels out for almost no trajectories invalidating (10). Instead, the beams travel independently, which are almost classical particle trajectories as discussed in Section 4.4.2.

It can be inferred from the above discussion that a set of trajectories transmitting the particles to an observation region would produce a density distribution resembling an interference pattern if and only if the net gauge effect is insufficient to enable an extraction of “which path” information. If “which path” information is available at the terminal point $\hat{x}$ of a trajectory, it cannot be common to the corresponding pairs of the beams.

The particle behavior here is described in terms of the correlation of trajectories, which is a geometrical concept, in contrast with the entangled physical systems, or particles, in standard quantum mechanics. In the above example of the double slit experiment, assigned gauges make significant contribution to the observation. Essentail gauges also impact upon the observations as in the Aharonov-Bohm effect, predicted by Bohm effect, predicted by Bohm¹², which has been experimentally verified.¹⁴^,¹⁵

To describe the Aharonov-Bohm effect, consider an experimental set up in which a controlled magnetic field is generated by a long vertical solenoid by controlling the current through its coil mimicking a magnetic monopole. An electron beam is split in two at a point $x_{i}$ . One of the beams travels along a trajectory $ρ_{1}$ on one side of the cylinder and the other along $ρ_{2}$ on the opposite side, both shielded from the magnetic field. Then the beams meet at another point $\hat{x}$ . Thus, the union $\hat{ρ}$ of $ρ_{1}$ and $ρ_{2}$ encloses the magnetic field but neither of the beams passes through it. Since the potentials were considered only the mathematical auxiliaries and the observable effects were still assumed to be produced by the fields, the Aharonov-Bohm effect was considered an anomalous behavior predicted by quantum mechanics.

In the above arrangement, the assigned gauges can be seen to become ineffective. As in case of the double slit arrangement with no gauge effect, where the physical trajectories are given in (10), the physical trajectories for the present arrangement are given by $S = 2 n π$ , where
$S = S (ρ_{1}) - S (ρ_{2}) = S_{0} + e \int_{\hat{ρ}} ϕ_{μ} d x^{μ} = S_{0} + e f .$ (11)
with $S_{0}$ being the free particle part of the action. Again as in case of the double slit arrangement, this produces a particle density distribution resembling the interference pattern. As $f$ is varied, it follows from (11) that the density pattern shifts repeating itself with period $2 π / e$ with respect to $f$ . This agrees with the observations.

This effect was predicted¹² based on the soluions of the Schrödinger equation but the argument is valid for the Klein-Gordon equation also. Since the underlying manifold in this case is multiply connected, the wave functions on one sheet differs from that on the other by a phase unless $f = 2 π / e$ resulting in a constructive interference and the periodicity; elsewhere the density pattern resembles the interference pattern of the two interfering waves differing by a nonzero phase fifference modulo $2 π$ . Thus, in the geometrical formulation, the Aharonov-Bohm effect is described solely by the definition of the physical trajectories in a sharp contrast with the standard argument based on the interfering waves.

The double-slit experiment and the Aharonov-Bohm effect share an essential similarity. In both of the cases, the observed effect depends on the fact that the two beams are related with each other through a gauge factor resulting from an alteration to the underlying geometry. In the former, this alteration results from the two slits generating an assigned gauge distribution and in the later, from a hole punctured by the field confined almost to a point, generating an essential gauge factor. In any case, the gauge factors, both assigned and essential, participate in producing the observable effects. Details of the above including the estimates are available elsewhere (Sections 4.4.1 and 4.4.2).

Extended path integral formulation: With $a {ϕ^{'}}_{μ} = i S_{μ}$ and constant gauge, for a unit vector, i.e., $l_{x} = 1$ , ${l^{'}}_{y}$ in (2) is a typical phase factor term $\exp {i S [ρ (τ)]}$ in Feynman’s representation of the amplitude, or wavefunction (Section 4.2). In view of this observation, Feynman’s quantum mechanical amplitude for a particle to go from $[x_{0}, τ (x_{0})]$ to $[x, τ (x)]$ is naturally adjusted to read

$K {κ; [x, τ (x)], [x_{0}, τ (x_{0})]} = \sum_{\begin{array}{l} all paths \\ from x_{0} to x \end{array}} w (ρ) κ (x) \exp {i S [ρ (τ)]} κ^{- 1} (x_{0})$ , (12)

which is just a slightly adjusted form of Feynman’s formulation but with a significant difference in that it includes the assigned gauges that incorporate the impact of an observation on the system as long as an intrusion altering the assigned gauge distribution is not strong enough to decohere the system of trajectories. If not, the representation of (12) remains valid in each region where the stated condition is satisfied and (12) would yield a different amplitude in each such region. Thus, the present formulation improves significantly over the others.
Since the wavefunction is defined in (12) with $x_{0}$ being arbitrary, it defines the amplitude or wavefunction as an aggregate of the Weyl lengths at $[x, τ (x)]$ of a unit vector transported to this point along all trajectories from everywhere, defined in (2) with $a {ϕ^{'}}_{μ} = i S_{μ}$ , providing a geomerical interpretation of the wavefunction. Following the standard procedure, (12) is expressed as

$κ^{- 1} (x) K^{'} (κ; x, τ + ε) = \int d \hat{m} (y) \exp [i \hat{S} (x, τ + ε; y, τ)] κ^{- 1} (y) K^{'} (κ; y, τ);$ (13)

where $K^{'} (κ; x^{'}, τ^{'}) = K {κ; [x^{'}, τ^{'}], [x_{0}, τ (x_{0})]}$ and $\hat{m} (y)$ is a suitable measure. This yields the following representation of the wavefunction:
$κ^{- 1} (x) ψ (κ; x, τ + ε) = \int d \hat{m} (y) \exp [i \hat{S} (x, τ + ε; y, τ)] κ^{- 1} (y) ψ (κ; y, τ)$ . (14)

For a constant gauge (14) reduces to Feynman’s representation of the quantum mechanical wavefunction $ψ$ given by (6), which is the same as (14) with $ψ$ replacing $κ^{- 1} ψ (κ)$ implying that $κ^{- 1} ψ (κ) = ψ$ . The wavefunction $ψ (κ)$ representing the basic physical system together with the impact of the observing system can be constructed by multiplying $ψ$ by the applicable assigned gauge. As discussed in Section 4.4.3., close approximations to the assigned gauges can be obtained for a given physical sytem, which can be improved further. It is clear that the wavefunction and the characterization of the physical trajectories given in (9) together describe the motion more completely than either one alone. This adds a significant bit to the prevailing formulations of quantum mechanics.

In the geometrical formulation, an act of observation assigns certain gauges to points in the underlying manifold. Thus, making a measurement on $ψ$ alters it to $ψ (κ)$ , which depends on the observing system and the interactions involved. Clearly, the wavefunction can be altered by altering the details of an observation. Thus, the concept of an unambiguous measurement cannot be assigned to a wavefunction. To eliminate the ambiguity, an associated gauge invariant quantity should be constructed and measured for a physical measurement to be objective and meaningful. Wheeler constructed the conjugates of tensors in Weyl’s geometry with the product of a tensor and its conjugate being a quantity of vanishing Weyl weight, which was arbitrarily assumed to be physically measurable.

In the present formulation, it can be concluded that if a physical trajectory exists, then $| κ (x) | = | κ (y) |$ and since a constant factor in the assigned gauge cancels out $| κ (x) | = 1$ , implying that $κ (x) = \exp [- i σ (x)]$ with some $σ (x)$ for all $x$ . This shows that the assigned gauges are reducible to phase factors. It follows that $κ^{- 1} (x) = κ^{*} (x)$ , the complex conjugate of $κ (x)$ . Thus, $ψ^{*} ψ = {(κ ψ)}^{*} (κ ψ)$ is a real gauge invariant quantity, rendering $ψ^{*}$ a natural conjugate of $ψ$ . Born’s probability density $ψ^{*} ψ = | ψ |^{2}$ is clearly a uniquely defined gauge invariant quantity with a clear outcome of any measurement regardless of the gauge but at the expense of the observable information contained in the gauge factors, both essential and assigned.

Quantum measurement

There are several interpretations of quantum mechanics, each with its own explanation of the impact of method of observation, or the observer, on what is observed. Neither one of them has gained universal acceptance; each one has its supporters and critics with varied views, which have changed with time. The interpretations together with their critiques are described in detail in literature, which has grown into a massive body. Here we discuss a few major ones briefly, which have gained prominence from time to time and compare the understanding of the indicated phenomenon according to these interpretations with that in the geometrical formulation. For now, the considerations are restricted to the double slit experiment discussed in Section 4.4.3.

The Copenhagen interpretation holds that the system is represented completely by a wavefunction, which collapses in one particular state consequent to a measurement, in the state that is observed. This view assigns dual nature, both wave and particle, to each of the entities, having no definite form until observed. Thus, the method of observation in a sense “bestows” the form upon the entity observed. In comparison, the geometrical formulation assumes a definite form, particle, for each of these entities and each one is expected to be observed as a particle individually. In a bulk, the entities’ behavior in the gauge neutral case resembles that of a wave in the sense that the particle density distribution resembles the intensity distribution of two interfering wavelets. If the intrusion by the observing system is sufficiently strong to yield a definite outcome, it destroys the correlation between the trajectories from two slits breaking them up into more than one sets, one corresponding to the observation region and the others, covering its complementary region. The deduced observation then compares with a collection of classical particles in agreement with the observation. Separate wavefunctions can be constructed for each of the regions. Thus, the wavefunction $ψ$ representing the particles in the gauge neutral case “breaks up” into more than one fragments. The fragment corresponding to the observation region can be considered the “collapsed” form of $ψ$ providing an interpretation of the “wavefunction collapse.”

The Bohm formulation¹⁶ or the pilot wave interpretation initially proposed by de Broglie, assumes that these entities are particles and each particle follows a definite trajectory but guided by a wave determined by the state of system. Thus, a particle passes through one of the slits but the wave passes through both and the probability of a particle being at a location on the observation screen is proportional to the resultant intensity of the interfering pilot wavelets at that location. Thus, the particle density pattern resembles the intensity distribution of two interfering wavelets without the particles assuming a wave form. Particle trajectories remain unknown, which are considered the hidden variables. Concerning the collapse of a wavefunction, the Bohm theory considers a universal wavefunction incorporating the observed and the observing systems together as a composite system, which never collapses; collapse occurs only in the phenominological sense applicable to a subsystem, which evolves separately caused by the decoherence resulting from an intrusion by a measurement.

According to the geometrical formulation, each one of the entities, assumed to be a particle, passes through one of the slits, which remains unknown but due to randomness. Also, while a definite trajectory cannot be assigned to a particle, sets of the physical trajectories can be computed from the information about the observed and the observing systems and thus, they are not completely hidden. The probability of finding a particle at a location is proportional to the density of the paths available for its passage, which for the identical slits and weak intrusions is shown to resemble the intensity distribution of two interfering wavelets originating at the slits. Two systems, the observed and the observing one, are considered together to determine the assigned gauges, which for strong intrusions, “decohere” the sets of trajectories. This decoherence causes the wavefunction to decompose into fragments, the observed one of them being the “collapsed” wavefunction, as discussed above. Wavefunction of the composite system, the observed and the observing one together, does not enter the considerations. Impact of the observing system is incorporated in the wavefunction through the assigned gauges.

Many worlds view or the relative state formulation¹⁷ assigns purely wave nature to these entities. It was developed by coupling the observing system with the observed one through wavefunction of the composite system. Each measurement leaves each subsystem in a relative state with respect to the other. There is no quantification of such alterations. With each observation, system’s state branches off to different non-communicating worlds, each one corresponding to each of the possible outcomes of an observation. What is observed depends on the world the observer enters. Such branching off occurs at each subsequent observation. Thus, the outcome of a measurement depends on all previous measurements.

In the extended formulation, the method of observation couples the two systems through the assigned gauges. Basic physical system and the assigned gauges determine the observation. Each act of observation assigns a computable multiplicative gauge factor altering the “state” of the observed system, which is altered further by each subsequent measurement. If an observation causes a strong intrusion, the system of trajectories changes in a drastic way. As discussed above, one “branch” of the trajectories corresponds to the observation in the lab and the other, to its complement but in the same “world,” not in a different one, and what is observed depends on which branch one observes. Each one of these branches has an associated wavefunction. Such “branching” off occurs with each measurement yielding a definite outcome. This provides an interpretation of the “many worlds” view.

The concept of a wave is basic to the interpretations of quantum mechanics discussed here, which is irrelevant to the present formulation. The concept of gauge transformations does not enter the formulations and interpretations of quantum mechanics, which is fundamental to the geometrical formulation. Thus, the geometrical formulation differs radically from the standard quantum mechanics and its interpretations in its underpinnings as well as implications, although it has parallels.

In the framework of the geometrical formulation, the expected observation in the experiments involving large number of particles can be determined to a large extent, with the usual information about the observed and the observing systems prior to making a measurement, distinguishing it fundamentally from the interpretations as they do not have such predictive ability; outcome of an observation is known only after it has occurred; they only attempt to describe how the observation ocurred.

Concluding remarks

Present paper presents a brief review of the earlier formulations of quantum mechanics and a more recent, geometrical formulation, together with their critical comparative study. An extension of Hamilton’s action principle in the framework of Weyl’s gauge transformations constitutes the basic founding element of the geometrical formulation. Consequent clear charcterization of the particle trajectories augments the standard formulations and interpretations of quantum mechanics. The fundamental underlying principles of the geometrical formulation, stated in (7) and (8) of the classical and quantum mechanics, respectively, show that they are more closely related to each other than known previously and both are intertwined with the gauge transformations.Futher, the bserved impact of the method of a physical measurement on its outcome is formulated in terms of the assigned gauges determined by the configuration of the observing system and the interactions between the observed and observing systems. Thus, the assigned gauges that were largely ignored, wasted, in earlier related studies are used effectively in the geometrical formulation to develop a view of quantum measurement that differs radically from the views in the prevailing interpretations of quantum mechanics. This provides a novel and clearer understanding of the quantum measurement, which greatly improves upon the earlier explanations. The geometrically based theory remains indeterministic and nonlocal, which appear to be the facts of nature as supported by the observations, but with greatly improved understanding of the indeterministic and nonlocal attributes of physical phenomena.

In the geometrical formulation, the extended wavefunction admits an interpretation as an aggregate of the lengths acquired by a unit vector transported from everywhere along a collection of trajectories providing a clearer geometrical view of the wavefunction. Impact of a measurement on the wavefunction is shown to be directly related to its impact on the physical trajectories adding to the clarity of the process of physical measurements. The geometrical formulation renders some justification to Born’s probability postulate also, which was a founding postulate in the other formulations. The wavefunction and the probability density supplemented with the physical trajectories (Sections 4.4.1 and 4.4.2) describe the behavior of a system more completely than otherwise. This constitutes a significant addition to the previling formulations of quantum mechanics. This formulation also enables one to determine the expected outcome of an observation from the information about the systems, the observing and observed, albeit to a limited extent. Other formulations and interpretations of quantum mechanics do not have such predictive ability; they only “explain” the observation if it is known.

Comparative study of the foumulations and interpretations of quantum mechanics in this paper indicates that the geometrical formulation improves significantly upon the earlier ones and its further studies are likely to provide clearer undertanding of the “mysteries” of quantum mechanics.

There are issues with the prevailing formulation of quantum fields also, which have been omitted in the present paper. Recent developments addressing such matters are available elsewhere.³ In this recent formulation, which is more streamlined, quantization is deduced from the periodicity of phase factors appearing in (4) and (12) providing a unified framework for somewhat disjointed theories. A cosequent result of major significance is elimination of unphysical infinite vacuum energy in contradistinction with the prevailing second quantization. A novel interpretation of time results in the process.