Standard formulation
Standard formulation of quantum mechanics is available in the text books in detail. Here we discuss a few of the postulates for their relevance to the other formulations discussed here.
The state of a physical system is described completely by its wave function, or the amplitude
. If there are two possible states
and
for a system, then their linear combinations
are also possible states; this is known as the Superposition Principle. Probability of a particle being in the space region between
and
is proportional to
, which is known as the Born Rule. Physical observables are represented by mathematical operators. If a state
of a system is a linear combination
of two eigenstates
and
of an observable
, then the outcome of a single measurement is one of the two corresponding eigenvalues of
, and the probability of it being the eigenvalue corresponding to
is equal to
. This constitutes the basis of the Copenhagen Interpretation, i.e., upon measurement the wave function collapses in a single state. Representation of most of the observables can be determined from that of the momentum
, which has a concrete representation by a differential operator, which is,
, where
is the position. Two canonically conjugate observables, e.g., the position and momentum, are assumed to satisfy a commutation relation. These yields the Uncertainty Principle: There is a limit to the accuracy of two conjugate observables in their simultaneous measurement. These postulates differ fundamentally from the classical formulation where the physical observables completely describe the state of a system that is precisely determined. Evolution of a system is described by the Schrödinger equation.
, (3)
Where
is the Hamiltonian operator obtained from its classical form by replacing its argument observables by their operator representatives. Here
is the time treated as a parameter and (3) refers to the one-dimensional non-relativistic case. Generalization to the Riemannian case, which includes the relativistic case, entails in letting
, where
and
is the contra variant components of the momentum and the covariant derivative. In the relativistic formulation, time loses its independent parameter character and in general Riemannian spaces; it is replaced by arc length as will be discussed.
Feynman’s path integral formulation
In Feynman’s path integral formulation of quantum mechanics (Section 4.3), a particle is assumed to take all alternative paths available to it and the wave function is defined as an aggregate of the phase factors associated with infinity of trajectories. Quantum mechanical amplitude
for a particle to go from
to
is defined to be the equiweighted sum of the phase factors
along all paths from
to
, where
is the classical action along a trajectory
joining these two points, i.e.,
. (4)
The weight
is equal for all trajectories. The amplitude admits the following integral representation:
(5)
Where
is a suitable measure and
is the action along the classical trajectory from
to
. Letting
vary over the entire extended manifold yields the following representation of the wave function:
. (6)
The differential equation for
is obtained by expanding both sides of (6) in powers of
and by term by term comparison; only a few terms in the expansion are needed.
Feynman had initially obtained the Schrödinger equation. Since then, the Klein-Gordon equation in the electromagnetic field (Section 4.4.2) and generalized to the Riemann spaces has also been obtained (Section 4.4.3). The Dirac equation created some complications. However, spinors have been formulated to be the solutions of the generalized Klein-Gordon equation with some improvements over the Dirac equation.13
Feynman’s formulation has a clearer conceptual basis, which is basically a pure particle formulation but not entirely as a wavelike coherence on the trajectories is introduced in the definition of the wave function. As a result, the wave function appears somewhat of a superposition of infinitely many waves. The formulation yields useful methods to deduce various quantum mechanical results and it modifies the dynamical equations in curved spaces by a curvature term, which is known to be physically significant.
Extension to the curved Riemannian spaces involves some mathematical complexities but the resulting calculus is still quite straightforward (Sections 4.3 and 4.4.3). Technically, the path integrals on such spaces are constructed by patching infinitesimal trajectories in the tangent spaces. This construction results in replacing
with the invariant measure
with
being the determinant of the metric
.
Wheeler’s wiener integral in weyl’s geometry
In the gauge transformations (Section 3.1), Weyl identified
with
, where
are the electromagnetic potentials, and took
to be a real constant. Later London7 showed that with
and
varying along the classical trajectory of a particle of charge
is directly proportional to an extended de Broglie wave function associated with the particle. The Weyl-London gauge transformation is widely used in quantum mechanics in its original as well as generalized to multidimensional cases. Further significant progress was made by formulating and extending Hamilton’s action principle in the framework of gauge transformations with
, where
is the classical action, for a charged particle in an electromagnetic field.8 This formulation was later extended to include the multi-dimensional gauge fields.9 Original formulations have undergone through some further adjustments [Sec. 2.3.3]. In the meantime, Wheeler investigated the correspondence between the Weyl geometry and quantum mechanics in some detail.10
For lack of a unique trajectory, Wheeler assigned infinity of trajectories and defined the Wiener integral representations of the amplitude,
and its conjugate
with the associated measurable probability density
. Specializing the formulation for a charged particle in an electromagnetic field in its non relativistic approximation, the probability density was shown to satisfy the Fokker-Planck, diffusion equation, which had been used before to interpret the Schrödinger equation as a diffusion phenomenon.11 This associates Wheeler’s formulation with quantum mechanics albeit in a limited way.
While Feynman’s formulation has been quite successful in deducing the quantum mechanical equations, Wheeler’s, has not been. Also, this formulation suffers from an encumbrance of somewhat arbitrary, intricate and not all that tenable reasoning underlying its assumptions. Therefore, Wheeler’s formulation will not be discussed futher here. It is mentioned above mainly for its historical significance and for its novel approach.
Formulation based on action principle
In this section, we describe a recently developed path integral formulation based on an extention of Hamilton’s action principle in the framework of gauge transformations (Section 1.1), referred to as the “Geometrical Formulation” (Section 4.3.3) .
Extended action principle: Hamilton’s action principle assumes the action
to be stationary about the particle path
, i.e.,
is an extremal defined by this variational characterization. The principle is stated as
,
where
indicates that the equality holds up to the first order in the area enclosed by each closed curve
obtained as the union of an arbitrary curve
in a small neighborhood of
and another arbitrary curve
in a small neighborhood of
with
and
fixed. Now, the classical action principle can be expressed as
(7)
for all closed curves
enclosing infinitesimally small areas with an arbitrary nonzero constant
. From (2), the choice
defines the gauge group element associated with
to be
.
Thus, (7) formulates the classical action principle in terms of the gauge transformations. Classical characterization of the particle motion stated in (7) admits a natural extension:
, (8)
which is a founding assumption of this formulation. The left side of (8) is the gauge group element with
, associated with an arbitrary curve
, closed or unclosed, and the equality is assumed to hold exactly. For the curves
, (8) reduces to (7) up to the first order as an extension should. Infinitely many solutions of (8) facilitating the passage of a particle will be called the physical paths.
The assigned gauges in the Weyl geometry are arbitrary. We require that
if and only if the points
and
in the underlying manifold are physically equivalent, constituting another founding assumption. For illustration, consider a free particle collimated by a small aperture and detected at some distance. The particle experiences the same physical conditions at all points in the interior of any path it may take and thus at all points except about the aperture and the detection point. However, it is subjected to different interactions at the aperture and at the detector, which are also geometrically localized and separated from the other points. Thus, the aperture and detection point are both physically different from the other points and from each other. Therefore, the assigned gauges are different at the aperture, at the detector and at the other points of the relevant region where they are equal to a constant. Present assumptions about the assigned gauges thus, take the experimental configuration and the involved interactions into account.
Disctinction between the assigned and the essential gauges is not so clear in literature, where they are considered essentially inconsequential by requiring meaningful physical quantities to be gauge invariant. In the present formulation, they assume a significant role as they are defined in terms of the physical conditions relevant to the physical system including the method of observation.
From (8), a constant value of the assigned gauges cancels out for a physical system confined to a region with all of its points being equivalent to each other, e.g., a free particle. In case of the closed curves as the action principle stated in (7) and the Bohr orbits,, the assigned gauges are ineffective, again due to cancelations. In all such cases, (8) reduces to
. Since nonzero real values of the action for such particle trajectories cannot be excluded,
should be purely imaginary, which can be set equal to
. Thus, London’s assumption of
being purely imaginary [7], is deduced here as a result. Since
is the same constant for all cases, (8) reduces to
. (9)
Left side of (8) is the length acquired by a unit vector transported along
. Thus, for a trajectory to be physical, i.e., an allowed particle path, (8) requires a vector attached to the particle to regain its length at some point along the path. Then it follows from (9) that the vector regains its length along such a path periodically.
It is clear from (8) and (9) that the assigned gauges participate in describing the observable effects.
Physical trajectories: Since there are no preferred physical paths, all are equally likely and a particle trajectory would be determined by random selection. Thus, while a particle follows a definite trajectory, it is not determinable due to the randomness.
If
and
are two physical paths, then the union
of
and
is also physical, termed a continuing union. The physical trajectories can be classified under two categories: Monotonic and nonmonotonic. Along a monotonic trajectory, the action increases (decreases) monotonically as the path is traversed. If a trajectory is not monotonic, it is nonmonotonic. Nontrivial monotonic physical paths with smallest allowed value of the action in magnitude will be called the elemental physical paths. For a constant
, the elemental physical paths
are defined by
. Thus, for a monotonic trajectory, a particle travels along a randomly selected elemental from one point to the other and then along another elemental from its arrival point to the next. It follows that a general monotonic particle trajectory is constituted of a sequence of randomly selected elementals. Nonmonotonic physical trajectories can be considered the continuing unions of their monotonic segments.
A particle originating at a source and detected some distance away, would travel along a continuing union of randomly selected elementals, which may or may not coincide with or be close to the extremal of the classical action principle. Thus, the particle would be detected as a localized entity, i.e., as a particle, but not necessarily at the location determined by the action principle. If such an experiment is conducted with a large collection of identical particles or many times, each with a single particle with particles in all repeats being identical, then each particle would travel along an independent continuing union of the elementals. The collection of the detected particles may or may not cluster about the classically determined location. This agrees with the observations.
For a macroscopic system, that of a classical scale, the trajectories available to a particle have been shown to be concentrated about an extremal. The basic argument is that there are many equiaction trajectories in a second order neighborhood of a classical trajectory because of a second order difference in the action along the trajectories in its neighborhood. Such trajecories in pairs can be shown to constitute non-monotonic physical trajectories. Thus, each monotonic segment of each such trajectory can facilitate passage of a particle. Consequently, a particle would travel along an almost classical trajectory. This implies also that the major part of an expected observation can be gleaned by studying the classical trajectories.
The argument applies and the conclusion holds equally well for the monotonic segments of a classical magnitude of the nonmonotonic physical trajectories that are the continuing unions of their piecewise monotonic segments. The equiaction trajectories can be shown to exist also in a small neghborhood of a nonclassical trajectory but spread over a first order neighborhood with consequent reduced density.
Role of gauge in observations: Consider the double slit experiment where identically prepared particles, e.g., the electrons, encounter two slits, at
and
, in a screen, and observed at an arbitrary location
on an observation screen. A pair of trajectories meeting at
can be considered a continuing union of two monotonic trajectories
and
each of a classical extent constituting a nonmonotonic trajectory
from
to
to
. In this arrangement, the assigned gauge at
cancels out reducing (9) to
. (10)
For identiacal slits, the points
and
are physically equivalent implying that
and hence, the solutions of (10) are given by
with
being an arbitrary integer. An extremal joining any two points in this case is a straight line with the momentum
of the particle(s) being almost constant. Thus, if the monotonic segments
,
of the corresponding nonmonotonic trajectory
are extremals of the action principle, then
, where
is equal to the difference between the path lengths of the straight lines from
to
and
to
. It follows that for a union of two straight lines transmitting a particle of momentum
to be physical, it is required that
. Since the physical trajectories are concentrated about the extremals
and
, as explained in Sec. 4.4.2, a high concentration of the physical trajectories and hence of the particles would exist in small neighborhoods of such points on the observation screen, denoted by
, decreasing away from them. In between the maxima, particles can only reach along nonextremals with small associated density due to a smaller concentration of the physical trajectories there. The particle density distribution deduced above resembles the intensity distribution resulting from the interference of two identical wavelets emanating from
and
as observed.
The above observation underlies the doctrene of the wave-particle duality, which does not make a distinction between the particle density distribution and the intensity pattern produced by the interfering waves, rendering the argument underlying this doctrene rather weak. In the geometrical formulation, the particle density distrbution is obtained above from the defintion of the physical trajectories.
If an intrusive observation is made behind a slit, e.g., by scattering the photons from the particle beam, path of the beam is altered which would shift the location of the corresponding beam on the observation screen. This shift can be calculated precisely for the classical trajectories, which is indicated above to yield a close approximation to the expected observation. Estimates (Section 4.4.1) show that for small values of the shift
, the particle beams can still be considered almost correlated since there would still be many pairs of almost extremals with a common point on the observation screen in the small neighborhoods of
. This implies the existence of a shifted and smeared density pattern. As the scattering effects increase, the shift increases shifting and smearing the density distribution further. When the shift increases still further, the correlation between the beams is almost completely lost. Consequently,
cancels out for almost no trajectories invalidating (10). Instead, the beams travel independently, which are almost classical particle trajectories as discussed in Section 4.4.2.
It can be inferred from the above discussion that a set of trajectories transmitting the particles to an observation region would produce a density distribution resembling an interference pattern if and only if the net gauge effect is insufficient to enable an extraction of “which path” information. If “which path” information is available at the terminal point
of a trajectory, it cannot be common to the corresponding pairs of the beams.
The particle behavior here is described in terms of the correlation of trajectories, which is a geometrical concept, in contrast with the entangled physical systems, or particles, in standard quantum mechanics. In the above example of the double slit experiment, assigned gauges make significant contribution to the observation. Essentail gauges also impact upon the observations as in the Aharonov-Bohm effect, predicted by Bohm effect, predicted by Bohm12, which has been experimentally verified.14,15
To describe the Aharonov-Bohm effect, consider an experimental set up in which a controlled magnetic field is generated by a long vertical solenoid by controlling the current through its coil mimicking a magnetic monopole. An electron beam is split in two at a point
. One of the beams travels along a trajectory
on one side of the cylinder and the other along
on the opposite side, both shielded from the magnetic field. Then the beams meet at another point
. Thus, the union
of
and
encloses the magnetic field but neither of the beams passes through it. Since the potentials were considered only the mathematical auxiliaries and the observable effects were still assumed to be produced by the fields, the Aharonov-Bohm effect was considered an anomalous behavior predicted by quantum mechanics.
In the above arrangement, the assigned gauges can be seen to become ineffective. As in case of the double slit arrangement with no gauge effect, where the physical trajectories are given in (10), the physical trajectories for the present arrangement are given by
, where
(11)
with
being the free particle part of the action. Again as in case of the double slit arrangement, this produces a particle density distribution resembling the interference pattern. As
is varied, it follows from (11) that the density pattern shifts repeating itself with period
with respect to
. This agrees with the observations.
This effect was predicted12 based on the soluions of the Schrödinger equation but the argument is valid for the Klein-Gordon equation also. Since the underlying manifold in this case is multiply connected, the wave functions on one sheet differs from that on the other by a phase unless
resulting in a constructive interference and the periodicity; elsewhere the density pattern resembles the interference pattern of the two interfering waves differing by a nonzero phase fifference modulo
. Thus, in the geometrical formulation, the Aharonov-Bohm effect is described solely by the definition of the physical trajectories in a sharp contrast with the standard argument based on the interfering waves.
The double-slit experiment and the Aharonov-Bohm effect share an essential similarity. In both of the cases, the observed effect depends on the fact that the two beams are related with each other through a gauge factor resulting from an alteration to the underlying geometry. In the former, this alteration results from the two slits generating an assigned gauge distribution and in the later, from a hole punctured by the field confined almost to a point, generating an essential gauge factor. In any case, the gauge factors, both assigned and essential, participate in producing the observable effects. Details of the above including the estimates are available elsewhere (Sections 4.4.1 and 4.4.2).
Extended path integral formulation: With
and constant gauge, for a unit vector, i.e.,
,
in (2) is a typical phase factor term
in Feynman’s representation of the amplitude, or wavefunction (Section 4.2). In view of this observation, Feynman’s quantum mechanical amplitude for a particle to go from
to
is naturally adjusted to read
, (12)
which is just a slightly adjusted form of Feynman’s formulation but with a significant difference in that it includes the assigned gauges that incorporate the impact of an observation on the system as long as an intrusion altering the assigned gauge distribution is not strong enough to decohere the system of trajectories. If not, the representation of (12) remains valid in each region where the stated condition is satisfied and (12) would yield a different amplitude in each such region. Thus, the present formulation improves significantly over the others.
Since the wavefunction is defined in (12) with
being arbitrary, it defines the amplitude or wavefunction as an aggregate of the Weyl lengths at
of a unit vector transported to this point along all trajectories from everywhere, defined in (2) with
, providing a geomerical interpretation of the wavefunction. Following the standard procedure, (12) is expressed as
(13)
where
and
is a suitable measure. This yields the following representation of the wavefunction:
. (14)
For a constant gauge (14) reduces to Feynman’s representation of the quantum mechanical wavefunction
given by (6), which is the same as (14) with
replacing
implying that
. The wavefunction
representing the basic physical system together with the impact of the observing system can be constructed by multiplying
by the applicable assigned gauge. As discussed in Section 4.4.3., close approximations to the assigned gauges can be obtained for a given physical sytem, which can be improved further. It is clear that the wavefunction and the characterization of the physical trajectories given in (9) together describe the motion more completely than either one alone. This adds a significant bit to the prevailing formulations of quantum mechanics.
In the geometrical formulation, an act of observation assigns certain gauges to points in the underlying manifold. Thus, making a measurement on
alters it to
, which depends on the observing system and the interactions involved. Clearly, the wavefunction can be altered by altering the details of an observation. Thus, the concept of an unambiguous measurement cannot be assigned to a wavefunction. To eliminate the ambiguity, an associated gauge invariant quantity should be constructed and measured for a physical measurement to be objective and meaningful. Wheeler constructed the conjugates of tensors in Weyl’s geometry with the product of a tensor and its conjugate being a quantity of vanishing Weyl weight, which was arbitrarily assumed to be physically measurable.
In the present formulation, it can be concluded that if a physical trajectory exists, then
and since a constant factor in the assigned gauge cancels out
, implying that
with some
for all
. This shows that the assigned gauges are reducible to phase factors. It follows that
, the complex conjugate of
. Thus,
is a real gauge invariant quantity, rendering
a natural conjugate of
. Born’s probability density
is clearly a uniquely defined gauge invariant quantity with a clear outcome of any measurement regardless of the gauge but at the expense of the observable information contained in the gauge factors, both essential and assigned.
Quantum measurement
There are several interpretations of quantum mechanics, each with its own explanation of the impact of method of observation, or the observer, on what is observed. Neither one of them has gained universal acceptance; each one has its supporters and critics with varied views, which have changed with time. The interpretations together with their critiques are described in detail in literature, which has grown into a massive body. Here we discuss a few major ones briefly, which have gained prominence from time to time and compare the understanding of the indicated phenomenon according to these interpretations with that in the geometrical formulation. For now, the considerations are restricted to the double slit experiment discussed in Section 4.4.3.
The Copenhagen interpretation holds that the system is represented completely by a wavefunction, which collapses in one particular state consequent to a measurement, in the state that is observed. This view assigns dual nature, both wave and particle, to each of the entities, having no definite form until observed. Thus, the method of observation in a sense “bestows” the form upon the entity observed. In comparison, the geometrical formulation assumes a definite form, particle, for each of these entities and each one is expected to be observed as a particle individually. In a bulk, the entities’ behavior in the gauge neutral case resembles that of a wave in the sense that the particle density distribution resembles the intensity distribution of two interfering wavelets. If the intrusion by the observing system is sufficiently strong to yield a definite outcome, it destroys the correlation between the trajectories from two slits breaking them up into more than one sets, one corresponding to the observation region and the others, covering its complementary region. The deduced observation then compares with a collection of classical particles in agreement with the observation. Separate wavefunctions can be constructed for each of the regions. Thus, the wavefunction
representing the particles in the gauge neutral case “breaks up” into more than one fragments. The fragment corresponding to the observation region can be considered the “collapsed” form of
providing an interpretation of the “wavefunction collapse.”
The Bohm formulation16 or the pilot wave interpretation initially proposed by de Broglie, assumes that these entities are particles and each particle follows a definite trajectory but guided by a wave determined by the state of system. Thus, a particle passes through one of the slits but the wave passes through both and the probability of a particle being at a location on the observation screen is proportional to the resultant intensity of the interfering pilot wavelets at that location. Thus, the particle density pattern resembles the intensity distribution of two interfering wavelets without the particles assuming a wave form. Particle trajectories remain unknown, which are considered the hidden variables. Concerning the collapse of a wavefunction, the Bohm theory considers a universal wavefunction incorporating the observed and the observing systems together as a composite system, which never collapses; collapse occurs only in the phenominological sense applicable to a subsystem, which evolves separately caused by the decoherence resulting from an intrusion by a measurement.
According to the geometrical formulation, each one of the entities, assumed to be a particle, passes through one of the slits, which remains unknown but due to randomness. Also, while a definite trajectory cannot be assigned to a particle, sets of the physical trajectories can be computed from the information about the observed and the observing systems and thus, they are not completely hidden. The probability of finding a particle at a location is proportional to the density of the paths available for its passage, which for the identical slits and weak intrusions is shown to resemble the intensity distribution of two interfering wavelets originating at the slits. Two systems, the observed and the observing one, are considered together to determine the assigned gauges, which for strong intrusions, “decohere” the sets of trajectories. This decoherence causes the wavefunction to decompose into fragments, the observed one of them being the “collapsed” wavefunction, as discussed above. Wavefunction of the composite system, the observed and the observing one together, does not enter the considerations. Impact of the observing system is incorporated in the wavefunction through the assigned gauges.
Many worlds view or the relative state formulation17 assigns purely wave nature to these entities. It was developed by coupling the observing system with the observed one through wavefunction of the composite system. Each measurement leaves each subsystem in a relative state with respect to the other. There is no quantification of such alterations. With each observation, system’s state branches off to different non-communicating worlds, each one corresponding to each of the possible outcomes of an observation. What is observed depends on the world the observer enters. Such branching off occurs at each subsequent observation. Thus, the outcome of a measurement depends on all previous measurements.
In the extended formulation, the method of observation couples the two systems through the assigned gauges. Basic physical system and the assigned gauges determine the observation. Each act of observation assigns a computable multiplicative gauge factor altering the “state” of the observed system, which is altered further by each subsequent measurement. If an observation causes a strong intrusion, the system of trajectories changes in a drastic way. As discussed above, one “branch” of the trajectories corresponds to the observation in the lab and the other, to its complement but in the same “world,” not in a different one, and what is observed depends on which branch one observes. Each one of these branches has an associated wavefunction. Such “branching” off occurs with each measurement yielding a definite outcome. This provides an interpretation of the “many worlds” view.
The concept of a wave is basic to the interpretations of quantum mechanics discussed here, which is irrelevant to the present formulation. The concept of gauge transformations does not enter the formulations and interpretations of quantum mechanics, which is fundamental to the geometrical formulation. Thus, the geometrical formulation differs radically from the standard quantum mechanics and its interpretations in its underpinnings as well as implications, although it has parallels.
In the framework of the geometrical formulation, the expected observation in the experiments involving large number of particles can be determined to a large extent, with the usual information about the observed and the observing systems prior to making a measurement, distinguishing it fundamentally from the interpretations as they do not have such predictive ability; outcome of an observation is known only after it has occurred; they only attempt to describe how the observation ocurred.