Review Article Volume 2 Issue 3
Faculty of Mechanics and Mathematics, Lomonosov Moscow State University, Russia
Correspondence: Aidagulov Rustem Rimovich, Faculty of Mechanics and Mathematics, Lomonosov Moscow State University, Moscow 119991, Russia
Received: April 27, 2018 | Published: June 6, 2018
Citation: Aidagulov RR, Glavatsky ST. Metric physics. Phys Astron Int J. 2018;2(3):204-211. DOI: 10.15406/paij.2018.02.00087
Different axiomatics of metric spaces define different types of geometries. Standard axiomatics defines elliptic–type geometries, which are subdivided into Archimedean and non–Archimedean. The former are used in classical physics, the latter–completely discontinuous–are more correlated with quantum physics. Replacing axioms of metric spaces by other, with opposite conditions, we obtain the axioms of hyperbolic geometries underlying relativistic physics. In our work, the notion of hyperbolicity is given a certain meaning, namely, the mathematical expression of the physical principle of cause–effect through the axioms of the metric.
Keywords: metric spaces, geometries, relativistic physics, special theory of relativity, axioms, triangle, velocity, light, binary relation, planck constant, topology
02.40.Ft
Initially, the article was created for the presentation of the special theory of relativity (STR) on the basis of the Berwald–Moor metric. The operations of addition of unidirectional velocities and the Doppler Effect for radially moving objects in this geometry coincide with their values in the Minkowski geometry. However, there are also significant differences that have a direct relationship to the interpretation of cosmological observations. We have decided to separate (this) geometric part from the physical (exposition of STR based on the Berwald–Moore metric) because of the need for a large amount of material, so we will concentrate here only on the expression of cause–effect in terms of the axioms of the metric.
Attempts to present a special and general theory of relativity on more general, more fundamental principles have been made since the very beginning of the theory of relativity. Initially, the generalizations were based on the replacement of the Riemannian metric by Finsler one. Such an approach was used in the works of Asanov et al.,1,2 Bogoslovsky et al.,3–5 and others. However, the Finsler geometry in these works was used rather to describe the anisotropy of the space–time, and not as a special form of ordering (cause–effect) and signature. In some works, the causality was introduced explicitly, with the metric6 or without it,7–9 through special axioms of the order. The order corresponding to the cause–effect there was introduced by replacing the triangle inequality by the opposite one. Connectivity and the elimination of the complete discontinuity of space were expressed through the property of divisibility of the metric, called its Archimedean property. In mathematics, hyperbolic geometry is often understood as the geometry of Lobachevsky. In our work this concept obtained a slightly different meaning, namely, the mathematical expression of the physical principle of cause–effect through the axioms of the metric. Here is an analogy with hyperbolicity with respect to the time parameter in differential equations that determines the cause–effect in their solutions and the finite rate of diffusion of perturbations. Replacing the triangle inequality in the axioms of the metric by the opposite one also determines the order (cause–effect) and the finiteness of the perturbation velocity (the speed of light). The infinite velocity of the light, which exists in the Galileo–Newton geometry, in this sense refers to parabolic geometry. The analogue of Lobachevsky’s geometry also rather refers to the geometry induced from a hyperbolic space to a sphere of imaginary radius. Non–Archimedean completely discontinuous spaces are more suitable for describing the quantum world.
It is quite possible that in the future the quantumness of space–time will also be expressed by the divisibility of the metric with certain accuracyε>0ε>0 with an additional modification of the triangle inequality (opposite inequality):
ρ(x,y)≤ (≥) ρ(x,z)+ρ(z,y)−ε,ρ(x,y)≤(≥)ρ(x,z)+ρ(z,y)−ε,
With a small constantεε , similarly the Planck constant. Recall that a metric on the setXX is defined by specifying pairwise distances:
r(x,y):X×X→ℝ+,r(x,y):X×X→R+,
Which determine the binary relation. In mathematics by a binary relation on the setAA any subset of the Cartesian squareA×AA×A is usually understood. This is equivalent to specifying the functionA×A→ℤ2={0,1}A×A→Z2=0,1{} . In physics under the relation a more fuzzy notion is more often understood, when instead of a value fromℤ2Z2 is taken a certain numerical value fromℝR . In particular in physics a unary relation means a function fromAA toℝ+R+ (for example, in determining a density or a fuzzy subset) or a function A→ℂA→C with complex values (in determining a wave function). The metric specification, from the physics point of view, is the definition of a binary relation with values inℝ+R+ . From the mathematical point of view, the metric is not a relation, but a function defined on the Cartesian productA×A→ℝ+A×A→R+ , which takes non–negative values and satisfies certain conditions.
Next, we consider the basic mathematical concepts from geometry with giving them a certain physical meaning.
A metric on the setXX is defined by the functionr(x,y):X×X→ℝ+r(x,y):X×X→R+ , satisfying the following conditions:
The first property is the property of reflexivity, the second–of symmetry. The last property, called the triangle inequality, corresponds to the relation of nearness transitivity.
Here the defining property is the triangle inequality. Indeed, if only this one is satisfied for the functionrr , then we can symmetrise the distance as follows:ρ(x,y)=12(r(x,y)+r(y,x))ρ(x,y)=12(r(x,y)+r(y,x)) (the triangle inequality is not violated in this case) and, if necessary, to factorize by the equivalence relationx≅y⇔ρ(x,y)=0x≅y⇔ρ(x,y)0= . Further, we will assume that the condition(2)(2) for the functionrr is satisfied.
Then from the conditionr(x,y)=0r(x,y)0= it follows thatr(y,x)=0r(y,x)0= , and from the conditionsr(x,y)=0, r(y,z)=0r(x,y)0,=r(y,z)0= , according to the triangle inequality, it follows thatr(x,z)=0r(x,z)0= . Therefore, we can factorize the set by the relationx∼y⇔r(x,y)=0x∼y⇔r(x,y)0= . Moreover, according to the triangle inequality, we get thatx∼y⇒r(x,z)=r(y,z)x∼y⇒r(x,z)=r(y,z) , hence the distance between the equivalent points is uniquely determined and the triangle inequality is not violated.
The triangle inequality here determines the fact that the open balls{y | ρ(x,y)<ε, ε>0}{y|ρ(x,y)<ε,ε>0} form a basis of the induced topology. The distance function determines the closeness of points for which a property of the kind of transitivity is true: "close plus close" is "close". According to this approach, "far plus far" may be "close" , but "far plus close" remains "far".
The triangle inequality is also a nonlinear an a log of the ball convexity condition. Indeed, if the distance function is defined on a linear space, then the ball convexity condition is equivalent to the triangle inequality. Another reason for using the axiom of triangle inequality is the possibility of measuring the objective distances by integrating (summing) of small distances along some path. Let the two farmers divide the land plot and measure the appropriate distances using a certain standard, for example a common bipedal, fastened with a third stick so that the distance between the ends of the two–knife remains constant. For one of them (we denote him the first) it is advantageous for the measured value–the distance between the pointsAA andBB –to be as large as possible, and for the second–as small as possible. When the triangle inequality is satisfied, the second will never trust the measurement of the first, since that can move deviating very far from side to side and so get an arbitrarily large value of the measured distance between the pointsAA andBB . But the first can entrust the measurement to the second, since, according to the triangle inequality, the objective distance does not exceed the distance measured by the second as a sum of distances:∑ni=1r(Ai−1,Ai)=n∑ni=1r(Ai−1,Ai)=n , whereA0=A, An=BA0=A,An=B . Respectively, for the first is beneficial (at least not to the detriment of) any measurement by summing the distances. Thus, by measuring, they can measure the objective distances between points if the triangle inequality holds (convexity of the balls holds). In essence, this means the presence of a variational principle of measuring distances along paths asρ(A,B)=infA=A0,...,An=B∑iρ(Ai,Ai+1)ρ(A,B)=infA=A0,...,An=B∑iρ(Ai,Ai+1) .
The variational principle also works in the case of concavity of balls, i.e. an objective measurement of distances is possible even in the case when the opposite inequality is satisfied instead of the triangle inequality. In this case, the first farmer will not trust the second, since he can significantly understate the measured value in comparison with the objective one. The second farmer can entrust the measurement to the first, since the distance measured first, which is the sum of the distances∑ni=1r(Ai−1,Ai)∑ni=1r(Ai−1,Ai) , is always not greater than the objective value, which is the upper limit of the measured distances. Here another type of variational principle is fulfilled. In this case, distances, generally speaking, do not define a topology. Apparently, therefore, this type of metrics has not been studied in detail: here the principle of transitivity of range is fulfilled–"far plus something" is "far". But here "close plus close" can also turn out to be "far". Correspondingly, in linear spaces the functionx+yx+y of a sum of two arguments will not be continuous when the closeness on the product of spaces is consistent with closeness as in the categorical product of spaces. Apparently, this is the main reason why mathematicians did not pay much attention to geometry of another type, where the triangle inequality is satisfied in the opposite direction, which corresponds to the concavity of the ball (convexity of the complement). We call such geometry concave or hyperbolic (the hyperbola restricts the concave region, the ellipse–convex). In this case, we must abandon the condition(2)(2) for the distance function, and also on the condition that distances are defined for any pairs of points. Correspondingly, we define a concave (hyperbolic) metric as a distance function on the Cartesian product taking values fromℝ+∪{*}R+∪{*} , where** means that in this case the distance between points is not defined (its value is some unknown negative number). Thus, a hyperbolic metric on the spaceXX satisfies the following conditions:
The last condition means that ifr(x,y)>0r(x,y)0> andr(y,z)>0r(y,z)0> are defined, thenr(x,z)>0r(x,z)0> is also defined. Here the condition(1)(1) corresponds to the anti–reflexivity (and can be obtained from condition (2)), the condition(2)(2) corresponds to anti-symmetry. The distance function determines here not a "physical" distance, but something corresponding to a relativistic interval. The distance can be corresponded to the inverse values that are not defined for the zero value. This justifies the anti–reflexivity condition. With such axioms, all distances can be considered positive. Any way, it is more convenient to work with zero values for distances, changing the first two axioms the following way:
For relativistic events, the conditionr(x,y)>0r(x,y)0> can be interpreted as follows: the eventyy occurs after the eventxx (performed in any inertial frame of reference) on the "time” r(x,y)r(x,y) . This determines the order: (independent of the frame of reference)y>x⇔r(x,y)>0y>x⇔r(x,y)0> . The inequality of a triangle means the fulfilment of the axioms of order for such a relation "later".
In the first case, the distance between the pointsx,yx,y is determined by the variational principleinf(∑iρ(xi−1,xi) | x0=x, xn=y)inf(∑iρ(xi−1,xi)|x0=x,xn=y) , in the second–bysup(∑iρ(xi−1,xi) | x0=x, xn=y)sup(∑iρ(xi−1,xi)|x0=x,xn=y) , wheresupsup is computed over all measurable paths. If both the triangle inequality and the opposite inequality are not satisfied, in general, it would not be possible to measure the distance by summing the standard (meter). As it will be seen below, in this case it is possible thatinf=0, sup=∞inf=0,sup=∞ .
In hyperbolic space, the open balls do not define a topology.10 In such a space, however, one can determine the interval topology by taking Uxy={z | r(x,z)>0, r(z,y)>0, x<z<y}Uxy={z|r(x,z)0,>r(z,y)0,>x<z<y} as the base of open sets. This can be interpreted as follows: the eventzz occurs later thanxx , but beforeyy . However, such a topology turns out to be stronger than the continuous structure determined by the distance function: one can choose a sequencexixi of points for whichlimn→∞r(x,xn)=0limn→∞r(x,xn)0= , but this sequence does not converges in the proposed interval topology. The interval topology for hyperbolic metrics is not defined by a uniform structure (as for usual metrics) and, accordingly, is not suitable for testing of completeness or for the completion of the space.
We can define a base of neighbourhoods of the pointxx in terms of the system of setsUx,ε={y | 0≤r(x,y)<ε}Ux,ε={y|0≤r(x,y)<ε} . Such neighbourhoods do not induce (in general) a topology, since they can contain no neighbourhoods of other points because of the hyperbolicity of neighbourhoods (with "tails" leaving to infinity). But they form a quasi–topology10 and are suitable for determining many characteristics defined in topological spaces.
Note that geodesic lines can be determined even without introducing the Finsler metric. By definition, the curve x(τ)x(τ) is geodesic if it is covered by open subintervals, and the distance between two points coincides with the limit of sums of distances in the decomposition. For a hyperbolic geometry, a measurable (geodesic) curve has an orientation (in the other direction it is immeasurable), respectively, the subdivisions into small intervals are also oriented, and the curve in a certain "sense" is smooth.
With such definition of geodesics, non–Archimedean geometry (a subclass of convex elliptic geometries) looks in a special way, where instead of the triangle inequality the stronger inequality holds:
r(x,z)≤max(r(x,y), r(y,z)).r(x,z)≤max(r(x,y),r(y,z).) (1)
The interval (geodesic) between two pointsA,BA,B with measurableρ(A,B)ρ(A,B) is defined as the set of pointsCC for which satisfy the condition:ρ(A,B)=ρ(A,C)+ρ(C,B)ρ(A,B)=ρ(A,C)+ρ(C,B) . The interval can consist only of the start and end points. This happens in the case of non–Archimedean spaces. Indeed, the inequality (1) implies that there is no pointyy between the pointsxx andzz such thatr(x,y)<r(x,z)r(x,y)<r(x,z) and r(y,z)<r(x,z)r(y,z)<r(x,z) . Such spaces look like discrete, indivisible (quantized). However, they can be nondiscrete in the topological sense. Here, another term is more suitable: a completely discontinuous space.
In non–Archimedean space, if Achilles has a step length of not more than 1, he can never leave a distance more than 1 from the starting position, i.e. Achilles in the non–Archimedean space will never catch up with the Turtle, if the initial distance between them is greater than the maximum of the steps of Achilles and the Turtle. In general, Achilles will not be able to move away from his initial position by a distance greater thanr0r0 if each step does not exceedr0r0 . Archimedes’ principle is to deny such a statement: “For anyr0>0, R>0r0>0,R>0 , from any initial positionxx it is possible to attain some pointyy by a distancer(x,y)r(x,y) greater than RR (that is,r(x,y)>Rr(x,y)>R ) in a finite number of steps, moving only by steps that do not exceedr0r0 ”. This means the asymptotic connection of space. However, a space with the metric defined above can be disconnected, for example, such is the set of hyperbolasxy=n, x>0, n=1,2,3,...xy=n,x>0,n=1,2,3,... . Therefore, by analogy with,11 under the Archimedean metric we mean a narrower class of metrics when is fulfilled the condition of uniform divisibility: “for any two pointsxx andyy , wherer(x,y)>0r(x,y)0> , and for anyε>0ε>0 there exists a pointzz such that |r(x,z)−12r(x,y)|<ε|r(x,z)−12r(x,y)|<ε and|r(z,y)−12r(x,y)|<ε|r(z,y)−12r(x,y)|<ε ”.
From this fact that distances can be divided by 2 with any accuracy follows that it is possible to divide them into nn parts with arbitrary accuracy. In addition, the condition of uniform divisibility also works for hyperbolic geometry. For a locally compact groupGG , even from the weak Archimedean condition (there exists a neighbourhoodUU of the identity such that for any elementg∈Gg∈G which is not identity, exists a positive integernn such thatgn∉Ugn∉U ) it follows that on such a group one can introduce the Lie group structure.11 Here the uniformity is ensured by the degrees of non–identity elements. The local compactness of a topological group corresponds to its completeness and finite dimensionality. In this case, the powers of the element gg (starting from zero) correspond to direct paths from the identity point to some final point. Therefore, restricting our considering to complete spaces, by the Archimedeanness of the metric we mean the existence for any pair of pointsAA andBB an isometryφφ of the interval[0,l], l=ρ(A,B)[0,l],l=ρ(A,B) , fromℝ+R+ to the given metric space, with the conditions:φ(0)=Aφ(0)=A and φ(l)=Bφ(l)=B .
The traditional non–Archimedean metrics arepp –adic metrics. Another, also common non–Archimedean metric, is the distance between the vertices of the tree (graph): the maximum distance from the vertices to the closest common for both vertices of the ancestor vertex is taken as the distance between these vertices. Note that thepp –adic distance between the numbers
x=∞∑i=−∞xipi, y=∞∑i=−∞yipix=∞∑i=−∞xipi,y=∞∑i=−∞yipi
Is also defined as a maximum of distances to the common "ancestor". Such ancestor here is the following rational number (common prefix):z=∑ki=−∞xipiz=∑ki=−∞xipi , wherekk is the maximal integer for whichxi=yixi=yi for∀i≤k∀i≤k (remember that for pp –adic numbers only a finite number of coefficients with negative indexes are different from 0).
Movings in spaces with a non–Archimedean metric have a jump type with a change in direction, like in the Markov processes. This is quite suitable for describing quantum processes. In one space, two types of metrics are not considered simultaneously. Therefore, to combine the classical approach with the quantum one, an adelic approach from the theory of numbers is usually used, which includes all compatible Archimedean and non–Archimedean completions.
In elliptic geometry, an inequality that is an immediate consequence of the triangle inequality is also true:
r(x,y)≥r(x,z)−r(y,z).r(x,y)≥r(x,z)−r(y,z).
If the pointsx,yx,y in non–Archimedean geometry are at a distancer>0r>0 (r(x,y)=rr(x,y)=r ) and the pointzz is closer to the point yy (r(y,z)<rr(y,z)<r ), then the distance to the point xx is equal tor(x,z)=rr(x,z)=r , i.e., the pointzz locates on a sphere with center at the pointxx . Thus, the sphere of radiusrr contains all open balls of radiusrr with center at any point on the sphere.
When determining the distance between living species of animals through the distance on the tree, defined as the maximum distance to the nearest common ancestor, we get a non–Archimedean distance. Defining the distances between species as the natural distance between the states of the Markov process of mutations, we also obtain a non–Archimedean distance. In general, the metric is naturally defined in the state space of the Markov process, and it is non–Archimedean. It should be noted that it is difficult to clearly determine the distances here. Their values can be calculated with some accuracy by simulating the Markov process. The calculations themselves are in fact calculations of Feynman integrals over all possible paths for the evolution of the Markov process. A different approach to the use of non–Archimedean (p–adic) metrics in quantum physics develops in the works of Volovich et al.,12 and his followers.
In a sense, non–Archimedean distances appear more often than Archimedean ones. So, in the theory of numbers, every prime ideal defines a non–Archimedean norm, and there are only a finite number of Archimedean norms (not greater than the degree of extension). An example of a non–Archimedean metric on words or texts is the following distance:
ρ(x,y)=(0, x=y,c−d, x≠y.
Hered≥0 is the length of the common prefix, c>1 is a constant, for example, the number of letters in the alphabet.
Remark: In mathematics, instead of the term "non–Archimedean metric" the term "ultrametrician" is used, and the term "ultrametric relation" is used for the defining relation. At the same time, when it comes to the norming of numbers, such as the p –adic or β –adic valuation (β is a prime ideal, which is maximal by virtue of the Dedekind property), the term "non–Archimedean valuation" is used, despite the absence of order on the numbers. Some authors use the term "non–Archimedean" only for orders in ordered groups. A section of mathematics that uses instead of the field of real numbersℝ its ordered extensionˉℝ is called a non–standard analysis. In this case, the order onˉℝ , consistent with the order ofℝ , is necessarily non–Archimedean. The most interesting among such extensions is the set of surreal numbers of Conway. The topology on such extensions is almost discrete, in the sense that the (countable) sequence{xi} converges tox if and only if all members of the sequence, starting with some, coincide withx . In this case, the distance between the points should be determined not inℝ+ , but in the extensionˉℝ+ . From mathematical logic it is known that any theorem of standard analysis, proved using nonstandard analysis, can be proved without it. Therefore, we will consider the use of non–standard analysis and the term "non–archimedean" (in the sense indicated) in physical applications as non–purposeful.
As we remarked above, the principal property of the ordinary metric is the triangle inequality, and for the hyperbolic metric–the opposite triangle inequality. So, a function defined on a Cartesian product and taking real–valued values (not necessarily only positive) we’ll call elliptic if
f(x,y)+f(y,z), ∀x,y,z;
call non–Archimedean if
f(x,z)≤max(f(x,y),f(y,z)), ∀x,y,z;
and call hyperbolic if
f(x,z)≥f(x,y)+f(y,z), ∀x,y,z.
We list some simple properties of such classes of functions.
Hence, as a consequence, the following remark about elliptic metrics is obtained. Let{fi(x,y)} be a family of elliptic functions, and all elements of the family {Mi−fi(x,y)} (with positive values{Mi} ) satisfy the triangle inequality. Then a functionF=∏iMi−∏i(Mi−fi(x,y)) satisfies the triangle inequality and, in the case of positivity of all its values, the usual metric is obtained from it. Such way, in particular, in13 he sensitivity of Jacquard metric has been enlarged forfi=f, Mi=1 .
Physicists in special theory of relativity (SRT) use translationally invariant metrics: here the distance from the pointx to the pointy is determined by the norm of the displacement vector|y−x| . In this case, the natural condition is that|λa|=λ|a| for any positive numberλ and for any vectora . Obviously, such metrics cannot be non–Archimedean. They are Archimedean even for a stronger condition: because of the Archimedean metric inℝ and the fact thatλ∈ℝ+ .
Metrics inℝn are defined by function–norms defined in a vector space for which the following conditions are satisfied:
In a linear space, the concepts of convexity and concavity of metrics can also be expressed in another way, and they are equivalent to the properties of the triangle inequality and the opposite triangle inequality, respectively. The notion of strict convexity (or concavity) is also naturally defined here–when it follows from the equality in the indicated inequalities that the pointsx, y, z lie on the same line.
Note that the hyperbolic metric is defined only for the interior of some cone and has the character of "directivity to the future", which specifies the ordering: if the distancey−x is measurable, then the eventy occurs later. If also the distancez−y is measurable, thenz−x is also measurable, and|z−x|≥|y−x|+|z−y| . This corresponds to the (strengthened) transitivity property of the order "later".
Usually, the cone of measurable vectors, where a hyperbolic (concave) norm is defined, is a set of vectors for which all coordinates are non–negative, or it contains such a set. The metric onℝn also defines metric in the ad joint space of linear functional. The ad joint space is also a vector spaceℝn . For the hyperbolic metric defined in the coneℝn+ , the adjoint metric is also defined inℝn+ . Namely:
|p|=sup|x|=1px,
In the elliptic case, and
|p|=inf|x|=1px
In the hyperbolic case. As a general example for both types of the metric we can consider the metric defined by the norm inLp in an n –dimensional space:
|x|=(1nn∑i=1|xi|p)1/p. (2)
Here, we took the positive coefficients for convenience, in the case of a hyperbolic metric (taking into account the orientation) that the metric is defined only in the case when all the coordinatesp≥1 are positive.
Theorem 3.1: Forp≥1 , the metric (2) defines an elliptic norm, forp<1 –a hyperbolic. The adjoint space has the same metricLq , wherep+q=pq, (q=pp−1) .
Proof: Forp>1 , the triangle inequality (and its strictness) follows from the Minkowski inequality, and the statement for the adjoint space is a consequence of Holder’s inequality.1 In this case, a strict inequality holds. Forp=1 , the triangle inequality is not strictly satisfied, and there exist many paths from the pointO (the origin) to the pointx with length|x| . The adjoint space in this case is the spaceL∞ .
Now consider the casep<1 andxi>0 for alli . The opposite triangle inequality for0<p<1 is also a direct consequence of Minkowski’s theorem for this case.14 Forp<0 , this can be proved using the inequalities for the casep>0 and the Holder’s inequality. Fortunately, in Edvin et al.,14 there are already corresponding generalizations, the direct consequences of which are our assertions for vectors with positive coordinates. This completes the proof.
In the casep=1 , the metric will be both ordinary and hyperbolic. The latter is defined only in the case when each coordinatexi is nonnegative.
By continuity, we can define the metric (2) and in the casep=0 . Moreover, the adjoint metric corresponds to the caseq=0 , i.e. coincides with the original one. This self–adjoint hyperbolic metric is called the Berwald–Moore metric:
|x|n=x1*x2...*xn.
Here the norm value is the geometric mean of the family of positive numbers. Note that the norm on the set of functional corresponds to the norm with another indexLq:1p+1q=1⇒pq=p+q . Self–adjoint (Hilbert) spaces are spaces withp=2 (for an elliptic metric) andp=0 (the Berwald–Moore metric for the hyperbolic metric). In part, so, some physicists consider the Berwald–Moore metric to be the proper metric of space–time, and not the Minkowski metric. In fact, these metrics are close in the following sense. Newton’s geometry, based on Galileo transformations, coincides with these geometries in the first approximation, differing only by orders ofO((vc)2) , wherev is a speed, and c –the speed of light. The geometries of Minkowski and Berwald–Moore themselves coincide in the second order of accuracy, differing only in the third order. And in dimension 2, these geometries are completely identical. Indeed, by changing the variablesx1=t−x, x2=t+x , the Minkowski metric ds2=dt2−dx2 becomes the Berwald–Moore metricds2=dx1dx2 .
Consider now quadratic pseudometrics:
|x|2=x21+...+x2k−x2k+1−...−x2n,n≥k≥1.
(3)
Prove the following theorem:
Theorem 3.2: The pseudo metric (2) defines an elliptic metric fork=n
, hyperbolic–fork=1<n
, and none in other cases. In the latter case, for1<k<n
, for any two points of the space there exist smooth paths of any positive length connecting them.
Proof: The casek=n is well known. The casek=1<n is considered in Edvin et al.,14 where a more general inequality is proved: letφ(x)=(xp1−xp2−...−xpn)1/p, p>1 , then
φ(x+y)≥φ(x)+φ(y), x∈ℝn+, y∈ℝn+.
This inequality in the casep=2 is equivalent to the opposite triangle inequality for the Minkowski metric. In the casek=0 , the pseudometric is negative definite and does not represent a hyperbolic metric.
For the remaining case1<k<n , we find smooth measurable paths issuing from the origin and ending at another pointr=(x1,..,xk,xk+1,...,xn) . Denote byr1=(x1,x2,..,xk,0,...,0) andr2=(0,...,0,xk+1,...,xn) the decomposition of the position vector into two parts. Leta=√x21+x22+...+x2k, b=√x2k+1+...+x2n . Choosing a path of the form(x1(t),x2(t),...,xn(t))=(x1(a+b)2at,x2(a+b)2at,...,xk(a+b)2at,xk+1(a+b)2bt,...,xn(a+b)2bt), 0≤t≤1 , we get to the point (y1,...,yk,yk+1,...,yn) for a distance of zero.
Further, connecting this point with r by a segment of zero length(y1+x1(ab)2at,y2+x2(ab)2at,...,yk+xk(ab)2at,yk+1+xk+1(ba)2b,...,yn+xn(ba)2b),0≤t≤1 , we obtain a path of zero length. Since the end point does not coincide with the origin, thena+b>0 . Whena=b , the second segment is not needed. Ifab=0 , then by a small perturbation we makeab≠0 . We smooth the path at the junction point with a slight perturbation and obtain a measurable path with any small length.
The path with any great length is obtained by making circles in the planer2=0 . Note that for k=1 we cannot go through closed paths, and for an achievable point there exists a maximal length (a segment) among the paths connecting this point to the origin. This completes the proof.
Note that the usual symmetric polynomialsσk on positive numbersxi also give the hyperbolic metric |x|k=σk(x1,x2,...,xn) . In this caseσ2 corresponds to the Minkowski metric,σn –to Berwald–Moore metric. The fulfilment of the opposite triangle inequality for these cases is proved in Edvin et al.,14.
For a hyperbolic metric, expressed as a homogeneous symmetric function of coordinates, the maximum of the norm for a fixed valueσ1=x1+x2+...+xn is attained when all variables are equal. So, it is more convenient to write the norm in terms of the variables:
y0=anσ1, yi=αzi+β∑j≠izj, zi=(xi+1−σ1n), i=1,...,n−1. (4)
Then the norm is written in the form:
|x|=y0f(v1,v2,...,vn−1), vi=yiy0. (5)
In accordance with the above,f(0,0,...,0)=1=maxf(v1,v2,...,vn−1) . In this casey0 expresses the time coordinates, vi –speed coordinates.
Consider the adduction of the metric|x|k=σk(x1,...,xn), n≥k to the form?? Using the coordinate transformation (4). Let’s start with the casek=2. We get:
σ2=12(σ21−s2)=12σ21−12∑i(xi−σ1n)2−σ21(1n−12n2)=
=12((1−1n)σ1)2−12(z1+...+zn−1)2−12n−1∑i=1z2i=12((1−1n)σ1)2−n−1∑i=1(αzi+β∑j≠izj)2.
Here, the values of the parametersα andβ are computed so that the equality to be satisfied. Thus, in the variables
y0=n−1n√2n∑i=1xi, yi=(α−β)zi+βn−1∑j=1zj, zi=xi+1−σ1n
The metric is adducted to the form of the Minkowski metric if the numbersα,β satisfy the system of equations:
α2+β2(n−2)=1, 4αβ+(n−3)β2=1.
Here we assume thatn≥3
.
The casen=2
is solved immediately:σ2=x1x2=(x1+x22)2−(x1−x22)2.
Suchα,β
are the following:
α=√7n−13±4√3(n−2)n2+10n−23, β=√n+5∓4√3n2+10n−23.
For k=3 we obtain:
σ3=16σ31−12σ1s2+13s3.
The metric reduces to
σ3=y30−y0n−1∑i=1y2i+a∑iy3i.
Fork=4=n enter the coordinates:
2y0=x1+x2+x3+x4, 2y1=x1+x2−x3−x4, 2y2=x1−x2−x3+x4, 2y3=x1−x2+x3−x4.
The reverse transition has an analogous form:
2x1=y0+y1+y2+y3, 2x2=y0+y1−y2−y3, 2x3=y0−y1−y2+y3, 2x4=y0−y1+y2−y3.
This transformation is similar to the transformation in the casek=2=n at the transition from the metric |x|2=x1x2 to 12(y20−y21) by the transformationy0=1√2(x1+x2), y1=1√2(x1−x2) preserving the sums of squares y20+y21=x21+x22 ,x21+x22+x23+x24=y20+y21+y22+y23. Herey0 plays the role of the time coordinate, and the rest–the role of spatial coordinates, and take arbitrary values. A positive value of the norm have vectors for whichy0 is greater than the maximal root of the corresponding polynomial. In our case:
x1x2x3x4=116(y40−2y20r2+8y0y1y2y3+2(y41+y42+y43)−r4), r2=y21+y22+y23.
At first glance this metric is anisotropic. As will be seen later, this does not mean anisotropy of space, but reduces only to the possible different compression of space in the direction of motion for different directions of velocity. And the speed of light will remain the same, independent of direction. This compression is apparent from the side of another observer.
An analogue of these transformations for the casek=n is the transformation using the Hadamard matrix with the normalization1√n . In this case, the role of the time coordinates playsy0=1√n(x1+x2+...+xn) . However, not for alln there is a Hadamard matrix. Correspondingly, the orthonormal system of coordinatesyi must be sought as in the casek=2 .
LetEk(x)=1Cknσk(x1,x2,...,xn) . An analogue of the Holder inequality:
(1nn∑i=1xiyi)k≥Ek(x)Ek(y)
Is not satisfied without the additional requirement of the same ordering:
(xi−xj)(yi−yj)≥0 ∀i,j.
Only fork=n this inequality holds for of non–negative numbers, which means that the Berwald–Moore metric is self–adjoint.
In general, any metric (norm) inRn can be adducted to the form:
|x|=E1φ(e2,e3,...,en),
whereei=Ei(x)Ei1(x) . This corresponds to dimension theory in mechanics. Removing the value ofE1(x) we get a dimensionless formula, and the multiplier is expressed as a function of the dimensionless variablesei(x) . For a hyperbolic metric, the opposite triangle inequality reduces to the negative definiteness of the quadratic form∂2s∂xi∂xjdxidxj on the indicatrix. The latter condition is often easier to verify.
Physicists often seem strange to have hyperbolic metrics defined in the regionxi≥0 . But really, there are no grounds for this. We can always take the coordinate systemy0,...,yn−1 , obtained by a linear transformation from the initial coordinates, wherey1,y2,...,yn−1 are arbitrary, and the restrictiony0>0 is superimposed on measurable vectors directivity to the future). And the strict opposite triangle inequality imposes on the indicatrix given by the homogeneous equationf(y0,...,yn−1)=1 of the first degree, or arbitraryy0=φ(y1,...,yn−1) , which is the solution of the first, one condition:
The quadratic form is positive–definite.
∑ij∂2φ∂yi∂yjxixj (6)
In a hyperbolic metric, there is always a(n−1) –dimensional light cone, the set of vectors with the norm 0. Any vector with a positive norm can be represented as the sum of two vectors with zero norms. The latter follows from the fact that two light cones with vertices at the pointO (origin) and at the pointX at the end of the measurable vector, intersect.
One of the problems considered in the paper is a minimization of the set of axioms for geometries describing the physical space–time continuum. We were convinced that hyperbolicity determines the cause–effect and does not require a special definition of the signature of space. The continuity property follows from the Archimedean property of the standard topology onℝ and from homogeneity axiom. For a quantum description of space–time, where the cause–and–effect can be not valid at short distances, probably it would be better to replace the standard topology onℝ with a quasi–topology: when all the sequences (filters){xi} converging to the pointx are defined by the fact that, starting from somei , their elements fall into anε –neighborhood of the pointx , with the fixed value ε as a quantum number.
In Guts et al.,8 is presented a justification for STR, expressed in the principles of the space uniformity and the identity of the finite light speed in all inertial reference systems, like in the Minkowski geometry. In the future, we plan to show that STR represents itself a hyperbolic homogeneous and isotropic geometry. The property of isotropy corresponds more closely to the Berwald–Moore metric, rather than Minkowski one. The isotropy of hyperbolic geometries corresponds more closely to the invariance property with respect to the choice of scale for all possible variants, different in various directions. The role of generalized conformal transformations, as shown at the end of the second section of this article, is played by transformations of the metric with the hyperbolic functions. The Berwald–Moore geometry fully corresponds to the homogeneity property when replacing scales in all possible directions.
The authors are grateful to Dr. Yu. V. Popov for useful discussions and help.
Author declares there is no conflict of interest.
©2018 Aidagulov, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.