The optimal control problem of spacecraft reorientation from arbitrary known initial attitude into the required angular position was solved. Spacecraft motion around the center of mass is given by quaternion of attitude.1 Designing the optimal rotation program is based on quaternion models, Pontryagin’s maximum principle, and universal variables.2 In present time, spacecrafts are used in many areas of industry and scientific occupations. For example, astrophysical researches and other scientific discoveries would be impossible without spacecrafts for which success of mission and duration of performance in a working point of orbit (orbital position) are determined by successful control of motion, by an efficiency of attitude control (an improved system of spacecraft attitude is especially important for the spacecrafts with instruments and devices for astronomy measurements and for satellites of Earth supervision).3–5 In particular, in April 2018, NASA launched the Transiting Exoplanet Survey Satellite (TESS), a space telescope that helps to study the exoplanets, or NASA’s James Webb Space Telescope measuring atmospheric properties and compositions of small planets, and also Atmospheric Remote-sensing Infrared Exoplanet Large-survey (ARIEL) space telescope of European Space Agency.5
Many works have been dedicated to investigating optimal solutions in problems of controlling the angular position of a rigid body.1,6–26 Most solutions correspond for the case when the spacecraft rotates around a motionless axis1,6–10 (including the use the algorithm of fuzzy logic7 or the method of the inverse dynamic problem),8 time-optimal maneuvers is topical also.1,9–14 Particular solutions were found obtained for axi-symmetric spacecraft.14–16 For the spacecraft with arbitrary mass distribution, an analytical solution to the three-dimensional turn problem with arbitrary boundary conditions for spacecraft’s angular position is not known except some special cases of reorientation problem.1 An analytical solution to the optimal turn problem in a closed form, if it were obtained, is of great practical interest, since it allows the finished laws of programmed control and variation of the optimal trajectory of spacecraft motion to be applied onboard of the spacecraft.11,17 Principal difference of the presented research work consists in use of new minimized index which characterizes consumption of energy and time for spacecraft reorientation and combines the duration of maneuver and integral of kinetic rotation energy. Issues of cost-efficiency also remain relevant for the time being for spacecraft motion control. The method designed in present article is universal control; it does not depend on a ratio (proportion) of moments of inertia or final position of a spacecraft. Also, the developed algorithm is useful for of control of the spacecrafts with inertial actuators (specific features of system with gyrodins were considered earlier).18–21
Formulation of optimization problem
Spacecraft’s angular rotation around the center mass is determined by the dynamic equations:6
(1)
where Ji are the spacecraft’s principal central moments of inertia, Mi are projections of torque M onto the principal central axes of the spacecraft’s inertia ellipsoid (these axes form body basis E), are projections of the absolute angular velocity vector ω onto the axes of the body basis E (i=
). The mathematical formalism of quaternions (the Euler–Rodrigues parameters) is used for describing the spatial motion of a spacecraft. Angular orientation of the spacecraft coordinate system is defined relative to the reference basis I which we assume inertial coordinate system. Motion of the body basis E relative to the reference basis I will be specified by the quaternion Λ.1 We assume that the quaternion Λ is normalized (||Λ||=1), and the basis I is inertial. Therefore, the following kinematic equation is true
(2)
The symbol is a sign of multiplication of quaternions. For simplicity, the quaternion specifying the current spacecraft orientation is assumed the normalized quaternion (||Λ ||= 1). The spacecraft motion control relative to its center of mass is done by changing the torque M (external or internal, if spacecraft orientation control is done with use of inertial actuators, i.e. powered gyroscopes). Region of admissible values for the vector M is determined by the condition
(3)
where u0>0 is positive value specifying power of actuators. Angular positions of the initial and final spacecraft attitude are given by the quaternions Lin & Lf respectively. In many practical modes of reorientation, initial state satisfies condition ω (0)=0 and final angular velocity must be absent ω (T)=0 (these cases occur very frequently, especially if attitude control is done relative to inertial coordinate system). The boundary conditions are:
, ω (0)=0 (4)
, ω (T)=0 (5)
where T is the time of ending the reorientation process, and the quaternions
and
which specify the orientation of spacecraft-bound axes at the initial and final time instants have arbitrary predefined values satisfying the condition ||
||=||
||=1. We estimate efficiency of control using the index
(6)
where k0 > 0 is the constant positive coefficient; Т is the time of ending the turn.
The reorientation optimal control problem is formulated in following statement: spacecraft must be transferred from state (4) into state (5) according to equations (1), (2) and restriction (3) with minimal value of the functional (6) (time T, when the spacecraft reorientation maneuver should end, is not fixed, and it is optimized simultaneously with value G). The solution М(t) is sought in the class of piecewise continuous functions. The chosen criterion of optimality guarantees the spacecraft’s motion with a kinetic energy of rotation not exceeding the required value (it provides turn of a spacecraft with the bounded rotation energy because the coefficient k0 ≠ 0). It allows for estimation of an energetically advantageous angular motion trajectory along which the spacecraft will turn from its initial position Lin into the required final angular position Lf, and for finding the corresponding control mode. Presence of time factor in the assumed index (6) limits the duration T of the optimal turn to some finite time Topt.
Solution procedure of the optimal control problem
The formulated control problem (1)–(6) is a classical dynamic problem of the optimal turn,1 and the torques Мi are the control variables, and angular velocity projections
are controllable variables (for minimization of index (6)). For solving the formulated problem, we use Pontryagin’s maximum principle.27 The restriction on the phase variables L is insignificant, since it is satisfied during any motions relative to the center of mass; ||Λ(t)||=const due to equation (2); we assumed ||Λ(0)||=||Λin||=1, and therefore ||Λ(t)||=1 at any time tÎ[0, T]. Introduce the conjugate variables
corresponding to the angular velocities
(i=
). The minimized functional (6) does not include the position coordinates, therefore we use universal variables ri,2 and to introduce the conjugate variables corresponding to the phase variables (components of quaternion Λ) is not obligatory. The Hamiltonian H of problem (1)-(6) looks like the following:
H = j1 ( М1 + (J2 - J3 )w2 w3 ) /J1 + j2 ( М2 + (J3 - J1 )w1 w3 ) /J2 + j3 ( М3 + (J1 - J2 )w1 w2 ) /J3 +
+ (7)
where the functions ri and λj are related by the following relations:2
,
,
and
are the conjugate variables corresponding to components of quaternion
(j=
).
The first terms (they contains the conjugate variables
) comprise the dynamical part of the Hamiltonian H, the second terms (they contains universal variables ri) comprise the kinematical part of the Hamiltonian H which is responsible for the geometric properties of optimal motion. Other terms corresponds to the chosen optimality criterion. The function H does not take into account the phase constraint ||Λ||=1, since || Λ(0)||=1. For the universal variables ri, the following system of equations are true:2
(8)
Change in vector r formed by the universal variables ri is solution of the following equation
(it is vector form of the equations (8) if variables ri is assumed as assume projections of vector r on the axes of the body basis E.2 The symbol × denotes the vector product of two vectors. It is know that the vector r turns out to be motionless relative to the inertial basis I, and |r|=const¹0.2 The equations for conjugate variables
have the form
The conjugate system of equations is
(9)
where n1 = (J2 – J3 )/J1 , n2 = (J3 – J1 )/J2 , n3 = (J1 – J2 )/J3 are the constant coefficients.
Thus, the problem of finding an optimal control is reduced to solving the system of equations of spacecraft’s angular motion (1), (2), and equations (8), (9) under the condition that the control itself is chosen by maximizing the Hamiltonian. The optimal function r(t) is calculated through the quaternion (t) using the formula:1,2
, where cE=
=const
(
is the quaternion conjugate to quaternion Λ). The direction of vector cE depends on the initial and final spacecraft positions. In order for the spacecraft to have the required orientation at the right-hand end
, the vector cE (or the value of vector r at the initial instant) by the corresponding solution of equation (2) should be determined. The system of differential equations (8), (9), together with the maximality condition of the Hamiltonian H, is necessary conditions of optimality. Constraint equations are given by the system of equation (2) which describes the spacecraft’s motion relative to its center of mass. The maximum conditions of function H determine sought solution M(t). Boundary position conditions Λ(0) and Λ(T) determine solutions Λ(t) and r(t). The boundary problem of the maximum principle is to find the value of the vector r(0) for which the solution of the system of differential equations (1), (2), (8), (9) together with simultaneous maximization, at every current moment of time, of the Hamiltonian H satisfies reorientation conditions (4), (5).
In the problem being solved, the time of the spatial turn is not fixed; therefore, the following transversality condition H=0 must be satisfied28 (since the Hamiltonian H does not explicitly depend on time). The system of to equations (8), (9), together with the requirement of maximality of the Hamiltonian H and the conditions r(0)≠0, H=0, are the necessary conditions of optimality. To find the control function M(t) (the optimal control program) and the optimal vector r, we define the conditions for the maximality of the Hamiltonian H, which we rewrite in the form
H = φ1М1 /J1 + φ2М2 /J2 + φ3М3 /J3 + Hinv
where Hinv does not explicitly depend on the control functions Mi . Let φ be the vector with components φi. If φ=;0, the maximum of the function H for the controls Мi (t) under restriction (3) is achieved when
Mi =
(10)
(the case of φ=0, in which the Hamiltonian does not explicitly depend on the control M, requires additional consideration). Further we will demonstrate that M=0 if Φ=0 (and φ = 0). The optimal solution is determined by the closed system of equations (1), (2), and (8)-(10) considering the conditions (4) and (5). Let us find the characteristic properties of the optimal motion using the normalized vector р = r/|r|, |p|=1 (due to the fact that |r| =const=|r(0)|¹0). For the vector p, we have
, or
(11)
In what follows, the components pi of the vector p will be used. Note that ri=|r(0)|pi. The solution to the system of equations (1) and (8)-(10) (under the requirement ω (0) = ω (Т) = 0) has the form
(12)
(13)
where b is a scalar value; а(t) is scalar function of time with £0 (b³0 for optimal motion ω (t)).
After substituting solution (12), (13) into system (9) and considering the equations (11) for the derivatives , we obtain the identity expressions if p=(2k0b – r0)p, where r0=|r(0)|. Therefore, the optimal functions а(t) and b(t) satisfy the condition =2k0b – r0 (since |p|¹0), from which two features follow:
(0)=
(T)= –r0 and b(0)=b(T)=0 (due to the requirement ω (0) = ω (Т) = 0). At initial instant t=0, а(0)>0; otherwise M×p<0 and b<0 due to the equations (1), (10), (12), and <0, a<0 for any t>0. However, in such a scenario (when а (0)£0), the switching is absent (since b<0 and
<0); the torque М acts in one direction, spinning the spacecraft until ω→∞. Accordingly, at end of reorientation maneuver, condition M×L<0 is necessary (and M×p<0 also) and а (T)<0. The scalar function a(t) is the continuous function of time. Therefore, moment of time when a(t)=0 exists. If
=0 then a(t)=0,Φ=0, φ=0 (otherwise the value a(t) does not change a sign, i.e. a(t)>0 and M×L>0 during interval of control [0, T], but such rotation does not satisfy the condition ω (T)=0); L is angular momentum of a spacecraft (the multiplication sign × means the scalar product of vectors).
From (1) and (13), we have
= M×p. If a(t)>0, we have acceleration process. If a(t)<0, we have braking. For acceleration phase [0, t1], we have
>0; for braking [t2, T], we have
<0 (since
=2
). For functions a(t) and b(t), we have the following properties: a(Т-t)= -a(t) and b(Т-t)=b(t).
The equalities (12), (13) are satisfied together. For spin-up, optimal controlling moment M is calculated by the formula
(14)
Optimal torque M and angular momentum L are parallel during acceleration phase. Differentiation of left and right parts of equalities (14) gives the following differential equations (angular accelerations
are taken from dynamic equations (1)):
(15)
Rewrite last equations in vector form
The obtained differential equation, for the controlling moment M, means its immobility relative to inertial coordinate system. As consequence, |M|=const during acceleration stage. For optimal braking, the torque M is
(16)
After differentiation of equalities (16) we obtain differential equations (15) from which the property |M|=const appears for the entire braking stage. Thus, equality |M|=const is satisfied for optimal rotation during acceleration and braking phases. It is very important property of optimal motion and optimal control. Both at imparting the calculated angular momentum, and at damping of rotation, the torque M has a constant magnitude (direction of vector M is not changed relative to inertial basis I); i.e. within acceleration and braking segments, optimal torque M is the fixed vector relative to inertial coordinate system. If relations (13) are fulfilled, then
To be certain of this, we differentiate the left-hand part of the given equality with respect to time considering the equations (11) for pi and the dependences (13) for components wi of angular velocity.
= w1p2p3 /J3 - w1p2p3 /J2 + w1p2p3 /J2 - w2p1p3 /J3 + w2p1p3 /J3 - w1p2p3 /J3 º 0
For optimal solution (12), the dependences (10) can be rewritten in the form
Mi =
Thus, when a(t)≠0 and j≠0, the statement |M|=const≠0 is true, and, therefore,
=const, as well. The torque M satisfies the condition (3). Therefore, we can write the following relation
M=m0signa(t)p , where m0=u0/C
,
For time interval when a(t)=const=0, the system (9) is transformed to the equations
, and the relations
(17)
are satisfied. Let us find the controlling moments necessary to support the spacecraft’s optimal rotation mode in time interval t1<t<t2 during the rotation with a(t)=const=0. This situation is named special control regime, since the Hamiltonian does not explicitly depend on the controlling moments, and control functions Mi cannot be found from the condition of maximality for function H when a(t)=const=0 and φ=const=0. But we can calculate optimal values Mi using (17). Let us substitute the functions
(t) computed by expressions (17) into dynamic equations (1) with taking into account the fact |r| =const¹0. As result, all components of torque M are Mi=0. Thus, between acceleration phase and braking of spacecraft rotation, we have M=0 and b(t)=r0/(2k0). This follows from the analysis of equations (17) that show a relation between the angular momentum L and the vector r of universal variables. The fact that L=r/(2k0) and |L|=const, keeping in mind the immobility of vector r in the inertial basis I, implies that the spacecraft’s angular momentum vector is constant relative to inertial coordinate system during the reorientation. The kinetic energy Ek is constant also (
since M=0).
From formula (13), we see that b=|L|. Left-hand sides of equations (13) are projections of the spacecraft’s angular momentum vector onto the axes of the body basis E. Expressions (13) lead to the conclusion that spacecraft rotation during optimal motion is done with a constant direction of angular momentum relative to the inertial coordinate system. The value of b equals the modulus of the spacecraft’s angular momentum L. The triple p1, p2, p3 represents directional cosines of the vector L relative to the axes of the basis E. Equations (13) clearly demonstrate that in the geometric representation, the vector p is simply the unit vector of the spacecraft’s angular momentum vector L in the spacecraft’s coordinate system. Thus, the optimal (in the sense of minimizing the functional (6)) spacecraft reorientation is performed along the “trajectory of free motion”. Equations (11), together with equalities (13), form a closed system of equations which determine unique properties of optimal motion. The optimal function b(t) is a non-negative piecewise-linear function of time. At t=0 and t=T, b(t)=0; at t=t1 and t=t2, b(t)=r0/(2k0). The time T equals T=t2+t1 (since |M|=m0 for acceleration segment and segment of braking, and t1=r0 /(2k0m0). Concrete solution is determined by close this system of equations (11), (13) by equation (2) with conditions (4), (5) for solution Λ(t).
Here and in what follows, it is assumed that the “trajectory of free motion” is a multitude of angular positions (the values of L) that a rigid body occupies during its rotation by inertia. In the geometric interpretation, the “trajectory of free motion” is a trace of the representing point Λ (t), where Λ (t) is a solution of the system of differential equations (1), (2) when M1=M2=M3=0 and
≠0. Let us find the proportion between the kinetic energy Ek and angular momentum L during optimal slew maneuver. The kinetic energy Ek and the value b are related by expression
Therefore, the proportion
=const.
For segments when a(t)≠0 and M≠0, we have
(see formulas (14), (16)), i.e.
(since |M|=const=m0 if a(t)≠0). If a(t)=0, then M=0, |L|=const and Ek=const. The quantities |L| and Ek are the continuous functions, therefore, Ek/|L|2 is continuous function of time, and it is constant within all three segments of control. Thus, it is shown that this proportion Ek/|L|2 is constant on the entire motion interval 0≤t≤T. It is key property of optimal motion for criterion (6), Ek/|L|2=const.
The Hamiltonian H is independent of time in explicit form, therefore, H=const inside the entire interval of control 0≤t≤T.28 From the formula (7), the equality
=const
is obtained for time interval when φ=0 (i.e. in special control regime). After substitution the values
calculated by equations (17) in this equality (taking into account the equalities ri=r0pi), the conditions
, and
are satisfied for the moments of time when a(t)=0 because |r| =const (at segments of acceleration and braking, the above mentioned conditions are satisfied automatically, it follows from the equations (14), (15), (16)).
Thus, the problem of constructing the optimal control ω(t) has been mainly reduced to finding such a value of vector p(0) that as a result of the spacecraft’s motion, according to equations (2), (11), and (13) with initial conditions (4), the equalities (5) will be satisfied. It is virtually impossible to find a general solution of this system of equations. The problem is to find boundary conditions on p(0) and p(T) which are related by expression
(18)
The time of ending the reorientation process is not fixed, therefore H(T)=0; the Hamiltonian H is independent of time in explicit form, hence, H=0 inside the entire interval of control 0≤t≤T.28 Maximum value of the kinetic energy Ek and modulus of angular momentum (and values a(0) and r0) are determined by condition H=0. At instant t=0, angular velocity is zero, the function H is equal to
Hence, the value a(0) for optimal function a(t) is a(0)= 1/(u0C). Accordingly, a(T) is a(T)= –1/(u0C). At instants when a(t)=0, the function H is
and b=r0/(2k0) if
=0 (this follows from (17)). From last equation, we find the optimal value
. It is obvious that modulus of angular momentum |L| has maximal value Lmax between acceleration and braking.
Thus, Lmax is determined unambiguously
. The found magnitude Lmax corresponds to the maximal kinetic rotation energy Emax= 1 /(2k0). Respectively,
if phase of uncontrolled motion (when M=0) is not absent.
Note, the vectors ω and p are related as
(19)
The task of the onboard control system for realization of optimal control is to impart the initial motion conditions to the spacecraft (namely the calculated angular velocity at time moment t=0) and to suppress the kinetic energy to zero at time moment t=T, when
(after the spacecraft reaches its final position
).
From the moment of reaching the necessary initial angular velocity
and until the reorientation is finished, when the spacecraft will be in the neighborhood of the required position
, there is no torque M acting on the spacecraft’s body; the spacecraft performs uncontrolled rotation (M=0), i.e. free motion. Creating the initial angular velocity and damping the final rotation happens in an impulse (as fast as the spacecraft’s actuators will allow). Between the imparting of angular momentum and the suppressing of angular momentum we have (if special control regime is present):
= 1 /(k0C) , = 1 / k0 (20)
(these equalities (20) follow from condition Н=0 and dependences (7), (17)).
Main types of optimal control
In many practical tasks, reorientation is made in situation when initial state satisfies condition (0)=0 and final angular velocity must be absent ω(T)=0 (these cases occur very frequently, especially if attitude control is done relative to inertial coordinate system). It is obvious, in moments of time t=0 and t=T, angular velocity calculated according to the formula (19), corresponding to nominal rotation (when Ek= 1/(2k0)), is not equal to zero. Therefore, segments of acceleration and braking at the beginning and the ending of turn maneuver are inevitable. But segment of special control regime can be absent. Presence of this regime is determined by the values of Λin ,
for known J1 , J2 , J3 . The most important characteristic is the integral
(21)
which we name path functional, and it is minimum for motion according to the law (11), (13)22 (the integral S depends exclusively on the values of Λin , Λf and the moments of inertia J1 , J2 , J3). The value S specifies type of optimal control and number of switching points for concrete values of J1 , J2 , J3 , Λin and Λf).
Let Lmax be the maximum value of the angular momentum modulus within the control interval [ 0 , T ]; Lnom is the module of the angular momentum in the interval of time with a special control regime (between acceleration and braking); Limp and Timp are the modulus of the angular momentum and the optimal time under impulse control (when the angular momentum at times t=0 and t=T changes abruptly from çL½= 0 to çL½= Lnom and from çL½= Lnom to çL½= 0, respectively). If special control regime is present during the optimal turn then the following equations must be satisfied:
t1 =
, t2 Lnom = S/C , t1 < t2 , t1 + t2 = Т, where Lnom = m0 t1 =
From this system of equations we obtain t2 =
= Timp. From the requirement t1 < t2 we find the condition for the existence of a special control regime: k0u0S > 1. Thus, taking into account the prominence value of integral (21) we arrive at the following conclusion:
(а) if k0u0S > 1, then the optimal value is
, and maximal energy is Emax = 1/(2k0) (see (20)), relay control with two switching points is optimum;
(b) if k0u0S £ 1, then ❘M❘= const > 0, the special control regime is absent, and t1 =
; the function b (t) has a derivative
≠0, the constant r0 satisfies the condition
, and the optimal value of r0 is r0 =
; maximal energy is
≤1/(2k0); relay control with one switching points is optimum.
Essentially, the requirement of optimality (minimization of index (6)) is equivalent to restriction
≤ 2Eadd =1 / k0 (22)
where Eadd is the maximum allowable energy of spacecraft rotation.
For the optimal turn (regardless of the presence or absence of a special control regime) t1 ≤ t2 , t1 + t2 = Т, t2 Lmax= S/C, Lmax= m0t1 , and therefore t1 ≤
always. If a segment with a special control regime occurs, then t 2=Timp=
, and Lmax=Lnom=Limp=
; braking duration τ= t1, since at the stages of acceleration and braking ❘M❘= m0 and М
L. If k0u0 S ≤ 1, then there is no segment with a special control regime and t2=t1=
, Lmax=
. Regardless of u0 , we have bmax≤Limp , so Emax≤1/(2k0), and if u0 ≥uкр , then bmax=Limp and Тopt=Timp+Limp /m0 , and if u0 < uкр , then bmax < Limp and Тopt=Тfast ; the critical value of uкр is uкр=1/(k0S ). If u0 = uкр, then Тopt=2Timp, G=
, and the relative loss in optimality (the growth of the functional (6)) is δG=G/Gimp‑ 1=1/ 3.
If time of reaching the calculated angular velocity which is equal
and duration of suppressing the angular velocity to zero are infinitesimal, then duration of reorientation is T=
because modulus of angular momentum during uncontrolled motion (between acceleration and braking) is |L|=
, where the integral (21) is calculated by the formula
where tpr is the predicted time of achieving the condition
during free rotation from the position
with initial angular velocity ω(0)=
≠ 0 (according to the equations (1), (2) in which all values Mi=0). Note, the value S and the vector p0, which satisfy optimal motion, are computed together. Remind
For a spherically symmetric spacecraft and for a dynamically symmetric spacecraft, key characteristics and the constants of control law are determined straightforwardly, without integration of motion equations (1), (2). For a spherically symmetric spacecraft, the integral S is calculated as
Accordingly, optimal modulus of angular momentum during uncontrolled rotation is
and kinetic energy is
For an axially symmetric spacecraft (when J2=J3), the integral S is equal
optimal modulus of angular momentum during uncontrolled rotation is
and kinetic energy is
The optimal controls Mi and angular velocities wi change according to the following laws:
Mi=0.5m0 [sign (t1 –t) + sign (t2– t)]pi (23)
Jiwi=0.5m0 (T-çt–t1½–çt–t2½)pi (24)
where
; t2=max(
,
); Т=t1+t2 ; t1 is moment of acceleration ending; t2 is moment of the beginning of braking. If t2>t1 (i.e. when k0>1/(u0S), then we have control with two points of switching when phase of rotation with M=0 (between acceleration and braking) is not absent, t0=t1 also. If t1=t2 (i.e. when k0≤1/(u0S), then moments of time when M=0, are absent, we have control with one switching (braking follows acceleration at once). In section 3, we demonstrated earlier that kinetic energy Ek= 1 /(2k0) if =0. It is obvious, Ek< 1 /(2k0) at acceleration and braking. Hence, Ek≤ 1 /(2k0) during the entire interval of time [0, T] (if u0< 1/(k0S), then Emax=u0S/2< 1 /(2k0) also). Thus, for optimal control (in sense (6)), the property Emax≤ 1 /(2k0) is satisfied always.
Key properties and features of optimal solution
We can show that the found control (12), (13) is indeed optimum (since the functions and wi calculated by formulas (12), (13) are single solution of the system (1), (9), (10), (11) if ω(0)= ω (Т)=0 and ri=r0pi). For zero boundary conditions ω (0)= ω (Т)=0, in general case, slew maneuver includes two phases during which magnitude of the torque М is maximal possible: acceleration and braking, and phase of uncontrolled motion at which equations (20) are satisfied. This type of control is called as control with two points of switching (t1 is moment of time when kinetic energy Ek achieves level k0/2, and t2 is moment of beginning of braking). A detailed analysis of the main reorientation stages: speedup, braking, and uncontrolled spacecraft rotation with the constant kinetic energy and angular momentum (relative to inertial coordinate system), shows that all three stages have a common property, namely, spacecraft rotates along the “trajectory of free motion”. It is characteristic for the “trajectory of free motion” that the direction of spacecraft’s angular momentum remains constant in inertial coordinate system. Taking into account that during the entire reorientation (on the entire time interval [0, T]) the torque M is parallel to the angular momentum vector L (i.e., the controlling moment acts in the same direction as L, or in the opposite direction, or equals zero), we can conclude that M×L=0, and, therefore, there are no reasons for a rotation of the angular momentum L in inertial coordinate system. Also, the integral (21) does not depend on the character of variation of the function |L(t)|,22 and S is equal to the value (21) for ideal mode (for motion according to (11), (17), (20)). However, there are such situations (under certain values S, u0 , k0) when the stage of uncontrolled motion is absent and M¹0 within the entire interval of control 0≤t≤T (braking replaces acceleration at once). This type of control is named control with one switching.
If
, then t2>t1 (t2≠t1), optimal motion includes regime of special control; time of optimal reorientation is
;
If k0u0S =1, then t2=t1, time of optimal reorientation is
; the derivative
(T/2)=0 but
(T/2)=0.
If k0u0S ≤ 1, then we have control with one switching when maximal energy of rotation Emax< 1 /(2k0), and duration of optimal reorientation is T=2
; point of switching is t0=
, maximal energy of rotation is Emax=u0S/2, maximal modulus of angular momentum is
, therefore, the derivative
(T/2)<0 (
<0 on the entire interval of time 0≤t≤T).
For spacecraft reorientation with limited control (by restriction (3)), key property of optimal motion remains valid is independent of number of switching, the proportion Ek/|L|2 between the kinetic energy Ek and angular momentum L is constant on the entire interval of time 0≤t≤T, independently of duration of acceleration and braking (independently of presence or absence of the uncontrolled stage with M=0). Angular momentum L and rotation energy Ek are continuous functions of time. Therefore, the proportion ρ=Ek/|L|2 is continuous function. For acceleration (or braking), the equalities (14) (or (16)) and |M|=const are satisfied. Therefore, the proportion ρ is constant within acceleration and braking. Between acceleration and braking, the equations (20) are satisfied; as result, ρ=const during free rotation. Hence, the proportion ρ=Ek/|L|2=const within time interval [0, T] because of a continuity of functionr. As consequence, modulus of torque M is identical for acceleration and braking, and it is equal to same magnitude
=u0 /C
Angular momentum L does not change the direction relative to inertial coordinate system during acceleration, braking and at rotation between acceleration and braking (when spacecraft rotates by inertia), so, a direction of angular momentum in inertial coordinate system invariably on the entire interval of time [0, T]. Hence, in the presence of restriction (3), the equations (11) and (13) are satisfied at the entire interval of rotation from t=0 to t=Т. At acceleration and braking |М|=const=m0, where m0 > 0 is maximal admissible magnitude of the torque М in direction of angular momentum L. Since on entire interval of control Ji /|L|=pi , the value m0 is identical during both phases of acceleration and braking, and it is equal u0 /C, where C is the earlier introduced constant which is unambiguously specified by the vector p0 and moments of inertia J1, J2 , J3. The condition |M|≤m0 is satisfied within the entire interval of control, where m0=u0/C. Optimal torque M is parallel to motionless line relative to inertial coordinate system. Direction of this motionless line is determined by direction of the vector p (since the directions of angular momentum L and the vector p coincide). Hence, M=m(t)p. The scalar function m(t) is specified as m(t)=(M1L1+M2L2+M3L3)/|L| or m(t)=M1p1+M2p2+M3p3. Control function m(t) is three-positional relay or two-positional relay if optimum is control with one switching. The function m(t) can be written in the following form: m(t)=m0 if |L|<Lopt and t<T/2; m(t)= –m0 if
; m(t)=0 if
(it is obvious that situation m(t)=0 is absent if τ=T/2, because the condition
is not satisfied). Here,
, and Lopt is modulus of angular momentum at moment of time t=T/2 (or during free rotation if phase of uncontrolled motion takes place); τ is duration of acceleration (braking). Note the spacecraft’s angular momentum satisfies the inequality |L|≤Lopt for any time t. If optimal control program has two points of switching Lopt=
; if optimum is control with one switching Lopt=
. For optimal control, spacecraft acceleration continues until angular momentum equals the target level
Thus, it is proven the following conclusion: spacecraft’s reorientation occurs with the minimal value of the index (6) if and only if the spacecraft rotates according to the law (11), (23), (24). If allow a step like change of the angular velocity vector , then the proposed optimal control problem (the kinematic reorientation problem) can be considered solved: equations (2), (11), and (19) completely define the necessary motion (t), the main moment of forces is zero (i.e., the spacecraft’s rotation is an Euler–Poinsot motion of rigid body) within interval 0<t<T in which Ek=1 /(2k0).
Topicality of the solved problem consists in the fact that by minimizing functional (6) the energy spent to perform spacecraft reorientation from position
into position
is bounded and maneuver time T is minimum. Narrow-mindedness of rotation energy was proven earlier (in section 4). If u0→∞, then the value of integral (6) is G=2T, consequently, the time of the optimal turn is T=G/2, and, minimizing integral (6), we obtain the turn of the spacecraft for the minimum time. If the moment M is limited, some non-zero time is required for imparting the required angular momentum to the spacecraft and for suppressing the existing angular momentum to zero. A restriction on the magnitude of feasible controlling moment leads to the appearance of intervals with non-zero duration when spacecraft increases and decreases its angular velocity. Duration of optimal maneuver is
(25)
if the special control regime is present (when k0u0S > 1). The first term is duration of so-called kinematic control (or ideal maneuver) when u0→∞, t1→0, and the equalities (17), (20) are satisfied within entire interval of time 0<t<T. This duration is minimal since S is minimal possible value since optimal maneuver (in sense (6)) satisfies equations (11), (13). Second term is duration of braking under restriction (3) (when u0<¥ and u0¹0). This time is minimal for control (16).23 Hence, the value (25) is minimal possible value of reorientation's time with restriction (3) and condition Ek≤ 1 /(2k0) for kinetic energy Ek during spacecraft maneuver. If optimum is control with one switching (because k0u0S ≤ 1), then energy Ek do not achieve the level Eadd=1/(2k0) under time-optimal control.13 Optimization in accordance with criterion (6) minimizes the time of spacecraft’s rotation from the position Λ(0)=Lin to the position Λ(T)=Λf in the presence of restrictions (3) and (22).
If
is much less than
, the beginning of braking will be determined from the fact that the angular momentum magnitude |L(t)| changes linearly when angular velocity is reduced to zero. During braking, the modulus of the controlling moment is constant, and the time moment from which braking will be started is specified by the following condition:
where m0 is the maximal controlling moment magnitude that can be provided by the actuators of attitude control system; δj are the components of the discrepancy quaternion
; К=|L(t)| is the current magnitude of the angular momentum. The said condition for finding the start moment of braking phase allows the onboard control system to form a signal of angular velocity reduction based on the information on current spacecraft orientation and measurements of angular velocity. Use of this condition increases the precision of the spacecraft reorientation into final position Λf .
Construction of optimal control in specific cases
We assume that the control non-limited by any restrictions is ideal mode (in this case, u0→∞ and t1→0, the braking is momentary process also). For the optimal motion, spacecraft reorientation from one angular position
to another position
is done by impulsive imparting the necessary angular velocity (the nominal value of the angular momentum vector) to the spacecraft, rotation of the spacecraft with the constant kinetic energy and modulus of angular momentum, and short-term (impulse) reduction of the rotation energy to zero. In ideal motion optimal with respect to criterion (6), the spacecraft’s reorientation between impulsive acceleration and impulsive braking is carried out with zero controlling moment M=0. Constructing the optimal reorientation regime with minimal value of functional (6) is non-trivial task. In the optimal reorientation problem (in constructing the optimal programmed motion ω(t)), it is crucial to find the initial vector ω(0) and the corresponding angular velocity ω(0+) (the angular velocity (0+) is calculated by formulas (19)). The vector p(0) depends on reorientation parameters
and the spacecraft characteristics J1, J2, J3. For arbitrary values J1 ≠ J2 ≠ J3, it is hard to find the solution of the considered problem of spacecraft’s three-dimensional reorientation for arbitrary values
and
. The difficulty is to find the vectors p(0) and p(T) which are related by (18). Analytical solution of the system of equations (2), (11), and (19) exists for dynamically spherical and dynamically symmetric bodies only. For a spherically symmetric spacecraft (when J1=J2=J3), the solution p(t), ω(t) have elementary form: p(t)=const and ω(t)=const, or in detail
, and
where
are components of the reorientation quaternion
.
For a dynamically symmetric spacecraft (for example, when J2=J3), the optimal control problem can be solved completely also. For this distribution of spacecraft’s mass, the following differential equations
,
are satisfied under condition . Last system of differential equations describes the oscillator (with the parameter
), for which
and
are harmonic functions of time. Therefore, р1=const=р10 and harmonic oscillations of the functions р2 and р3 are observed. In this special case, the optimal motion is the simultaneous rotation of the spacecraft as a rigid body around its axial axis OX and around spacecraft’s angular momentum L which is constant in the inertial space and which constitutes a certain constant angle with the spacecraft’s axial axis. Angular velocities with respect to OX and p axises have a constant ratio (as is shown above, the vectors L and p are parallel). The solution of system (2), (11), (19), necessary for solving the control problem, is regular precession. For the regular precession case
where p0=p(0); e1 is the unit vector of the spacecraft’s axial axis; α is the spacecraft’s rotation angle around its axial axis; β is the spacecraft’s rotation angle around the vector p, e is the quaternion exponential.1 It is assumed that |α|≤ π, 0≤β≤ π. For a spacecraft with moments of inertia
, the solution p(t) is written as follows:
(26)
In this case, dependences (26), together with equalities (19), form a solution of the system of equations (2), (11) under condition (13). At the same time, the vector p also generates a cone around the axial axis OX in the spacecraft’s coordinate system. The specific value of р0 is determined exclusively by the requirement that, according to equations (2), (11), (19), boundary conditions (4) and (5) must be satisfied. In this type of control, the spacecraft’s angular momentum preserves a constant direction in the inertial reference basis I, while the axially symmetric body moves along a “conic trajectory”. To move the spacecraft from position
into position
, it rotates simultaneously around the vector cE, which is constant relative to the inertial basis I, by the angle β, and around its own longitudinal axis by the angle α. Using the mathematical formalism of quaternions to describe rotations of rigid body about the center of mass, relations reflecting a dependence between the values p0, α, and β are written. The dependence of parameters p0, α, and β on the boundary angular positions
and
is given by the following system of equations:
For a known reorientation time T, angular rotation velocities around the OX and p axises are equal to
, and
(for ideal mode
). The magnitude of angular momentum during optimal rotation is |L|=J2 β/T. The programmed values of controllable functions ωi (projections of the angular velocity vector ω) have the following form:
where
. Optimal values p0, α, and β corresponding to solution of last system of five transcendent equations and which correspond to free motion from position Λin into position Λf can be determined with using the device.29
For an asymmetric spacecraft (when J1 ≠ J2 ≠ J3), the system (2), (11), (19) can be solved by numerical methods only (e.g., using the method of successive approximations or iterations methods with consecutive approach to true solution). To find the vector p0, it is necessary the solving the boundary problem
,
, taking into account the equations (1), (2) imposed upon the motion, in which Mi=0. As a result, the value of the angular velocity vector at the initial time moment
, for which the spacecraft is moved by its free rotation with respect to the center of mass (M=0) from the state
, ω(0)=
into the state Λ(T)=
, will be found (ω(T) is arbitrary here). In particular, the method of solving the boundary problem and determining the vector p0 was described in detail in the article.13 The value of vector p0 relates to
as
The known algorithms presented in patent24 and system25 can be used for finding calculated values
and p0 also. These algorithms13,24,25 are reliable and provide asymptotic approaching for sought value p0. Other calculation schemes30,31 can be useful only in some specific cases.
Thus, key results are the following: optimal control program of spacecraft reorientation was found; it was demonstrated that two-impulse control when spacecraft rotates by inertia between acceleration and braking is optimum in general case. Key characteristic properties of the obtained optimal motion are determined. All conclusions are absolutely true since well-known mathematical methods were used, and all mathematical formulas are based on the checked theories. For a dynamic symmetric spacecraft, a complete solution of the reorientation problem in closed form is presented; optimal values of control law parameters can be found by the device.29 The obtained control method is differs from all other known solutions. Main difference consists in new form of minimized functional which allows to turn a spacecraft with the bounded rotation energy (maneuver time is minimized also). This useful quality is advantage of presented control mode because it significantly saves the controlling resources and increases the possibilities of spacecraft control. Furthermore, rotation with energy not exceeding the given value allows us to stop rotation of a spacecraft within known time what is very important in safety sense in critical situations.
Example and results of mathematical simulation
Spacecraft reorientation with minimal functional (6) is performed along the “trajectory of free motion”, on which the direction of the spacecraft’s angular momentum remains constant in inertial coordinate system on the entire time interval from t=0 to t=T. Optimal torque M is collinear to the vector p which is unit vector of angular momentum L. For cases when
, the strategy “acceleration of rotation, uncontrolled rotation, damping of rotation” is optimum for optimal control problem (1)-(6) if ω(0)=ω (T)=0 in slew maneuver. The assumed criterion of optimality supports motion of a spacecraft with the bounded kinetic energy of rotation during reorientation maneuver. The universality of the designed control method is proved by the following factors: it does not depend from actuators type, mass and size of spacecraft, spacecraft’s moments of inertia, altitude of working orbit (and from others, for example, from periodicity of reorientation, angle of a turn). Let us give a numerical example of solving the optimal control problem for spacecraft reorientation with minimal value of the integral (6). As an example, let us consider spacecraft reorientation for 180 degree from initial position
, when body axes coincide with the axes of reference basis I, into the target position
. It is assumed that initial and final angular velocities are zero, ω(0)= ω(T)=0. Values of the elements of quaternion
that characterizes the target attitude of a spacecraft are:
λ0=0, λ1=0.707107, λ2=0.5, λ3=0.5
Let us find the optimal control program for angular velocity ω(t) for transferring the spacecraft from the state
, ω(0)=0 to the state
, ω(T)=0 if the coefficient k0 is equal to k0=0.5 joules–1. The constant u0 which characterizes power of actuators is u0=0.05 N kg–1/2. The spacecraft’s mass-inertial characteristics are as follows:
J1=12801.6 kg m2 , J2=45747.3 kg m2 , J3=40331.1 kg m2
As a result of solving the turn boundary problem from position
into position
(the optimal reorientation problem in the impulse setting), we obtained the calculated value of the vector p0= {0.4469347; -0.1861273; 0.8749891} and integral S=471.1 m
. Iterations method guaranteeing successive approach to true value p0 was used13 (in most cases, this method provides asymptotic approaching). The maximal value of the control torque is m0=8.41 N m. The obtained value S shows that k0u0S > 1 and optimal program is control with phase of uncontrolled rotation. The durations of speedup and braking are the same and equal to τ=28.3 s, the angular momentum magnitude within stage of rotation by inertia equals Lopt=238 N m s. Optimal changing the controlling moment M is described by the law
М=u0 [sign (
) + sign (
)]
Λin p0
L / (2C)
Results of the mathematical modeling of the reorientation process under optimal control are given on Figures 1–3. Figure 1 shows the character of changing the angular velocities ω1(t), ω2(t), ω3(t) with respect to time. The reorientation time is T=361.4 s. Between acceleration and braking, the spacecraft rotates with a constant energy equal to Ek=1 joules. The value of index (6), which characterizes the cost-efficiency of the rotation trajectory Λ(t), ω(t) after spacecraft’s angular motion from position
into position
, equal to G=685 s. Figure 2 shows the changes in the components of quaternion Λ(t) that defines current attitude of a spacecraft in process of rotation maneuver: λ0(t), λ1(t), λ2(t), λ3(t). Finally, Figure 3 shows the dynamics of the components p1(t), p2(t), p3(t) of the unit vector p during optimal maneuver. The variables λj and pi are dimensionless quantities. It is characteristic that the change in the projection p1 is very small in comparison with changes in the projections p2 and p3 (the angular velocity component ω1 also changes a lot less on the interval of free rotation than angular velocity components ω2 and ω3). This confirms the fact that the OX axis is longitudinal axis. Unlike variables ωi, variables pi and λj are smooth functions of time.
Figure 1 Changing the angular velocities during optimal reorientation maneuver.
Figure 2 Changing the components of orientation quaternion Λ(t) during optimal reorientation.
Figure 3 Changing the components of unit vector p under optimal control.