Symmetrized Covariant Derivatives

Introduction

A symmetrized derivative covariant derivative is symmetrization of a number of covariant derivatives:
	∇_((a_1...a_n))= ∇_((a_1)∇_(a_2)... ∇_(a_(n-1))∇_(a_n)) .
The main advantage of symmetrized derivatives is that they have a greater degree of symmetry than non-symmetrized (or ordinary) derivatives. For instance, it is obvious that the following is zero:
	F^([ab])∇_((ab))T^c= 0 .
However, when we write out the symmetrized derivative, it is not immediately clear that the expression is zero:
	F^([ab])∇_((ab))T^c=F^([ab]) (∇_a∇_bT^c+1/2R_(abd)^cT^d)
Of course, when we write F^([ab]) ∇_a∇_b as a commutator, we recover zero. But the point is that eliminating commutators is something the xTensor canonicalizer does not do. So it is often necessary to explicitly symmetrize covariant derivatives in order to fully canonicalize an expression.

Implementation

SymmetrizeCovDssymmetrizes covariant derivatives
ExpandSymCovDsde-symmetrizes covariant derivatives
$AutoSymmetrizeCovDswhen set to True, derivatives are automatically symmetrized

Functions for symmetrizing and de-symmetrizing covariant derivatives.

Symmetrized covariant derivatives are implement in xTras as follows. Whenever a covariant derivative CD is defined to be symmetrizable, CD[-a,-b] will mean ∇_((ab)) (and likewise for more indices). A covariant derivative can be defined to symmetrizable by giving the option SymCovDQ -> True to DefCovD or DefMetric:
First define a manifold.
In[1]:=
Click for copyable input
Now define a metric and covariant derivative. We need to specify SymCovDQ in order to make the covariant derivative symmetrizable.
In[2]:=
Click for copyable input
Derivatives with more than one index are now interpreted as symmerized derivatives:
In[20]:=
Click for copyable input
Out[20]=
Out[21]=
We can expand the symmetrized derivative with a call to ExpandSymCovDs:
In[22]:=
Click for copyable input
Out[22]=
Single derivatives can be symmetrized with SymmetrizeCovDs:
In[23]:=
Click for copyable input
Out[23]=
Out[24]=
Symmetrized derivatives are automatically Leibnitz:
In[25]:=
Click for copyable input
Out[25]=
Integrating by parts can be done with VarD:
In[29]:=
Click for copyable input
Out[29]=
Notice that two symmetrized derivatives acting on each other are not automatically symmetrized further. We need to call SymmetrizeCovDs again to do so explicitly:
In[31]:=
Click for copyable input
Out[31]=
However, when we set $AutoSymmetrizeCovDs to True, all derivatives that are not fully symmetrized will automatically get symmetrized:
In[32]:=
Click for copyable input
Out[32]=
In[34]:=
Click for copyable input
Out[34]=
By setting $AutoSymmetrizeCovDs to False we return to the default behaviour:
In[35]:=
Click for copyable input
Out[35]=
In[36]:=
Click for copyable input
Out[36]=
Because symmetrizing derivatives is a an operation the grows exponentially in complexity (see the section on performance below), the function SymmetrizeCovDs stores its results in the variable $SymCovDCache. The cache can be cleared with ClearSymCovDCache:
In[43]:=
Click for copyable input
In[44]:=
Click for copyable input
Out[44]=
By default, symmetrizations are cached:
In[45]:=
Click for copyable input
Out[45]=
Subsequent symmetrizations are then faster:
In[46]:=
Click for copyable input
Out[46]=
But the first symmetrization was slower compared to when no cache is used:
In[47]:=
Click for copyable input
Out[47]=
This is because SymmetrizeCovDs spents some time on store the results in the cache. If you plan on symmetrizing a lot of derivatives, using the cache is faster in the long run.

Performance

The implementation of symmetrizing derivatives is exponentially complex in the number of derivatives. SymmetrizeCovDs has an optimized algorithm in the case when there is no torsion. Whenever there is torsion, a slower algorithm is used.

Properties of symmetrized derivatives

Below we list some useful formulas and properties of symmetrized derivatives.

Symmetrizing

Because it is always possible to commute adjacent covariant derivatives by introducing curvature, a symmetrized covariant derivative can in general be expressed as
	∇_((a_1...a_n))= ∇_(a_1)...∇_(a_n) + polynomial in curvature ,
or vice-versa:
	∇_(a_1)... ∇_(a_n)=  ∇_((a_1...a_n))- polynomial in curvature .
For example, we have
	∇_((ab))T^c= 1/2(∇_a∇_b+∇_b∇_a)T^c		= 1/2(∇_a∇_b+∇_a∇_b+∇_b∇_a-∇_a∇_b)T^c		= (∇_a∇_b+1/2[∇_b,∇_a])T^c		= ∇_a∇_b T^c+1/2R_(abd)^cT^d+1/2T_(ab)^d∇_dT^c .
By induction, we can deduce that a single ordinary derivative can be eaten by a symmetrized derivative as follows:
	∇_((a_1...a_n))∇_b = ∇_((a_1...a_nb))-D[i/(n+1)]_(a_1...a_n, b) ,
	∇_b ∇_((a_1...a_n)) = ∇_((a_1...a_nb))-D[(n+1-i)/(n+1)]_(a_1...a_n, b) ,
where the general commutator operator D is given by
	D[f(i)]_(a_1...a_n, b) = D[f(i)]_((a_1...a_n), b) = ∑_( i=1)^nf(i)∇_((a_1...a_(i-1)))[∇_b,∇_(a_i)] ∇_((a_(i+1)...a_n)) .
Although it is not explicitly indicated after the last equal sign because it would clutter notation, all a indices above are symmetrized over.
This expression for the operator D is not completely closed; it contains terms of the form
	∇_((a_1...a_n))∇_((b_1...b_m))
for which it is quite hard to find a closed formula relating it to ∇_((a_1...a_nb_1...b_m)).
However, whenever there is no torsion, the operator D can be expressed as
	D[f(i)]_(a_1...a_n, b) = ∑_(i=0)^(n-1) g_1(i,n) (∇_(a_1... a_i)[∇_b,∇_(a_(i+1))])∇_(a_(i+2)... a_n)		+ ∑_(i=0)^(n-2)g_2(i,n)(∇_(a_1... a_i)R_(ba_(i+1)a_(i+2))^c)∇_(a_(i+3)... a_nc)	-∑_(i=0)^(n-3)(∇_(a_1... a_i)R_(ba_(i+1)a_(i+2))^c)D[g_3(n,i)(j)]_(a_(i+3)...a_n, c)
where the commutator in the first term does not act on the indices of ∇_(a_(i+2)... a_n), but only on the expression on which the operator D acts. Furthermore, the numeric functions g_i are given by
	g_1(i,n)        = ∑_(k=i+1)^(n-1)f(k)(k-1 ; i )	g_2(i,n)        = ∑_(k=i+1)^(n-1)f(k)(n-k)(k-1 ; i )	g_3(n,i)(j) = ∑_(k=i+1)^(n-1)f(k)(n-k)(k-1 ; i )(j/(n-i-1)-Theta(i+j-k)(i+j-k+1)/(n-k))
where is the UnitStep function. Note that g_3(n,i)(j) with fixed n and i gets fed into the next iteration of D as its f (j).
A similar closed form for D with torsion should also exist.

Other properties

Distributivity is inherited from the ordinary derivative:
	∇_((a_1...a_n))(X+Y)= ∇_((a_1...a_n))X+∇_((a_1...a_n))Y .
If the ordinary derivative is Leibnitz, we have the following generalization of the Leibnitz rule:
	∇_((a_1...a_n))(X Y)= ∑_( i=0)^n(n ; i )∇_((a_1...a_i)X  ∇_(a_(i+1)...a_n))Y .
Counting the number of terms on boths sides provides a consistency check on this formula; the left-hand-side has 2^n terms, the same as the right-hand-side, namely .
Integrating by parts also carries over nicely:
	Y ∇_((a_1...a_n))X=(-1)^n  X ∇_((a_1...a_n))Y+ total derivative.
Lastly, the action of a Leibnitz symmetrized derivative on a generic function F with arguments X^mu (mu = 1,…,p) can be written as the following generalized chain rule:
	∇_((a_1⋯ a_n))F(X^mu) = ∑_(i=1)^n(∂_(mu_1)⋯∂_(mu_i)F(X^mu)∑_({k_1,…, k_i}∈P(n,i))M(k_1,…,k_i) ∇_((a_1...a_(k_1))X^(mu_1)⋯∇_(a_(n-k_i+1)⋯ a_n))X^(mu_i)).
This a generalization of Faà di Bruno's formula. Here , and the contracted mu indices are summed over. Furthermore, P(n,i) denotes the set of partitions of n into i integers {k_1,…,k_i} (to be precise, P(n,i) is the result of IntegerPartitions[n, {i}]). And M is the weighted multinomial
	M(k_1,…,k_i)  = (Multinomial(k_1,…,k_i))/(∏_(p=1)^nCount({k_1,…,k_i},p)!).
Note that
	∑_({k_1,…, k_i}∈P(n,i))M(k_1,…,k_i)  = S_n^((i)),            ∑_(i=1)^n S_n^((i))= B_n,
The generalized Leibnitz rule above is actually a limiting case of the generalized chain rule, as we can see by setting F(X^mu) = X Y:
	∇_((a_1⋯ a_n))(X Y) = ∂_X(X Y)M(n) ∇_((a_1...a_n))X  + ∂_Y(X Y)M(n) ∇_((a_1...a_n))Y 			 + ∂_X∂_Y(X Y)∑_(k=1)^(⌊n/2⌋)M(k,n-k)∇_((a_1...a_k)X∇_(a_(k+1)⋯ a_n))Y			 + ∂_Y∂_X(X Y)∑_(k=1)^(⌊n/2⌋)M(k,n-k)∇_((a_1...a_k)Y∇_(a_(k+1)⋯ a_n))X			=  Y ∇_((a_1...a_n))X  + X ∇_((a_1...a_n))Y + ∑_(k=1)^(n-1)(n ; k )∇_((a_1...a_k)X∇_(a_(k+1)⋯ a_n))Y			=∑_(k=0)^n(n ; k )∇_((a_1...a_k)X∇_(a_(k+1)⋯ a_n))Y .
Similarly, the distributivity property can also be derived from the generalized chain rule by setting F(X^mu) = X+Y.