Coarse graining procedure for the Ising model

To make things more clear, let us see how the coarse graining procedure works for the Ising model. If we call the local magnetization at the -th site and the dimensionality of the system, every "block" will have volume ; we define for every block of the system centered in the coarse grained magnetization as:

where is the number of spins (degrees of freedom in general) which belong to the block centered in ; this definition is reasonable as long as is large. Since it has been built as an average, does not fluctuate much on microscopic scales but varies smoothly in space. Of course, in general we need to specify in order to determine , but the coarse graining procedure we are applying will be useful only if the final results are independent of (at least in the spatial scales considered).


We now must express the partition function in terms of , and as we have stated before:

so we must now compute . Since we now have a system made up of "blocks" this effective Hamiltonian will be composed of two parts: a bulk component relative to the single blocks and an interface component relative to the interaction between the blocks; let us consider them individually.

bulk component
Suppose that every block of volume is separate from the rest of the system; inside every one of them the magnetization is uniform (since the linear dimension of the blocks is much smaller than the correlation length), so we can use Landau theory for uniform systems. In the case of the Ising model, it led to:

The total bulk energy is thus obtained summing over all the blocks:


interaction component
We now must take into account the fact that adjacent blocks do interact. In particular since as we have stated does not vary much on microscopic scales, the interaction between the blocks must be such that strong variations of magnetization between neighbouring blocks is energetically unfavourable. If we call a vector of magnitude that points from one block to a neighbouring one, the most simple analytic expression that we can guess for such a term can be a harmonic one[1]:

(the factor multiplying , just like the numeric factors multiplying and , have been inserted for future convenience). We can also think of this as a first approximation of a general interaction between the blocks, namely as the first terms of a Taylor expansion of the real interaction energy.

Now, since the linear dimension of the blocks is much smaller than the characteristic length of the system we can treat as a continuous variable and thus substitute the sum over with an integral:

(while the sum over remains a sum, since for every there is only a finite number of nearest neighbours). Therefore:

Keeping in mind that , the interaction term can be rewritten in terms of :

where we have called the components of . Thus, if we now define for the sake of simplicity:
we will have:
Therefore, the (functional) partition function of the system will be:
Let us now make a couple of considerations:

  • If the energy of the system has the same structure of the one used in Landau theory
  • The term proportional to is completely new but we could have introduced it intuitively to a Landau-like mean field functional, since the introduction of spatial variations in the order parameter has an energetic cost which must depend on how it varies in space, i.e. it depends on the gradient of . In particular, it must involve because of the symmetries of the model: since the system is isotropic and -invariant, we must use combinations of derivatives that are invariant under rotations and parity, and is the simplest of them[2].

If there is also an external magnetic field , we must add to the Hamiltonian the term:

so that the partition function becomes:
which is a functional of and . As usual, all the thermodynamics of the system can be obtained from , provided that now we take functional derivatives instead of usual derivatives.

Saddle point approximation: Landau theory[edit | edit source]

We can now compute , as a first approach, using The saddle point approximation; as we will see this will reproduce a Landau-like mean field theory which will also take into account the presence of inhomogeneities. In particular thanks to the new term involving we will be able to compute the fluctuation correlation function[3] and so also to determine the critical exponents and .


Therefore we approximate with the leading term of the integral, i.e. we must determine the function that maximizes the exponent, namely minimizes:

and then compute as:
where is determined imposing the stationarity of the functional with respect to :
This leads to the state equation of the system:
where we have defined for brevity. If we now call the integrand of , since[4]:
we have:
Note that if and (i.e. the system is uniform) we get the same state equation that we have obtained with Landau theory:

Correlation function in the saddle point approximation[edit | edit source]

We can now proceed to compute the correlation function within our approximations. In order to do that, we take the (functional) derivative of the state equation with respect to , so that appears:

Now, from fluctuation-dissipation theorem we know that:
so that:
Note that this means that the correlation function can be interpreted as the Green's function of the operator written between the square brackets.


In case of translationally invariant (i.e. uniform) systems, is constant and equal to the equilibrium values given by the Landau theory for the Ising model; in particular, depending on the sign of there are two possible situations:

In this case , so the last equation becomes:

Defining:
this can be rewritten as:


In this case the magnetization is:

so the differential equation for becomes:
This can be rewritten in a form similar to the previous case; in fact, if we define:
we get:

We will shortly see that and are the expressions of the correlation length for and , respectively. We can therefore see that in both cases we get:


Thus, for both the cases and the correlation function can be obtained by solving the differential equation:

which can be done with Fourier transforms. If we use the following conventions for the Fourier transform:

then transforming both sides of the equation we get:

where [5]. From this last equation we can also foresee that when , since we have and so , from which we have that the critical exponent is null (we will see that explicitly once we have computed ). Therefore, renaming we can now determine with the Fourier antitransform:
This integral is a bit tedious to compute, and in general its result depends strongly on the dimensionality of the system; the general approach used to solve it is to shift to spherical coordinates in and then complex integration for the remaining part, which involves . In order to do some explicit computations, let us consider the case ; we will then have:
Therefore:
This last integral can be computed, using the residue theorem, extending it to the complex plane:
Now, the integrand exhibits two poles at ; we choose as the contour of integration the upper semicircle in the complex plane, which contains only the pole at and so using the residue theorem we will have:
Therefore, in the end we have:

We see now clearly that the correlation function has indeed an exponential behaviour (as we have stated also in Long range correlations) and that is really the correlation length; furthermore, and from the definition of the exponent we have , so since we indeed have .


Therefore, we have seen that for the Ising model . If we also consider the values of the other critical exponents we see that the upper critical dimension for this model is . In other words, mean field theories are actually good approximations for the Ising model if . We will later see some other confirmations of this fact.

Gaussian approximation[edit | edit source]

Until now even if we have introduced Ginzburg-Landau theory we are still neglecting the effects of the fluctuations since we are regarding the mean field theory approximation for non-homogeneous systems as a saddle point approximation of a more general theory; in other words, since we are approximating as we are still regarding the magnetization as non fluctuating over the system. In order to include the fluctuations we must do more and go further the simple saddle point approximation. The simplest way we can include fluctuations in our description is expanding expressed as a functional integral around the stationary solution and keeping only quadratic terms; this means that we are considering fluctuations that follow a normal distribution around the stationary value. The important thing to note, however, is that in this approximation these fluctuations are independent, i.e. they do not interact with each other[6]. As we will see, with this assumption the values of some critical exponents will differ from the "usual" ones predicted by mean field theories.


Let us apply this approximation from easier cases to more complex ones (and finally to the one we are interested in).

Gaussian approximation for one degree of freedom[edit | edit source]

Let us consider a system with a single degree of freedom , and call its Hamiltonian. Supposing that is a minimum for , i.e. , expanding around we get:

where is the fluctuation of around its stationary value . The Boltzmann factor needed to compute the partition function will therefore be:
where for the sake of simplicity we have defined:
With this approximation, the partition function of the system results:
The last one is a gaussian integral, which readily gives:
Therefore:
and since the free energy of the system is:
We thus see that the introduction of the fluctuations in the degree of freedom has led to the appearance of an entropic term in the free energy.

Gaussian approximation for N degrees of freedom[edit | edit source]

This is a simple generalization of the previous case; the Hamiltonian will now be a function of the -component vector , and calling the minimum of , expanding around we get:

Now, the Hessian matrix:
is of course symmetric (thanks to Schwarz's theorem), so it can be diagonalized. Calling its eigenvalues and defining , we can write:
where is the fluctuation from the eigenvector of the Hessian relative to the eigenvalue . We therefore have:
The last integrals are all independent and of the same form as the previous case. Thus, in the end we have:

Gaussian approximation for infinite degrees of freedom[edit | edit source]

Let us now move to the really interesting case, i.e. the case of infinite degrees of freedom. In general terms (we shall shortly see this explicitly for the Ising model) we want to compute a partition function of the form:

and the Gaussian approximation will be obtained, in analogy with the previous cases, determining the extremal (uniform) solution and then expanding around to the second order in the fluctuation . Thus, we will in general obtain a Hamiltonian of the form:
where:
Therefore:

Gaussian approximation for the Ising model in Ginzburg-Landau theory[edit | edit source]

Let us now apply what we have just only stated to a concrete case, i.e. the Ginzburg-Landau theory for the Ising model we were considering. In this case:

and we have seen that the stationary solution is such that:
Let us now include also the fluctuations of , substituting . To the second order in we have:

and so setting :

Since , defining for simplicity and calling the volume of the system we get:
In order to compute this integral it is more convenient to shift to Fourier space.


Let us make some remarks on what happens when we apply Fourier transformations in this case. If our system is enclosed in a cubic box of volume , we can define the Fourier components of the magnetization as:

where and is a vector whose components are integer numbers. We can therefore expand the magnetization in a Fourier series:
Substituting this expression of in we obtain an integral representation for the Kronecker delta; in fact:
and this is true only if:
Let us now make two observations. First: since is real we have that . Second: our coarse graining procedure is based on the construction of blocks which have a linear dimension that cannot be smaller than , the characteristic microscopic length of the system; this means that not all the are allowed, and in particular we must have . Now, thinking about the functional integral form of the partition function, what does the trace become in Fourier space? Since is expressed in terms of the Fourier modes , which are in general complex, the measure of the integral becomes:
However, since is real (i.e. ) the real and imaginary parts of the Fourier modes are not independent, because we have:
This means that if we use the trace we have written above we would integrate twice on the complex plane; we must therefore change the measure so as to avoid this double counting. We can for example simply divide everything by , or restrict the integration on the region where for example the last coordinate of , let us call it , is positive. Therefore:
where the last step defines the symbolic notation . In the end, we have:


Let us now compute the partition function of the system in the simpler case [7], so that in the end we can determine the free energy of the system. In this case so , and therefore substituting:

into:

we get (renaming the coordinate in Fourier space, so as not to confuse it with the constant ):
Therefore substituting in the expression of the partition function the exponentials factorize, and in the end:
Since , changing variables to:
the integration in gives:
Thus:
We therefore have that the free energy of the system is:


We can now compute the specific heat of the system, and so determine its critical exponent . We therefore want to compute:

The derivatives are straightforward, and in the end we get:
Let us now consider the two terms separately and study their behaviour for (we are in fact considering ). Neglecting the proportionality constants that we don't need, we can rewrite the first contribution as:
where we have substituted the sum with an integral[8], since the density of states in Fourier space is high (it is proportional to , see the footnote). Now, using the definition of that we have previously seen we have:
In order to understand the divergent behaviour of the integral for , we use a "scaling trick"; we change variable defining:
so that the integral becomes:
and for we know that , so the integral is computed for all values of . Now, this integral must be computed shifting to spherical coordinates, so we will have (a part from numerical factors that involve all the angles, which are integrated trivially since the integrand only depends on ). Therefore the integrand of is , and its behaviour for large or small is:
where of course in the first case we have ; for large , since in general:
then the integral will converge if , i.e. . To sum up, the integral that appears in converges for (since in this case the integrand does not diverge in the domain of integration); of course, when this integral converges its result is simply a number. We therefore have that:
and since for , we see that brings to a diverging contribution[9] for . We can also wonder what happens for . From what we have stated about the rescaled form of we could think that the integral diverges, but we must also take into account the prefactor , which tends to zero for as the transition is approached (since ). The net result is finite, as could also be argued from the original (unscaled) form of ; in fact if in then (in spherical coordinates) the integrand is proportional to , and since:
then converges if , i.e. . To sum up, we can say that the first contribution to the specific heat behaves as:
where we have also used the definition of the exponent , i.e. . Note that this also means that in the Gaussian approximation this term brings no corrections to the exponent above four dimensions.


Let us now consider the second contribution to . In particular, as we have done before we rewrite it substituting the sum with an integral and also using the definition of , so that:

Again, we are interested in its behaviour for , and as we have already noted it is only the behaviour of the integrand for that can cause any divergence. As previously done, changing variable to :
and using spherical coordinates:
The integrand behaves as:
where again in the first case . Therefore, the integral in (not itself) in the limit converges if , i.e. ; this means that for and we have that behaves as:
For the case it is more convenient to consider the unscaled form of . In this case, in the limit we have (again in spherical coordinates):
and the integrand converges for if . Therefore for the second contribution to diverges, but in the same range of the divergence of the first contribution is more relevant; on the other hand, for only the first contribution diverges. It is therefore the term containing that determines the divergence of the specific heat, and in particular for we have and so we see that in the Gaussian approximation the inclusion of the fluctuations has changed the value of the critical exponent to[10]:
In order to compute it, however, we still must determine so we now proceed to compute the two-point correlation function in order to determine both and .

Two-point correlation function in the Gaussian approximation[edit | edit source]

We know that the (simple) correlation function is defined as:

so we first have to determine:
Shifting to Fourier space, we have:
where:
It is clear that in all the integrals factorize since the Fourier modes are all independent (they are decoupled); therefore, all the integrals in the numerator that don't involve or simplify with the same integrals in the denominator, so that in the end we are left with:
(where the factor in the denominator of the exponent has disappeared because we must remember that ). There are now two possible cases:

In this case (which can be re-expressed as ) the two coefficients and are distinct, and in the numerator the double integral factorizes into two integrals of the form:

since the integrand is odd. We therefore have:

In this case (equivalent to ) we can either have so that , or so that .

Let us first consider the case . Using polar coordinates, we define so that the measure in the complex plane becomes:

Thus:
We are therefore left with the last case : now, shifting to polar coordinates the integrals in both the numerator and denominator involving factorize and simplify, so in the end renaming in the integrals for simplicity:
Noting that , if we change variable to and integrate we get[11]:

Therefore since the correlation function in Fourier space is non null only when , in general we can write:

Going back to real space we have:
We see that appropriately substituting the sum with an integral (see the footnote on page ) and defining:
this correlation function acquires the same form of the one computed in mean field theory. This means that the critical exponents and now have the same values predicted by mean field theory, namely:

Interaction between fluctuations: expansion to the fourth order[edit | edit source]

We have therefore seen that mean field theories can be improved including the fluctuations of the order parameter around its extremal values; in particular with the Gaussian approximation we have stopped the expansion at the second order and this led to a change in the critical exponent of the specific heat, which now really diverges instead of exhibiting a jump discontinuity as simple mean field theories predict. However, the quartic term that we ignore within the Gaussian approximation (and which basically represent the interactions between the fluctuations) becomes crucial when we approach a critical point. We could thus wonder if the Gaussian approximation can be improved. In particular, reconsidering the expression of with :

then keeping all the terms in when expanding around , remembering that and considering that the odd terms in give no contributions (since we integrate an odd function over an even domain), we have:
A natural approach to compute the partition function would now consist in using a perturbative method in order to expand the term in powers of the parameter . This is of course a reasonable approach if is small; however, with a simple dimensional analysis we can show (and this is what we are going to do in the following) that for and this parameter diverges. The approach of the Gaussian approximation is therefore inconsistent, at least from this point of view.

Dimensional analysis of Landau theory[edit | edit source]

We know that the partition function of the system~is:

where, in the case :
It is now convenient to rescale the order parameter so that the term proportional to has only a numerical coefficient. This can be done defining:
so that:
where we have defined the dimensionless[12] effective Hamiltonian . Now, since is dimensionless all the three integrals that appear in must be so; this means that , and must have precise dimensions. In fact, from the first contribution we have that:
and similarly:
from which we have that:
We can therefore use to define a length scale independent of the dimension of the system, i.e. . Since by definition and since within mean field theories and Gaussian approximation, we see that this choice is equivalent to measuring lengths in units of the correlation length (which we know is independent of the dimension of the system). We now rescale all the variables, by setting:
where . This way, the partition function can be written in the form:
where:
and , depending on the quartic term in , is the contribution due to the interaction between the fluctuations. It is the presence of this terms that prevents us from computing exactly and that forces us to resort to approximations.


The most standard procedure to apply in this case would be a perturbative method, namely to consider the dimensionless parameter as small, i.e. , and expanding the exponential term containing :

Written out explicitly, we have:
We thus immediately see that if then diverges when , making the perturbative approach infeasible. On the other hand, if we indeed have when , so the Gaussian approximation is actually a good one. Now, we can say that the perturbative expansion is reasonable if , which gives:
This, using also the definition of , can be rewritten as:
which is therefore a criterion that tells us if the perturbative approach is valid. Note that from this analysis we have determined the upper critical dimension of the system (), so we can say that this last criterion is equivalent to the Ginzburg criterion.


A final remark. The argument we have shown is not really convincing. In fact, we have only shown that every term of the perturbation theory diverges as ; however, this does not necessarily mean that the whole perturbation series is divergent. For example, consider the exponential series:

This is convergent for (), but each term of the sum diverges in the same limit. We thus understand that we must be careful when handling perturbation expansions, since the appropriate resummation of divergent terms can bring to a convergent series.

  1. Sometimes this approximation is called elastic free energy.
  2. At this point we could wonder why the interaction part of the Hamiltonian does not contain other terms, like : in fact this is in principle perfectly acceptable since is of second order in and is invariant under rotations and parity . However, we have that:
    and so when we integrate over , the first term vanishes in the thermodynamic limit supposing that the magnetization or its gradient goes to zero sufficiently rapidly as . Therefore, we are left only with : the two terms are perfectly equivalent.
  3. We will do our computations on the Ising model, as usual.
  4. This follows from the integral form of . In fact, if in general a functional is of the form:
    then from the definition of functional derivative we have:
    where is an arbitrary function that vanishes on the boundary of integration. Integrating by parts we get:
    and so finally:
  5. In the following, for the sake of simplicity we will indicate the magnitude of a vector simply removing the arrow sign.
  6. In solid state physics this assumption is often called random phase approximation, while in field theory free field approximation.
  7. We could have equivalently considered the case , but it is a bit more complicated since and so there is another term that contributes to the free energy. In other words, if then in we are not considering the term with (which is exactly equal to ).
  8. The substitution:
    can be justified as follows:
    Now, is quantized since is finite and we have periodic boundary conditions, and in particular . Therefore:
  9. A little remark: the origin of this divergence does not come from the behaviour of the integral for large wavelengths. In fact, in the original definition of the integral has an upper limit, , so it cannot diverge because of the large behaviour; if it diverges, it must be because of the behaviour for , which corresponds to large wavelengths (this is also why this divergence is sometimes called infrared divergence).
  10. We stress again that the same calculations could have been done in the case , but we have not done so only for the sake of simplicity.
  11. In order to compute the integral in the numerator we have used a standard "trick":
  12. Remember that has the dimension of the inverse of an energy.
 Previous