Combinatorial formulation of Ising model revisited

Costa, G.A.T.F.da; Maciel, A. L.

doi:10.1590/S1806-11172003000100007

Abstracts

In 1952, Kac and Ward developed a combinatorial formulation for the two dimensional Ising model which is another method of obtaining Onsager's famous formula for the free energy per site in the termodynamic limit of the model. Feynman gave an important contribution to this formulation conjecturing a crucial mathematical relation which completed Kac and Ward ideas. In this paper, the method of Kac, Ward and Feynman for the free field Ising model in two dimensions is reviewed in a selfcontained way and Onsager's formula computed.

Em 1952, Kac e Ward desenvolveram uma formulação combinatorial do modelo de Ising em duas dimensões que é um outro método para se obter a famosa fórmula de Onsager para a energia livre por sítio no limite termodinâmico do modelo. Feynman fez importante contribuição a esta formulação conjecturando uma relação matemática crucial que completou as idéias de Kac e Ward. Neste trabalho, o método de Kac, Ward e Feynman para o modelo de Ising em duas dimensões sem campo é revisada e a fórmula de Onsager é calculada.

Combinatorial formulation of ising model revisited

G.A.T.F.da Costa; A. L. Maciel Suported by a PIBIC/CNPQ - BIP/UFSC fellowship

Departamento de Matemática. Universidade Federal de Santa Catarina 88040-900, Florianópolis, SC, Brasil

^{Endereço para correspondência} Endereço para correspondência G.A.T.F.da Costa E-mail: gatcosta@mtm.ufsc.br

ABSTRACT

In 1952, Kac and Ward developed a combinatorial formulation for the two dimensional Ising model which is another method of obtaining Onsager's famous formula for the free energy per site in the termodynamic limit of the model. Feynman gave an important contribution to this formulation conjecturing a crucial mathematical relation which completed Kac and Ward ideas. In this paper, the method of Kac, Ward and Feynman for the free field Ising model in two dimensions is reviewed in a selfcontained way and Onsager's formula computed.

RESUMO

Em 1952, Kac e Ward desenvolveram uma formulação combinatorial do modelo de Ising em duas dimensões que é um outro método para se obter a famosa fórmula de Onsager para a energia livre por sítio no limite termodinâmico do modelo. Feynman fez importante contribuição a esta formulação conjecturando uma relação matemática crucial que completou as idéias de Kac e Ward. Neste trabalho, o método de Kac, Ward e Feynman para o modelo de Ising em duas dimensões sem campo é revisada e a fórmula de Onsager é calculada.

I Introduction

The aim of statistical physics is to understand the macroscopic behaviour of a system formed by a very large number of particles from information about how they interact with each other. One way in which one can gain insight into this problem and thus about complex systems is by constructing idealized models which hopefully will exhibit some of the interesting features of real systems like phase transitions. Perhaps the most studied of these idealized models is the Ising model so called in honor to his first investigator, Ernst Ising (1900-1998).

The model was originally proposed as a simple model of ferromagnetism. In ref. [1] Ising investigated the model in one dimension and computed exactly its partition function. In 1944, Onsager [2] considered the free field model in two dimensions and succeded to compute the partition function exactly. His method became known as the algebraic formulation of the model. In 1952, Kac and Ward [3] developed a quite different method of obtaining Onsager results known as the combinatorial formulation of the Ising model. Feynman developed the method farther and conjectured an identity relating functions defined on graphs and functions defined on paths on a square lattice [4, 7]. This identity is a crucial element in the combinatorial formulation of Kac, Ward and Feynman of the Ising model. The identity was later formally proved by Sherman [4-6], followed later on by another proof by Burgoyne [7]. A somewhat similar treatment to the combinatorial formulation of Kac, Ward and Feynman can be found in refs. [12-14]. An important variant of the combinatorial formulation using the so called Pffafians was developed by Green and Hurst [10].

The bibliography on the Ising model is vast and to give a full list of references is virtually impossible. A nice introduction to the model though is the paper by B. Cipra given in ref. [17]. Old surveys but still useful on the distinct formulations of the Ising model in two dimensions and its history can be found in refs. [10-11, 15-16] together with full lists of original references.

The objective of the present paper is to review in a selfcontained way the calculation of the Onsager's formula for the two dimensional free field Ising model in the combinatorial formulation of Kac, Ward and Feynman. Our presentation follows chapter 5, section 5.4, of Feynman's book [9] and the paper by Burgoyne [7] although we have tried to be more careful with the mathematics involved than these references are.

The paper is organized as follows. In section 2, the Ising model is defined. In section 3 and through its various subsections the combinatorial formulation of Kac, Ward and Feynman of the partition function is given. In section 4, Onsager's formula for the free energy per site in the thermodynamic limit is computed.

II Definition of the model

The model is defined on a finite planar square lattice L which mimic a regular arranjement of atoms in two dimensions. Suppose the lattice is embedded in the plane with sites having coordinates in Z × Z. To each site i of L it is assigned two possible states also called "spins" and denoted by s_i, where s_i = +1 or s_i = 1. The interaction energy between two particles located at the i-th and j-th sites and in the states s_i and s_j, respectively, is postulated to be

where "n.n" stands for nearest neighbors, hence, in the Ising model it is assumed that the energy depends only on short range interactions. The energy is -J if the nearest neighbors are in the same state and +J if the states are distinct. The constant J which can be positive or negative is a parameter for the model.

Suppose L has N² sites. Then, there are 2 distinct configurations of the spins and, therefore, 2 configurations s = (s₁,...,s) of the system. Call S = {s} the set of possible configurations of the system. The energy of each configuration s Î S is given by

Suppose as well the system is at equilibrium temperature given by T. According to statistical mechanics, the probability p_s to find the system in the configuration s is

where b = , k_B is Boltzmann constant, and

is the so called partition function of the model. This simple looking function is simple to compute exactly only in one dimension, difficult but possible to compute exactly in dimension two. In three dimensions nobody knows how to do it.

The exact knowledge of Z(b) allows one to obtain information about the global behaviour of the system. Important quantities that are relevant to understand the physics of the system are all defined in terms of lnZ or its derivatives. For instance, the free energy per lattice site f in the thermodynamic limit is defined as

A basic problem is to find a closed form, analytic expression for f. Phase transitions will appear as singularities in f or in one of its derivatives.

III The combinatorial formulation

In the combinatorial formulation the partition function is expressed as a sum over special subsets of the lattice L called admissible graphs. Next, using a relation first conjectured by R. Feynman the resulting expression is converted into a product over paths. The final step towards the Onsager's formula to be accomplished in section 4 consists in deriving an integral representation for this product.

III.1 The partition function as a sum over graphs

Let's rewrite the partition function (2.4) as

with K = + . Noting that s_is_j = ±1, it follows that

and

where u = tanh K and x = 2N(N 1) is the number of bonds in L. Notice that |u| < 1, for any K.

Definition 3.1. An admissible graph is a connected or disconnected subset of L whose sites have even valence.

Definition 3.2. Given an admissible graph G, define

where the product is over the bonds i of G.

Theorem 3.1. Call A the set of all admissible graphs G of L. Then,

Proof: To each pair i,j of nearest neighbors of L there correspond a term us_is_j and a bond. Since the number of pairs i,j of n.n. coincide with the number x = 2N(N1) of bonds of L the product on the RHS of (3.3) is a polinomial of degree x, that is,

The second summation is over all possible products of p pairs (s_is_j) of n.n. of L where a pair is not to occur twice in the same product. To each pair (s_is_j) there is associated a bond connecting the neighbors i and j so to each product of p pairs correspond a graph (connected or disconnected). So, the second summation is over all graphs with p bonds. The graphs may have sites with valence 1,2,3 or 4. The summations over the spins s_i's eliminate graphs having sites with odd valence because ås_i = 0 and å = 0. The graphs left are those whose sites have valence 2 or 4, thus admissible. If V_G is the number of sites in a admissible graph G then there is a factor 2 associated to it because each site of G contributes a factor 2 coming from å = 2 and å = 2 . The sum over s includes all the s_i and not only those with sites i in G. The summation over the sites not in G will give a factor 2^V where V = N² is the number of sites in the lattice, hence, in the end one gets the factor 2^V.

III.2 The partition function as a product over paths

Let's orient and number the bonds of L with distinct positive integers i and call L with this indexation a colored lattice.

Definition 3.3. A path p over L is an ordered sequence of bonds each starting at the site where the previous one ended. The last bond ending at the site from which the first one started. Thus, p is closed. The path is subjected to the constraint that it never goes backwards over the previous bond. A path p is given by a word, that is, an ordered sequence of symbols D_i where i distiguishes the bonds of L. A path p is then of the form

for some l and where e_i = +1 (1) if the path traverses bond j_i following the direction (opposite direction) assigned to it. Because a path is closed it is defined to within its circular order so that

The inversion p¹ of p is given by

We take p and p¹ to be equivalent. Given p, denote by [p] the set of all paths equivalent to p, that is, its circular permutations and their inversions.

Definition 3.4. A periodic path is one which has the word representation

for some l and w > 2 and where the subword in between brackets is nonperiodic.

Definition 3.5. A path p has assigned to it a sign given by

where t is the number of 2p-angles turned by a tangent vector while traversing p. A positive (negative) angle is assigned to a counterclockwise (clockwise) rotation.

Example 1. See Figure 1a). A tangent vector starting at point e and traversing the path shown in Figure 1a) turns once a total angle given by 4. = 2p after its return to e so in this case t = 1 and s(p) = +1. For the path in Figure 1b), the total angle turned is 3- 3 = 0 so t = 0 and s(p) = 1.

Remark. In section 4, instead of assigning an angle ± p/2 to a turn we will count the contribution to the sign by assigning a = eⁱ^p/4 and a = eⁱ^p^/4 to each counterclockwise and clockwise turn, respectively, and then in the end multiplying the result by 1. In the example above, one gets in this manner (eⁱ^p/4)⁴ = 1 and (eⁱ^p/4)³. (eⁱ^p/4)³ = +1. Multiplying both results by 1, one recovers the correct sign for each path.

The sign of periodic paths. Suppose the sign of the nonperiodic path in between brackets in (3.10) is (1)^1+t_s. Then, the sign of the periodic path with period w is (1)1 +. Hence, the sign of a periodic path is 1 if its period is an even number and the sign equals the sign of the nonperiodic subpath if the period w is an odd number.

Definition 3.6. To each path p it is assigned the function I_p(u) given by

where l = m₁+...+m_k, for some k, is the length of p, m_i being the number of times bond i is covered by p, and the function W_p(u), "the amplitude of p", defined as follows:

Theorem 3.2. The functions I_G(u) and W_p(u), |u| < 1, defined above satisfy the following relation:

The product is over all inequivalent classes [p] of closed nonperiodic paths. The summation is over all admissible graphs of the finite N × N planar square lattice L.

Relation (3.14) is a simpler version suitable for the Ising model of a more general relation investigated by Sherman and Burgoyne in refs. [4-7]. The difference is that they assign to the bonds i of the lattice distinct parameters d_i, hence, in this case the functions I_G and W are given in terms of these parameters. In the Ising model context under consideration these are all equal to u and |u| < 1.

According to references [4,7,10,11], relation (3.14) first appeared as a conjecture in lecture notes by Feynman ( ref. [9], published only in 1972 and already mentioning ref. [4]). The first proof of it was achieved by Sherman in refs. [4,6] followed by another one later on by Burgoyne in ref. [7]. The simplest nontrivial case of the general relation is investigated in ref. [8].

Below Burgoyne's proof is essencially reproduced for the case |u| < 1.

Proof: Expand the product over the distinct classes of nonperiodic paths [p] as 1 (one) plus an infinite sum of terms of the form

for some k where p₁,...,p_k is a set of nonperiodic paths over L. The product on the r.h.s of (3.15) is over the bonds i traversed by p₁,p₂,...p_k, and r_i says how many times. If p₁, p₂,...p_k traverse bond i, say, m₁(i),...,m_k(i) times, m_j> 0, respectively, then r_i = = 1m_j(i). The sign s is the product of the signs of p₁, p₂,...,p_k.

Let's prove, first, that those terms having r_i = 1, "i, add up to åI_G(u). Consider one of these terms with associated paths p₁, p₂, ..., p_k. Each bond in the set of bonds traversed by p₁, p₂,..., p_k is traversed only once by one of these paths. Thus, the only possible intersection if any between any two of these paths in this case can occur only at a site of valence 4 and they cross each other like in Fig. (2.a). Otherwise, they are disjoint. Thus, the set of bonds traversed by paths p₁,...,p_k constitute a graph whose vertices have valence 2 or 4. This is an admissible graph. Therefore, to each term of the form of (3.15) with r_i = 1, "i, one can associate an admissible graph. This graph can be disconnected. This happen if the set of paths can be split into subsets completely disjoint which generate admissible graphs without any bonds and vertices in common.

Now, given an admissible graph G one can in general associate more than one term of the form of (3.15) with r_i = 1, each associated with a distinct set of paths. Let's see how this follows. The sites of an admissible graph have valence 2 or 4. When a path strikes a site of valence 4 it has only 3 possible directions to follow. See Figures 2a, 2b, 2c. (The case in Fig. 2d is forbidden.) Then, any two terms associated to a given admissible graph G will differ only in the types of crossings at the sites of G. Since there are 3 types of crossing per valence 4 site, the number of possible terms associated to G is 3^V where V is the number of sites of G with valence 4.

A term has a sign which comes out from the contribution of the signs of the paths associated to that term. Let's see how the sign of a term comes out. A term with t₁ crossings of type 1 (Fig. 2a.) has a sign which can be expressed as (1) where t₁ includes selfcrossings of single paths plus crossings between different paths. Indeed, since distinct closed paths always intersect in a even number of crossings then (1) will give the correct sign of the term which is the product of the signs of the individual paths. Let's associate to the crossings of type j = 2,3 the sign (+1)(+1) so that a term with t_j crossings of type j = 1,2,3 has a sign given by (1)(+1)(+1).

There are V! ways of distributing V = t₁+t₂+t₃ crossings among the sites of G but since there are t_j crossings of the type j, j = 1,2,3, one has to divide V! by t₁!t₂!t₃! so that the number of distinct terms with t_j crossings of type j is

These terms have the same factor I(G) = u^L where L is the number of bonds of G. Summing all these terms arising from a given G and summing over all admissible graphs G of L the result is

where å_t means summation over all t₁, t₂, t₃ such that t₁+t₂+t₃ = V. Using the multinomial theorem the summation over {t} gives (1+1+1)^V and one gets the result å_GI(G).

If G is disconnected with l components G_i, i = 1,2,...,l, each of them with t_j, j = 1,2,...,l, sites of valence 4 and å_tj = V, then applying the previous argument to each component will give I(G₁)I(G₂)...I(G_l) = I(G).

In view of the above result, the theorem could be equivalently stated by saying that the sum of terms with r_i > 1 for at least one of the i converges to zero. Let's prove this.

Let be the set of all colored connected or disconnected subgraphs g of the colored lattice without valence 1 sites and such that if g is connected then g is not a poligon, that is, a graph having valence 2 sites only. A disconnected graph is allowed to have some but not all of the components as poligons. The reason for excluding graphs which are poligons or having all components which are is that closed paths with repeated bonds over them are necessarily periodic and these are forbidden. The coloring of g is that inherited from the colored lattice.

Given g Î , call i₁,...,i the bonds of g. A term w_g associated to g is of the form

for some k and set of paths p₁,...,p_k, which traverse the bonds of g only, where

and ri_j is the number of times bond i_j is traversed by p₁,...,p_k, that is, If p₁,..., p_k traverse the i-th bond m₁(i),...,m_k(i) times, m > 0, respectively, then

Some but not all of the m's can be zero so that r_ij > 1 with at least one r_ij > 1.

Let's consider the set of all terms with the same effective set of bonds {i} traversed, hence, the terms associated to a given g. Within this set it's possible in general to find terms with the same powers {r} and the same |w_g| although having distinct associated paths and possibly with different effective sign.

Let's group together those terms which cover the same bonds of g the same number of times. Denote by _g,N(r) the set of terms w_g with the same powers {ri_j} and such that = N, for fixed N. The summation over all terms with repeated lines can now be expressed as

where å_g_Î means summation over all elements in ; å means summation over all positive integers N compatible to the given graph g and such that N > l(g)+1; å_r(N) means summation over a set of positive integers r₁,...,r_l such that r₁+...+r_l = N and which are also compatible to g; and, finally, å means summation over all terms w_gÎ_g,N(r).

Now the following remarks come to order. In the second summation, the case N = l is excluded for it implies that r_i = 1 and in this case there can be no repeated bonds. The case N < l corresponds to another element g¢Î . The equality depends on the graph g. For instance, take the graph shown in Fig. 1b where l + 1 = 9. No nonperiodic closed path with repeated bonds can have length N = 9 because l(g) = 8. The length N can only be even and its minimum is N = 12. Hence, for this particular graph g the summation is over all even numbers greater or equal to 12. In any case, the set {N}_g has always infinite elements. Given g and N Î {N}_g, not all partitions of N are allowed in the third sum. For instance, given the graph in Fig. 1b and N = 12, the partition with r = 1, "k ¹ 1, and r_i = 5 can not be associated to any allowed path. So, the set of integers {N} and partitions of N must be suitable to each g.

Given g, let's consider now the partial sums

The goal is to show that in the limit n ® ¥, s_n goes to zero. In ref. [7] its proved that s_n = 0. The argument of the proof goes as follows.

Since the bonds of g are covered the same number of times by all elements in the group, choose a bond of g, say b, which is traversed > 1 times by all elements in the group _g,N. This choice has to be done for each partition r(N). Denote by P the set of paths associated to w_g. Then, P = P¢ P¢¢ where P¢ is the set of those paths which traverse bond b whereas P¢¢ is the set of those paths which do not traverse b.

Given a path pÎ P¢, let p_c be the path segment obtained from p upon removal of b. Given P¢ = {p,p¢...} define

This set has exactly path segments. Collect under a same subgroup S the elements w_gÎ _g,N having the property that line b is covered exactly times by all elements in S and they all have the same subset with path segments and the same subset P¢¢. The set _g,N is the union of such subsets, that is,

where s(w) is the sign of w_g and |w_g| = u^N. Recall that |u| < 1 so that |w_g| < 1.

The elements inside any given S cancel each other. Denote by q and e the elements of S that are in P¢ and P¢¢, respectively. Suppose that the segments q₁,...,q are all distinct. (For the case with repeated segments, see [7]). The terms in S are precisely those which can be obtained by joining the ends of the segments and this can be done in exactly ! ways. This gives the possible terms w_g in the subgroup. From the properties of the permutation group half of N! permutations are odd and half are even and so the signs of half of the terms are positive and half are negative, hence, a cancellation takes place.

Using (3.14), the partition function of the two dimensional Ising model can now be expressed as a product over paths as follows:

The next step consists in expressing the product over [p] as an integral. This will be achieved in the next section.

IV Paths amplitudes and Onsager's formula

Consider all paths that start at a fixed site P₁ which we take as the origin with coordinates (0,0) and end at the site P_n+1 with coordinates (x,y) in n steps. Starting at (0,0) and whenever a site is reached there are four possible directions which a path can take (see Figure 2 and the Remark below). The path a) continues forward in the same direction of the previous step; b) it turns left 90⁰ relative to the previous step; c) it turns right 90⁰ relative to the previous step; d) it turns 180⁰. To each one of this possibilities it is assigned an ''amplitude'' which is given by: A) u for the case a); B) ua for the case b); C) u for the case c) and D) 0 (zero) for the case d), where u = tghK and a = eⁱ^p/4 is the contribution to the sign of p each time it turns left (counterclockwisely) relative to the previous step and , when it turns right (clockwisely). See the Remark after Example 1, sec. 3.2.

Remark. The lattice being finite it has a border so that when a path strikes a site on the border it may have there only two or three possible directions to follow. In the spirit of refs. [7,9] we shall neglect the border and derive the relevant formulas as if there was no border at all with the justification that in the limit N ® ¥ which we shall take in the end of the calculation border effects dissapear. Of course, another approach would be to do everything on a toroidal lattice. In this case, however, relation (3.14) must be replaced by another more involved identity apropriate for the toroidal lattice ( given in refs. [4, 10] ). We shall restrict the presentation to the planar case only.

Call U_n(x,y) the amplitude of arrival at (x,y) moving upward in the n-th step, D_n(x,y) the amplitude of arrival at (x,y) moving downwards in the n-th step, L_n(x,y) the amplitude of arrival at (x,y) moving from the left in the n-th step, and R_n(x,y) the amplitude of arrival at (x,y) moving from right in the n-th step.

If the path arrives at (x,y) moving upward in the n-th step then

where U_n1, D_n1, L_n1 and R_n1 are the amplitudes associated to the four possibilities to reach site (x,y 1). Relation (4.1) can be understood as follows. If (x,y1) is reached going up a bond in the (n1)-th step, there the amplitude is U_n1(x,y1) so in the n-th step as the path follows the same direction of the previous step, by the rules a) and A) above, a factor u is multiplied to the amplitude U_n1(x,y 1). See Figure 3.

If the site (x,y 1) is reached from the left in the (n 1)-th step (Figure 4), the path has to make a counterclockwise rotation to go to (x,y) in the n-th step. By the rules b) and B) a factor ua should then be multiplied to the amplitude L_n1(x,y 1).

The case that the path goes down to (x,y 1) in the (n 1)-th step and goes up to (x,y) in the n-th step corresponds to a 180⁰ rotation. By rules d) and D) the amplitude should be 0.D_n1(x,y 1). If the site (x,y 1) is reached from the right the path has to make a clockwise rotation to go to (x,y) (Figure 5). By the rules c) and C) a factor u should then be multiplied to the amplitude R_n1(x,y 1).

Analogously, if a path arrives at (x,y) in the n-th step going down the amplitude is given by the relation

If it arrives at (x,y) coming from the left then the amplitude is given by

At last, if it arrives at (x,y) coming from the right the amplitude is

Of course to compute an amplitude using the above recursion relations it is needed the amplitude in the zero-th step. We shall follow the convention of reference [10], namely, that in the zero-th step a path arrives at the origin moving upward so that U₀(x,y) = d_x,0 d_y,0 and D₀ = R₀ = L₀ = 0. The amplitude to arrive in zero steps is one if the path arrives going upward at the origin and zero for any other point or any other direction of arrival.

Example 2. See Figure 6. Let's compute the amplitude of arrival at site (2,1) in 3 steps moving upward in the third step. Only one path is possible in this case. Using the recursion (4.1),

In the second step, the path moves to site (2,0) coming from the left so U₂ = D₂ = R₂ = 0 and U₃(2,1) = uaL₂(2,0). From (4.3),

with U₁ = D₁ = R₁ = 0 so that U₃(2,1) = u² aL₁(1,0) where

implying that U₃(2,1) = u³.

Example 3. Let's now compute the amplitude of arrival at (2,1) in 3 steps moving from the left in the third step. In this case, the possible paths are shown in Figure 7a) and 7b).

Using relation (4.3), the amplitude is

Using (4.1),

Since U₁ = D₁ = R₁ = 0, one finds that U₂(1,1) = uaL₁(1,0) = uau = u². Using (4.3), with D₁ = L₁ = R₁ = 0,

Therefore, L₃(2,1) = 2u³.

Definition 4.1. The partial amplitude of a path p of length n is given by

Definition 4.2 . The amplitude å_p

p(n,P₁)(u) of arrival at P_n+1(x,y) from any direction in n steps is given by

Example 4. The partial amplitudes for the paths in Figure 6, 7a) and 7b) are u³, u³ and u³, respectively. The amplitude of arrival at (2,1) from any direction in 3 steps is, then, u³+2u³.

Definition 4.3. Fix n and call C_n(x,y) the set of all paths of length n starting at (0,0) and arriving at (x,y). Given p Î C_n and F_n Î (x,y) where

Define the extension of F_n(x,y), denoted by the same symbol, so as to include sites (x,y) which can be reached only by a number m > n of steps but in this case set F_n(x,y) = 0.

Lemma. The transform of F_n, the function _n(,h), 0 < < 2p and 0 < h < 2p, given by

is well defined and

Proof: F_n(x,y) = 0 for |x| > n or/and |y| > n. Then, for fixed n the sums in (4.14) have only a finite number of terms.

Using (4.14), the transform of U_n(x,y) is:

Upon substitution of (4.1), and making the change = y1 it follows that

Similarly, we obtain

_n(e,h),

_n(e,h) and

_n(e,h):

Call y_n(,h) the matrix

Then, from (4.17-20) we obtain that

where

with u = eⁱ^h, = eⁱ^e, h = eⁱ^e, = eⁱ^h and a = eⁱ.

Call 1, 2, 3 and 4 the directions shown in the Figure 8 below:

Notice that the subindices i,j of M_ij are in one-to-one with the directions. Indeed, uM_1j corresponds to the amplitude of arrival at (x,y) ( in (,h) space) coming up in the (n1)-th step, "j, but going up if j = 1, down if j = 2, coming from the left if j = 3 and coming from the right if j = 4 in the n-th step. Therefore, uM_1j is the amplitude of arrival at (x,y) ( in (,h) space) following directions 1 and j in the (n1)-th and n-th steps, respectively. More generally, uM_ij is the amplitude of arrival at (x,y) following directions i and j in the (n1)-th and n-th steps, respectively. From now on only closed paths starting at (0,0) and arriving at (0,0) in n steps will be considered. From (4.22) it follows that

Denote by y_0,i, 1 < i < 4, the line matrix with the only element distinct from zero and equal to 1 in the i-th column. Let y₀ º y_0,i according to whether the path arrives at the origin moving up (i = 1), down (i = 2), from the left (i = 3) or from the right (i = 4), respectively. Then

where i = 1,2,3,4 if F = U,D,L,R, respectively, and Y^T is the transpose of Y and

Given a 4 ×4 matrix A, y_0,iA is the line matrix formed by the elements in the i-th line of A, that is,

so y_0,iA = A_ii. Therefore, the sum over i equals the trace of A. Thus,

The total partial amplitude of arrival at (0,0) of closed paths moving in any direction in n steps given by (4.12) can be expressed compactly as

From (4.15), (4.26) and (4.28), it follows that

To better understand relation (4.30), consider the matrix uⁿMⁿ, for some n. An element (uⁿMⁿ) of this matrix is given as

Recall that uM_i,j is the partial amplitude of a path arriving at a site coming from direction i and going to the next site in one step following direction j. Thus, each term in the r.h.s. of (4.31) is the amplitude of a path of length n starting at P₁ coming from direction i₁, going to P₂ following direction i₂, etc, and arriving at site P_n+1 following direction i_n+1 after n steps. The element (uⁿMⁿ) gives the total partial amplitude of arrival at P_n+1 in n steps in (,h) space.

The terms in (uⁿMⁿ) describe open as well as closed paths. Let's see some examples.

Example 5. Take n = 5, i₁ = 2 and i₆ = 1. The term M₂₃M₃₁M₁₁M₁₄M₄₁ describes a path beginning at P₁ where it arrived coming from direction i₁ = 2, going to P₂, P₃, P₄, P₅ and to P₆ following directions i₂ = 3, i₃ = 1, i₄ = 1, i₅ = 4 and i₆ = 1, respectively. See Figure 9a).

Example 6. Take n = 6, i₁ = i₆ = 2 and the term M₂₃M₃₁M₁₁M₁₄M₄₂M₂₂ of (M⁶)₂₂ . This term describes the closed path in Fig. 9b. The elements of Mⁿ outside the diagonal have associated to them only open paths. This is implied by the simple fact that these elements have i₁ ¹ i_n+1. Closed paths are to be found only in the diagonal elements since there i₁ = i_n+1. However, open paths can also be associated to some terms in the diagonal elements. Let's see some examples.

Example 7. Take n = 2, i₁ = i₃ = 1 and the element (M²)₁₁ = M₁₁M₁₁+M₁₂M₂₁+M₁₃M₃₁+M₁₄M₄₁ with

To each one of the terms of (M²)₁₁ correspond the paths (a), (b), (c) and (d), respectively, shown in Figure 10.

Example 8. Take n = 4 and consider the following terms in (M⁴)₁₁:

a)The term u⁴M₁₁M₁₁M₁₁M₁₁ = u⁴u⁴ is the amplitude of the open path shown in fig. 11 below.

b) The term M₁₁M₁₁M₁₃M₃₁ = uu()(au) = u³u whose associated open path is shown in Figure 12.

c) The term u⁴M₁₃M₃₂M₂₄M₄₁ = u⁴()()(h)(u) = u⁴

⁴(

h)(

u) = u⁴

⁴ is the amplitude of the closed path shown in :

In order to restrict to the elements of Mⁿ having closed paths we must take the trace of Mⁿ. A closed path begins at and return to P₁ after n steps. Since it is closed it has to cover n/2 horizontal bonds in one direction and n/2 horizontal bonds following the opposite direction. The same is true for the vertical bonds traversed by p. So, if the term , i_n+1 = i_n, describes a closed path, then the number of h's (u's) equals the number of 's ('s) appearing in it. In this case, it's possible to organize the term into a product of pairs h = 1 and u = 1 and the double integral in and h will give (2p)² times a product of a's and 's. More precisely, the double integral over a closed path in TrMⁿ equals

where the first product is over all counterclockwise rotations and the second is over all clockwise rotations, so

where t(p) is the number of complete 2p revolutions performed by a tangent vector traversing the closed path p. Remind that one has yet to multiply (4.34) by (1) in order to get the complete sign s(p) of p.

If a path is open the h's, 's (u's, 's) don't match up into pairs. There will be left integrals of the form

where q stands for h or and k > 1, hence, the integrals in h and e remove completely terms describing open paths. Let's see examples.

Example 9. Take n = 1. In this case there are only open paths and

Example 10. Using (4.35), in ex. 7,

Example 11. It is clear that

if n = 1,2,3, which is guaranteed by the fact that in a square lattice closed paths are possible only if n > 4. In the case n = 2 the path in Figure 10.b) is closed but it traverses the same edge back and its amplitude is thus zero.

Example 12. Using (4.35) in ex.8, for the term M₁₁M₁₁M₁₁M₁₁ = u⁴

For the term M₁₁M₁₁M₁₃M₃₁ = u³,

For the term M₁₃M₃₂M₂₄M₄₁ =⁴ which describes a closed path the result

follows which has the form (4.33-34) with

⁴º 1.

Given a closed path in (Mⁿ)_ii, the inverse path is present in some (Mⁿ)_jj, j ¹ i. For instance, in (M⁴)₁₁ there are the closed paths shown in Fig. 14 given by the terms M₁₄M₄₂M₂₃M₃₁ and M₁₃M₃₂M₂₄M₄₁.

In (M⁴)₂₂ there are the terms M₂₄M₄₁M₁₃M₃₂ and M₂₃M₃₁M₁₄M₄₂ , with associated closed paths shown in Fig. 15c and 15d, respectively:

In (M⁴)₃₃, there are the terms M₃₂M₂₄M₄₁M₁₃ and M₃₁M₁₄M₄₂M₂₃ with associated closed paths shown in Figure 16e) and 16f), respectively.

In (M⁴)₄₄, there are the terms M₄₂M₂₃M₃₁M₁₄ and M₄₁M₁₃M₃₂M₂₄ with the associated closed paths shown in Figure 17g) and 17h), respectively.

Note that (e) is the inversion of (a), (f) is the inversion of (c), (g) of (b), and (h) of (d).

So restricting to the diagonal terms of Mⁿ which amounts to take the trace of this matrix and then performing a double integration on the angles to eliminate open paths, dividing the result by 2 to eliminate inversions, and multiplying the result by uⁿ gives the total complete amplitude ( with the right signs ) to arrive back at P₁ in n steps moving in any direction. We have thus achieved the following relation:

The above result is restricted to a fixed site P₁. For the finite N × N lattice with N² sites and disregarding boundary effects, the total (independent of site) amplitude of closed paths of length n is:

Taking all N² sites into account imply that given a closed path p(n), the summation å_p(n)W(p) includes all circular permutations of p. To eliminate these the previous relation has to be divided by n. Then, the amplitude is given by

We notice that a nonperiodic path appears n times in the sum but a periodic path of length n and period w has n/w distinct starting points only and for this reason it appears n/w times in the sum over paths. For instance, the periodic path (Dj₁Dj₂)(Dj₁Dj₂)(Dj₁Dj₂) of length n = 6 and period w = 3 has only two distinct starting points. The other equivalent periodic path is Dj₂(Dj₁Dj₂)(Dj₁Dj₂)Dj₁. After division by n, periodic paths with period w will show up in the sum with a weight 1/w. Thus, the above relation includes all closed paths of length n over the N × N lattice, periodic and nonperiodic, and excludes inversions and circular permutations. The total amplitude of closed paths of any length is then given by the series

whose convergence will be investigated below. We note that since the lattice is square, closed paths with nonzero amplitude are possible only for n > 3 but in view of relation (4.38) in Ex. 11 we can write the series in (4.45) starting from n = 1.

With the above remarks,

In å_[p] the first term is the sum of W_p(u) over all nonperiodic paths. The other terms give the sum over all periodic paths since any periodic path is the repetition of some nonperiodic path p with period given by w = 2,3,.... In section 3.2 the sign of a periodic path was proved to be 1 if w is even and equal to the sign of its nonperiodic subpath if w is odd. This explains the signs in the r.h.s of (4.46).

Since |u| < 1 then |W| < 1 and the series between brackets converges to ln(1+W), and the r.h.s. of (4.46) equals to

a result to be used below.

Theorem 4.1. Take |u|< r < . Then, the series

converges uniformily.

Proof: We have that |M_ij| < 1 , "i,j = 1,2,3,4, so from (4.31) we get

and

The series

converges for |u| < r < , hence, by Weierstrass M-test the series (4.48) converges uniformly for |u| < r < 1/4.

We may conclude that the series converges uniformly to the matrix ln(1-uM) in the same interval.

We may now integrate the series term by term to get the series (4.45) which likewise converges uniformly in the same interval. Interchanging integration and summation in (4.45) yields

From the previous analysis, |u| < r < 1/4. However, the r.h.s. of (4.52) is well defined in a bigger domain. Using the relation

which is valid for det(1-uM) ¹ 0 [18], we get

The determinant can be easily computed and one finds that

Taking the logarithm on both sides of relation (3.24) gives

or, using (4.45-47), (4.52,4.54),

Using (4.55) with u = tanhK and the relations

gives

Onsager's formula follows from (4.57) after taking the limit N ® ¥:

where f is the free energy per site in the thermodynamical limit (see (2.5)).

The integral in (4.62) can not be evaluated in terms of simple functions. The derivatives of the integral however can be expressed in terms of elliptic functions [2,15,19].

Set 2k = tanh2K/ cosh2K. Then,

Expanding the logarithm in powers of k it follows that

The series converges for |2k(cos+cosh)| < 4 |k| < 1. For k > 0 (J > 0) and at k = 1/4, that is, at the critical value K = K_c ( or temperature T_c = 2J given by

it diverges. ( Similarly, for J < 0 which implies k < 0 and divergence at T_c = 2J.

The internal energy U is given by

From (4.63),

where

By performing one of the integrals its found that

where k₁ = 4k and F(k₁) is the complete elliptic integral of the first kind defined by

The elliptic function F (see Ref. [19]) has the property that

as k₁ ® 1. So it diverges logarithmically at k₁ = 1, or at the value K_c given by (4.65).

In relation (4.69) for U the function F is multiplied by (2tanh²2K1) which is zero at the critical point K_c. Indeed, from the identity cosh²x = 1+sinh²x relation (4.65) implies that sinh(2K_c) = 1. Using this and (4.65), tanh²2K_c = 1/2 follows. So the function U is continuous at K_c.

The specific heat can be computed from the definition

It is given by

where

and E(k₁) is the complete elliptic integral of the second kind, defined by

which is well defined at k₁ = 1. From the exact result (4.73) it follows that the specific heat is logarithmically divergent at the critical point.

[1] E. Ising, Zeitschrift f. Physik 31, 253 (1925).
[2] L. Onsager, Phys. Rev. 65, 117 (1944).
[3] M. Kac and J. C. Ward, Phys. Rev. 88, 1332 (1952).
[4] S. Sherman, J. Math. Phys. 1, 202 (1960).
[5] S. Sherman, Bull. Am. Math. Soc. 68, 225 (1962).
[6] S. Sherman, J. Math. Phys. 4, 1213 (1963).
[7] P. N. Burgoyne, J. Math. Phys. 4, 1320 (1963).
[8] G. A. T. F. da Costa, J. Math. Phys. 38, 1014 (1997).
[9] R. P. Feynman, Statistical Mechanics. A set of lectures The Benjamin and Cummings Publishing Co., 1972.
[10] H. S. Green and C. A. Hurst, Order-Disorder Phenomena, John Wyley and Sons.
[11] S. G. Brush, Rev. Mod. Phys. ,39, 883 (1967).
[12] N. V. Vdovichenko, Soviet Phys. JETP, 20, 477 (1965).
[13] Landau and Lifschitz, Statistical Physics, Addison-Wesley, 1969.
[14] M. L. Glasser, Am. J. Phys., 38, 1033, 1970.
[15] C. J. Thompson, Mathematical Statistical Mechanics, Princenton University Press, 1972.
[16] G. F. Newell and E. W. Montroll, Rev. Mod. Phys. , 25, 353 (1953).
[17] B. A. Cipra, The American Mathematical Monthly, 94, , 937 (1987).
[18] F. Brauer and J. Nohel, The qualitative Theory of ODE, W. A. Benjamin, INC., 1969.
[19] M. Abromowitz and I. A. Stegun, Handbook of Mathematical Functions, Dover, 1972.

Endereço para correspondência

G.A.T.F.da Costa

E-mail:

gatcosta@mtm.ufsc.br

Suported by a PIBIC/CNPQ - BIP/UFSC fellowship

Publication Dates

Publication in this collection
21 May 2003
Date of issue
2003

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

[1] [1] E. Ising, Zeitschrift f. Physik 31, 253 (1925).

[2] [2] L. Onsager, Phys. Rev. 65, 117 (1944).

[3] [3] M. Kac and J. C. Ward, Phys. Rev. 88, 1332 (1952).

[4] [4] S. Sherman, J. Math. Phys. 1, 202 (1960).

[5] [5] S. Sherman, Bull. Am. Math. Soc. 68, 225 (1962).

[6] [6] S. Sherman, J. Math. Phys. 4, 1213 (1963).

[7] [7] P. N. Burgoyne, J. Math. Phys. 4, 1320 (1963).

[8] [8] G. A. T. F. da Costa, J. Math. Phys. 38, 1014 (1997).

[9] [9] R. P. Feynman, Statistical Mechanics. A set of lectures The Benjamin and Cummings Publishing Co., 1972.

[10] [10] H. S. Green and C. A. Hurst, Order-Disorder Phenomena, John Wyley and Sons.

[11] [11] S. G. Brush, Rev. Mod. Phys. ,39, 883 (1967).

[12] [12] N. V. Vdovichenko, Soviet Phys. JETP, 20, 477 (1965).

[13] [13] Landau and Lifschitz, Statistical Physics, Addison-Wesley, 1969.

[14] [14] M. L. Glasser, Am. J. Phys., 38, 1033, 1970.

[15] [15] C. J. Thompson, Mathematical Statistical Mechanics, Princenton University Press, 1972.

[16] [16] G. F. Newell and E. W. Montroll, Rev. Mod. Phys. , 25, 353 (1953).

[17] [17] B. A. Cipra, The American Mathematical Monthly, 94, , 937 (1987).

[18] [18] F. Brauer and J. Nohel, The qualitative Theory of ODE, W. A. Benjamin, INC., 1969.

[19] [19] M. Abromowitz and I. A. Stegun, Handbook of Mathematical Functions, Dover, 1972.