MULTAN78

*******************************************************************

* *

* M M U U L TTTTTTT AAA NN N 777777 8888 *

* MM MM U U L T A A NNN N 7 8 8 *

* M M M M U U L T A A N NN N 7 8888 *

* M M M U U L T AAAAAAA N NN N 7 8 8 *

* M M U U L T A A N NNN 7 8 8 *

* M M UUUUU LLLLLL T A A N NN 7 8888 *

* *

*******************************************************************

A system of computer programmes for the

automatic solution of crystal structures from

X-ray diffraction data.

Principal author: Peter Main

Department of physics, University of york, York, England.

Others closely associated with the development of MULTAN:

S. E. Hull, I. Lessinger, G. Germain, J-P. Declercq, M. M. Woolfson.

March 1978

1 page 2

************

* Contents *

************

Section page

Introduction . . . . 3

Preparation of data for NORMAL . . . . 5

Preparation of data for MULTAN . . . . 9

Preparation of data for EXFFT . . . . 14

Preparation of data for SEARCH . . . . 15

Running the programmes . . . . 17

Flow diagram of MULTAN78 . . . . 18

Computer dependence of MULTAN78 . . . . 19

On changing array sizes . . . . 20

Description of NORMAL . . . . 21

Description of SIGMA2 . . . . 23

Description of CONVERGE . . . . 25

Description of FASTAN . . . . 28

Figures of merit . . . . 29

Description of the fourier procedure . . . . 31

Check-list for difficult structures . . . . 33

1 page 3

****************

* Introduction *

****************

MULTAN78 is an updated version of the previously issued MULTAN77.

The major alteration is the ability to use knowledge of molecular structure as described by Main (1976, crystallographic computing techniques, ed. F. R. Ahmed, pp. 97 - 105, Munksgaard). Among other things, this allows the use of an oriented molecular fragment (possibly obtained from an E-map) to help in the structure determination. Even if the position of the fragment is not known, the orientation still provides useful information to the phase determination process. In addition, an imporved 'magic integer' phase permutation is used, the progmamme NORMAL has been rewritten and SEARCH contains more general molecular recognition routines.

The complete set of programmes will perform all the necessary calculations drom the normalisation of structure factors to the output of a 'picture' of the structure on the lineprinter. The programmes are developed on a dec system 10 computer at york and an IBM 370/158 at Louvain. Earlier development took place on an ICL 1906A at Leeds University, England, and an IBM 370/165 at the centre Europeen de Calcul Atomique et Moleculaire, Orsay, France, the use of whose computing facilities the authors gratefully acknowledge. Acknowledgement is also due to the many people who have passed on helpful suggestions and given constructive criticism of earlier versions of MULTAN. Without this feedback from users, development of the programmes would take very much longer.

The general aim in producing MULTAN78 has been to provide a system of programmes which will solve crystal structures in a single computer job. The automatic operation means the user need not be an expert in direct methods to use the system successfully, yet it is hoped that the programmes are sufficiently flexible to allow user intervention at certain critical points if necessary. In addition, the authors hope the programmes are easy to use and they will continue to develop the system towards these ends. There can be no guarantee that the programmes are free of bugs or crystallographer traps and, indeed, the authors will be pleased to hear of any misbehaviour or special difficulties in implementing MULTAN78 on a particular computer.

The programmes are written in FORTRAN IV and should run, with a minimum of alteration, on any computer with a 64k store of 32-bit words. If the operating system allows several programmes to be run in the same job, it is possible to execute the programmes sequentially and obtain a solution of the crystal structure in a single computer run.

To help the implementation of the programmes, they are usually distributed with test data and the lineprinter output the data is expected to produce. As MULTAN78 is not completely machine independent, the actual output obtained may not be identical in all details to the one supplied.

Phase determination is carried out using the tangent formula of Karle and Hauptman (1956, Acta Cryst., 9, 635) which is based on the SIGMA2 probability formula of Cochran (1955, Acta Cryst., 8, 473). The tangent formula as used in MULTAN works most satisfactorily with equal atom structures but the presence of a heavy atom does not invalidate the theory. In fact, the easiest way to determine a heavy-atom structure is probably to use MULTAN to determine the heavy atom positions and as much of the remainder of the structure as the E-map gives. The remaining atoms can then be found using the NORMAL weighted fourier techniques.

1 page 4

The complete system of programmes is made up of the following:

NORMAL computes normalised structure amplitudes from F(obs) values using >

****** either a K-curve, a Wilson plot or a Debye curve. Outputs intensity

statistics as a check on the scaling and an aid to space group determination.

Prepares a reflexion input file for MULTAN or can be used to prepare files for Karle

recycling or for weighted fourier calculations.

K-curve, Wilson plot, a Debye curveを用いてF(obs)から規格化構造因子 (E) の計算する．

MULTAN consists of three logical sections:

****** a) SIGMA2 finds sets of three reflexions whose indices are related as h, h' and h-h'.

h, h', h-h'関係の反射を求める．

b) CONVERGE finds the starting reflexions for the tangent formula by

application of the SIGMA1 formula and by assigning phase values to

reflexions which define the origin and enantiomorph. A small number

of other reflexions may be chosen which give a 'good start' to the

tangent formula. As the values of these phases are initially unknown,

a number of different phase values are assigned by 'magic integers',

thus producing a multiple starting point and hence the name of the programme.

Tangent 式のための原点になる反射を求める．

c) FASTAN determines phases for all reflexions from each starting set produced

by CONVERGE using the tangent formula. Figures of merit for each set of phases

are also computed.

FASTANは用いた反射の位相を決定する．任意の反射数に応じた多重解となる．

EXFFT computes an E-map from a set of normalised structure factors. This

***** programme uses the Cooley-Tukey fourier transform algorithm and is

adapted from a crystallographic programme written by L. F. Ten Eyck.

フーリエ合成を行い，電子密度図 E-mapを計算する．

SEARCH finds the coordinates of the highest peaks in the E-map and, for a

****** molecular structure, looks for groups of peaks which could form a

molecule or a molecular fragment consisting of at least five atoms.

Computes bond lengths and angles for each molecular fragment and outputs on

the lineprinter a properly scaled view of the peaks plotted on the least

squares plane. Other orthogonal projections can be output if required. Each

fragment is interpreted in terms of bonded atoms by applying simple

stereochemical constraints and alternative interpretations are output if found.

The interpreted fragments are compared with those input by the user and

corresponding atoms are identified. This part of the programme is a rewritten

version of the original by Michel Koch (Acta Cryst., 1974, B30, 67).

電子密度図 E-mapから原子位置を算出する．

1 page 5

**********************************

* Preparation of data for NORMAL *

**********************************

------------------------------------------------------------

Card 1 FORMAT(20A4) TITLE - any FORTRAN characters in cols. 1 - 80

The TITLE is reproduced at the head of a number of pages of output. It is

recommended that it include useful information such as name or mnemonic of

compound, chemical formula, space group and number of atoms per unit cell.

------------------------------------------------------------

Card 1'

col 5 ifile = 0 read data from cards

= 1 read data from file

------------------------------------------------------------

Card 2 FORMAT(I5,4X,A1,7I5,F5.0,I5,F5.0) programme parameters

Note that this card is changed from the MULTAN74 version.

col 5 ICENT = 0 for non-centrosymmetric space group

= 1 for centrosymmetric space group

col 10 LATT = p,a,b,c,i,f or r (lattice type)

if the lattice type is r, the space group symmetry may

be described by either rhombohedral or hexagonal axes.

cols 11-15 NSYM = number of symmetry cards to be input (see cards 5 below)

cols 16-20 NGP = number of atom groups of known geometry to be input (<10)

(see descriptions of NORMAL and MULTAN for explanation)

col 25 INORM = 0 to normalise using Debye curve (same as Wilson plot if

NGP = 0)

= 1 to normalise using K-curve, i.e. a non-analytical curve

is fitted to the Debye curve for scaling

cols 26-30 NB = number of points on Wilson (or Debye) plot. If left

blank, the default value of 8*log10(n/20) is used (5.GT.NB

.LT.20), where N is the total number of reflexions input

col 35 ISCALE = 0 to apply separate scale factor for each parity group or

other appropriate index group so that = 1.0

= 1 to apply a single overall scale factor

col 40 ILIST = 0 to suppress listing of complete set of calculated E's

= 1 to obtain listing of E's

cols 41-45 NREF = number of reflexions with the largest E-values to be

written on the MULTAN data file (channel 9).

= 0 gives the default value of 100 + 4*(number 0f

independent atoms).

see below for limits on the value of NREF.

cols 46-50 EMIN = minimum E-value accepted for the nref strong reflexions.

= 0 gives the default value of 1.2

cols 51-55 IZRO = number of smallest E-values to be written on the MULTAN

data file (IZRO < NREF) for the PSI(ZERO) figure of

merit (see section on 'figures of merit').

= 0 gives the default value of 50

cols 56-60 EMAX = largest E-value accepted for the IZRO weak reflexions.

= 0 gives the default value of 0.2

ISCALE = 0 is recommended unless there are specific reasons to use

ISCALE = 1. the index groups are as follows (the order is fixed):

Triclinic: parity groups eee, oee, eoe, ooe, eeo, oeo, eoo, ooo.

Monoclinic, Orthorhombic: same as triclinic.

Tetragonal: modified parity groups eee, oee+eoe, ooe, eeo, oeo+eoo, ooo.

Trigonal, Hexagonal: evaluate mod(h,3), mod(k,3), mod(h+k,3), mod(l,2).

Cubic: evaluate mod(l-h,3), mod(k-l,3), mod(h-k,3), mod(h+k+l,2).

Primitive rhombohedral: same as cubic

For trigonal and higher symmetries, reflexions are divided into the

following six groups: oooe, ooee+oeoe+eooe, eeee, oooo, ooeo+oeoo+eooo,

eeeo. where e denotes a zero and o denotes non-zero for the above

quantities.

1 page 6

If some or all of the molecular geometry is known, as much as possible

should be included in the NGP atomic groups.

It should not be necessary to use more than 10 reflexions per atom in the

asymmetric unit. Usually NREF may be set at about 7 or 8 times the number of

atoms for orthorhombic symmetry. More reflexions may be required for lower

symmetry and fewer reflexions for higher symmetry. MULTAN limits the number of

reflexions to the minimum of KUSER3 and KUSER2/NSYM. The parameters KUSER3

and KUSER2 are set at the beginning of MULTAN to 500 and 2500 respectively and

are related to array sizes. These may be changed by the user as explained in

the section 'on changing array sizes'. because of packing, NREF is limited to

417 for 24-bit word computers.

------------------------------------------------------------

Card 3 FORMAT(9F8.0) Temperature factor and scale factor card

If the programme is to calculate its own scale and temperature factors,

this card should be left blank or put temp = 0.0 (note that some computers

ignore completely blank cards).

cols 1- 8 TEMP = temperature factor B (must be supplied if scale factor(s)

are to be supplied, but temp only may be supplied and the

programme will calculate its own scale factor(s)).

cols 9-16 SCALE1 = overall scale factor if ISCALE = 1

= first of 8 (or 6) scale factors if ISCALE = 0, the

remaining scale factors being punched in cols 17-24 etc.

Because of default options, temp must not be exactly 0.0 if supplied.

The scale factors are for the appropriate index groups, and in the same

order, as listed under card 2 for ISCALE.

F**2(absolute) = k * F**2(observed) defines the scale factor to be used.

if INORM = 1, TEMP and SCALE1 etc. will always be calculated.

------------------------------------------------------------

Card 4 FORMAT(6F10.6) Unit cell parameters

a, b, c (angstroms) alpha, beta, gamma (degrees)

------------------------------------------------------------

Cards 5 Free FORMAT the NSYM symmetry cards

Each card contains one general equivalent position punched, for example,

as 1/4-x, y, 1/2+z

spaces are ignored on input and the three fields are separated by commas.

If the space group is centrosymmetric, do not include equivalent positions

related by the centre of symmetry.

If the space group is centred, include only those general equivalent

positions associated with one lattice point.

Example: C2/c NSYM = 2 and the 2 symmetry cards are x, y, z and

x, -y, 1/2+z

------------------------------------------------------------

Card 6 FORMAT(8(3X,A2,I5)) unit cell contents

cols 4- 5 NAT(1,1) = chemical symbol of atomic type 1

cols 6-10 NAT(1,2) = number of atoms of this type in the unit cell

etc. to a maximum of 8 atomic types

Input the total contents of the unit cell - include all atoms whether the

1 page 7

lattice is centred or primitive. Include all atoms whether input later as part

of a group or not.

The scattering factors for 50 different kinds of atom are stored in the

programme. In order of atomic number, their chemical symbols as punched are:

H, LI, Be, B, C, N, O, F, Na, Mg, Al, Si, P, S, Cl, K, Ca, Ti, V, Cr, Mn, Fe,

Co, Ni, Cu, Zn, As, Se, Br, Rb, Sr, Zr, Mo, Ru, Rh, Pd, Ag, Cd, Sn, Sb, I, Cs,

Ba, W, Os, Pt, Au, Hg, Pb, Bi.

If the atomic symbols punched on the card are not one of the above, the

constants for the analytical scattering factor should be read in from the

following cards.

------------------------------------------------------------

Cards 7 FORMAT(5F10.0) analytical scattering factor constants

These cards are included only if an atomic type does not correspond to

one of the above 50 and are input in the order of appearance of the

atoms on card 6

The scattering factor curve is represented as

f = a * exp(-a'*s**2) + b * exp(-b'*s**2) + c

where s = sin(theta)/lambda and the 5 constants are input in the order

a, a', b, b' and c.

For values of the constants see for example Frsyth and Wells (1959) Acta

Cryst.,12,412.

------------------------------------------------------------

Cards 8 FORMAT(1X,A4,2I5,F5.0,6F10.3/(80A1)) the NGP atomic groups

if NGP = 0 these cards are omitted. Note the parameter NINF additional

to the MULTAN77 version.

These are groups of atoms whose geometry, orientation or position are

known. For each group, the first card is:

cols 2- 5 IDENT = any four characters to identify the group, e.g. BENZ

cols 6-10 NINF = 2 for group of random orientation and position

= 3 for group of known orientation and random position

= 4 for known atomic positions

= 5 if atoms to be used for Karle recycling

groups of type 2 may be included with groups of type 5

= 6 if atoms to be used in weighted fourier calculations

for groups of type 6 iscale on card 2 must be 1,

and scale1 and bt must be specified on card 3 (values

can be taken from a previous run of NORMAL)

no other groups should be included with a group of type 6

cols 11-15 NT = number of atoms in this group

cols 16-20 GN = number of times this group is present in the unit cell

cols 21-30, 31-40, et seq. aa, bb, cc (a), alpha, beta, gamma (deg.)

the coordinate system to which the fractional coordinates

of the atoms in this group are referred. Leave blank for

default option (see below).

The remaining cards contain the chemical symbols and coordinates of the NT

atoms in the group in free FORMAT, separated by spaces. Each card contains:

NF = chemical symbol of atom. Note that this symbol must be identical to one

of those on card 6.

x ) fractional coordinates of atom referred to the specified system of axes.

y ) if NINF = 2, the origin and orientation of the group are arbitrary.

z ) if NINF = 3, the origin is arbitrary.

1 page 8

If NINF = 2, the spherically averaged molecular scattering factor is

calculated to aid the scaling either by the debye curve or K-curve method.

If the orientation or position of the group is known (NINF = 3 or 4), it is

automatically passed on to MULTAN to be used in the estimation of the phase and

variance of SIGMA2 relationships.

If NINF = 2, the default for the coordinate system to be used for the

atomic positions is aa = bb = cc = 1.0, alpha = beta = gamma = 90.0, i.e.

the coordinates are given in angstroms referred to an orthogonal system of

axes. If NINF = 3 or 4, the coordinate system is that on card 4.

------------------------------------------------------------

Card 9 FORMAT(I5,18A4) FORMAT for reflexion data

cols 1- 5 NNN = number of reflexions punched on each card (.LE. 12)

if NNN is negative, a file of all input reflexions is

created on channel 15.

cols 6-77 FMT = FORMAT for reading reflexion cards

example: -5 (5(3I3,F5.2,I2))

It is suggested that NNN>12 be used to signify input of reflexion data

from magnetic tape or disc (channel 15). Individual installations may arrange

the exact form of input for themselves, but the programmes as issued can read

the 'standard' file by putting NNN = 15. (Note that NNN = -15 will read

reflexions from the file then rewrite the file after calculating the E's.

------------------------------------------------------------

Cards 10 FORMAT as specified on card 9 reflexion data

include only if NNN .le. 12

The programme expects NNN reflexions per card. For each reflexion, the

programme expects h, k, l, F(obs), ID where all are integer except F(obs).

(note that the decimal point need not be punched, but may have its position

specified by the FORMAT description. This may save space in packing the

reflexions on to the cards.)

ID = 0 for observed reflexion; ID = 1 for unobserved. The programme will

not allow reflexions with ID = 1 to be used in MULTAN for phase

determination no matter how large the E-value.

Because of packing, the maximum magnitude of h, k or l (also h+k for

trigonal or hexagonal) is 49 for 24-bit word machines and 127 for 32-bit words.

The data are terminated by a dummy reflexion with a negative F(obs).

The programme expects a complete set (observed and unobserved) of unique

reflexions (those not related by symmetry). It assumes that systematically

absent reflexions (caused by screw axes, glide planes and lattice centrings)

will not be input. Failure to observe these points will usually result in

inaccurate scaling of the E's.

The highest NREF E's will be written onto the MULTAN data file (channel

9) with the restriction that E > Emin and the reflexion is observed.

the lowest izro E's will also be written onto the MULTAN data file for

the PSI(ZERO) test with the restriction that E < EMAX.

(see the section on figures of merit).

下記の例は，空間群P(-)の1非対称単位に２個の分子が入っている難解な例である．

前段階の規格化構造因子 (E)の計算の段階で，E>7.0を越える異常な反射が１個あり，

試行錯誤の結果，後段の位相決定の際，悪影響を及ぼしている可能性が示唆されたの

で，これを除外した． Eの数は，念のため500個に増やした．

MULTANの段階ではNANYを8-10個に増やして解析した．1024個の解の中から

Combined Figure of Meritが2.0を越える解（2.2415）が正解であった．

********** Input Data Example **********

cpcot64 adduct

0

1 P 1 0 0 0 0 1 500 0 0 0

16.197 20.990 9.505 134.32 91.72 90.35

X,Y,Z

H 96 C 116 O 20

1 (I3,2I4,F8.2,I1,F5.2)

0 0 1 256.73 0.80

0 0 2 259.48 1.14

0 0 3 138.70 1.44

0 0 4 15.56 1.75

0 0 5 92.86 1.04

0 0 6 17.23 2.39

0 0 7 7.40 5.28

0 0 8 12.85 3.64

0 1 -8 7.96 5.68

0 1 -7 13.51 3.18

0 1 -6 32.27 1.40

0 1 -5 16.74 1.99

0 1 -4 14.33 1.86

0 1 -3 56.45 0.73

0 1 -2 100.14 1.14

0 1 -1 119.49 0.74

0 1 0 2.52 1.80

0 1 1 188.00 0.95

0 1 2 75.46 0.81

0 1 3 50.44 0.76

0 1 4 27.34 1.22

･

(Total 4274 RD)

･

10 -9 -3 7.69 0.00

10 -9 -2 10.67 0.00

10 -9 -1 39.62 0.00

10 -9 0 18.82 0.00

10 -9 1 5.68 0.00

-10.0

****************************************

1 page 9

**********************************

* Preparation of data for MULTAN *

**********************************

------------------------------------------------------------

Card 1 FORMAT(20A4) TITLE - any FORTRAN characters in cols 1-80

------------------------------------------------------------

Card 2 FORMAT(I5) control parameter

col 5 IPATH = 0 to run SIGMA2, CONVERGE and FASTAN using default

values throughout, i.e. no further data need be supplied.

= 1 to run SIGMA2 (cards 3- 7)

= 2 to run CONVERGE (cards 8- 9)

= 4 to run FASTAN (cards 10-15)

= 8 to halt the programme, i.e. no calculation takes place.

Any sensible sum of the above codes is also accepted to run sections of

the programme in combination. IPATH = 5 is not allowed. Include only those

data cards needed for the sections of the programme asked for. If IPATH = 0,

no data cards at all are required and default values will be used throughout.

In this case, the data file written by NORMAL must be used.

------------------------------------------------------------

Card 3 FORMAT(16I5) unit cell contents

cols 1- 5 JN(1) = number of atoms of type 1 in the unit cell

cols 6-10 JZ(1) = atomic number of atom type 1.

etc. to a maximum of 8 different atomic types.

If the data file written by NORMAL is used for input to MULTAN, this card

should be left blank, otherwise the complete contents of the unit cell should

be given. Note that this differs from the input of unit cell contents to

NORMAL.

------------------------------------------------------------

Card 4 FORMAT(3I5,4X,A1,2I5,F10.0) SIGMA2 parameters

If the data file written by NORMAL is used, only ilist and nsrmax need be

supplied.

col 5 ILIST = 1 to request SIGMA2 listing

= 0 to suppress SIGMA2 listing

cols 6-10 NSRMAX = maximum number of unique phase relationships to be used.

NSRMAX can not exceed KUSER4/3.

= 0 gives the default value of 11 * NREF - 500.

col 15 ICENT = 0 for non-centrosymmetric space group

= 1 for centrosymmetric space group

col 20 LATT = p,a,b,c,i,f,r (lattice type)

cols 21-25 NSYM = number of symmetry cards to follow (= 0 if the data file

written by NORMAL is used).

cols 26-30 NGP = number of atomic groups to be input on cards 5a.

cols 31-40 EMIN = minimum value of E to be used. All E's less than this

are ignored on input.

The programme sets up all possible phase relationships and selects the

NSRMAX strongest. From experience, the optimum ratio of number of phase

relationships to number of reflexions is about 8. Fewer relationships than

this may result in failure to solve some structures, while more relationships

tends to use computer time unnecessarily. For difficult structures, use as

many relationships as possible.

1 page 10

------------------------------------------------------------

Cards 5 free FORMAT the NSYM symmetry cards

If the data file written by NORMAL is used, these cards are omitted.

Each card contains one general equivalent position punched as described

in the input to NORMAL e.g.

1/2 - x, -y, 1/2 + z

If the space group is centrosymmetric, do not include equivalent

positions related by the centre of symmetry.

If the space group is centred, include only those general equivalent

positions associated with one lattice point.

------------------------------------------------------------

Cards 5a FORMAT(2I5/(I5,3F10.6)) the NGP atomic groups

These are groups of atoms whose orientation or position is known and are

to be used in the estimation of the phase and variance of SIGMA2 relationships.

If these groups have already been input to NORMAL and the data file written by

NORMAL is used by MULTAN, they will be used automatically and should not be

input again here.

For each group, the first card is:

cols 1- 5 NINF = 3 for group of known orientation and random position

= 4 for known atomic positions

cols 6-10 NAG = number of atoms in the group.

the remaining cards contain the atomic number and coordinates of the NAG

atoms in the group. Each card contains:

cols 1- 5 NZ = atomic number

X ) fractional atomic coordinates

Y ) the origin is arbitrary if ninf = 3

Z )

------------------------------------------------------------

Cards 6 variable FORMAT reflexion data

If the data file written by NORMAL is used, these cards are omitted.

The first card contains the FORMAT statement for the data which follows,

one reflexion per card. example: (3I4,F8.2)

Each subsequent card contains h,k,l,E for one reflexion.

The last card contains a dummy reflexion with a negative E-value.

It should not be necessary to use more than 10 reflexions per atom in the

asymmetric unit and usually about 7 or 8 times the number of atoms may be used.

The programme limits the number of reflexions to the minimum of KUSER3-1 and

KUSER2/NSYM. The parameters KUSER3 and KUSER2 are set at the beginning of

MULTAN to 500 and 2500 respectively and are related to array sizes. These may

be changed by the user as explained in the section 'on changing array sizes'.

The reflexion cards may be input in any order as the programme decides

internally on the optimum order for phase determination. However, it should

be noted that the order of input of reflexions in the E-map data is the same as

the order of input here.

1 page 11

------------------------------------------------------------

Cards 7 Same FORMAT as cards 6 The PSI(ZERO) reflexions

(see 'description of SIGMA2' and 'figures of merit')

If the data file written by NORMAL is used, these cards are omitted.

each card contains h, k, l, E

The last card contains a dummy reflexion with a negative E-value. If no

PSI(ZERO) reflexions are included, the terminating card must still be present.

The number of PSI(ZERO) reflexions must be less than the number of

reflexions in cards 6. Normally, about 50 reflexions with very small or zero

E-value will produce a sensitive figure of merit. (Do not use systematically

absent reflexions).

------------------------------------------------------------

Card 8 FORMAT(6I5,F10.0) CONVERGE parameters

col 5 LIST = 1 to obtain complete listing of convergence map

= 0 to list bottom of convergence map only.

= -1 to suppress listing of convergence map

cols 6-10 NINPUT = number of reflexions to be input to converge on cards 9

cols 11-15 NSPEC = number of special reflexions required in starting set

cols 16-20 NGEN = number of general reflexions required in starting set

cols 21-25 NANY = number of reflexions of any sort required in starting set

cols 26-30 NSREQ = maximum number of phase sets to be generated by FASTAN.

= 0 gives the default value of 64.

NSREQ is only used if NSPEC = NGEN = NANY = NINPUT = 0.

cols 31-40 PROB = lower limit of probability for acceptance of a phase

determined by the SIGMA1 formula. (see 'description of

CONVERGE' for advice on the use of this parameter).

= 0 gives the default value of 0.95

The reflexions which appear in the starting set will be those needed for

origin definition, the SIGMA1 reflexions and the NINPUT + NSPEC + NGEN + NANY

others. See 'description of CONVERGE' for the number of phase sets these

reflexions will generate.

If NINPUT = NSPEC = NGEN = NANY = 0, the programme will choose the number

of starting reflexions required with the constraint that the number of phase

sets generated will not exceed nsreq.

Special reflexions are those whose phases are restricted to two values

such as zero or pi. General reflexions are those whose phases may take on any

value. Note that in centrosymmetric space groups there are no general

reflexions; all reflexions are special.

At least one phase must be permuted for FASTAN to work.

------------------------------------------------------------

Cards 9 FORMAT(5I5,F10.0) The NINPUT reflexions for CONVERGE

These reflexions allow the user to impose his own choice of starting point

on the programme. Each card contains information for one reflexion.

cols 1- 5 KODE = code number of reflexion as listed in SIGMA2 output

cols 6-10 H ) indices of reflexion in 'standard' form

cols 11-15 K ) as listed in the SIGMA2 output

cols 16-20 L )

cols 21-25 IFAZ = phase angle in degrees

1 page 12

cols 26-35 WATE = -2.0 for enantiomorph-fixing reflexion (IFAZ ignored)

= -1.0 for starting set reflexion (IFAZ ignored)

= 0.0 to 1.0 the weight of a known phase

= 2.0 for origin-fixing reflexion.

Either kode or h, k, l may be used to specify a reflexion. If KODE is

punched, h, k, l are ignored.

For an enantiomorph-fixing or starting set reflexion, IFAZ is ignored

since the programme will assign an appropriate phase value and permute it in

the normal way.

For a known phase, a weight greater than 0.99 will be interpreted as

meaning certainty and the phase will be fixed during tangent formula refinement

Apart from its obvious purpose, this facility of inputting reflexions to

CONVERGE can be used to persuade the programme to choose different origin

defining reflexions if the user is dissatisfied with its original choice.

One, two or three origin-defining reflexions of the user's own choice may be

read in here and, if necessary, the programme will choose the remaining reflexions

needed to define the origin completely.

------------------------------------------------------------

Card 10 FORMAT(5I5,3F10.0) FASTAN parameters

cols 1- 5 IMK = number of starting phases to be input from cards

= 0 if starting phases to be generated by the programme

col 10 IPRINT = 1 to obtain listing of final phase values

= 0 to suppress listing of final phase values

col 15 ISET = 0 no action.

= 1 read phase set numbers off cards 11 and generate these

phase sets only.

col 20 IFOUR = 0 to request preparation of data file (on channel 10)

for input to EXFFT to calculate E-maps

= 1 if no such data file is required

note that the significance of these values of IFOUR has

been reversed from those in MULTAN77.

cols 21-25 ISKIP = number of phase permutations to be skipped before

calculations begin. MULTAN begins with permutation

number iskip+1, so normally ISKIP = 0.

cols 26-35 WTFOM1 = weight for absolute figure of merit

cols 36-45 WTFOM2 = weight for PSI(ZERO) figure of merit

cols 46-55 WTFOM3 = weight for residual figure of merit

The maximum number of starting phases which can be introduced from cards

(parameter IMK) is KUSER3/2. The parameter KUSER3 is set at the beginning of

MULTAN and is normally 500. For more information see the section 'on changing

array sizes'.

When rerunning a job that ran out of time, ISKIP should be set to the

number of sets of phases previously written on channel 10. Note that this may

not correspond to the last set of phases listed on the lineprinter because of

buffering. To do this the files on channels 10 and 11 must be saved from the

previous run, and parts 1 and 2 of MULTAN must not be rerun.

The WTFOM parameters are used in the calculation of the combined figure

of merit (see section on 'figures of merit'). If left blank, these weights will

be set to their default values of 1.0. If the space group contains no

translational symmetry, better discrimination may be obtained with 0.8, 1.2

and 1.0 as the three weights.

1 page 13

------------------------------------------------------------

Cards 11 FORMAT(20I4) Phase set numbers

include only if ISET = 1

cols 1- 4 NUMSET = number of phase set to be generated

cols 5- 8 etc. phase set numbers should be in ascending order

= -1 to terminate input of numbers

As many cards 11 as desired may be input. This facility allows the

generation of particular sets of phases from a previous run of the programme.

------------------------------------------------------------

Cards 12-15 include only if IMK > 0. (maximum value of IMK is KUSER3/2).

Cards 12 FORMAT(20I4) code numbers of the IMK starting reflections

------------------------------------------------------------

Cards 13 FORMAT(20i4) markers of the IMK starting reflexions

= 1 if phase is to be allowed to vary from the start of phase

determination

= -1 if phase is to be kept fixed and only allowed to vary during the

last few cycles of tangent formula refinement

------------------------------------------------------------

Cards 14 FORMAT(20F4.0) weights of the IMK starting reflexions (0.0 to 1.0)

origin and enantiomorph defining reflexions should be given weights of 1.0

Cards 15 FORMAT(20I4) phases, in degrees, of the IMK starting reflexions

and NUMSET (in field IMK+1)

NUMSET = identification number of set of phases

As many cards 15 as desired may be input to produce different starting

points for the tangent formula. Data is terminated by a dummy set of phases

with NUMSET = 0.

MULTANの段階ではNANYを8-10個に増やして解析した．1024個の解の中から

Combined Figure of Meritが2.0を越える解（2.2415）が正解であった．

********** Input Data Example **********

cpcot

7

0

10

0

****************************************

MULTANの出力リストのまとめ部分を抽出

ABS FOM, PSI ZERO, RESIDから総合評価COMBINED FOMを算出した部分

cpcot

SUMMARY OF FIGURES OF MERIT OUTPUT BY TANGENT FORMULA

ABS FOM PSI ZERO RESID

MAXIMUM VALUE 2.0092 5.992 46.77

MINIMUM VALUE 0.4546 0.934 14.80

RELATIVE WEIGHTS 1.00 1.00 1.00

(FOR COMBINED FOM)

NUMSET ABS FOM PSI ZERO RESID COMBINED FOM

1 2.0092 5.992 14.80 2.0000 誤った解

446 1.1302 0.934 20.97 2.2415 第446目の解が正解

1 page 14

*********************************

* Preparation of data for EXFFT *

*********************************

------------------------------------------------------------

Card 1 FORMAT(2I5,5X,I5,4F5.0) E-map parameters

cols 1- 5 NOSET = serial number of the set of phases from which to

calculate the E-map

= 0 to choose set with highest combined figure of merit.

if EXFFT is rerun with NOSET = 0, the set with the next

highest combined figure of merit will be used.

cols 6-10 LEVEL all numerical values in the E-map greater than level are

underlined with **** in the lineprinter output to

facilitate contouring. If LEVEL = 0, the default value

of 100 is used. Note that the map is automatically

scaled between + & -999 (approximately) at the grid

points at which it is calculated.

col 20 IPRINT = 0 to suppress printing of the E-map

= 1 to obtain the printing of the E-map

cols 21-25 FACTOR the grid on which the E-map is evaluated will have a

spacing of approximately 1/factor angstroms.

= 0 gives the default value of 3.0

cols 26-30 XMAX )

cols 31-35 YMAX ) printing limits along the x, y, z directions

cols 36-40 ZMAX )

If IPRINT = 1, the E-map will be printed for the portion of the unit cell

running from 0.0 to xmax in x, 0.0 to ymax in y, 0.0 to zmax in z.

If xmax, ymax and zmax are all 0.0, the default values are 1.0, 1.0, 1.0

(complete unit cell).

The printing of the E-map will normally be used only rarely. The output

is in sections of constant y with x across the page and z running down it on a

grid of about 3 points per angstrom. The maximum number of grid points across

the page is 36 (i.e. about 12A in length). If more are needed, a new page is

started. All numerical values greater than level are underlined with **** to

facilitate contouring.

------------------------------------------------------------

Card 2 FORMAT(6F10.0) unit cell parameters

include only if not already supplied to NORMAL

a, b, c (angstroms), alpha, beta, gamma (degrees).

********** Input Data Example **********

446

****************************************

1 page 15

**********************************

* Preparation of data for SEARCH *

**********************************

------------------------------------------------------------

Card 1 FORMAT(4I5,5F10.0) Programme parameters

cols 1- 5 NPC = number of peaks to SEARCH for (<180).

= 0 gives the default of (11*n)/9 where n = number of

independent atoms (excluding h). If two fragments are

within 2.8a of each other, the default is automatically

increased to (3*n)/2.

cols 6-10 NA = number of additional atoms to be included in the

interpretation of the map (NPC+NA < 200)

= 0 if no additional atoms.

cols 11-15 MFRN > 0 if bond sequences to be input for molecular recognition

= 0 if no bond sequences input.

cols 16-20 NPROJ = number of orthogonal projections of each fragment to be

plotted (maximum of 3).

= 0 programme decides number for itself (maximum of 2).

= -1 no projections are plotted at all.

cols 21-30 DMT interpeak distances less than DMT are output (maximum of

25 distances for each peak).

= 0 gives the default value of 2.4A.

cols 31-40 DMN = minimum allowed bond length in the interpretation

= 0 gives the default of 1.1A.

cols 41-50 DMX = maximum allowed bond length in the interpretation

= 0 gives the default of 1.95A.

cols 51-60 ANGMN = minimum allowed bond angle in the interpretation

= 0 gives the default of 85 degrees.

cols 61-70 ANGMX = maximum allowed bond angle in the interpretation

= 0 gives the default of 145 degrees.

The three orthogonal projections of each fragment which can be requested

by the NPROJ parameter are on the least squares plane, the plane orthogonal to

the least and most squares plane and on the most squares plane, in that order.

If the default is used (NPROJ=0) the programme will always give the first

projection (on the least squares plane). It will output the second projection

if the fragment is cylindrical or globular. The third projection (on the most

squares plane) is output only if NPROJ = 3.

Note that the acceptable bond distances and angles given by the default

values do not include all possible chemical configurations of atoms.

------------------------------------------------------------

Card 2 FORMAT (20A4) FORMAT for atom cards (include only if NA>0).

cols 1-80 FORMAT for reading in coordinates of additional atoms on

cards 3. e.g. (3F10.5)

------------------------------------------------------------

Cards 3 FORMAT as specified on card 2 atomic coordinates (only for NA>0)

There are na cards, each containing x,y,z, for one additional atom.

The additional atoms input here are included in the bond-lengths and

angles calculation and are also output as part of the picture. This facility

will not normally be used for the interpretation of E-maps, but is very useful

in the later stages of crystal structure determination when a difference

fourier may be used to locate hydrogen atoms. If SEARCH is used for this task,

inclusion of the subtracted atoms at this point ensures they are included in

the bond-lengths and angles calculations and also that they appear in the

1 page 16

lineprinter picture of the structure.

------------------------------------------------------------

Cards 4 Free FORMAT Bond sequence codes (include only if MFRN>0)

The bond codes allow the programme to compare the molecular fragment it

finds with that input here. They are not used by the programme in its SEARCH

for a fragment and are therefore optional.

The bond codes for any molecule are found as follows:

(a) Draw the molecule or fragment whose chemical structure (not

necessarily geometry) is known.

e.g. 0--0 0

i i i i

0 0--0 0

i i i i

0--0 0

(b) Number the atoms in any order

e.g. 1--3 8

i i i i

4 2--6 10

i i i i

5--7 9

(c) Specify the connections to each atom as follows:

1 3 4/2 3 6 7/3 1 2/4 1 5/5 4 7/6 2 8 9/7 2 5/8 6 10/9 6 10/10 8 9//

which means atom 1 is bonded to atoms 3 and 4; 2 is bonded to 3, 6 and 7 etc.

Redundant information may be removed from the bond sequence

e.g. 1 3 4/2 3 6 7/4 5/5 7/6 8 9/8 10/9 10//

or 1 3 4/2 3 6 7/5 4 7/8 6 10/9 6 10//

in which each bond is specified only once.

(d) Type any of these descriptions in free FORMAT, ending the information

with //. The maximum number of atoms in the molecule is 200. Up to five

molecules can be input in this way and the complete data is terminated with ///

The comparison routines in this version of SEARCH are described by Main &

Hull (1978) Acta Cryst., in press. All identified peaks are put in a data file

(on channel 14) for use by NORMAL in one of the recycling procedures. No peaks

will be identified if no bond sequences are input. In this case, file 14 will

contain the peaks from the largest interpreted fragment. However, it is

strongly recommended that the output from SEARCH be examined carefully, since

it will most often be the case that user intervention will be either helpful or

necessary before recycling can be intelligently used.

********** Input Data Example **********

0 0 0

****************************************

ピークサーチプログラムで見つけられた分子座標はラインプリンタ用に投影図が書き出される．

+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

cpcot SET NO.446 FRAGMENT 1

PLOT OF PEAKS ON LEAST SQUARES PLANE SCALE = 2.50 CMS/A

+ 68

+ 27

+ 67

+ 90

+ 62

+ 33

+ 20

+ 59

+ 85

+ 31

+ 50

+ 79

+ 83

+ 43

+ 102

+ 16

+ 5

+ 34

+ 35

+ 81

+ 76

+ 23

+ 80

+ 58

+ 19

+ 10

+ 22

+ 26

+ 6

+ 64

+ 24

+ 57

+ 53

+ 32

+ 88

+ 97

+ 41

+ 52

+ 11

+ 100

+ 70

+ 1

+ 69

+ 93

+ 63

+ 51

+ 37

+ 74

+ 87

+ 66

+ 94

+ 96

+ 78

+ 61

+ 30

+ 39

+ 56

+ 75

+ 47

+ 17

+ 49

+ 2

+ 77

+ 9

+ 14

+ 60

+ 12

+ 44

+ 89

+ 28

+ 73

+ 55

+ 54

+ 3

+ 98

+ 4

+ 38

+ 15

+ 18

+ 29

+ 8

+ 7

+ 42

+ 91

+ 46

+ 99

+ 25

+ 86

+ 72

+ 40

+ 21

+ 92

+ 45

+ 13

+ 82

+ 84

+ 36

+ 71

+ 101

+ 65

+ 48

+ 95

同時にPEAKというファイルが作られるので，パソコン端末側に転送するかあるいはウインドウ間でCOPY&PASTEして，適当な分子表示プログラム（Chem3D等）で表示させる．

16.19700 20.99000 9.50500 134.31999 91.72000 90.35000

1 0.91654 0.10039 0.71643

5 0.98118 0.24933 0.83630

6 0.99326 0.18481 0.46550

10 0.81588 0.31442 0.67992

11 0.74400 0.23430 0.83691

16 1.04354 0.21386 0.80041

19 0.85892 0.29219 0.76030

20 0.98051 0.40226 0.96471

22 0.85445 0.25498 0.82697

23 0.91607 0.24852 0.93442

24 0.91673 0.16137 0.89727

26 0.90274 0.17101 1.04021

27 1.07441 0.42386 1.05675

31 0.92117 0.34026 1.16239

32 0.72206 0.27174 0.99548

33 1.00422 0.34941 1.24725

34 1.00112 0.26437 0.61226

35 0.77739 0.38666 0.79738

37 0.59825 0.20045 0.83239

41 0.76232 0.29823 0.42574

43 0.96009 0.29823 0.78069

50 1.04659 0.31097 0.61521

51 0.92052 0.01617 0.68339

52 1.00617 0.13851 0.26814

53 0.82126 0.27953 0.50422

59 0.89885 0.42113 1.21909

62 0.93749 0.45155 1.12982

63 0.69948 0.18876 0.67320

64 0.72201 0.36607 0.52053

66 0.62276 0.16103 0.66153

67 1.07733 0.38493 1.27752

68 1.11841 0.41669 1.17501

69 0.62304 0.24909 0.98994

79 0.78241 0.46866 0.78160

85 0.87975 0.38140 1.42638

88 0.69511 0.33536 0.68603

90 0.98498 0.39008 1.43519

94 0.93902 -0.06179 0.45468

97 0.74637 0.21819 1.06296

1 page 17

**************************

* Running the programmes *

**************************

The separate programmes in MULTAN78 communicate with each other through

data files. The programmes, together with the files used and their logical

numbers are set out on the next page. File 9 contains symmetry information,

the unit cell contents and the reflexions used by MULTAN for phase

determination and the PSI(ZERO) test. File 15 contains the complete data set

input to NORMAL and can be used for subsequent runs of NORMAL. File 11

contains all the reflexions used by MULTAN together with the SIGMA2

relationships, SIGMA1 information and the tangent formula starting reflexions

chosen by CONVERGE. File 10 contains the phase sets output by MULTAN which

are then passed on to EXFFT for the computation of E-maps. Files containing

formatted information (card images) are 9, 12 and 14. All other files are

unformatted (binary).

Normally, it will not be found necessary to output a SIGMA2 listing, any

of the phase sets produced by FASTAN or any of the E-maps. the 'pictures'

output by SEARCH together with their interpretations will be found satisfactory

for the elucidation of most structures. Several E-maps may be produced in a

single job by running EXFFT and SEARCH for each map required using default

values each time. The E-maps will be computed in order of the combined figure

of merit. Further E-maps can be obtained in subsequent jobs again by using

default values for EXFFT and SEARCH with the same file 10.

If only a partial structure appears in an E-map, it may be used to compute

starting phases for a rerun of the tangent formula. This is done by NORMAL

which prepares the necessary input file to MULTAN. Alternatively, if the

partial structure is in the wrong position in the cell, it can be used to

estimate the phase and variance of SIGMA2 relationships in a redetermination of

the structure. Such oriented groups of atoms can be input to NORMAL or MULTAN

as already described.

Note that if only a small number of atoms are missing from the E-map, It

may be better to find them using a weighted fourier synthesis than to use the

tangent formula recycling procedure. NORMAL can be used to prepare the input

needed by EXFFT to produce the fourier synthesis.

1 page 18

****************************

* Flow diagram of MULTAN78 *

****************************

********

standard data file (15)---><--*NORMAL*---<---input file or card reader (5)

(binary) ********

i NORMAL requires one binary

i scratch file (8)

i

transfer file (9) v

(formatted) i

i

MULTAN requires one binary ********

scratch file (11) *MULTAN*---<---input file or card reader (5)

********

i

binary phases file (10) v

i

EXFFT requires one binary *******

scratch file (8) *EXFFT*----<---input file or card reader (5)

*******

i

fourier map file (13) v

(binary) i

i

********

*SEARCH*---<---input file or card reader (5)

********

i

peak coordinate file (14) v

(formatted) i

All programmes use channel 6 for lineprinter output

Numbers in parentheses are the logical numbers by which the files are accessed.

Karle recycling

---------------

NORMAL produces a file on channel 12 (formatted) which should be used as input

to MULTAN on channel 5.

Weighted fourier

----------------

NORMAL produces a file on channel 10 (binary) for input to EXFFT. MULTAN may

be bypassed altogether or supplied with a file on channel 5 with ipath = 8.

1 page 19

***********************************

* Computer dependence of MULTAN78 *

***********************************

All programmes in the MULTAN system have been written in a fairly standard

FORTRAN IV in the hope that they will be largely computer independent.

Unfortunately, it is not possible to make them completely machine independent

and the authors will welcome information from users who have any difficulties

implementing the system on their own machine.

The programmes will work on machines with 24-bit words, but this imposes

limitations due to packing. Only two cards have to be changed in this case as

shown below. On 24-bit word machines, MULTAN will handle a maximum of 417

reflexions and this restriction is removed completely if the word size is

larger.

Serial number

MU005100 length of integer word (use 32 unless it is smaller)

EX002900 as above

SE006300 number of lines per inch on the line printer

1 page 20

***************************

* On changing array sizes *

***************************

To enable users to alter storage space for reflexions and phase

relationships, all relevant array sizes are defined by programme parameters at

the beginning of MULTAN:

Parameter Present Function Dimension of arrays Constraints

value

KUSER1 5000 number of SIGMA2 none = KUSER4 / 3 and

relationships mod(KUSER1,KUSER3)=0

KUSER2 3000 hemisphere of (IC,IOC,IOR) 3*KUSER2.le.KUSER4-1000

reflexions

KUSER3 500 number of (MKG,MKANG,PALF,IZ) none

reflexions IH,E,IPHAZ,WT,ALPHA, (but see KUSER5)

IORDE

dimension of LIM = KUSER3+1

dimension of KDSTOR,MKSTOR,

*************************

The programme NORMAL converts F(obs) values to E(obs) according to the

formula

E(obs)**2 = F(obs)**2 / (i)

where (i) is the expected intensity of that particular reflexion.

For randomly positioned atoms in the unit cell, (i) is given by Wilson's

statistics as

(i) = sum over i (F(i)**2) * (Debye-Waller factor))

where F(i) is the scattering factor of the ith atom.

This formula is modified slightly for particular reflexions by space group

symmetry elements as indicated by Wilson. For example, in P2 (i) the 0k0

reflexions are expected to have twice the average intensity of other

reflexions because of the two-fold screw axis. The above formula is then

modified to

(i) = 2.0 * sum over i (F(i)**2) * (Debye-Waller factor))

for these reflexions. NORMAL automatically uses the correct formula for the

space group.

If something is known about the stereochemistry of the structure, the

assumption of randomly distributed atoms need not be made in calculating(i).

instead, it can be assumed there are randomly distributed groups of atoms in

the unit cell. In this case, F(i) in the formula for (i) is replaced by G(i),

the spherically averaged scattering factor of the ith group. G(i) is

calculated from the Debye scattering formula

G(i)**2 = sum over j & k (F(j) * F(k) * sin(t*rjk) / (t * Rjk))

where t = 4 * pi * sin(THETA) / lambda

and Rjk = distance between the jth and kth atoms (in the ith group)

It is recommended that as much stereochemical knowledge as possible is

used in this way, the larger the atomic groups the better. Ideally, the

complete molecule should be included as a single group, but often the whole of

its stereochemistry is not known. However, the geometry of fragments of the

molecule may well be available and these fragments can be included as separate

groups in the data for NORMAL. As the group scattering factor is spherically

averaged, the orientation of the group as it appears in the data for NORMAL is

arbitrary.

Use of molecular scattering factors in the calculation of E-values is

especially valuable for planar molecules consisting of fused six-membered rings

These structures can be awkward to solve and the E-map may reveal a hexagonal

net rather than a picture of a discrete molecule. Making use of the known

stereochemistry of such molecules in the calculation of the E's has the effect

of reducing the height of the spurious peaks and sometimes completely removing

the central peak in each ring.

The Wilson plot is printed out with the symbols wwww following a smooth

curve through the points. The symbols dddd follow a smooth curve through the

'Debye' points, i.e. those obtained using molecular rather than atomic

scattering factors. If no atomic groups have been input to NORMAL, the two

curves wwww and dddd will coincide. If the correct stereochemistry has been

input, the Debye curve will be closer to a straight line than the Wilson plot.

if the complete stereochemistry has been input, the Debye curve should be a

straight line within experimental error.

1 page 22

The user may choose one of two methods of scaling the F's to obtain E's.

The programme always calculates the least squares straight line through the

debye points to obtain the Debye-Waller factor and the scale factor needed to

put (i) and F(obs)**2 on the same scale. The least squares line may also be

used to estimate (i) from the molecular scattering factors as a function of

sin(theta) in the calculation of E's. This is the recommended method of

scaling when the Debye curve does not deviate wildly from a straight line,

which ought to be the case if a lot of stereochemical information has been

used. If no stereochemistry is input to NORMAL, the Debye curve becomes

identical to the Wilson plot. In the case where the Debye curve is obviously

not a straight line, the K-curve may be used for scaling. In this method, a

non-analytical curve is fitted to the Debye points so that it goes exactly

through every point. The K-curve, instead of the least squares straight line,

is then used to estimate (i) in the calculation of E's. The K-curve is

extrapolated by drawing a straight line, parallel to the least squares straight line,

through the last Debye point at each end of the plot. Even if the

K-curve is used for scaling, the programme will be helped by the input of

stereochemical information provided this makes the Debye curve closer to a

straight line than the Wilson plot. The success (or otherwise) of the

structure determination depends to a large extent upon the E's which are

produced here and a little care and intelligent thought in their calculation

is always handsomely repaid.

NORMAL includes a facility for applying a separate scale factor to

reflexions in different index groups. The groups used depend upon the space

group symmetry and are assigned automatically by the programme. See

'preparation of data for NORMAL' for more details of the index groups used.

The scale factors are computed to make the average value of E**2 equal to unity

for each group separately. From practical experience, the use of this facility

is always recommended unless there is good reason for not using it. Very

often, the expected value of the intensity of a reflexion, (i), is a function

of index, e.g. whether h, k or l is even or odd, as well as of sin(theta),

hence this facility. In addition, the convergence procedure used to find a

starting set of reflexions for the tangent formula is greatly assisted by

having the same average E**2 in all major reflexion groups.

The lineprinter output of NORMAL includes intensity distributions of

reflexions in principal zones as well as the distribution for general

reflexions. This is not only a check on the scaling of the E's but also a

valuable aid to space group determination if the symmetry is not completely

determined by systematic absences. For normal to produce correct statistics,

all unique reflexions to a maximum value of sin(theta) should be supplied,

leaving out systematic absences.

It is always wise to check the distribution of average E**2 as a function

of sin(theta). If this distribution indicates that the E's at high angles are

generally too strong, i.e. average E**2 much greater than unity, this can give

trouble during phase determination in MULTAN. It will be better to rescale

the E's using a lower temperature factor.

The file written by NORMAL for input to MULTAN contains much of the input

data including symmetry information and unit cell dimensions and contents.

The strongest E's are also written on the file, the exact number being chosen by

the user. In general, the number of reflexions used need not exceed 10 times

the number of atoms to be located and often 7 times the number of atoms is

quite sufficient. In addition, a number of the weakest E's are output

(usually 50, but can be changed by the user) for use in the PSI(ZERO) figure of

merit in MULTAN.

1 page 23

*************************

* Description of SIGMA2 *

*************************

The SIGMA2 section of MULTAN sets up the SIGMA2 phase relationships which

will be used later by the tangent formula. The data required to do this

consist of space group information and a number of strong reflexions either

input from cards or from the data file written by NORMAL. The number of

reflexions used need not exceed 10 times the number of atoms to be located in

the crystal structure and 7 times the number of atoms will normally be

sufficient. The reflexions may be input in any order.

All the data are listed immediately after input. It will be found that

each reflexion is referred to by a code number which is listed along with its

indices and, in much of the subsequent output, only the code number appears in

order to cut down the volume of paper generated by the programme. In addition,

the indices are 'standardised' so that all reflexions will lie in the same part

of the reciprocal lattice and it is the phase of the 'standardised' reflexion

which is eventually computed by the tangent formula. SIGMA2 then sets up all

phase relationships of the form

phi(-h) + phi(k) + phi(h-k) = q(h,k)

where q(h,k) is the estimated phase of the SIGMA2 relationship with a weight

ka(h,k) given by

ka(h,k) = 2 * s3 * s2**(-1.5) * E(h) * E(k) * E(h-k)

where sn = sum over j (z(j)**n)

the quantity q(h,k) will be zero unless groups of atoms of known orientation

or position have been input either to NORMAL or MULTAN. In this case, q(h,k) is

estimated as described by Main (1976, crystallographic computing techniques,

ed. F. R. Ahmed, pp. 97 - 105, munksgaard) and the value of ka(h,k) is also

modified.

In order to save storage space and computer time, only the strongest

relationships are retained and used by the tangent formula, the actual number

being specified by the user. Normally, the number of relationships used should

be about 8 times the number of reflexions. Fewer relationships than this may

result in failure to solve some structures, while more tends to use computer

time unnecessarily. Difficult structures will need more relationships than 8

times the number of reflexions. Users should note that this may require

expanding the arrays iee and iph throughout MULTAN where the relationships are

stored. The programming has been arranged to make such changes of array size

as easy as possible as explained in the section 'on changing array sizes'.

A SIGMA2 listing is usually unnecessary but, if it has been requested,

the programme outputs all the phase relationships it will eventually use in the

tangent formula. The relationships common to any one reflexion are listed

under the heading of that reflexion, in order of magnitude of ka(h,k), so each

relationship will appear three times in the listing, once under each of the

three contributing reflexions. All reflexions are referred to by their code

numbers, which are printed as negative if the Friedel opposite is to be used.

In addition, upon transforming a reflexion to its standard form, a phase shift

may occur and this is output in units of pi/12. Thus, the entry in the SIGMA2

listing of 75 -80 12 10.79 under the heading of reflexion code number 6

signifies the relationship

phi(6) = phi(75) - phi(80) + pi

with a value of ka(h,k) of 10.79. the reflexions are in their standardised

form and phi(6) means the phase of the reflexion with code number 6.

1 page 24

The PSI(ZERO) reflexions to be input to SIGMA2 are always present on the

data file written by NORMAL. If this file is not used for input to MULTAN, The

PSI(ZERO) reflexions may also be input from cards. These reflexions are used

for the PSI(ZERO) figure of merit and are optional, but the use of this figure

of merit is highly recommended. Each card should contain h,k,l and E values

for very weak or absent reflexions (not systematic absences, however). Any

number of reflexions may be input up to a maximum of the number of strong

reflexions previously input, but about 50 reflexions should give a sensitive

figure of merit. The SIGMA2 routine is then used to set up the right-hand-side

of Sayre's equation for these reflexions using only the high E's previously

input. This information is passed on to FASTAN where the figure of merit is

actually computed.

1 page 25

***************************

* Description of CONVERGE *

***************************

The CONVERGE section of the programme uses the space group information

previously input to determine which reflexions are structure seminvariants,

how many reflexions are needed to define the origin uniquely and what phase

restrictions are imposed by the crystal symmetry. This latter information is

transmitted to FASTAN and phase restrictions are automatically applied during

phase determination. The information on structure seminvariants and phase

restrictions is used to apply the SIGMA1 formula as described later. CONVERGE

also determines the 'best' reflexions for origin and enantiomorph definition

and finds a small number of other reflexions which appear to provide a good

starting point for phase determination.

The SIGMA1 formula in CONVERGE is obtained as a special case of the SIGMA2

relationship in which two of the reflexions are the same. After

'standardisation' of the indices, such a relationship may be written as

phi(i) +- phi(j) +- phi(j) - delta = 0

where delta is a phase shift caused by translational space group symmetry. If

phi(i) is the phase of a structure seminvariant and is restricted in value,

this relationship may be used to estimate it. For example, if the signs of the

two phi(j)'s are different, we clearly have the indication

phi(i) = delta

while if the signs are the same we have

phi(i) = -+ 2 * phi(j) + delta

the value of 2 * phi(j) will be known if phi(j) is restricted in value and once

more we have an indication of the value of phi(i). These phase indications for

phi(i) are added vectorially, each with a weight of E(j)**2 - 1.0, and the

resultant vector is projected on to the direction of the possible phase values

for phi(i). If the length of this projection is s1, the probability that it

indicates the correct phase of phi(i) is given by

p = 0.5 + 0.5 * tanh(e(i) * s1 / (2.0 * sqrt(n)))

where n is the number of atoms in the unit cell.

The indicated phase of each reflexion to which the SIGMA1 formula has been

applied is output together with its associated probability. The number of

reflexions contributing to the calculation of the probability is also output

in the columns headed 'nc'. Those reflexions whose probabilities exceed a

threshold set by the user are included in the starting point for the tangent

formula where they are given an initial weight of 2.0 * p - 1.0. Because of

this weighting, The threshold probability may be set as low as 0.90 or 0.85.

however, The SIGMA1 formula must always be used with care and if there are any

doubts at all about the scaling of the E's, this threshold should be set quite

high. A particularly bad situation for SIGMA1 is when the average intensity of

certain classes of reflexions is very much greater than the average intensity

elsewhere. Under these circumstances, a phase may be indicated with a

probability of unity and still be incorrect. In general, a probability value

will be more reliable the larger the number of contributing reflexions.

The next operation of the programme CONVERGE is to find the 'best'

reflexions to use for origin definition and to look for several other

reflexions which appear to provide a good starting point for phase

determination. The exact number and type of these additional reflexions is

set by the user. A good starting point will be one which uses the strongest phase

relationships and leads quickly to multiple indications for unknown phases,

thus increasing the reliability of phase determination during the initial

crucial stages. The algorithm used to find such a starting point is described

1 page 26

by Germain, Main & Woolfson (Acta Cryst., 1970, B26, 274) and so will not be

given in detail here.

The table of estimated alpha's is output together with the indices and

E-value of each reflexion. In addition, the reflexion type and phase

restrictions are given in the columns headed 'mk'. If a phase value is

unrestricted by space group symmetry, the value of mk is 1. Any other value

of mk denotes a restricted phase, one of whose possible values is 15 * (mk - 1)

degrees, the other being 15 * (mk - 1) + 180 degrees. All reflexions marked

with an asterisk (*) are structure seminvariants.

If a convergence mapping has been requested, this is printed out next.

Each reflexion is listed as it is eliminated in the convergence procedure with

its estimated alpha computed from the reflexions remaining. Also listed are

all the relationships contributing to this estimated alpha, printed, for

example, as 1 -97 12 1.88. If the reflexion just eliminated has a code

number of 277, this relationship is phi(277) = phi(1) - phi(97) + 12*(pi/12)

with a ka(h,k) value of 1.88. As each reflexion is eliminated, those remaining

are examined to ensure they contain all the necessary reflexions for origin

definition. If not, the reflexion just eliminated is reinstated and becomes

one of the reflexions used to define the origin.

If CONVERGE can not find all the reflexions required to define the origin

uniquely, the programme will print the message, 'origin can not be properly

defined' and then stop. Apart from an unforeseen programming error, the only

reason for this can be that the necessary reflexion types are either

completely absent or they enter into no phase relationships with other

reflexions. The only recovery is to add the missing classes of reflexions to

the data set and to set up more phase relationships.

After the origin defining reflexions have been found, the other reflexions

in the starting set (requested by parameters NSPEC, NGEN and NANY) are chosen.

As the phases of these are unknown, they are assigned a number of different

values as determined by the appropriate 'magic integer' sequence (Main, 1977,

Acta Cryst., A33, 750 - 757 and 1978, Acta Cryst., A34, 31 - 38), or given

the two values allowed by space group symmetry if the phase is restricted, thus

producing a multiple starting point for the tangent formula. The sets of

phases assigned to the starting reflexions are evenly distributed over the

whole possible range of phase sets so that at least one should be close to the

correct set. The number of starting sets is given by (NSETS)*(2**NS)*(EF)

where NSETS is the number of phase sets produced by the magic integers, NS is

the number of reflexions whose phases take on just two values and EF is an

enantiomorph factor which will have one of the values 1/2, 1, 2 or 4. The

value of NSETS as a function of NGEN, the number of general phases to be

permuted in the starting set, is given in the following table along with the

r.m.s. Error of the 'best' set of starting phases:

NGEN 0 1 2 3 4 5 6 7 8 9 10

NSETS 1 4 12 20 32 50 80 128 206 332 536

RMS error 0 26 29 37 42 45 47 48 49 50 50

The value of EF depends upon how the programme decides to fix the origin and

enantiomorph for a particular space group and set of data, but it will normally

be 1/2 or 1. The values of 2 and 4 occur when general reflexions are used for

origin definition where special reflexions could have been used, as indicated

in the following table:

1 page 27

EF = 1/2 if an unknown phase is used to define the enantiomorph - the range of

permutation is restricted to 1/2 the phase circle.

= 1 if the enantiomorph can be defined by restricting the origin-defining

phases to fixed values, or

if the enantiomorph is fixed by the space group.

= 2 if the enantiomorph is defined by restricting the origin-defining phases

to two sets of values (one phase must take on two values).

= 4 if the enantiomorph is defined by restricting the origin-defining phases

to four sets of values (two phases each take on two values).

the reflexions which are eliminated last in the convergence procedure will

be the most important for phase determination and will appear at the end of

the convergence map. It will be seen that, starting from the end of the

convergence map and working upwards, each phase may be computed from the

relationships listed with it, since these relationships use only the starting

set reflexions and phases already determined. The programme therefore

remembers the order in which reflexions appear in the convergence map and

determines their phases in reverse order by the tangent formula in FASTAN. It

follows that any reflexion near the bottom of the convergence map with a zero

estimated alpha and no contributing relationships represents a weak link during

the first few steps of phase determination. The additional reflexions in the

starting set, whose phases are permuted, are chosen to minimise the number of

such weak links.

A good starting point for the tangent formula is one which not only leads

quickly to multiple phase indications, but one which makes full use of all

starting set reflexions within the first few steps of phase determination.

This is especially true of the origin defining reflexions and if they are not

all used during the determination of the first 25 unknown phases a warning

message is printed, 'MULTAN has difficulty finding good origin defining

reflexions - please look critically at the convergence results'. The usual

cause of this is that a whole class of reflexions is systematically weak and so

is completely eliminated well before the end of convergence. If one of these

reflexions is required for origin definition, the warning message will be

printed. If no action is taken by the user, MULTAN will continue with the

tangent formula and may well determine the structure successfully. However,

if a solution is not found, a better starting point may be found by renormalising

the F's so that = 1 for each class of reflexions separately, thus

enhancing the effect of the systematically weak reflexions.

The final lineprinter output from this section of the programme is a

complete listing of all reflexions chosen to start phase determination - the

SIGMA1 reflexions, origin definition and those reflexions whose phases are

permuted. As far as possible, the programme chooses reflexions with restricted

phases for origin definition. However, this criterion may be relaxed if

suitable reflexions can not be found and general reflexions may be chosen

instead. Under these circumstances, the origin defining reflexions will be

used to define the enantiomorph also and sometimes one or even two of them will

be given two possible phase values to start the phase determination. This

information is also printed out with the origin defining reflexions.

The definitions of the permuted phases in terms of the magic integer

sequence used are also printed out. Each general phase is defined by

phi(i) = m(i) * x

where m(i) is the magic integer and x is a variable which takes on different

values at equal intervals in the range 0 to 2*pi radians. The interval of x

is chosen to make the r.m.s. Difference in the phases from one set to the next

equal to the r.m.s. error in the 'best' set of phases.

1 page 28

*************************

* Description of FASTAN *

*************************

The tangent formula section of the programme generates each starting set

of phases by assigning phase values given by a 'magic integer' sequence to

those reflexions chosen by CONVERGE. By means of the tangent formula, FASTAN

then develops them into a complete set of phases using the phase relationships

output by SIGMA2. Starting phases may be input from cards if so desired.

together with each set of phases are output a number of figures of merit to

enable the programme (or the user) to decide which phases should be used in

the computation of E-maps.

Phases are determined and refined using a weighted tangent formula

(Germain, Main & Woolfson, 1971, Acta Cryst., A27, 368)

sum over k (w(k)*w(h-k)*ka(h,k)*sin(phi(k)+phi(h-k))) t(h)

tan(phi(h)) = ----------------------------------------------------- = ----

sum over k (w(k)*w(h-k)*ka(h,k)*cos(phi(k)+phi(h-k))) b(h)

where w(h) is the weight associated with the phase phi(h). Each weight is

computed from

w(h) = amin1(1.0, 0.2*aa(h))

where aa(h)**2 = t(h)**2 + b(h)**2

when groups of atoms of known orientation or position have been input, the

tangent formula is modified as described by main (1976, Crystallographic

computing techniques, ed. F. R. Ahmed, pp. 97 - 105, Munksgaard). The weights

of reflexions whose phases have been determined by the SIGMA1 formula are

initially set at 2.0 * p - 1.0, where p is the probability of the SIGMA1

determination. The weights of all other reflexions in the starting set are

unity while reflexions whose phases are undetermined are given a weight of

zero. The programme determines initial phases values for the bottom 60 phases

from the convergence map and refines them before going on to determine phases

for the complete data set. These are determined in the order indicated by

CONVERGE and also refined in the same order. All SIGMA1 phases are kept

constant and contribute to the phase determination with their initial weight

until they are determined by the tangent formula with a larger weight.

Thereafter they take on the value and weight indicated by the tangent formula.

the other reflexions in the starting set are kept constant until the tangent

formula refinement has essentially converged; they are then allowed to refine

to their final values, which takes about two further cycles.

Since the tangent formula is weighted, all reflexions are included the

whole of the time and there is no rejection criterion for a phase other than

the weight being calculated as very small or zero. The form of weighting

ensures that poorly determined phases have little effect in the determination

of other phases, while the fact that all phases are included in the phase

determination process leads to very efficient propagation of phase knowledge

throughout the data set.

Provision is made in FASTAN for the input of starting phases from cards.

This may be used to input one's own starting reflexions if those chosen by the

programme are unsatisfactory in some way. If this facility is used in a

recycling procedure in which the starting phases are given by a previously

obtained partial structure, It is advisable to make the reflexion markers

negative. This will fix the phases at their input values and ensures the new

phases determined conform to the input information. All phases are then

allowed to refine for the last two or three cycles of tangent formula

refinement. If the input phases are allowed to refine immediately, they may

change their values completely and so bear no relation to those input.

1 page 29

********************

* Figures of merit *

********************

For each set of phases determined, three figures of merit are computed

and output with the tangent formula results with the titles 'ABS FOM', 'PSI ZERO'

and 'RESID'. They are used by the programme to decide in which order to

compute the E-maps.

The 'absolute figure of merit', ABS FOM, is a measure of internal

consistency among the SIGMA2 relationships and is calculated from

ABS FOM = sum over h (aa(h) - ar(h)) / sum over h (ae(h) - ar(h))

where aa(h) is defined in the previous section, ae(h) is the estimated value of

aa(h) calculated during the convergence procedure and ar(h) is the value of

aa(h) expected from random phases, i.e.

ar(h)**2 = sum over k (ka(h,k)**2)

ABS FOM is clearly zero for random phases and unity if the sum of all the aa(h)

is equal to its expectation value. If the space group contains translational

symmetry elements, the correct set of phases should give one of the highest

values of ABS FOM. Because the tangent formula tends to maximise the internal

consistency of the phase relationships, this is the least reliable of the

figures of merit. Values of ABS FOM above unity are common. Sometimes the

correct set of phases has given ABS FOM as low as 0.7, but this is unusual.

The next figure of merit, PSI(ZERO), was first used by Cochran & Douglas

(Proc. Roy. Soc., 1957, A243, 281) and is defined as

PSI(ZERO) = sum over h (PSI(h)) / PSI(rand)

where PSI(h) = magnitude of sum over k (E(k) * E(h-k))

and PSI(rand) = sum over h (sqrt(sum over k ((magnitude of E(k)*E(h-k))**2))

The terms in the summation for PSI(h) are those for which phases are known and

the values of h in the first summation are those for which the corresponding

E-value is either very small or zero, hence the PSI(ZERO) reflexions input to

SIGMA2. Since PSI(h) is essentially the right hand side of sayre's equation

for small E(h), it should have a low value for the correct phases. The value

of PSI(ZERO) should therefore be as small as possible. Practical experience

shows this to be a powerful discriminator against incorrect sets of phases -

it is very sensitive to molecular position and is completely independent of the

tangent formula.

The third figure of merit, resid, is merely an ordinary crystallographic

residual for the equations

E(h) = s *

where < > means 'average value of' and the scale factor s is computed to

minimise resid. The reliability of this figure of merit increases with the

ratio of phase relationships to reflexions. Clearly, resid should be a minimum

for the correct set of phases.

After all sets of phases have been generated, MULTAN outputs a summary of

the figures of merit including a 'combined figure of merit', c, calculated from

af - af(min) ps(max) - ps r(max) - r

c = w1 * --------------- + w2 * --------------- + w3 * -------------

af(max)-af(min) ps(max)-ps(min) r(max)-r(min)

where af is ABS FOM, ps is PSI(ZERO) and r is resid. The weights w1, w2 and w3

given to each term have default values of unity, but may also be input as data

to FASTAN. Clearly, c has a maximum value of w1 + w2 + w3, a minimum of zero

and is a maximum for the 'best' set of phases. The combined figure of merit

has been found to be a more reliable indicator of the correct set of phases

than either of the other three figures of merit separately. Therefore, it is

1 page 30

used by EXFFT to decide on the order of computation of E-maps.

It should be noted here that when the space group contains no translational

symmetry elements, the SIGMA2 phase relationships are perfectly satisfied when

the phases are a linear function of the indices. These phases will necessarily

produce the highest possible value of ABS FOM and will be almost invariably

wrong. Therefore in space groups such as P1 or C2, the PSI(ZERO) figure of

merit will be much more reliable than ABS FOM and may be given a higher weight

in th

e calculation of the combined figure of merit.

1 page 31

****************************************

* Description of the fourier procedure *

****************************************

The remaining programmes in the MULTAN system, EXFFT and SEARCH, are

normally run consecutively to calculate and interpret E-maps and will be

described together in this section.

Using default values in the data, EXFFT will always produce the E-map with

the largest combined figure of merit which has so far not been computed.

Alternatively, the particular set of phases which is to be used in the E-map

calculation may be specified. The E-map is not normally output, but is

immediately passed on to the peak SEARCH programme SEARCH.

SEARCH looks for the highest peaks in the map and lists their coordinates

in order of peak height. The number of peaks listed is normally about 22 per

cent more than the number of atoms to be located unless the actual number is

specified by the user. SEARCH also looks for bonded sequences of peaks among

those located, taking a default value of 1.95A as the maximum bonding distance

allowed. These sequences are sorted into molecular fragments and the fragment

number is printed along with the peak coordinates. Peaks which are not within

bonding distance of any other are given a fragment number of zero. If two

fragments containing at least five peaks each are found to be within 2.8A of

each other, SEARCH assumes they are joined by a weak peak not yet found. It

then automatically increases the number of peaks to 50 per cent more than the

number of atoms and this increased number of peaks is used throughout the rest

of the programme. This automatic increase in the number of peaks is only given

if the default number of peaks has been used. Interpeak distances and angles

are calculated and then tabulated in the next section of lineprinter output.

For each fragment containing more than 8 peaks, SEARCH projects the peaks

onto the least squares plane and plots their positions on the lineprinter. Up

to two more orthogonal projections can be output if requested or if the

programme decides it is desirable. These are on the plane orthogonal to the

least and most squares planes and on the most squares plane, in that order.

If the default option is used (NPROJ = 0) SEARCH will always give the first

projection and will output the second if the fragment is cylindrical or

globular. The criterion used to output the second projection is that the sum

of the squares of the distances of all the peaks from the plane, sumdsq,

should be less than twice the value of SUMDSQ for the least squares plane. All three

projections can be output for any fragment by setting NPROJ = 3.

The simple stereochemical criteria of maximum and minimum bond lengths

and angles are applied to each fragment in an attempt to identify spurious peaks.

In order to satisfy these criteria, different peaks may be assumed to be

spurious and so a number of interpretations of the fragment are sometimes

possible. The programme searches for different interpretations in this way,

but the SEARCH is not exhaustive. Up to three interpretations which are

thought to be significantly different can be output.

If the user has input the bond sequence of the molecule, this is compared

with the bond sequence of each interpretation and corresponding atoms are

identified as far as possible. The comparison routines used are described by

Main & Hull (1978) Acta Cryst., in press. They are completely general and are

capable of matching any two molecules no matter what their connectivities. In

this version, the input molecule plays a passive role and is used only as a

check on the molecule found in the map. SEARCH is being developed further to

enable the input molecule to be used actively in the E-map interpretation.

1 page 32

It may sometimes happen that the peaks corresponding to one or two atoms

are quite small and so are not plotted. This could have the effect of

splitting the molecule into two disconnected fragments making it almost

unrecognisable in the way it is presented by SEARCH. The user should be aware

of this and, if he suspects it is happening, should request a larger number of

peaks to be plotted. This also has the effect, of course, of plotting more

spurious peaks as well as the genuine ones sought for. As a last resort, the

complete map can be output directly from EXFFT. This should be rarely

necessary, however, and is usually reserved for cases where the data are poor

and the atoms are not properly resolved.

1 page 33

***************************************

* Check-list for difficult structures *

***************************************

If MULTAN has failed to solve the structure and the data have been checked

for errors, the following points may suggest a possible course of further

action.

(1) Make sure that all available knowledge of the stereochemistry of the

structure has been used in the calculation of normalised structure factors.

The larger the atomic groups input to NORMAL for the calculation of molecular

scattering factors the better. If the Debye curve is far from being straight,

use a K-curve to normalise the structure factors.

(2) After normalisation, check the distribution of average E**2 with

sin(theta). If the E's at high angles are too large (average E**2 > 1.0) it

may be necessary to renormalise using a smaller temperature factor.

(3) The number of strong reflexions chosen for phase determination

should normally be about 7 or 8 times the number of atoms to be located.

Changing the number of reflexions (either a greater or a smaller number will

do) often makes the programme behave differently and, with luck, it will solve

the structure. The number of reflexions required per atom tends to be more

for lower symmetry space groups than for higher symmetry.

(4) The number of unique phase relationships used should NORMALly

be more than about 8 times the number of reflexions. Fewer than this may result in

failure to solve some structures. For specially difficult problems, use as

many relationships as possible. MULTAN has been written to make increases in

array sizes as easy as possible if this proves to be necessary.

(5) If there are too few phase relationships per reflexion and there is

room for more relationships in the computer, increase the number of reflexions.

this will result in more relationships being set up and will increase the

number of relationships per reflexion. If there are already too many

relationships for the computer to hold, decrease the number of reflexions to

the point where most of the relationships set up can be held in store. This

will maximise the number of relationships per reflexion.

(6) Look critically at the phases determined by the SIGMA1 formula, if

any. Normally the threshold probability for the acceptance of these phases

can be set quite low, but if there are any doubts about the scaling of the E's,

this threshold should be high. The SIGMA1 formula can not easily be applied

when the average intensity of certain groups of reflexions is very much greater

than the average intensity of the remainder. In this situation, a phase may

be indicated with a probability of unity and still be incorrect.

(7) Make sure the starting point for the tangent formula is a good one.

all the reflexions in the starting set should be used during the first few

stages of phase determination. This can be checked by looking at the bottom

of the convergence map. If a starting set reflexion (apart from SIGMA1's) is not

used at all during the early stages of phase determination, a better starting

point can sometimes be obtained by including at least one other reflexion whose

phase depends upon that of the 'late starter'. However, the remedy for a poor

starting point will depend upon the particular situation and general rules can

not be given.

1 page 34

(8) If there are 'gaps' near the bottom of the convergence map, i.e.

reflexions eliminated with a zero estimated alpha and no SIGMA2 contributors,

it may be necessary to increase the number of reflexions in the starting set.

even without gaps, an extra starting set reflexion may help, but only at the

expense of increased computing time.

(9) If the convergence map is very 'thin' near the bottom, i.e. many

phases are determined by only one or two relationships each at the very start

of phase determination, this may be a cause of failure to solve the structure.

the remedy for this is to increase the number of reflexions and use all the

relationships the programme sets up.

(10) Check to see if all the reflexions at the bottom of the convergence

map have something obvious in common, i.e. do they all have the l index even,

or h divisible by 3 or h+k even, etc. If so, then make sure that the average

value of E**2 is equal to unity for all such reflexion groups separately, by

renormalisation.

(11) Be sure to use the PSI(ZERO) figure of merit especially when the

space group contains no translational symmetry. This figure of merit is

automatically used if NORMAL has supplied the input to MULTAN, but the

PSI(ZERO) reflexions can be omitted if MULTAN is used in isolation.

(12) For a difficult structure, the combined figure of merit may not

indicate the 'correct' set of phases very reliably. Be sure to examine E-maps

further down the ranking order than is usually necessary.

(13) When certain atoms are missing from an E-map, the molecule quickly

becomes unrecognisable. It may happen that these key atoms are present as

rather weak peaks and so are not plotted by SEARCH. by plotting more peaks

(using the parameter NPC in the data for SEARCH), parts of the E-map may

become more easily interpretable.

(14) A common incorrect solution is a correctly oriented molecular

fragment misplaced with respect to the symmetry elements. Such fragments can

be used to help MULTAN determine the structure by including them in the NGP

atomic groups in the data for NORMAL and rerunning the programme system in the

usual way.

(15) When everything else has failed, the authors are usually willing

either to discuss a particular structure or to attempt a solution of the

structure themselves. As MULTAN continues to develop, the version of the

programme available to the authors may be more powerful than that available to

the general user.

End of MULTAN write-up