Readme
<Readme>
Readme for xeasy format 2D peak list handling and 2D VNMR data conversions.
consider_correl
Label xeasy format 2D correlation assignments file with names in place of xeasy atom
numbers and deviations from the shifts given in the xeasy format atoms list (*.prot).
Usage: consider_correl xeasyfile.seq xeasyfile.prot < assignfile.peaks > outfile
Installation: Compile source with cc -o consider_correl -O consider_correl.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks), and writes a tab delimited file of 7 columns to standard output.
The seven columns are peak number, dimension 0 shift in ppm, deviation from shifts
file shift for the dimension 0 assignment, dimension 1 shift in ppm, deviation from
shifts file shift for the dimension 1 assignment, peak intensity measurement, and
assignment. Only those peaks in the original file for which assignments are given
are listed in the output. The assignments are given with each residue number
preceding its amino acid residue name. Because all correlations are presumed to
be intra-residue, the residue number and name is only given once. If inter-residue
correlations are found in the file an error message is sent to standard error, and
the program terminates. See label_correl_devs for a similar routine that gives
deviations from the average shift for an assignment in the peaks file. For best
viewing, the tab delimited text file should be read in a spread sheet application.
Example lines from labeled file:
Assignments with Deviations from Shift List Entries
peak id. Dim 0 (ppm) Dev. 0 Dim 1 (ppm) Dev. 1 Amplitude Assignment
3 8.097 0.002 4.584 -0.005 1.02e+08 17TRP:HN-HA
7 8.018 -0.022 4.538 0 6.29e+07 2SER:HN-HA
14 8.233 0.019 4.372 0.021 8.985e+06 18SER:HN-HA
17 7.922 -0.014 4.377 0.035 8.829e+06 12ALA:HN-HA
21 8.217 -0.006 4.343 0.005 1.303e+08 10SER:HN-HA
consider_nOe
Usage: consider_nOe xeasyfile.seq xeasyfile.prot < assignfile.peaks > outfile
Installation: Compile source with cc -o consider_nOe -O consider_nOe.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks), and writes a tab delimited file of 7 columns to standard output.
The seven columns are peak number, dimension 0 shift in ppm, deviation from shifts
file shift for the dimension 0 assignment, dimension 1 shift in ppm, deviation from
shifts file shift for the dimension 1 assignment, peak intensity measurement, and
assignment. Only those peaks in the original file for which assignments are given
are listed in the output. The assignments are given with each residue number
preceding its amino acid residue name. To allow for inter-residue correlations,
each assignment lists two residue number and name pairs. See label_nOe_devs for a
similar routine that gives deviations from the average shift for an assignment in
the peaks file. For best viewing, the tab delimited text file should be read in a
spread sheet application.
Example lines from labeled file:
Assignments with Deviations from Shift List Entries
peak id. Dim 0 (ppm) Dev. 0 Dim 1 (ppm) Dev. 1 Amplitude Assignment
112 8.224 0.01 4.584 -0.005 7.054e+06 18SER:HN-17TRP:HA
113 8.21 -0.013 4.584 0.024 6.939e+06 10SER:HN-9PHE:HA
114 8.104 0.002 4.581 -0.01 2.003e+06 19PHE:HN-19PHE:HA
117 8.01 -0.066 4.538 0.279 1.536e+06 22SER:HN-22SER:HA
120 8.025 0.027 4.536 0.277 1.693e+06 23ARG:HN-22SER:HA
examine
Analyze an assigned xeasy format 2D peak list and generate a list of average shifts with
RMS deviations for each of the resonances.
Usage: examine xeasyfile.seq xeasyfile.prot assignfile.peaks > outfile
Installation: Compile source with cc -o examine -O examine.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks) and writes a tab delimited file with 3 columns for resonance name,
average shift, and RMS deviation to standard output. Resonance names are given
with the residue number preceding the amino acid residue name. For resonances
for which data are not available in the .peaks file, a line with only the resonance
name is printed. For best viewing, the tab delimited text file should be read in
a spread sheet application.
Example lines from file:
atom avg. shift RMS dev.
1VAL:N
1VAL:HN
1VAL:CA
1VAL:HA
1VAL:CB
1VAL:HB 1.35233 0.00124722
1VAL:QG1 0.695 0
1VAL:QG2 0.7415 0.0045
Perl script to extract VNMR pks columns in the given order to a tab delimited file.
usage: extract_vnmr.pl col1 col2 col3 ... < infile > outfile
Reads from standard input and extracts columns from the first line of each entry
in a VNMR pks format file (type generated by the VNMR command, ll2d). Writes on
standard output a tab delimited file consisting of the column entries in the order
listed. In order, the columns read are peak number, F1 shift in ppm, F2 shift in
ppm, amplitude, and volume.
Example of use: Generate a four column file of peak number, direct dimension shift
in ppm, indirect dimension shift in ppm, and peak volume:
extract_vnmr.pl 1 3 2 5 < varianpeaks.pks > peakcolumns.txt
label_correl
Label xeasy format 2D correlation assignments file with names in place of xeasy atom
numbers.
Usage: label_correl xeasyfile.seq xeasyfile.prot < assignfile.peaks > outfile
Installation: Compile source with cc -o label_correl -O label_correl.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks) from standard input and writes a tab delimited file of 5 columns
to standard output. The five columns are peak number, dimension 0 shift in ppm,
dimension 1 shift in ppm, peak intensity measurement, and assignment. Only those
peaks in the original file for which assignments are given are listed in the
output. The assignments are given with the residue number preceding the amino
acid residue name. Because all correlations are presumed to be intra-residue,
the residue number and name is only given once. If inter-residue correlations
are found in the file an error message is sent to standard error, and the
program terminates. For best viewing, the tab delimited text file should be read
in a spread sheet application.
Example lines from labeled file:
peak id. Dim 0 (ppm) Dim 1 (ppm) Amplitude Assignment
3 8.489 4.747 0.003936 2SER:HN-HB2
10 8.476 4.508 0.006785 2SER:HN-HB3
24 8.12 4.993 0.01321 22SER:HN-HB3
75 8.02 4.622 0.0567 9PHE:HN-HA
label_correl_devs
Label xeasy format 2D correlation assignments file with names in place of xeasy atom
numbers and deviations from the average shifts for each assigned resonance in the file.
Usage: label_correl_devs xeasyfile.seq xeasyfile.prot assignfile.peaks > outfile
Installation: Compile source with cc -o label_correl_devs -O label_correl_devs.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks), and writes a tab delimited file of 7 columns to standard output.
The seven columns are peak number, dimension 0 shift in ppm, deviation from average
shift for the dimension 0 assignment, dimension 1 shift in ppm, deviation from
average shift for the dimension 1 assignment, peak intensity measurement, and
assignment. Only those peaks in the original file for which assignments are given
are listed in the output. The assignments are given with each residue number
preceding its amino acid residue name. Because all correlations are presumed to
be intra-residue, the residue number and name is only given once. If inter-residue
correlations are found in the file an error message is sent to standard error, and
the program terminates. See consider_correl for a similar routine that gives
deviations from shifts given in the shifts file. For best viewing, the tab
delimited text file should be read in a spread sheet application.
Example lines from labeled file:
Assignments with Deviations from Average Shift of Each Assignment
peak id. Dim 0 (ppm) Dev. 0 Dim 1 (ppm) Dev. 1 Amplitude Assignment
3 8.097 0.0015 4.584 0.0035 1.02e+08 17TRP:HN-HA
7 8.018 0.0005 4.538 -0.0054 6.29e+07 2SER:HN-HA
14 8.233 -0.008 4.372 -0.00166667 8.985e+06 18SER:HN-HA
17 7.922 -0.0075 4.377 0.00525 8.829e+06 12ALA:HN-HA
21 8.217 0.0015 4.343 0.000333333 1.303e+08 10SER:HN-HA
29 8.037 0.0005 4.312 0.00225 1.133e+08 14SER:HN-HA
32 7.764 0.002 4.308 -0.0045 8.896e+07 20ALA:HN-HA
label_nOe
Label xeasy format 2D nOe assignments file with names in place of xeasy atom numbers.
Usage: label_nOe xeasyfile.seq xeasyfile.prot < assignfile.peaks > outfile
Installation: Compile source with cc -o label_nOe -O label_nOe.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks) from standard input and writes a tab delimited file of 5 columns
to standard output. The five columns are peak number, dimension 0 shift in ppm,
dimension 1 shift in ppm, peak intensity measurement, and assignment. Only those
peaks in the original file for which assignments are given are listed in the
output. The assignments are given with each residue number preceding its amino
acid residue name. To allow for inter-residue correlations, each assignment
lists two residue number and name pairs. For best viewing, the tab delimited text
file should be read in a spread sheet application.
Example lines from labeled file:
peak id. Dim 0 (ppm) Dim 1 (ppm) Amplitude Assignment
15 8.031 4.656 0.001687 9PHE:HN-9PHE:HA
19 8.085 4.638 0.001396 10SER:HN-9PHE:HA
24 8.208 4.533 0.02944 14SER:HN-14SER:HB3
31 7.644 4.532 0.002276 15ILE:HN-14SER:HB3
label_nOe_devs
Usage: label_nOe_devs xeasyfile.seq xeasyfile.prot assignfile.peaks > outfile
Installation: Compile source with cc -o label_nOe_devs -O label_nOe_devs.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks), and writes a tab delimited file of 7 columns to standard output.
The seven columns are peak number, dimension 0 shift in ppm, deviation from average
shift for the dimension 0 assignment, dimension 1 shift in ppm, deviation from
average shift for the dimension 1 assignment, peak intensity measurement, and
assignment. Only those peaks in the original file for which assignments are given
are listed in the output. The assignments are given with each residue number
preceding its amino acid residue name. To allow for inter-residue correlations,
each assignment lists two residue number and name pairs. See consider_nOe for a
similar routine that gives deviations from shifts given in the shifts file. For
best viewing, the tab delimited text file should be read in a spread sheet
application.
Example lines from labeled file:
Assignments with Deviations from Average Shift of Each Assignment
peak id. Dim 0 (ppm) Dev. 0 Dim 1 (ppm) Dev. 1 Amplitude Assignment
112 8.224 0.00693333 4.584 -0.0072 7.054e+06 18SER:HN-17TRP:HA
113 8.21 -0.00484615 4.584 0.005 6.939e+06 10SER:HN-9PHE:HA
114 8.104 0.00483333 4.581 -0.00333333 2.003e+06 19PHE:HN-19PHE:HA
117 8.01 0.00257143 4.538 -0.00233333 1.536e+06 22SER:HN-22SER:HA
120 8.025 0.0152308 4.536 -0.00433333 1.693e+06 23ARG:HN-22SER:HA
122 8.019 0.0115714 4.456 -0.000571429 6.724e+06 22SER:HN-21ARG:HA
123 7.995 -0.0156364 4.456 -0.000571429 2.669e+06 21ARG:HN-21ARG:HA
possibilities
Examine an xeasy format 2D assignments file in conjunction with an xeasy format
sequence file and an xeasy format atoms file to generate a list of assignment possibilities
for unassigned shifts.
Usage: possibilities xeasyfile.seq xeasyfile.prot < assignfile.peaks > outfile
Installation: Compile source with cc -o possibilities -O possibilities.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks) from standard input and writes a tab delimited file of 6 columns
to standard output. The six columns are peak number, dimension 0 shift in ppm,
dimension 1 shift in ppm, peak intensity measurement, possible assignments for
dimension 0 shift, and possible assignments for dimension 1 shift. Only those
peaks in the original file for which at least one shift is not assigned are listed
in the output. The assignments are given with each residue number preceding its
amino acid residue name, and each possibility lists a residue number and name.
For best viewing, the tab delimited text file should be read in a spread sheet
application. Use caution in interpreting the results of this procedure.
Possible assignments are determined only by a shift being within the error
range of a shift listed in the xeasy shifts file (*.prot) without any other
considerations. One application of this is to attempt to assign parts of a larger
sequence on the basis of assignments of NMR spectra of fragments of that sequence.
Example lines from output:
peak id. Dim 0 (ppm) Dim 1 (ppm) Amplitude Potential1 Potential2
1 8.112 4.62 9.225e+06 9PHE:HN
2 8.079 4.623 1.151e+07 9PHE:HN
22SER:HN
4 8.001 4.577 7.184e+06 21ARG:HN
23ARG:HN
5 8.112 4.541 8.714e+06 9PHE:HN 6SER:HA
14SER:HA
toxeasy
Read first four columns in a tab delimited file and write an xeasy format
2D peaks file.
Usage: toxeasy < infile > outfile.peaks
Installation: Compile source with cc -o toxeasy -O toxeasy.c
Reads from standard input the first four columns of each line of a file as peak
number, dimension 0 shift in ppm, dimension 1 shift in ppm, and peak intensity
measurement. Writes on standard output a 2D xeasy format file (*.peaks) with the
color set to ? and the assignments and error set to 0.
varian2dtoazara
Create correct Azara format 2D .par and .spc files from a 2D Varian (VNMR)
curpar and datdir/phasefile pair.
Usage: varian2dtoazara phasefilepath outfilebase [sfrq]
Installation: Select proper make file and compile source with make
Reads parameters from phasefilepath/curpar and writes an azara parameters file
to outfilebase.par. Reads the 2D phasefile from phasefilepath/datdir/phasefile
and writes an azara format 2D spectrum file to outfilebase.spc. The VNMR
parameter, sfrq, is used as the frequency for the directly detected dimension.
If a third parameter is given, that parameter is used as the name of the VNMR
parameter for the indirect dimension. In the absence of a third parameter, the
VNMR parameter, dfrq, is taken as the frequency of the indirect dimension.
write_peaks
Write peak list from an xeasy format 2D assignments file with names in place of
xeasy atom numbers.
Usage: write_peaks xeasyfile.seq xeasyfile.prot < assignfile.peaks > outfile
Installation: Compile source with cc -o write_peaks -O write_peaks.c
Reads the xeasy sequence file (*.seq) for amino acid residue names and the xeasy
shifts file (*.prot) for the shift name and number. Reads the assigned 2D peaks
file (*.peaks) from standard input and writes a tab delimited file of 5 columns
to standard output. The five columns are peak number, dimension 0 shift in ppm,
dimension 1 shift in ppm, peak intensity measurement, and assignment. All
peaks in the original file are listed in the output with blanks when assignments
are missing. To allow for inter-residue correlations, every assignment lists two
residue number and name pairs. For partial assignments, the unassigned portion
is indicated by question marks. Assignments are given with the residue number
preceding the amino acid residue name. For best viewing, the tab delimited text
file should be read in a spread sheet application. This program can give very
long files.
Example lines from labeled file:
peak id. Dim 0 (ppm) Dim 1 (ppm) Amplitude Assignment
1 5.129 7.784 -210300
2 4.338 8.231 295600
3 4.366 8.223 -237400 10SER:HA-10SER:HN
4 4.354 8.22 -334200 18SER:HA-18SER:HN
5 4.325 8.224 247500
<Usage>
Entering any command without parameters will print out a description of usage.
<Installation>
Compile each C program in the utilities folder with cc -o name -O name.c
The perl script requires perl
The code for varian2dtoazara is in the varian2dtoazara folder.
Use the file Makefile.mac to make the program on Mac OS X.
Use the file Makefile.sol to make the program on Solaris 7 with Sun Workshop 5.
<Versions>
This is the first version, without any change history.
<Copyright>
Bruce D. Ray 2005
<License>
This software is distributed under the current CCPN license (see www.ccpn.ac.uk)