Correlation by lines in list

This software takes tables as input and computes the Pearson correlation coefficient between all rows in a list against all the other rows.

Manual :

The software was written in GO and compiled on linux 64 bits with 8 threads.

1- install Perl free programming language and GNU parallel.

2- unzip the software

3- copy your n tables in csv (TAB delimitated by default) files in the “data” directory.

The tables must have at least 3 rows.

- first line of the table : columns names

- first column : rows names

- separator : TAB

- decimal separator : '.'

- the software accept 'NA' and 'NaN' in the table

4- Edit the /rowlist/rowlist.txt file and write one rowname per line.

5- Edit the parallel configuration file to set the max number of CPUs = max number of tables processed in parallel.

the executable file uses 8 threads per table. So if you set nb_cpu=4 the sofware will use 4x8=32 threads.

6- execute the software by the command : perl corr_by_lines-0.8.pl

7- processed file is in the “result” directory.

Caution : If you process a very large table, the table will be too large to be opened in a spreadsheet. To open the table in R, don't forget to skip the first line using : read.csv(file="file.csv", skip=1, header=F, sep = '\t')

8- you can use the command : perl parallel_format_table-0.1.pl to obtain a correlation table with means and SD.

9- formatted table is in the table directory