Select lines by Regex in table Julia

This software select rows in a CSV tables (TAB separator) where the first column corresponds to the list in reference files. Lists can contain strings or Regex. String search is done by default, a row is selected if the first cell match exactly the string. The first line (table header) can be included in the search like any other row or selected using a command line argument . Unlimited number of lists and tables can be processed. To remove line, use Remove Lines

Manual :

1- if not already installed, install Julia free programming language

2- unzip the software

3- copy your data csv (TAB separator) table in the “table” directory.

4- copy your reference lists in the “lists” directory, one regex or string per line.

5- command line arguments :

Arguments must be separated by a space : "-hr" will not work, use "-h -r" instead

h or -h : keep the header

r or -r : regex mode. Cells matching a regex in list are selected

To start the software :

julia main.jl # process the header as regular row and search cells matching exactly a string in the list

julia main.jl h or julia main.jl -h # keep the header and search cells matching exactly a string in the list

julia -t 10 main.jl # parallel processing with 10 threads. One table is processed by one thread

julia main.jl r or julia main.jl -r # do not keep the header and search cells matching a regex in the list

julia main.jl r h or julia main.jl -r -h # keep the header and search cells matching a regex in the list

Tables are processed in parallel if multithread is enabled in Julia using -t argument or export JULIA_NUM_THREADS=4 in .bashrc

6- processed files are in the “results” directory.


FAQ :

I want to match exaclty the cells names -> enter the exact cell names in the reference list and use the default setting : julia main.jl

I want to match the cells containing a string -> enter the string in the reference list and use the regex setting : julia main.jl -r