Select lines by Regex in table Julia
This software select rows in a CSV tables (TAB separator) where the first column corresponds to the list in reference files. Lists can contain strings or Regex. String search is done by default, a row is selected if the first cell match exactly the string. The first line (table header) can be included in the search like any other row or selected using a command line argument . Unlimited number of lists and tables can be processed. To remove line, use Remove Lines
Manual :
1- if not already installed, install Julia free programming language
2- unzip the software
3- copy your data csv (TAB separator) table in the “table” directory.
4- copy your reference lists in the “lists” directory, one regex or string per line.
5- command line arguments :
Arguments must be separated by a space : "-hr" will not work, use "-h -r" instead
h or -h : keep the header
r or -r : regex mode. Cells matching a regex in list are selected
To start the software :
julia main.jl # process the header as regular row and search cells matching exactly a string in the list
julia main.jl h or julia main.jl -h # keep the header and search cells matching exactly a string in the list
julia -t 10 main.jl # parallel processing with 10 threads. One table is processed by one thread
julia main.jl r or julia main.jl -r # do not keep the header and search cells matching a regex in the list
julia main.jl r h or julia main.jl -r -h # keep the header and search cells matching a regex in the list
Tables are processed in parallel if multithread is enabled in Julia using -t argument or export JULIA_NUM_THREADS=4 in .bashrc
6- processed files are in the “results” directory.
FAQ :
I want to match exaclty the cells names -> enter the exact cell names in the reference list and use the default setting : julia main.jl
I want to match the cells containing a string -> enter the string in the reference list and use the regex setting : julia main.jl -r