created by LeeTL1220
on 2016-01-14
Notice: these workflows are out of date and should no longer be used. See the WDL scripts instead.
These workflows are not officially supported, but are used extensively, internally, for our evaluations.
These instructions assume some familiarity with running Queue.
Download Queue scala scripts
Download the following files to the same directory. This step only needs to be done once. - CaseSampleHBExomePipeline.scala
[http://www.broadinstitute.org/~lichtens/gatk4cnv/CaseSampleHBExomePipeline.scala] - CreatePoNPipeline.scala
[http://www.broadinstitute.org/~lichtens/gatk4cnv/CreatePoNPipeline.scala] - recapseg-hb-eval.jar
(Queue) [http://www.broadinstitute.org/~lichtens/gatk4cnv/recapseg-hb-eval.jar] - gatk-protected.jar
[http://www.broadinstitute.org/~lichtens/gatk4cnv/gatk-protected.jar]
Please note: These locations are temporary and will be updated in the future.
Test that invoking the help documentation works
java -jar recapseg-hb-eval.jar -S CreatePoNPipeline.scala --help
java -jar recapseg-hb-eval.jar -S CaseSampleHBExomePipeline.scala --help
Example case sample run with ACNV
Parameters to the recapseg-hb-eval.jar may need to be adjusted depending on your execution environment
```
. /broad/tools/scripts/useuse use .hdfview-2.9 use Java-1.8 use .r-3.1.3-gatk-only
CASESAMPLES=tumorbamfiles.txt CONTROLSAMPLES=normalbamfiles.txt
OUTDIR=/home/username/evals/out_case/
GATK4PJAR=gatk-protected.jar
HDFLOC=/broad/software/free/Linux/redhat6x8664/pkgs/hdfview2.9/HDFView/lib/linux/
REF=/seq/references/Homosapiensassembly19/v1/Homosapiensassembly19.fasta
TARGETS=/home/lichtens/mytargetlist.bed
PON=/home/lichtens/evals/outpon/createpon/mybloodnormals.pon
MEM=8
SNPLIST=mylistofcommonhetsites.interval_list
PD=250 OTHEROPTS=" -jobResReq virtualfree=${MEM}G -keepDups -rawcov -jobQueue gsa -pd ${PD} -noWt " OTHEROPTSACNV=" -acnv -snp ${SNPLIST} -icontrol ${CONTROLSAMPLES} "
Do not modify below this line
java -jar recapseg-hb-eval.jar -S CaseSampleHBExomePipeline.scala -mem ${MEM} -pon ${PON} -i ${CASESAMPLES} -o ${OUTDIR} -gatk4pjar ${HBJAR} -r ${REF} -L ${TARGETS} -qsub -run -logDir ${OUTDIR} -hvl ${HDFLOC} -Z ${Z} ${OTHEROPTS} ${OTHEROPTSACNV}
```
Example case sample run without ACNV
Parameters to the recapseg-hb-eval.jar may need to be adjusted depending on your execution environment
```
. /broad/tools/scripts/useuse use .hdfview-2.9 use Java-1.8 use .r-3.1.3-gatk-only
INPUTBAMS=casebam_list.txt
OUTDIR=/home/username/evals/out_case/
GATK4PJAR=gatk-protected.jar
HDFLOC=/broad/software/free/Linux/redhat6x8664/pkgs/hdfview2.9/HDFView/lib/linux/
REF=/seq/references/Homosapiensassembly19/v1/Homosapiensassembly19.fasta
TARGETS=/home/lichtens/mytargetlist.bed
PON=/home/lichtens/evals/outpon/createpon/mybloodnormals.pon
MEM=8
PD=250 OTHEROPTS=" -jobResReq virtualfree=${MEM}G -keepDups -rawcov -jobQueue gsa -pd ${PD} -noWt "
Do not modify below this line
java -jar recapseg-hb-eval.jar -S CaseSampleHBExomePipeline.scala -mem ${MEM} -pon ${PON} -i ${INPUTBAMS} -o ${OUTDIR} -hbJar ${GATK4PJAR} -r ${REF} -L ${TARGETS} -qsub -run -logDir ${OUTDIR} -hvl ${HDFLOC} ${OTHEROPTS}
```
Example create PoN run
Parameters to the Queue jar (recapseg-hb-eval.jar) may need to be adjusted depending on your execution environment
```
. /broad/tools/scripts/useuse use .hdfview-2.9 use Java-1.8 use .r-3.1.3-gatk-only
INPUTBAMS=bloodnormalsbamlist.txt
OUTDIR=/home/lichtens/evals/out_pon/
GATK4PJAR=gatk-protected.jar
HDFLOC=/broad/software/free/Linux/redhat6x8664/pkgs/hdfview2.9/HDFView/lib/linux/
REF=/seq/references/Homosapiensassembly19/v1/Homosapiensassembly19.fasta
PONFILENAME=myblood_normals.pon
TARGETS=/home/lichtens/mytargetlist.bed
MEM=6
MEM_PON=14
CORES=4
OTHEROPTS=" -jobResReq 'virtualfree=${MEM}G' -keepDups -rawcov -jobQueue gsa -pd 250 -sparkMaster 'local[${CORES}]' "
Do not modify below this line
java -jar recapseg-hb-eval.jar -S CreatePoNPipeline.scala -pon ${PON} -i ${IN} -o ${OUTDIR} -gatk4pjar ${HBJAR} -r ${REF} -L ${TARGETS} -qsub -run -logDir ${OUTDIR} -hvl ${HDFLOC} -mem ${MEM} -mempon ${MEMPON} ${OTHER_OPTS}
```
Updated on 2017-12-29
From LeeTL1220 on 2016-01-20
To run the jobs on a local machine, remove “-qsub” and “-jobQueue gsa”. Add “ -maxConcurrentRun 10 -jobRunner ParallelShell “, where 10 is the number of processes to use.
From Geraldine_VdAuwera on 2017-12-29
Notice: these workflows are out of date and should no longer be used. See the WDL scripts instead.