X-Git-Url: https://git.donarmstrong.com/?p=rsem.git;a=blobdiff_plain;f=rsem-control-fdr;h=eb8f3454b05579568877917f0c41972a94018f6c;hp=86cfb936d6e4ff91acd7f3a3c4ead43ecf6a826c;hb=92b24279a3ecc72946e7e7c23149ad0d181f373a;hpb=80182d3120cde6e3831fc3ea99af91f20dade08b diff --git a/rsem-control-fdr b/rsem-control-fdr index 86cfb93..eb8f345 100755 --- a/rsem-control-fdr +++ b/rsem-control-fdr @@ -1,4 +1,4 @@ -#!/usr/bin/perl +#!/usr/bin/env perl use Getopt::Long; use Pod::Usage; @@ -74,7 +74,7 @@ rsem-control-fdr [options] input_file fdr_rate output_file =item B -This should be the result file generated by 'rsem-run-ebseq', which contains all genes/transcripts and their associated statistics. +This should be the main result file generated by 'rsem-run-ebseq', which contains all genes/transcripts and their associated statistics. =item B @@ -82,7 +82,7 @@ The desire false discovery rate (FDR). =item B -This file is a subset of the 'input_file'. It only contains the genes/transcripts called as differentially expressed (DE). When more than 2 conditions exist, DE is defined as not all conditions are equally expressed. +This file is a subset of the 'input_file'. It only contains the genes/transcripts called as differentially expressed (DE). When more than 2 conditions exist, DE is defined as not all conditions are equally expressed. Because statistical significance does not necessarily mean biological significance, users should also refer to the fold changes to decide which genes/transcripts are biologically significant. When more than two conditions exist, this file will not contain fold change information and users need to calculate it from 'input_file.condmeans' by themselves. =back @@ -96,7 +96,7 @@ Use hard threshold method to control FDR. If this option is set, only those gene =item B<--soft-threshold> -Use soft threshold method to control FDR. If this option is set, this program will try to report as many genes/transcripts as possible, as long as their average PPDE >= 1 - fdr_rate. (Default: off) +Use soft threshold method to control FDR. If this option is set, this program will try to report as many genes/transcripts as possible, as long as their average PPDE >= 1 - fdr_rate. This option is equivalent to use EBSeq's 'crit_fun' for FDR control. (Default: off) =item B<-h/--help> @@ -114,8 +114,4 @@ We assume that we have 'GeneMat.results' as input. We want to control FDR at 0.0 rsem-control-fdr GeneMat.results 0.05 GeneMat.de.txt -If we instead want to use soft threshold method to obtain more called genes/transcripts, we can use: - - rsem-control-fdr --soft-threshold GeneMat.results 0.05 GeneMat.de.txt - =cut