]>
git.donarmstrong.com Git - samtools.git/log
Heng Li [Wed, 27 Oct 2010 04:14:43 +0000 (04:14 +0000)]
* samtools-0.1.8-20 (r778)
* speed up pileup, although I do not know how much is the improvement
Heng Li [Tue, 26 Oct 2010 21:26:04 +0000 (21:26 +0000)]
* samtools-0.1.8-19 (r777)
* integrate mpileup features to pileup: min_baseQ, capQ, prob_realn, paired-only and biased prior
Heng Li [Tue, 26 Oct 2010 19:27:46 +0000 (19:27 +0000)]
remove local realignment (probabilistic realignment is still there)
On behalf of John Marshall [Thu, 21 Oct 2010 10:52:38 +0000 (10:52 +0000)]
Add the relevant filename or region to error messages, and cause a failure
exit status where appropriate. Based on a patch provided by Marcel Martin.
Heng Li [Tue, 19 Oct 2010 23:44:31 +0000 (23:44 +0000)]
* Minor code changes. No real effect.
* change quality to 30 in toy.sam
Heng Li [Tue, 19 Oct 2010 03:40:13 +0000 (03:40 +0000)]
added another toy example
Heng Li [Thu, 14 Oct 2010 03:32:12 +0000 (03:32 +0000)]
improve the LD statistics
Heng Li [Wed, 13 Oct 2010 03:49:26 +0000 (03:49 +0000)]
* a minor fix to the -L option
* add ldstats to vcfutils.pl
Heng Li [Tue, 12 Oct 2010 19:51:57 +0000 (19:51 +0000)]
a minor change
Heng Li [Tue, 12 Oct 2010 19:49:06 +0000 (19:49 +0000)]
forget to add the key file
Heng Li [Tue, 12 Oct 2010 19:48:46 +0000 (19:48 +0000)]
* vcfutils.pl: fixed a typo in help message
* added APIs: bcf_append_info() and bcf_cpy()
* calculate adjacent LD
Heng Li [Mon, 11 Oct 2010 15:06:40 +0000 (15:06 +0000)]
added filter for samtools/bcftools genetated VCFs
Heng Li [Tue, 5 Oct 2010 18:05:18 +0000 (18:05 +0000)]
* removed a comment line in kaln.c
* vcfutils.pl fillac works when GT is not the first field
Petr Danecek [Tue, 5 Oct 2010 12:59:36 +0000 (12:59 +0000)]
Convert VCF output of "bcftools view -bgcv" to a valid VCF file
Heng Li [Sun, 3 Oct 2010 02:51:03 +0000 (02:51 +0000)]
* samtools-0.1.8-18 (r763)
* added bcftools manual page
* minor fix to mpileup and view command lines
Heng Li [Sun, 3 Oct 2010 01:46:25 +0000 (01:46 +0000)]
* vcfutils.pl qstats: calculate marginal ts/tv
* allow to call genotypes at variant sites
Heng Li [Fri, 1 Oct 2010 04:29:55 +0000 (04:29 +0000)]
I am changing the gap open probability back to 0.001. It seems that
being conservative here is a good thing...
Heng Li [Fri, 1 Oct 2010 04:11:27 +0000 (04:11 +0000)]
* samtools-0.1.8-17 (r760)
* the default gap open penalty is too small (a typo)
* added comments on hmm_realn
* Java implementation
Heng Li [Thu, 30 Sep 2010 14:12:54 +0000 (14:12 +0000)]
mark samtools-0.1.8-16 (r759)
Heng Li [Thu, 30 Sep 2010 14:12:02 +0000 (14:12 +0000)]
round to the nearest integer
Heng Li [Tue, 28 Sep 2010 21:16:43 +0000 (21:16 +0000)]
I was trying to accelerate ka_prob_glocal() as this will be the
bottleneck. After an hour, the only gain is to change division to
multiplication. OK. I will stop.
Heng Li [Tue, 28 Sep 2010 20:57:49 +0000 (20:57 +0000)]
this is interesting. multiplication is much faster than division, at least on my Mac
Heng Li [Tue, 28 Sep 2010 20:19:13 +0000 (20:19 +0000)]
minor changes
Heng Li [Tue, 28 Sep 2010 19:44:16 +0000 (19:44 +0000)]
prob_realn() seems working!
Heng Li [Tue, 28 Sep 2010 16:48:23 +0000 (16:48 +0000)]
minor
Heng Li [Tue, 28 Sep 2010 16:47:41 +0000 (16:47 +0000)]
Convert phredQ to probabilities
Heng Li [Tue, 28 Sep 2010 16:32:08 +0000 (16:32 +0000)]
Implement the glocal HMM; discard the extention HMM
Heng Li [Tue, 28 Sep 2010 04:06:11 +0000 (04:06 +0000)]
improve numerical stability
Heng Li [Tue, 28 Sep 2010 03:27:54 +0000 (03:27 +0000)]
more comments
Heng Li [Tue, 28 Sep 2010 03:17:16 +0000 (03:17 +0000)]
fixed a bug in banded DP
Heng Li [Tue, 28 Sep 2010 03:05:12 +0000 (03:05 +0000)]
* fixed that weird issue.
* the banded version is NOT working
Heng Li [Tue, 28 Sep 2010 02:57:05 +0000 (02:57 +0000)]
More comments. This version seems working, but something is a little weird...
Heng Li [Mon, 27 Sep 2010 21:21:40 +0000 (21:21 +0000)]
A little code cleanup. Now the forward and backback algorithms give
nearly identical P(x), which means both are close to the correct
forms. However, I have only tested on toy examples. Minor errors in
the implementation may not be obvious.
Heng Li [Mon, 27 Sep 2010 20:55:15 +0000 (20:55 +0000)]
...
On behalf of John Marshall [Mon, 27 Sep 2010 12:19:06 +0000 (12:19 +0000)]
Abort if merge -h's INH.SAM cannot be opened, just as we abort
if any of the IN#.BAM input files cannot be opened.
Also propagate any error indication returned by bam_merge_core()
to samtools merge's exit status.
On behalf of John Marshall [Fri, 24 Sep 2010 15:08:24 +0000 (15:08 +0000)]
Use bam_validate1() to detect garbage records in the event of a corrupt
BAI index file that causes a bam_seek() to an invalid position. At most
one record (namely, the bam_iter_read terminator) is tested per bam_fetch()
call, so the cost is insignificant in the normal case.
On behalf of John Marshall [Fri, 24 Sep 2010 15:00:19 +0000 (15:00 +0000)]
Add bam_validate1().
Heng Li [Wed, 22 Sep 2010 16:07:50 +0000 (16:07 +0000)]
* samtools-0.1.8-15 (r379)
* allow to change capQ parameter in calmd
On behalf of John Marshall [Wed, 22 Sep 2010 15:15:33 +0000 (15:15 +0000)]
When bam_read1() returns an error (return value <= -2), propagate that error
to bam_iter_read()'s own return value. Similarly, also propagate it up to
bam_fetch()'s return value. Previously bam_fetch() always returned 0, and
callers ignored its return value anyway. With this change, 0 continues to
indicate success, while <= -2 (which can be written as < 0, as -1 is never
returned) indicates corrupted input.
bam_iter_read() ought also to propagate errors returned by bam_seek().
main_samview() can now print an error message and fail when bam_fetch()
detects that a .bai index file is corrupted or otherwise does not correspond
to the .bam file it is being used with.
On behalf of John Marshall [Wed, 22 Sep 2010 14:47:42 +0000 (14:47 +0000)]
0 is a successful return value from bam_read1(). (In practice, it never
returns 0 anyway; but all the other callers treat 0 as successful.)
Heng Li [Mon, 20 Sep 2010 21:43:08 +0000 (21:43 +0000)]
* merge files region-by-region. work on small examples but more tests are needed.
Heng Li [Mon, 20 Sep 2010 20:56:24 +0000 (20:56 +0000)]
improve qstats by checking the alleles as well
Heng Li [Fri, 17 Sep 2010 22:12:13 +0000 (22:12 +0000)]
convert UCSC SNP SQL dump to VCF
Heng Li [Fri, 17 Sep 2010 17:02:11 +0000 (17:02 +0000)]
hapmap2vcf convertor
Heng Li [Fri, 17 Sep 2010 14:11:37 +0000 (14:11 +0000)]
* added comments
* VCF->BCF is not possible without knowing the sequence dictionary before hand...
Heng Li [Fri, 17 Sep 2010 13:15:53 +0000 (13:15 +0000)]
* put n_smpl to "bcf1_t" to simplify API a little
Heng Li [Fri, 17 Sep 2010 01:36:01 +0000 (01:36 +0000)]
fixed a bug in indexing
Heng Li [Thu, 16 Sep 2010 20:54:48 +0000 (20:54 +0000)]
* fixed a bug in capQ
* valgrind identifies a use of uninitialised value, but I have not fixed it.
Heng Li [Thu, 16 Sep 2010 19:03:59 +0000 (19:03 +0000)]
* fixed a bug in razip: -c will delete the input file
* copy tabix/bgzip to here
Heng Li [Thu, 16 Sep 2010 17:45:49 +0000 (17:45 +0000)]
* samtools-0.1.8-14 (r727)
* allow to change the capQ parameter at the command line
Heng Li [Thu, 16 Sep 2010 17:38:43 +0000 (17:38 +0000)]
* added varFilter to vcfutils.pl
* reimplement realn(). now it performs a local alignment
* added cap_mapQ() to cap mapping quality when there are many substitutions
Heng Li [Wed, 15 Sep 2010 04:18:31 +0000 (04:18 +0000)]
* convert BCF to QCALL input
Heng Li [Wed, 15 Sep 2010 02:41:50 +0000 (02:41 +0000)]
dynamic band width in realignment
Heng Li [Wed, 15 Sep 2010 02:05:32 +0000 (02:05 +0000)]
fixed a bug in realignment
Heng Li [Wed, 15 Sep 2010 00:54:09 +0000 (00:54 +0000)]
fixed a minor issue
Heng Li [Tue, 14 Sep 2010 23:25:10 +0000 (23:25 +0000)]
fixed a bug in realignment
Heng Li [Tue, 14 Sep 2010 23:18:24 +0000 (23:18 +0000)]
minor changes. It is BUGGY now!
Heng Li [Tue, 14 Sep 2010 20:32:33 +0000 (20:32 +0000)]
* aggressive gapped aligner is implemented in calmd.
* distinguish gap_open and gap_end_open in banded alignment
* make tview accepts alignment with heading and tailing D
On behalf of John Marshall [Tue, 14 Sep 2010 13:04:28 +0000 (13:04 +0000)]
Add svn:ignore properties for generated files that don't appear in "make all".
On behalf of John Marshall [Mon, 13 Sep 2010 12:37:53 +0000 (12:37 +0000)]
Add svn:ignore properties listing the generated files.
(Except for *.o, which we'll assume is in global-ignores.)
Heng Li [Wed, 8 Sep 2010 16:53:55 +0000 (16:53 +0000)]
* samtools-0.1.8-13 (r715)
* fixed a bug in identifying SM across files
* bcftools: estimate heterozygosity
* bcftools: allow to skip sites without reference bases
Heng Li [Fri, 3 Sep 2010 21:19:12 +0000 (21:19 +0000)]
quite a lot changes to the contrast caller, but I still feel something is missing...
Heng Li [Fri, 3 Sep 2010 04:30:48 +0000 (04:30 +0000)]
* changed 3.434 to 4.343 (typo!)
* fixed a bug in the contrast caller
* calculate heterozygosity
Heng Li [Thu, 2 Sep 2010 03:24:47 +0000 (03:24 +0000)]
SNP calling from the GL field
Heng Li [Wed, 1 Sep 2010 22:52:30 +0000 (22:52 +0000)]
fixed another problem
Heng Li [Wed, 1 Sep 2010 22:31:17 +0000 (22:31 +0000)]
* fixed bugs in parsing VCF
* parser now works with GT/GQ/DP/PL/GL
Heng Li [Wed, 1 Sep 2010 19:28:29 +0000 (19:28 +0000)]
Do not compile _BCF_QUAD by default
Heng Li [Wed, 1 Sep 2010 19:21:41 +0000 (19:21 +0000)]
Write the correct ALT and PL in the SNP calling mode.
Heng Li [Wed, 1 Sep 2010 16:50:33 +0000 (16:50 +0000)]
more commands for my own uses
Heng Li [Wed, 1 Sep 2010 13:26:10 +0000 (13:26 +0000)]
Utilities for processing VCF
Heng Li [Tue, 31 Aug 2010 20:44:57 +0000 (20:44 +0000)]
preliminary contrast variant caller
Heng Li [Tue, 31 Aug 2010 16:28:39 +0000 (16:28 +0000)]
z' and z'' can be calculated
Heng Li [Tue, 31 Aug 2010 14:20:57 +0000 (14:20 +0000)]
* rename vcfout.c as call1.c
* prepare to add two-sample comparison
Heng Li [Tue, 24 Aug 2010 19:28:16 +0000 (19:28 +0000)]
fixed a bug in calculating the t statistics
Heng Li [Tue, 24 Aug 2010 18:05:50 +0000 (18:05 +0000)]
* samtools-0.1.8-13 (r698)
* perform one-tailed t-test for baseQ, mapQ and endDist
Heng Li [Tue, 24 Aug 2010 16:30:13 +0000 (16:30 +0000)]
added regularized incomplete beta function
Heng Li [Mon, 23 Aug 2010 21:36:17 +0000 (21:36 +0000)]
change the default correlation coefficient
Heng Li [Mon, 23 Aug 2010 18:46:52 +0000 (18:46 +0000)]
print QUAL as floating numbers
Heng Li [Mon, 23 Aug 2010 18:06:07 +0000 (18:06 +0000)]
* samtools-0.1.8-12 (r692)
* group data by samples in "mpileup -g"
Heng Li [Mon, 23 Aug 2010 14:58:53 +0000 (14:58 +0000)]
remove VCF output in mpileup
Heng Li [Mon, 23 Aug 2010 14:48:20 +0000 (14:48 +0000)]
* use the revised MAQ error model for mpileup
* prepare to remove the independent model from mpileup
Heng Li [Fri, 20 Aug 2010 19:46:40 +0000 (19:46 +0000)]
added revised MAQ error model
Heng Li [Wed, 18 Aug 2010 13:55:20 +0000 (13:55 +0000)]
allow to read the prior from the error output. EM iteration is working.
Heng Li [Tue, 17 Aug 2010 16:12:20 +0000 (16:12 +0000)]
* write a little more VCF header
* concatenate BCFs
Heng Li [Tue, 17 Aug 2010 00:53:16 +0000 (00:53 +0000)]
use float for QUAL
Heng Li [Sat, 14 Aug 2010 04:11:13 +0000 (04:11 +0000)]
faster for large sample size (in principle)
Heng Li [Sat, 14 Aug 2010 03:28:31 +0000 (03:28 +0000)]
* a numerically stable method to calculate z_{jk}
* currently slower than the old method but will be important for large sample size
* in principle, we can speed up for large n, but have not tried
Heng Li [Thu, 12 Aug 2010 01:58:31 +0000 (01:58 +0000)]
fixed an issue in parsing integer
Heng Li [Mon, 9 Aug 2010 17:05:07 +0000 (17:05 +0000)]
do not print refname if file is converted from VCF
Heng Li [Mon, 9 Aug 2010 16:59:47 +0000 (16:59 +0000)]
* parse PL
* fixed a bug in parsing VCF
Heng Li [Mon, 9 Aug 2010 16:49:23 +0000 (16:49 +0000)]
* fixed a bug in kstrtok@kstring.c
* preliminary VCF parser (not parse everything for now)
* improved view interface
Heng Li [Mon, 9 Aug 2010 14:43:13 +0000 (14:43 +0000)]
* improved kstring (added kstrtok)
* removed the limit on the format string length in bcftools
* use kstrtok to parse format which fixed a bug in the old code
Heng Li [Mon, 9 Aug 2010 05:12:05 +0000 (05:12 +0000)]
help messages
Heng Li [Mon, 9 Aug 2010 04:01:52 +0000 (04:01 +0000)]
perform single-tail test for ED4
Heng Li [Mon, 9 Aug 2010 03:48:35 +0000 (03:48 +0000)]
* test depth, end distance and HWE
Heng Li [Sun, 8 Aug 2010 06:04:15 +0000 (06:04 +0000)]
reimplement incomplete gamma functions. no copy-paste
Heng Li [Sat, 7 Aug 2010 02:42:54 +0000 (02:42 +0000)]
* bcftools: add HWE (no testing for now)
* record end dist in a 2x2 table, not avg, std any more
Heng Li [Fri, 6 Aug 2010 21:30:16 +0000 (21:30 +0000)]
* Special functions: log(gamma()), erfc(), P(a,x) (incomplete gamma)
* Not using Numerical Recipe due to licensing issues
Heng Li [Fri, 6 Aug 2010 03:46:53 +0000 (03:46 +0000)]
Fisher's exact test
Heng Li [Fri, 6 Aug 2010 01:48:33 +0000 (01:48 +0000)]
* samtools-0.1.8-11 (r672)
* collect more stats for allele balance test in bcftools (not yet)