]> git.donarmstrong.com Git - samtools.git/log
samtools.git
13 years ago * samtools-0.1.8-19 (r777)
Heng Li [Tue, 26 Oct 2010 21:26:04 +0000 (21:26 +0000)]
 * samtools-0.1.8-19 (r777)
 * integrate mpileup features to pileup: min_baseQ, capQ, prob_realn, paired-only and biased prior

13 years agoremove local realignment (probabilistic realignment is still there)
Heng Li [Tue, 26 Oct 2010 19:27:46 +0000 (19:27 +0000)]
remove local realignment (probabilistic realignment is still there)

13 years agoAdd the relevant filename or region to error messages, and cause a failure
On behalf of John Marshall [Thu, 21 Oct 2010 10:52:38 +0000 (10:52 +0000)]
Add the relevant filename or region to error messages, and cause a failure
exit status where appropriate.  Based on a patch provided by Marcel Martin.

13 years ago * Minor code changes. No real effect.
Heng Li [Tue, 19 Oct 2010 23:44:31 +0000 (23:44 +0000)]
 * Minor code changes. No real effect.
 * change quality to 30 in toy.sam

13 years agoadded another toy example
Heng Li [Tue, 19 Oct 2010 03:40:13 +0000 (03:40 +0000)]
added another toy example

13 years agoimprove the LD statistics
Heng Li [Thu, 14 Oct 2010 03:32:12 +0000 (03:32 +0000)]
improve the LD statistics

13 years ago * a minor fix to the -L option
Heng Li [Wed, 13 Oct 2010 03:49:26 +0000 (03:49 +0000)]
 * a minor fix to the -L option
 * add ldstats to vcfutils.pl

13 years agoa minor change
Heng Li [Tue, 12 Oct 2010 19:51:57 +0000 (19:51 +0000)]
a minor change

13 years agoforget to add the key file
Heng Li [Tue, 12 Oct 2010 19:49:06 +0000 (19:49 +0000)]
forget to add the key file

13 years ago * vcfutils.pl: fixed a typo in help message
Heng Li [Tue, 12 Oct 2010 19:48:46 +0000 (19:48 +0000)]
 * vcfutils.pl: fixed a typo in help message
 * added APIs: bcf_append_info() and bcf_cpy()
 * calculate adjacent LD

13 years agoadded filter for samtools/bcftools genetated VCFs
Heng Li [Mon, 11 Oct 2010 15:06:40 +0000 (15:06 +0000)]
added filter for samtools/bcftools genetated VCFs

13 years ago * removed a comment line in kaln.c
Heng Li [Tue, 5 Oct 2010 18:05:18 +0000 (18:05 +0000)]
 * removed a comment line in kaln.c
 * vcfutils.pl fillac works when GT is not the first field

13 years agoConvert VCF output of "bcftools view -bgcv" to a valid VCF file
Petr Danecek [Tue, 5 Oct 2010 12:59:36 +0000 (12:59 +0000)]
Convert VCF output of "bcftools view -bgcv" to a valid VCF file

13 years ago * samtools-0.1.8-18 (r763)
Heng Li [Sun, 3 Oct 2010 02:51:03 +0000 (02:51 +0000)]
 * samtools-0.1.8-18 (r763)
 * added bcftools manual page
 * minor fix to mpileup and view command lines

13 years ago * vcfutils.pl qstats: calculate marginal ts/tv
Heng Li [Sun, 3 Oct 2010 01:46:25 +0000 (01:46 +0000)]
 * vcfutils.pl qstats: calculate marginal ts/tv
 * allow to call genotypes at variant sites

13 years agoI am changing the gap open probability back to 0.001. It seems that
Heng Li [Fri, 1 Oct 2010 04:29:55 +0000 (04:29 +0000)]
I am changing the gap open probability back to 0.001. It seems that
being conservative here is a good thing...

13 years ago * samtools-0.1.8-17 (r760)
Heng Li [Fri, 1 Oct 2010 04:11:27 +0000 (04:11 +0000)]
 * samtools-0.1.8-17 (r760)
 * the default gap open penalty is too small (a typo)
 * added comments on hmm_realn
 * Java implementation

13 years agomark samtools-0.1.8-16 (r759)
Heng Li [Thu, 30 Sep 2010 14:12:54 +0000 (14:12 +0000)]
mark samtools-0.1.8-16 (r759)

13 years agoround to the nearest integer
Heng Li [Thu, 30 Sep 2010 14:12:02 +0000 (14:12 +0000)]
round to the nearest integer

13 years agoI was trying to accelerate ka_prob_glocal() as this will be the
Heng Li [Tue, 28 Sep 2010 21:16:43 +0000 (21:16 +0000)]
I was trying to accelerate ka_prob_glocal() as this will be the
bottleneck. After an hour, the only gain is to change division to
multiplication. OK. I will stop.

13 years agothis is interesting. multiplication is much faster than division, at least on my Mac
Heng Li [Tue, 28 Sep 2010 20:57:49 +0000 (20:57 +0000)]
this is interesting. multiplication is much faster than division, at least on my Mac

13 years agominor changes
Heng Li [Tue, 28 Sep 2010 20:19:13 +0000 (20:19 +0000)]
minor changes

13 years agoprob_realn() seems working!
Heng Li [Tue, 28 Sep 2010 19:44:16 +0000 (19:44 +0000)]
prob_realn() seems working!

13 years agominor
Heng Li [Tue, 28 Sep 2010 16:48:23 +0000 (16:48 +0000)]
minor

13 years agoConvert phredQ to probabilities
Heng Li [Tue, 28 Sep 2010 16:47:41 +0000 (16:47 +0000)]
Convert phredQ to probabilities

13 years agoImplement the glocal HMM; discard the extention HMM
Heng Li [Tue, 28 Sep 2010 16:32:08 +0000 (16:32 +0000)]
Implement the glocal HMM; discard the extention HMM

13 years agoimprove numerical stability
Heng Li [Tue, 28 Sep 2010 04:06:11 +0000 (04:06 +0000)]
improve numerical stability

13 years agomore comments
Heng Li [Tue, 28 Sep 2010 03:27:54 +0000 (03:27 +0000)]
more comments

13 years agofixed a bug in banded DP
Heng Li [Tue, 28 Sep 2010 03:17:16 +0000 (03:17 +0000)]
fixed a bug in banded DP

13 years ago * fixed that weird issue.
Heng Li [Tue, 28 Sep 2010 03:05:12 +0000 (03:05 +0000)]
 * fixed that weird issue.
 * the banded version is NOT working

13 years agoMore comments. This version seems working, but something is a little weird...
Heng Li [Tue, 28 Sep 2010 02:57:05 +0000 (02:57 +0000)]
More comments. This version seems working, but something is a little weird...

13 years agoA little code cleanup. Now the forward and backback algorithms give
Heng Li [Mon, 27 Sep 2010 21:21:40 +0000 (21:21 +0000)]
A little code cleanup. Now the forward and backback algorithms give
nearly identical P(x), which means both are close to the correct
forms. However, I have only tested on toy examples. Minor errors in
the implementation may not be obvious.

13 years ago...
Heng Li [Mon, 27 Sep 2010 20:55:15 +0000 (20:55 +0000)]
...

13 years agoAbort if merge -h's INH.SAM cannot be opened, just as we abort
On behalf of John Marshall [Mon, 27 Sep 2010 12:19:06 +0000 (12:19 +0000)]
Abort if merge -h's INH.SAM cannot be opened, just as we abort
if any of the IN#.BAM input files cannot be opened.

Also propagate any error indication returned by bam_merge_core()
to samtools merge's exit status.

13 years agoUse bam_validate1() to detect garbage records in the event of a corrupt
On behalf of John Marshall [Fri, 24 Sep 2010 15:08:24 +0000 (15:08 +0000)]
Use bam_validate1() to detect garbage records in the event of a corrupt
BAI index file that causes a bam_seek() to an invalid position.  At most
one record (namely, the bam_iter_read terminator) is tested per bam_fetch()
call, so the cost is insignificant in the normal case.

13 years agoAdd bam_validate1().
On behalf of John Marshall [Fri, 24 Sep 2010 15:00:19 +0000 (15:00 +0000)]
Add bam_validate1().

13 years ago * samtools-0.1.8-15 (r379)
Heng Li [Wed, 22 Sep 2010 16:07:50 +0000 (16:07 +0000)]
 * samtools-0.1.8-15 (r379)
 * allow to change capQ parameter in calmd

13 years agoWhen bam_read1() returns an error (return value <= -2), propagate that error
On behalf of John Marshall [Wed, 22 Sep 2010 15:15:33 +0000 (15:15 +0000)]
When bam_read1() returns an error (return value <= -2), propagate that error
to bam_iter_read()'s own return value.  Similarly, also propagate it up to
bam_fetch()'s return value.  Previously bam_fetch() always returned 0, and
callers ignored its return value anyway.  With this change, 0 continues to
indicate success, while <= -2 (which can be written as < 0, as -1 is never
returned) indicates corrupted input.

bam_iter_read() ought also to propagate errors returned by bam_seek().

main_samview() can now print an error message and fail when bam_fetch()
detects that a .bai index file is corrupted or otherwise does not correspond
to the .bam file it is being used with.

13 years ago0 is a successful return value from bam_read1(). (In practice, it never
On behalf of John Marshall [Wed, 22 Sep 2010 14:47:42 +0000 (14:47 +0000)]
0 is a successful return value from bam_read1().  (In practice, it never
returns 0 anyway; but all the other callers treat 0 as successful.)

13 years ago * merge files region-by-region. work on small examples but more tests are needed.
Heng Li [Mon, 20 Sep 2010 21:43:08 +0000 (21:43 +0000)]
 * merge files region-by-region. work on small examples but more tests are needed.

13 years agoimprove qstats by checking the alleles as well
Heng Li [Mon, 20 Sep 2010 20:56:24 +0000 (20:56 +0000)]
improve qstats by checking the alleles as well

13 years agoconvert UCSC SNP SQL dump to VCF
Heng Li [Fri, 17 Sep 2010 22:12:13 +0000 (22:12 +0000)]
convert UCSC SNP SQL dump to VCF

13 years agohapmap2vcf convertor
Heng Li [Fri, 17 Sep 2010 17:02:11 +0000 (17:02 +0000)]
hapmap2vcf convertor

13 years ago * added comments
Heng Li [Fri, 17 Sep 2010 14:11:37 +0000 (14:11 +0000)]
 * added comments
 * VCF->BCF is not possible without knowing the sequence dictionary before hand...

13 years ago * put n_smpl to "bcf1_t" to simplify API a little
Heng Li [Fri, 17 Sep 2010 13:15:53 +0000 (13:15 +0000)]
 * put n_smpl to "bcf1_t" to simplify API a little

13 years agofixed a bug in indexing
Heng Li [Fri, 17 Sep 2010 01:36:01 +0000 (01:36 +0000)]
fixed a bug in indexing

13 years ago * fixed a bug in capQ
Heng Li [Thu, 16 Sep 2010 20:54:48 +0000 (20:54 +0000)]
 * fixed a bug in capQ
 * valgrind identifies a use of uninitialised value, but I have not fixed it.

13 years ago * fixed a bug in razip: -c will delete the input file
Heng Li [Thu, 16 Sep 2010 19:03:59 +0000 (19:03 +0000)]
 * fixed a bug in razip: -c will delete the input file
 * copy tabix/bgzip to here

13 years ago * samtools-0.1.8-14 (r727)
Heng Li [Thu, 16 Sep 2010 17:45:49 +0000 (17:45 +0000)]
 * samtools-0.1.8-14 (r727)
 * allow to change the capQ parameter at the command line

13 years ago * added varFilter to vcfutils.pl
Heng Li [Thu, 16 Sep 2010 17:38:43 +0000 (17:38 +0000)]
 * added varFilter to vcfutils.pl
 * reimplement realn(). now it performs a local alignment
 * added cap_mapQ() to cap mapping quality when there are many substitutions

13 years ago * convert BCF to QCALL input
Heng Li [Wed, 15 Sep 2010 04:18:31 +0000 (04:18 +0000)]
 * convert BCF to QCALL input

13 years agodynamic band width in realignment
Heng Li [Wed, 15 Sep 2010 02:41:50 +0000 (02:41 +0000)]
dynamic band width in realignment

13 years agofixed a bug in realignment
Heng Li [Wed, 15 Sep 2010 02:05:32 +0000 (02:05 +0000)]
fixed a bug in realignment

13 years agofixed a minor issue
Heng Li [Wed, 15 Sep 2010 00:54:09 +0000 (00:54 +0000)]
fixed a minor issue

13 years agofixed a bug in realignment
Heng Li [Tue, 14 Sep 2010 23:25:10 +0000 (23:25 +0000)]
fixed a bug in realignment

13 years agominor changes. It is BUGGY now!
Heng Li [Tue, 14 Sep 2010 23:18:24 +0000 (23:18 +0000)]
minor changes. It is BUGGY now!

13 years ago * aggressive gapped aligner is implemented in calmd.
Heng Li [Tue, 14 Sep 2010 20:32:33 +0000 (20:32 +0000)]
 * aggressive gapped aligner is implemented in calmd.
 * distinguish gap_open and gap_end_open in banded alignment
 * make tview accepts alignment with heading and tailing D

13 years agoAdd svn:ignore properties for generated files that don't appear in "make all".
On behalf of John Marshall [Tue, 14 Sep 2010 13:04:28 +0000 (13:04 +0000)]
Add svn:ignore properties for generated files that don't appear in "make all".

13 years agoAdd svn:ignore properties listing the generated files.
On behalf of John Marshall [Mon, 13 Sep 2010 12:37:53 +0000 (12:37 +0000)]
Add svn:ignore properties listing the generated files.
(Except for *.o, which we'll assume is in global-ignores.)

13 years ago * samtools-0.1.8-13 (r715)
Heng Li [Wed, 8 Sep 2010 16:53:55 +0000 (16:53 +0000)]
 * samtools-0.1.8-13 (r715)
 * fixed a bug in identifying SM across files
 * bcftools: estimate heterozygosity
 * bcftools: allow to skip sites without reference bases

13 years agoquite a lot changes to the contrast caller, but I still feel something is missing...
Heng Li [Fri, 3 Sep 2010 21:19:12 +0000 (21:19 +0000)]
quite a lot changes to the contrast caller, but I still feel something is missing...

13 years ago * changed 3.434 to 4.343 (typo!)
Heng Li [Fri, 3 Sep 2010 04:30:48 +0000 (04:30 +0000)]
 * changed 3.434 to 4.343 (typo!)
 * fixed a bug in the contrast caller
 * calculate heterozygosity

13 years agoSNP calling from the GL field
Heng Li [Thu, 2 Sep 2010 03:24:47 +0000 (03:24 +0000)]
SNP calling from the GL field

13 years agofixed another problem
Heng Li [Wed, 1 Sep 2010 22:52:30 +0000 (22:52 +0000)]
fixed another problem

13 years ago * fixed bugs in parsing VCF
Heng Li [Wed, 1 Sep 2010 22:31:17 +0000 (22:31 +0000)]
 * fixed bugs in parsing VCF
 * parser now works with GT/GQ/DP/PL/GL

13 years agoDo not compile _BCF_QUAD by default
Heng Li [Wed, 1 Sep 2010 19:28:29 +0000 (19:28 +0000)]
Do not compile _BCF_QUAD by default

13 years agoWrite the correct ALT and PL in the SNP calling mode.
Heng Li [Wed, 1 Sep 2010 19:21:41 +0000 (19:21 +0000)]
Write the correct ALT and PL in the SNP calling mode.

13 years agomore commands for my own uses
Heng Li [Wed, 1 Sep 2010 16:50:33 +0000 (16:50 +0000)]
more commands for my own uses

13 years agoUtilities for processing VCF
Heng Li [Wed, 1 Sep 2010 13:26:10 +0000 (13:26 +0000)]
Utilities for processing VCF

13 years agopreliminary contrast variant caller
Heng Li [Tue, 31 Aug 2010 20:44:57 +0000 (20:44 +0000)]
preliminary contrast variant caller

13 years agoz' and z'' can be calculated
Heng Li [Tue, 31 Aug 2010 16:28:39 +0000 (16:28 +0000)]
z' and z'' can be calculated

13 years ago * rename vcfout.c as call1.c
Heng Li [Tue, 31 Aug 2010 14:20:57 +0000 (14:20 +0000)]
 * rename vcfout.c as call1.c
 * prepare to add two-sample comparison

13 years agofixed a bug in calculating the t statistics
Heng Li [Tue, 24 Aug 2010 19:28:16 +0000 (19:28 +0000)]
fixed a bug in calculating the t statistics

13 years ago * samtools-0.1.8-13 (r698)
Heng Li [Tue, 24 Aug 2010 18:05:50 +0000 (18:05 +0000)]
 * samtools-0.1.8-13 (r698)
 * perform one-tailed t-test for baseQ, mapQ and endDist

13 years agoadded regularized incomplete beta function
Heng Li [Tue, 24 Aug 2010 16:30:13 +0000 (16:30 +0000)]
added regularized incomplete beta function

13 years agochange the default correlation coefficient
Heng Li [Mon, 23 Aug 2010 21:36:17 +0000 (21:36 +0000)]
change the default correlation coefficient

13 years agoprint QUAL as floating numbers
Heng Li [Mon, 23 Aug 2010 18:46:52 +0000 (18:46 +0000)]
print QUAL as floating numbers

13 years ago * samtools-0.1.8-12 (r692)
Heng Li [Mon, 23 Aug 2010 18:06:07 +0000 (18:06 +0000)]
 * samtools-0.1.8-12 (r692)
 * group data by samples in "mpileup -g"

13 years agoremove VCF output in mpileup
Heng Li [Mon, 23 Aug 2010 14:58:53 +0000 (14:58 +0000)]
remove VCF output in mpileup

13 years ago * use the revised MAQ error model for mpileup
Heng Li [Mon, 23 Aug 2010 14:48:20 +0000 (14:48 +0000)]
 * use the revised MAQ error model for mpileup
 * prepare to remove the independent model from mpileup

13 years agoadded revised MAQ error model
Heng Li [Fri, 20 Aug 2010 19:46:40 +0000 (19:46 +0000)]
added revised MAQ error model

13 years agoallow to read the prior from the error output. EM iteration is working.
Heng Li [Wed, 18 Aug 2010 13:55:20 +0000 (13:55 +0000)]
allow to read the prior from the error output. EM iteration is working.

13 years ago * write a little more VCF header
Heng Li [Tue, 17 Aug 2010 16:12:20 +0000 (16:12 +0000)]
 * write a little more VCF header
 * concatenate BCFs

13 years agouse float for QUAL
Heng Li [Tue, 17 Aug 2010 00:53:16 +0000 (00:53 +0000)]
use float for QUAL

13 years agofaster for large sample size (in principle)
Heng Li [Sat, 14 Aug 2010 04:11:13 +0000 (04:11 +0000)]
faster for large sample size (in principle)

13 years ago * a numerically stable method to calculate z_{jk}
Heng Li [Sat, 14 Aug 2010 03:28:31 +0000 (03:28 +0000)]
 * a numerically stable method to calculate z_{jk}
 * currently slower than the old method but will be important for large sample size
 * in principle, we can speed up for large n, but have not tried

13 years agofixed an issue in parsing integer
Heng Li [Thu, 12 Aug 2010 01:58:31 +0000 (01:58 +0000)]
fixed an issue in parsing integer

13 years agodo not print refname if file is converted from VCF
Heng Li [Mon, 9 Aug 2010 17:05:07 +0000 (17:05 +0000)]
do not print refname if file is converted from VCF

13 years ago * parse PL
Heng Li [Mon, 9 Aug 2010 16:59:47 +0000 (16:59 +0000)]
 * parse PL
 * fixed a bug in parsing VCF

13 years ago * fixed a bug in kstrtok@kstring.c
Heng Li [Mon, 9 Aug 2010 16:49:23 +0000 (16:49 +0000)]
 * fixed a bug in kstrtok@kstring.c
 * preliminary VCF parser (not parse everything for now)
 * improved view interface

13 years ago * improved kstring (added kstrtok)
Heng Li [Mon, 9 Aug 2010 14:43:13 +0000 (14:43 +0000)]
 * improved kstring (added kstrtok)
 * removed the limit on the format string length in bcftools
 * use kstrtok to parse format which fixed a bug in the old code

13 years agohelp messages
Heng Li [Mon, 9 Aug 2010 05:12:05 +0000 (05:12 +0000)]
help messages

13 years agoperform single-tail test for ED4
Heng Li [Mon, 9 Aug 2010 04:01:52 +0000 (04:01 +0000)]
perform single-tail test for ED4

13 years ago * test depth, end distance and HWE
Heng Li [Mon, 9 Aug 2010 03:48:35 +0000 (03:48 +0000)]
 * test depth, end distance and HWE

13 years agoreimplement incomplete gamma functions. no copy-paste
Heng Li [Sun, 8 Aug 2010 06:04:15 +0000 (06:04 +0000)]
reimplement incomplete gamma functions. no copy-paste

13 years ago * bcftools: add HWE (no testing for now)
Heng Li [Sat, 7 Aug 2010 02:42:54 +0000 (02:42 +0000)]
 * bcftools: add HWE (no testing for now)
 * record end dist in a 2x2 table, not avg, std any more

13 years ago * Special functions: log(gamma()), erfc(), P(a,x) (incomplete gamma)
Heng Li [Fri, 6 Aug 2010 21:30:16 +0000 (21:30 +0000)]
 * Special functions: log(gamma()), erfc(), P(a,x) (incomplete gamma)
 * Not using Numerical Recipe due to licensing issues

13 years agoFisher's exact test
Heng Li [Fri, 6 Aug 2010 03:46:53 +0000 (03:46 +0000)]
Fisher's exact test

13 years ago * samtools-0.1.8-11 (r672)
Heng Li [Fri, 6 Aug 2010 01:48:33 +0000 (01:48 +0000)]
 * samtools-0.1.8-11 (r672)
 * collect more stats for allele balance test in bcftools (not yet)

13 years ago * the code base is stablized again.
Heng Li [Thu, 5 Aug 2010 20:17:58 +0000 (20:17 +0000)]
 * the code base is stablized again.
 * I will delay the vcf parser, which is quite complicated but with little value for now