Merging depth output from multiple .bam files can be difficult since Samtools only outputs depth counts for coordinates with non-zero coverage. If you want to merge depth output from these .bam files you first need to fill in the base pair positions of no coverage with zero values so the depth output for all .bam files is the same length. Then using a simple UNIX cat you can merge multiple .bam file depth output into one file for comparison and analysis.
Here is a simple Python script to zero fill Samtools depth output:
And below is an example of how to use it:
A repository of programs, scripts, and tips essential to
genetic epidemiology, statistical genetics, and bioinformatics
Welcome to the Genome Toolbox! I am glad you navigated to the blog and hope you find the contents useful and insightful for your genomic needs. If you find any of the entries particularly helpful, be sure to click the +1 button on the bottom of the post and share with your colleagues. Your input is encouraged, so if you have comments or are aware of more efficient tools not included in a post, I would love to hear from you. Enjoy your time browsing through the Toolbox.
Showing posts with label depth. Show all posts
Showing posts with label depth. Show all posts
Thursday, February 20, 2014
Thursday, December 12, 2013
Generate Coverage Plot in R from Depth Data
Here is a simple way to take coverage data (coverage.depth) and make a plot in R to visualize it. All you need is a column with base pair coordinates (V1) and a column with respective depth (V2) and you are all set to plug it in the below R code.
Sum Overlapping Base Pairs of Features from Chromosomal BED File
I had a .bed file of genomic features on a chromosome that I wanted to figure out the extent of overlap of the features to investigate commonly covered genes as well as positions where features were likely to form. I wanted to generate a plot similar to a coverage depth plot from next-generation sequencing reads. I am sure more efficient methods exist, but here is some Python code that takes in a .bed file of features (features.bed) and creates an output file (features.depth) with the feature overlap "depth" every 5,000 base pairs across the areas which contain features in your chromosomal .bed file.
Subscribe to:
Posts (Atom)