Genome Toolbox: Generate Random Genomic Positions

Sunday, June 1, 2014

Generate Random Genomic Positions

Generating random genomic positions or coordinates can be useful in comparing characteristics of a set of genomic loci to that what would be expected from permutations of the underlying genomic distribution. Below is a Python script to aid in selecting random genomic positions. The script chooses a chromosome based on probabilities assigned by chromosome length and then chooses a chromosomal position from a uniform distribution of the chromosome's length. An added gap checking statement is included to ensure the chosen position lies within the accessible genome. You can choose the number of positions you want, the number of permutations to conduct, the size of the genomic positions, and the genomic build of interest. A UNIX shell script is included as a wrapper to automatically download needed chromosomal gap and cytoband files as well as run the Python script. Useage for the UNIX script can be seen by typing ./make_random.sh from the command line after giving the script executable privileges. An example command would be ./make_random 100 10 1000 hg19. This command would make 10 .bed files each with 100 random 1Kb genomic regions from the hg19 genome build. Below are the make_random.sh and make_random.py scripts.

3 comments:

UnknownJune 9, 2014 at 11:37 AM
Hi,

Thank you for the function but what about chrX and chrY?
I see that all the BED files do not include any locations from these chromosomes.

Thanks!
ReplyDelete
Replies
UnknownSeptember 19, 2014 at 5:57 PM
I tried this script and this is the error I got:

Genome Build: hg19
Fetching gap and length files...
Download complete.
Traceback (most recent call last):
File "make_random.py", line 72, in ?
def sort_coords(coords,cols=itemgetter(1,2)):
TypeError: itemgetter expected 1 arguments, got 2
Program completed.

Any idea why? Thanks.
ReplyDelete
Replies

Add comment

Pages

Sunday, June 1, 2014

Generate Random Genomic Positions

3 comments: