lociLocations.pl - A program to retrieve chromosomal locations by name
lociLocations.pl [--help] [--locusColumn=columnName] [--path=path of database] [--locilistPath=locilistPath] [--retrieveLoci=loci names] [--anonymous] [--calculateGaps] [--aliases=aliasesPath] [--source=name]
This program takes a list of markers and retrieves the physical and cytogenetic locations from a database. If called with the option --calculateGaps it calculates a histogram of gaps for a given marker set. This allows to check for the saturation of the genome by the given marker set. The loci to be retrieved either from STDIN or from a file specified by the --locilist option. This input is expected to be a tab separated file (see options --locus, --anonymous).
Program output is printed to STDOUT and is in PropertyList format. A dictionary is produced holding as keys the loci names. The values are dictionaries again containing physical (key p) and cytongenetic (key band) information. You can redirect to a file and may directly use it with the --labelMap option of coloredChromosomes.pl.
The default source for retrieval is ENSEMBL as of package version 1.4.2. You need to install the ENSEMBL API as described in http://www.ensembl.org/Docs/linked_docs/ensembl_tutorial.pdf . The source argument of lociLocation is intepreted as follows: --source host:user:db and defaults to 'ensembldb.ensembl.org:anonymous:homo_sapiens_core_29_35b'. Since database versions change frequently and old versions become unavailable you should use the most recent numbers from http://www.ensembl.org/Homo_sapiens. If, for example the web page reads 'Current Release 29.35b' than the database name should be 'homo_sapiens_core_29_35b'. This is clearly a shortcoming in the ENSEMBL database structure where a reference to the current version is lacking. Another problem is that only gene symbols are allowed right now. There is no mechanism to retrieve microsatellite markers. If either of these things do bother you please send an email to helpdesk@ensembl.org (as did I). Hopefully this gives momentum to required code changes at EMBL.
coloredChromosomes.pl(1) PropertyList(3)