* This data was published in Kaplan et al, Nature 2008.
The above data is in a tab delimited file (.chv) where each row consists of the following columns:
Note that the genomic coordinates are 1-based.
We supply a perl script named nucleo08_chv2chr.pl that converts the .chv file to a more standard format:
In order to convert a file to the more standard format, first unzip it, then run the script on it:
The predictions are in the same tab delimited file format as the data in the "Nucleosome Measurements" table above.
The "Gene Location" files are in tab delimited format with 4 columns: Chromosome; Gene Id; Gene start; Gene end.
The "Orthogroup Mapping" files are in tab delimited format with 2 columns: Orthogroup Id; Gene Id.
We thank Ilan Wapinski and Aviv Regev for kindly supplying the orthogroup mapping files, with the corresponding genome sequence and gene location files.