We have analyzed DNA sequences of known genes from 16 yeast chromosomes (Saccharomyces cerevisiae) in terms of oligonucleotides. We have noticed that the relative abundances of oligonucleotide usage in the genome follow a long-tail Lévy-like distribution. We have observed that long genes often use strongly over-represented and under-represented nucleotides, whereas it was not the case for the short genes (shorter than 300 nucleotides) under consideration. If selection on the extremely over-represented/ under-represented oligonucleotides was strong, long genes would be more affected by spontaneous mutations than short ones.
|Long-Tail Feature of Dna Words Over- And Under-Representation in Coding Sequences||2014-07-29|