Repozytorium PCSS

Przejdź do menu głównego
Przejdź do wyszukiwarki
Przejdź do treści
Przejdź do stopki

Pobierz

Brak tytułu

Twórca:

Typ zasobu:

Współtwórca:

Cebrat S., Kowalczuk M., Mackiewicz P., Nowicka A., Mackiewicz D., Dudkiewicz M.

Abstrakt:

The protein sequence is coded with the help of the triplets of nucleotides, each corresponding to one amino acid in a protein sequence. The triplet code of the coding sequences possesses some informative redundancy. Some triplets are more probable than others. The analogous redundancies appear in all natural languages. The non-equal frequency of the characters in plain text makes possible that entire words can be predicted given the context of the plain text. This is typical problem in cryptanalysis that a plain text is compressed before encrypting it in order to reduce the language redundancies. The nucleotides represent the natural units to discuss the redundancies in the coding sequences of natural genomes. The mutation pressure and selection pressure are the main factors responsible for the observed redundancies. Then, the nucleotide frequency in DNA seems to be the natural information weight. We show, that the probability of a nucleotide to stay nonmutated becomes another, the more efficient information weight. It has smaller redundancy although it is correlated with the nucleotide frequency. We have found the values of probability for nucleotide to stay non-mutated in the particular case of the Borrellia burgdorferi genome. In order to examine the usefulness of the new frequencies we used them in a problem of bit-string packing in a channel with a given capacity. We performed a computer experiment, in which we have generated all possible oligomers consisting of k nucleotides and we have shown, that if the number of bits of the information carried out by the oligomers does not exceed a given threshold value, the same as calculated for genes of the Borrelia burgdorferi genome, then the distribution of the generated oligomers resembles the one used by these genes.

Miejsce wydania:

Wydawca:

Data złożenia:

Format:

application/pdf

Identyfikator zasobu:

oai:lib.psnc.pl:626

Język:

Temat i słowa kluczowe:

protein sequence, Borrelia burgdorferi genome, oligomers, codon

Kolekcje, do których przypisany jest obiekt:

Data ostatniej modyfikacji:

18 sie 2014

Data dodania obiektu:

2 cze 2014

Liczba wyświetleń treści obiektu:

185

Wszystkie dostępne wersje tego obiektu:

https://lib.psnc.pl/publication/800

Wyświetl opis w formacie RDF:

Wyświetl opis w formacie OAI-PMH:

Volume 13(1) 2007

Nazwa wydania	Data
Information Weights of Nucleotides in DNA Sequences	18 sie 2014

Dane kontaktowe

Adres

Poznańskie Centrum Komputerowo-Sieciowe
ul. Jana Pawła II 10
61-139 Poznań
Polska

Telefon

(+48 61) 858-20-01
(+48 61) 852-59-54

E-Mail

Odwiedź nas!

https://dingo.psnc.pl

Ta strona wykorzystuje pliki 'cookies'. Więcej informacji