1

Тема: Ищем белок по нотам мелодии "Петушок погромче пой"

Формы, механизмы, энергия наномира. Сообщение 101 202

Online service PROTEIN PICOTECHNOLOGY

Заказ 2018-09-04-001:

Ищем белок по нотам мелодии "Петушок погромче пой"

Слушать песню (на латвийском): https://yadi.sk/d/JXOoo26P04Gdsw

Ноты: EGFEEGFE

Аминокислотная последовательность: tqkttqkt

Один из белков, содержащих такую последовательность:

https://www.ncbi.nlm.nih.gov/protein/OG … WPG4F2N01R

ATP-dependent DNA helicase [Geobacteraceae bacterium GWC2_53_11]

Геном, содержащий, вероятно, программную спираль, музыка сборки которой легла в основу мелодии "Петушок погромче пой": https://www.ncbi.nlm.nih.gov/nuccore/1085230279

CDS             complement(1370..2773)
                     /protein_id="OGU13475.1"
                     /translation="MTQFQETETLELKKSTSELKEAVISIVAMLNKHQRGEVLFGVRN
                     DGAVAGQQVSDKTLRDISKAIADHIEPKIYPVVEHVAFDSKQCVKVRVEGKEHPYYAY
                     GRAYIRVGDEDRQLSARELEKLILEKNRDRLRWDTEVCTKATVDDIGPGKLKSFLKLS
                     GLKFDTIPNTLEKLNLVRDGKLLNAAVLCFARKPEKFFPNARLRCAVFGTLDTTFTIS
                     MQDFYGDVFYLIKKAEEYILEHINIGMRLDGMLRVDVPEIDPEAFREAIINAFCHRDY
                     REYDSVNIAIFKDRVEIRSPGLLYGGLTIASIRKKMVSSRRNELLAELFQRSHRIEKW
                     GRGIKMILEREPATEFEEIGTVQFIATFKRKNVEAVFSDSVGEKVGEKVGETSEKTTQ
                     KTTQKTTQKTTQKTTQKTTQKILELIAGNPKISRVELAAAIGITADGIKYQLDKMRKE
                     GMLKRIGPDKGGYWEIV"

                                                                                         t caaacaatct
     1381 cccaataccc gcccttatcc ggcccgatac gctttaacat cccttctttc ctcatcttgt
     1441 ccaactgata tttgataccg tctgcagtaa ttccaattgc tgcggctagc tcgacacggc
     1501 tgattttggg atttccggca atcagttcca gaattttctg ggtagttttc tgggtagttt
     1561 tctgggtagt tttctgggta gttttctggg tagttttctg ggtagttttc tccgaagttt
     1621 caccgacctt ttcaccgacc ttttcaccga cactgtcgct gaaaacagcc tcaacattct
     1681 tccgcttaaa agtggcaata aactgaaccg tgccgatttc ctcaaactcc gtcgccggtt
     1741 cccgttccag aatcatcttg atgccacggc cccatttctc gatacgatgg ctgcgctgga
     1801 acaactccgc cagcaattca ttacgccggg aagaaaccat tttcttcctg atagacgcaa
     1861 tggtcagccc accgtagagc aggccgggac tgcggatttc cacccgatcc ttgaaaatgg
     1921 cgatattgac cgaatcgtat tcccggtaat cccgatggca aaatgcgttg atgatcgcct
     1981 cacgaaacgc ctccgggtca atttcgggca catccacccg cagcatcccg tccaggcgca
     2041 taccgatgtt gatatgttcg agaatgtatt cctcggcttt tttgatcaga taaaagacat
     2101 cgccatagaa atcctgcata ctgatggtaa atgtggtatc caacgtcccg aacacggcgc
     2161 aacgcagccg ggcgttagga aagaactttt ccggtttgcg ggcaaaacag agcacggcgg
     2221 cgttcagcag tttgccgtct ctgaccagat tcagtttttc gagggtgttg gggatggtgt
     2281 cgaacttgag accactaagt ttcagaaagg atttcagctt gccagggccg atgtcgtcaa
     2341 cggtcgcctt ggtgcatacc tcggtatccc agcggagccg gtcacggttt ttctccagaa
     2401 tcagcttttc cagttcgcgg gcacttaact gccggtcttc atcgccaacc cggatatatg
     2461 ctcggccata ggcatagtag gggtgctcct tgccttcgac ccgtaccttg acgcattgtt
     2521 tggaatcaaa cgcaacgtgt tcgacaacgg ggtagatttt gggttcgatg tggtcggcaa
     2581 ttgccttgga aatatcgcgg agggttttat cggaaacctg ctgaccggca acagcgccgt
     2641 cgtttctgac gccgaacagt acttcgcccc gctggtgctt gttcagcata gccacaatcg
     2701 agatcaccgc ctccttcagt tcggaggtgg atttcttcaa ctcaagggtt tccgtttctt
     2761 gaaattgggt cat

Нуклеотидная последовательность в стандарте fasta:

>OGU13475
atgacccaatttcaagaaacggaaacccttgagttgaagaaatccacctccgaactgaag
gaggcggtgatctcgattgtggctatgctgaacaagcaccagcggggcgaagtactgttc
ggcgtcagaaacgacggcgctgttgccggtcagcaggtttccgataaaaccctccgcgat
atttccaaggcaattgccgaccacatcgaacccaaaatctaccccgttgtcgaacacgtt
gcgtttgattccaaacaatgcgtcaaggtacgggtcgaaggcaaggagcacccctactat
gcctatggccgagcatatatccgggttggcgatgaagaccggcagttaagtgcccgcgaa
ctggaaaagctgattctggagaaaaaccgtgaccggctccgctgggataccgaggtatgc
accaaggcgaccgttgacgacatcggccctggcaagctgaaatcctttctgaaacttagt
ggtctcaagttcgacaccatccccaacaccctcgaaaaactgaatctggtcagagacggc
aaactgctgaacgccgccgtgctctgttttgcccgcaaaccggaaaagttctttcctaac
gcccggctgcgttgcgccgtgttcgggacgttggataccacatttaccatcagtatgcag
gatttctatggcgatgtcttttatctgatcaaaaaagccgaggaatacattctcgaacat
atcaacatcggtatgcgcctggacgggatgctgcgggtggatgtgcccgaaattgacccg
gaggcgtttcgtgaggcgatcatcaacgcattttgccatcgggattaccgggaatacgat
tcggtcaatatcgccattttcaaggatcgggtggaaatccgcagtcccggcctgctctac
ggtgggctgaccattgcgtctatcaggaagaaaatggtttcttcccggcgtaatgaattg
ctggcggagttgttccagcgcagccatcgtatcgagaaatggggccgtggcatcaagatg
attctggaacgggaaccggcgacggagtttgaggaaatcggcacggttcagtttattgcc
acttttaagcggaagaatgttgaggctgttttcagcgacagtgtcggtgaaaaggtcggt
gaaaaggtcggtgaaacttcggagaaaactacccagaaaactacccagaaaactacccag
aaaactacccagaaaactacccagaaaactacccagaaaattctggaactgattgccgga
aatcccaaaatcagccgtgtcgagctagccgcagcaattggaattactgcagacggtatc
aaatatcagttggacaagatgaggaaagaagggatgttaaagcgtatcgggccggataag
ggcgggtattgggagattgtttga

https://getfile.dokpub.com/yandex/get/https://yadi.sk/i/rh-1HkZfUMvu9Q

https://getfile.dokpub.com/yandex/get/https://yadi.sk/i/at18JXiy63WDLw
Вторичная структура полностью: https://yadi.sk/i/Jq0A-RV9xILnPg

Под циклический фрагмент музыки "Петушок погромче пой" строится 6.5 периодов программной 5577-спирали.

Обратите внимание на то, что первый аминокислотный остаток отличается по химическому составу, но не по высоте ноты!

2

Re: Ищем белок по нотам мелодии "Петушок погромче пой"

Online service PROTEIN PICOTECHNOLOGY

Заказ 2018-09-04-002:

Ещё один белок, содержащий фрагмент, который строится под циклическую мелодию из песни "Петушок погромче пой".

https://www.ncbi.nlm.nih.gov/protein/XP … WPG4F2N01R

https://www.ncbi.nlm.nih.gov/nuccore/942128036

PREDICTED: Latimeria chalumnae otogelin (OTOG), mRNA

Файл в стандарте fasta:

>XM_014485269
atgaggtacggattacctttaaagaacaaatctagccttgttctcttgtctcatagcaag
aacaacagtgaaatactcaagaaatgggatactgcttcccaaggaaaacagggattagaa
acaactataactactttttcatctattgaaacacctcctgtggtgagaaaccttccagct
ttagtagaacaatctcctgatacttgctcagtaccatgtcttaatggtggcgaatgtaca
tatttggaccagtgcaactgcagccaatttaatgcacaaggatcaaggtgccaaacagtt
cccaacactggatatgaaagagacatgatctgtcggacatggggacagtacaattttgaa
acatttgatggactttactttaactttcctggaacatgtatgtatacacttgtgaaagac
tgtgatcaatctgaacccaccttcttagtacaggcttttccaggtcctggcgatgtccat
ttcaaaatttgtgctttggcaatgccatcacaccttgcagtggtggatgatttattctat
attcaagcctttgaggggctttttacctactggcagatggttcacaatgatccacaatgt
catttttcttcatcttcctgtcaacgttctgttagtttattcttttcgtctgcggaagaa
ataaggttaaaaggatctcaagtcacttgtaatggaatgagcatcacactgccatatact
tcccgcaaaataagggtagaaagaatagcagagcatgtattggttacacaccatcatata
tttattctggcttgggatggcaactcaggagtctatataaagatgaatccagagtatatt
ggtaagacatgtgggctatgtggaaacttcactggggacctacaggatgacctcttaacc
agcagcggtcttttaacagaagatattgcaacgtttgctaacagctggatggaaccatat
ccaaatgaatcttgtacatatgtaccaactaactatccatcaccttgctctacacaaagc
caagaaactctcaaggtaatatatactctatgtaatagactcctggaatcaccgttcata
aaatgccagtaccatgtgagcccctaccctttcctagcaagttgtacaaatgacctttgc
atgtctggtcctgaggatgtaattttgtgtcgagcactgaatgaatatgccagagcttgt
gcccaggcaggttatcctcttcatggctggagaagaaaaatccctcagtgtgcggtgggg
tgtgaaggagagcttatctatagggagtgtattagctgttgtccaagtttatgtcaccac
cacatgaaacagtgtttcgacagtgaactccagtgtcttgatgggtgttactgtcctgaa
gagttgatttatgagaatggaacctgcattaaaccctctgaatgcccatgtatgtttcat
gggatttcgtatgcaacaggctctgtggtacaagaagaatgcaacaactgcacttgtatg
ggaggaatatggaactgcacagacaacatttgtccagcggaatgctcagtaacaggggac
attcatttcatgacttttgatgggaagccatatacattccaggcaacgtgccagtacatt
ctggcaaagagtcgcagtagtggcaaatttactgttattcttcaaaacacagcttgtaga
gggaacttggatggctcctgcatccaatcagtgagtgttgttattgatcaagatcccaag
aagcaagtgacattgacacatacgggagaggtgtttgtctgtaatcagtacaaggtcaac
ctaccctacattgatggtttatttgaaattagacaactttcttcattgtttgtacaagta
aaaacaaagtttgggctgcaaatacaatatgacaaaggtgggcttcggctttaccttgta
gtggacggccaatggaaagatgacactgtggggctctgtggaaccttcaatgggaatatc
caagatgattttctatctcctgcaggagtgcctgaaagtattcctgaaatatttgccaac
tcctggaaaatttcatcagcttgtggttcaagcaagatcattccactgccagatccatgt
gatatacatcttcaggcagctccatatgcttcagaatcatgtagcgtcctcacaaaggat
atttttgctccatgtcaccaatacttgagcccaatttccttccatgagcagtgcaaatca
gatacctgtaaatgtggacagatatgtatgtgtgctgcacttgctcactatgctcgacag
tgcagaaaacatggagtaattatcgactttagaagaaatgttactgattgtgagctgagt
gcccttgtttctttatgggagaagtttactcacctggagctgcaggcattccaacttcag
gcctatggtactagttgcccaaatagtcaaacctattttaactgtagtgattcaatggaa
aatgatgagatgagcagaggaattgcttatgaaaagacctgtgagaatcagcttctcaac
atgaccttctctggagaagtcccctgcatttctgggtgtatttgtccaccaggttttatg
aaacatggtgatgagtgctttaaacctgacaactgtccctgcttctggaagggaaaggaa
tactttcctgaagacaaggtgacctctccctgctatacctgcatctgtcaacagggatca
tttcagtgcaccttccaaccctgtccgtcaacatgcacagcatacggagaccgtcattac
atgacttttgacggacttttgtttgatttcattggagcttgtaaagtttacttggttaag
agtaattcggatcctggtctatcggttattgctgaaaacatgaattgttataacactgga
attatttgcagaaagtttctacaaattaatgttggacgatcttttttagtatttgatgac
gacactggaattccaaacccatctagcttaatagacaaacagcaaaacgtacagatatgg
caagcaggattcttcaccattgttaattttcccaatgaaaatgttacaatcttgtgggat
cgaagaacaacagtgcatatccaagttggaccaaactggaagggtgagctattgggattg
tgtggaaattttgaccttaaaacaacaaatgaaatgcggactccagacaacattgagaca
gggaattcccaggagtttgggaatagctggtcagcagtagagtgtgtaaacagccctgat
atccagaatccatgcaacctgagtccattaagggaaccatatgccaagaaaaagtgtggg
atcttactgagtgaagtgtttgaagcctgccacccggtggttgatgttacctggtttcat
tcaaattgcctaacagacacatgtggctgtacccgtgggggtgattgtgagtgtttatgt
accagtgtgtcagcatacgcacaccgctgttgtcagcatgggattactgttgactggaga
tcccctactacttgcccttttgactgtgaatacttcaacaaatttctaggaaagggccct
tataaactggctgcttacttggatggtaaagcttttatggcagccaatctatctggtggt
attgttttcccagtaaagggagatgtctccattccaggaataagttttatgttgactcca
ggactgtataaacctaaagcatacgatactagcttgatttcctttgaagcagcagaaaga
cccaatttctttctacttttgggatccaatggttccctgttcctttcaaagtgggaagag
aatgaagagtttcaaacttcatctacatttatcttacacaaggatacctggattacagga
tacagttcttttgaatcatttaagaagcctggattcttcattcactacatgatatcctct
gtccatctgatgaaacatgaacacactgaaggatttcgacaatcttctcactttaggctc
attgaatctaattttagagctccagcacgctccacctgtgagtggagatatgacagctgt
gcaaccccatgtttcaagacctgccgagatcccacaggagaaaactgtgaaaccttacca
aaaatagaaggatgttttcctaaatgtccttcctacatggtactggacgaggttactatg
agatgcgtgtattttgaggactgcattaagccagcagttgttgttctaccttcacctcta
agtattacaacaataccacctgagcctgcctggaatctctctgttccagtactgtcaagt
acacactttgtttcagctactacaattccaagttcagccagtacatttaccacagttcta
aagctgaatactacaacacctcatgaagtttcaactgtatctatatcagaaaaaacactg
gcaacatctccagtaaaagttacagcctattcagtcaatatcacagctgctattcttcca
actacatcaccatatttgtctaaaactaccccttctataacagagtcaatgttgtcacca
tcagtagcaacaactgctcctaaagaaccagatacaactgctaaaacatctgcaaaggct
acttctagagaatttattaaaacaagtataccatacaaaactggaattacagaagtgagt
agtactgtgccatctcttgttaccacatctgtaactgacctatcacaagtaacttcaagc
aaaatggccataactgctgctgcaaccacagtgctgttcaagagagttacatcatcagca
attcctgaaacaataactacaccagagactggatcaaggatatttcccacaaaggaagtg
ccattaacaacacaaaaaactacacaaaaaccatatgtaattacagcatcaaattataca
gtacctacatcagtgcaagaaaaaaacctttctgctacaacatatccatcaaccacacat
atccctctccagtctactcagttaacatcaagtgtggtatatcccattttgacagaaatg
acttcatcatctttgacaatgtcacaagagcaaactaagtcattagaaacatcaactact
aaatatttatcaccaattgcaaatgcaacagagattacacttacgccttttacatggaaa
tcaacacctttaaatgttactgtatactcttcagttatgacaacccaaccatccactgct
tcctcatacccaacaacaaagcttaagacaacactaacacccataataacatcggaatca
acagttagaacagaatcagtaacagaaaaaacagagtactcaatgaaaaccgtaaaaaca
agcccagctgaaacattttcaacagaatttgtaaaaacaatttcagcttctaaagagtat
ccaagaactgtaccatatccaacaccagaacaaacaatttcattaaaaataacacccata
ggtgacactacctcaacagaaacacagtcagcgaccacaagatctcattatacaatggaa
aaacttttggaaacttcagcagtctctctcaaaacattaccaggtgcttggagtactaaa
gaaacctcaactgttttctatcctgagacgtcatcaattaaaaaaactttaaccaccaca
gcataccaaaccttaccagagacatcatcaatcactgttacatacccagtagctaaattc
agtacaataaggtctacagcaaaactgaccaccacatatccactatctgttggttcaact
actctttcagtctataatgttacattcagcactagatctcctacagaaattacagcagtg
tcatttacatcagctacaacagagacactgacttcttatccgcatacttctaaagaacat
actacagttttactttccacaggccagacaaggtttaaagaaacttcaccaccaaccatc
acccccttacttatatcacctatcacaaaaactacaactccaaaaacatcaaccgtaatt
tcattatcaccagagttaaatgtgactgcaggtgtgacatcaaatgagactgcagctaca
cagacatcagtagcaaaagccactataagttcaactacatctgctgtttcattatcacct
gcaatagctacatacacaaccctaagcaaagctacaagtgaaaaagtctcaactccatct
gcttctcttttaacatcagcctacagttcaactagctattctccagctataactagcagt
gtttctttcactacttcaagcccaatagtcaccacagaaaagctctctacaacaacaaaa
acctttactagcactgtgctttcaacaactgcaaccagccatgtcgaagcacatgttcct
gagactacaaaagaattaccattttcagagttttcaactgtgacactcgctccttcagtg
ccaagctcattacttcctactaagtactggactacagtgcccaaagaaacctctcgaaat
atgactgtacctaccacaaagaatctaacatctataagtattactcaacctgttcaggtt
tcaacatcaactgaagttacacttgaagctatgacaggtgcaacatttatgtcaactagt
tcctacataaaaaatgtgacctctcaatcaacccaaacttgcacagttccatttcctgaa
tatcctgatccttgcattgaatatttctgtgtgaatggccacttgattaaacacaataaa
actcaaacctgtccctataatgcaactgcacctacttgtggactaatgggacttgcactg
gaggttaatggagataagtgttgtccaaaatgggaatgcccatgtcaatgcaccatttat
agtgatctaagattgatgacctttgatggaagcaacttgggtatctttaaagcagcttca
tatattattactcacgtacagcaggaaatcataaccatcaaaacacaaaactgccagaac
ttaatgggggactacaggaactcaatcaagctatgcctggcaaccctcaacataacccat
ccaactcttctttttagcatcgatcgcctgaaaagaaaactatttgtgaattcaacatct
gtcagctctaaatacagaaaaaatggatttatgattatggatactggtcatatgtatttg
atcaaaacaccagctggaataaagataaaatggtttcacaacaccggaatgatgacaata
gagaccaatacctccagtaaattaacaaccatgggattgtgtggttactgtgatagaaac
caaacaaatgactttacactactgaatggtacagttcttagtgaaggggagaacccaaca
accttcatggacagctggcaagtgccaaatactttacgttatgtgggagaggatcggcgg
cgtgacgttaactgttcaaccagtgactgctctgagtgtttcagcatgctttataaccaa
accttcagcagctgccatccctttgtttctcctgaggtgttttgtgaaatctggatacaa
gatgaagaatatgtgaaaaatccttgtatagcactggctgcatatgtagcaacttgtcaa
aagtttaatatctgcataaaatggaggacttcagactactgctccttccattgccctcaa
gatttaaggtatcaagcttgtttgccagtgtgcaatgcaaaaacctgccagaactatgat
ttagagcatctggatgttcaagaatgctcaggactaagtgaaggctgtgtctgtccagaa
gggactgttcttcatcgttcctattctgatctctgtatcccagaggaaaagtgtgcctgc
actgacagtttcggaaccccacgagctccaggggagacatggaccacgtcaccagatagc
tgttgtacatacacgtgcataggcaatgataccattattccagtggaacataactgctct
gatgttatagaaccagaatgcaggcatggtgaagaagtggtaataatcctgtctgatgac
aaaagctgctgtcctcaaaaaacatgtatttgcaaccagacaatttgtgaaagttatatt
cctcagtgtagcgatgaagaaaaacttattcttaactataaggcagactcttgctgtcca
gaatatatttgtgtgtgtgctgcttgctctgacaatattccaaaatgtcaagatggagaa
attcttattgtggatggtaataccacagatagatgctgtcctggataccagtgcgtttgt
gaaacgcatcgctgtcctgagtacaactgttctttaggaaaagcaataatcgaagcatgg
agtccagagcaatgttgcccatataaaacatgtgtgtgtgcatgtgagacaattcctaaa
cccaattgcaaactgggagaagcaatacaaatagatgaagaattccaaaatgaccctgaa
aatcaatgtggttgcattaaatacaaatgtgtaaagaaccaggtgtgtgtagatgaagag
agaggtgtcttgcatccaggacagactattgtggaacatactcctaatggcttttgccat
attacccattgtactagccagactgatccacaaaccaactattaccagattagtgttaaa
agaataaactgtactgcaacgtgtcaatcaaaccaaaaatatgagcctcctgttgattta
acaacttgctgcggaatctgtaaaaatgtgacatgtctccacactatagaaaatggtaca
gttaaagcttatgagcctggcacctcttggatatcaaattgtgttagatatggatgcaca
gacacaatgttaggacctgttctaactgcatctccaattagctgtcctccctttaatgaa
actgaatgcctgaagatagggggaactattgtgcccttcatggatggatgctgtaagagc
tgtaaagaagatgcaaagttctgtcagaaagtgaccgtaaggatgaaaattcgaaagaat
gactgtaaaagtaatataccaattaacattgtttcatgtgatgggaaatgtccttctgcc
agtatttataattacaatattaacacatatgccaggttctgcaagtgctgcagggaaact
gggctgcaaacacgtactgtacagctattctgtagtggcaactcaacttgggtcagttat
tccatccaggagcccacagattgttcttgccagtggtcctag

Вторичная структура:

https://getfile.dokpub.com/yandex/get/https://yadi.sk/i/fVNdwa9JXKFmEg

https://getfile.dokpub.com/yandex/get/https://yadi.sk/i/GLPcON4dAZDC2g

Кушелев: Любопытная структура...

Здесь последовательность TTQKTTQK не образует периодической вторичной структуры.