Amino acid dipepetide frequency for Cylindrospermopsis raciborskii virus RM-2018a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.337AlaAla: 5.337 ± 0.625
0.648AlaCys: 0.648 ± 0.206
3.355AlaAsp: 3.355 ± 0.415
3.851AlaGlu: 3.851 ± 0.776
3.05AlaPhe: 3.05 ± 0.277
5.376AlaGly: 5.376 ± 0.478
0.762AlaHis: 0.762 ± 0.233
6.1AlaIle: 6.1 ± 0.542
5.376AlaLys: 5.376 ± 0.844
6.9AlaLeu: 6.9 ± 0.643
1.487AlaMet: 1.487 ± 0.273
4.613AlaAsn: 4.613 ± 0.618
2.707AlaPro: 2.707 ± 0.559
3.393AlaGln: 3.393 ± 0.453
2.745AlaArg: 2.745 ± 0.357
4.804AlaSer: 4.804 ± 0.496
5.452AlaThr: 5.452 ± 0.655
4.079AlaVal: 4.079 ± 0.364
0.686AlaTrp: 0.686 ± 0.169
2.707AlaTyr: 2.707 ± 0.325
0.0AlaXaa: 0.0 ± 0.0
Cys
0.419CysAla: 0.419 ± 0.156
0.152CysCys: 0.152 ± 0.076
0.381CysAsp: 0.381 ± 0.121
0.191CysGlu: 0.191 ± 0.093
0.305CysPhe: 0.305 ± 0.106
0.801CysGly: 0.801 ± 0.245
0.152CysHis: 0.152 ± 0.08
0.343CysIle: 0.343 ± 0.143
0.457CysLys: 0.457 ± 0.181
0.991CysLeu: 0.991 ± 0.22
0.152CysMet: 0.152 ± 0.087
0.419CysAsn: 0.419 ± 0.147
0.877CysPro: 0.877 ± 0.302
0.457CysGln: 0.457 ± 0.151
0.648CysArg: 0.648 ± 0.196
0.572CysSer: 0.572 ± 0.151
0.61CysThr: 0.61 ± 0.208
0.724CysVal: 0.724 ± 0.227
0.191CysTrp: 0.191 ± 0.086
0.381CysTyr: 0.381 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
3.736AspAla: 3.736 ± 0.332
0.724AspCys: 0.724 ± 0.178
2.249AspAsp: 2.249 ± 0.336
2.745AspGlu: 2.745 ± 0.429
2.402AspPhe: 2.402 ± 0.313
3.05AspGly: 3.05 ± 0.261
0.305AspHis: 0.305 ± 0.091
3.431AspIle: 3.431 ± 0.351
2.821AspLys: 2.821 ± 0.36
5.414AspLeu: 5.414 ± 0.559
0.915AspMet: 0.915 ± 0.189
2.249AspAsn: 2.249 ± 0.291
2.211AspPro: 2.211 ± 0.253
1.563AspGln: 1.563 ± 0.261
1.83AspArg: 1.83 ± 0.248
3.546AspSer: 3.546 ± 0.369
2.974AspThr: 2.974 ± 0.403
2.745AspVal: 2.745 ± 0.287
1.296AspTrp: 1.296 ± 0.251
2.059AspTyr: 2.059 ± 0.278
0.0AspXaa: 0.0 ± 0.0
Glu
4.079GluAla: 4.079 ± 0.519
0.686GluCys: 0.686 ± 0.186
3.355GluAsp: 3.355 ± 0.449
4.232GluGlu: 4.232 ± 0.485
2.059GluPhe: 2.059 ± 0.283
3.736GluGly: 3.736 ± 0.483
0.801GluHis: 0.801 ± 0.201
5.109GluIle: 5.109 ± 0.454
3.774GluLys: 3.774 ± 0.53
5.986GluLeu: 5.986 ± 0.592
1.067GluMet: 1.067 ± 0.268
3.507GluAsn: 3.507 ± 0.418
1.982GluPro: 1.982 ± 0.283
3.088GluGln: 3.088 ± 0.425
3.126GluArg: 3.126 ± 0.38
4.308GluSer: 4.308 ± 0.451
3.66GluThr: 3.66 ± 0.312
4.956GluVal: 4.956 ± 0.469
1.258GluTrp: 1.258 ± 0.232
2.287GluTyr: 2.287 ± 0.417
0.0GluXaa: 0.0 ± 0.0
Phe
3.164PheAla: 3.164 ± 0.393
0.381PheCys: 0.381 ± 0.144
2.364PheAsp: 2.364 ± 0.293
2.592PheGlu: 2.592 ± 0.33
1.563PhePhe: 1.563 ± 0.232
3.05PheGly: 3.05 ± 0.354
0.457PheHis: 0.457 ± 0.096
1.906PheIle: 1.906 ± 0.219
2.631PheLys: 2.631 ± 0.366
3.431PheLeu: 3.431 ± 0.373
0.724PheMet: 0.724 ± 0.175
2.669PheAsn: 2.669 ± 0.412
2.097PhePro: 2.097 ± 0.322
1.258PheGln: 1.258 ± 0.21
2.287PheArg: 2.287 ± 0.332
3.126PheSer: 3.126 ± 0.372
3.164PheThr: 3.164 ± 0.46
2.173PheVal: 2.173 ± 0.255
0.419PheTrp: 0.419 ± 0.127
1.22PheTyr: 1.22 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
5.032GlyAla: 5.032 ± 0.683
0.457GlyCys: 0.457 ± 0.138
2.897GlyAsp: 2.897 ± 0.298
3.66GlyGlu: 3.66 ± 0.397
4.003GlyPhe: 4.003 ± 0.428
4.651GlyGly: 4.651 ± 0.842
0.572GlyHis: 0.572 ± 0.211
5.604GlyIle: 5.604 ± 0.6
3.965GlyLys: 3.965 ± 0.584
6.519GlyLeu: 6.519 ± 0.945
0.762GlyMet: 0.762 ± 0.15
4.156GlyAsn: 4.156 ± 0.53
0.343GlyPro: 0.343 ± 0.147
2.021GlyGln: 2.021 ± 0.254
2.554GlyArg: 2.554 ± 0.475
5.109GlySer: 5.109 ± 0.462
4.651GlyThr: 4.651 ± 0.686
4.918GlyVal: 4.918 ± 0.527
1.067GlyTrp: 1.067 ± 0.23
3.241GlyTyr: 3.241 ± 0.584
0.0GlyXaa: 0.0 ± 0.0
His
0.496HisAla: 0.496 ± 0.122
0.229HisCys: 0.229 ± 0.132
0.572HisAsp: 0.572 ± 0.16
0.762HisGlu: 0.762 ± 0.168
0.762HisPhe: 0.762 ± 0.138
0.572HisGly: 0.572 ± 0.133
0.229HisHis: 0.229 ± 0.108
0.953HisIle: 0.953 ± 0.215
0.991HisLys: 0.991 ± 0.32
1.525HisLeu: 1.525 ± 0.279
0.038HisMet: 0.038 ± 0.043
0.496HisAsn: 0.496 ± 0.127
0.496HisPro: 0.496 ± 0.142
0.572HisGln: 0.572 ± 0.181
0.724HisArg: 0.724 ± 0.154
0.801HisSer: 0.801 ± 0.155
0.61HisThr: 0.61 ± 0.19
0.762HisVal: 0.762 ± 0.178
0.343HisTrp: 0.343 ± 0.124
0.496HisTyr: 0.496 ± 0.173
0.0HisXaa: 0.0 ± 0.0
Ile
5.681IleAla: 5.681 ± 0.52
0.953IleCys: 0.953 ± 0.258
3.698IleAsp: 3.698 ± 0.356
4.651IleGlu: 4.651 ± 0.521
2.097IlePhe: 2.097 ± 0.283
4.537IleGly: 4.537 ± 0.613
0.801IleHis: 0.801 ± 0.153
3.507IleIle: 3.507 ± 0.39
4.88IleLys: 4.88 ± 0.521
4.956IleLeu: 4.956 ± 0.507
1.029IleMet: 1.029 ± 0.161
5.032IleAsn: 5.032 ± 0.555
4.194IlePro: 4.194 ± 0.48
2.821IleGln: 2.821 ± 0.35
3.012IleArg: 3.012 ± 0.405
5.071IleSer: 5.071 ± 0.447
5.032IleThr: 5.032 ± 0.506
3.546IleVal: 3.546 ± 0.41
0.724IleTrp: 0.724 ± 0.134
2.249IleTyr: 2.249 ± 0.42
0.0IleXaa: 0.0 ± 0.0
Lys
4.842LysAla: 4.842 ± 0.8
0.572LysCys: 0.572 ± 0.169
3.241LysAsp: 3.241 ± 0.451
4.156LysGlu: 4.156 ± 0.515
2.402LysPhe: 2.402 ± 0.28
3.012LysGly: 3.012 ± 0.631
1.067LysHis: 1.067 ± 0.158
4.194LysIle: 4.194 ± 0.418
4.308LysLys: 4.308 ± 0.546
5.871LysLeu: 5.871 ± 0.575
1.067LysMet: 1.067 ± 0.262
3.164LysAsn: 3.164 ± 0.574
3.126LysPro: 3.126 ± 0.386
3.317LysGln: 3.317 ± 0.666
3.202LysArg: 3.202 ± 0.424
4.117LysSer: 4.117 ± 0.456
4.575LysThr: 4.575 ± 0.56
3.812LysVal: 3.812 ± 0.451
0.991LysTrp: 0.991 ± 0.257
2.249LysTyr: 2.249 ± 0.271
0.0LysXaa: 0.0 ± 0.0
Leu
6.1LeuAla: 6.1 ± 0.466
0.686LeuCys: 0.686 ± 0.213
4.651LeuAsp: 4.651 ± 0.437
7.663LeuGlu: 7.663 ± 0.713
3.012LeuPhe: 3.012 ± 0.328
5.986LeuGly: 5.986 ± 0.829
1.411LeuHis: 1.411 ± 0.241
5.604LeuIle: 5.604 ± 0.649
4.956LeuLys: 4.956 ± 0.509
6.824LeuLeu: 6.824 ± 0.601
1.677LeuMet: 1.677 ± 0.278
6.329LeuAsn: 6.329 ± 0.467
4.804LeuPro: 4.804 ± 0.426
3.736LeuGln: 3.736 ± 0.412
3.507LeuArg: 3.507 ± 0.379
7.587LeuSer: 7.587 ± 0.432
6.367LeuThr: 6.367 ± 0.653
5.528LeuVal: 5.528 ± 0.596
0.839LeuTrp: 0.839 ± 0.207
2.211LeuTyr: 2.211 ± 0.48
0.0LeuXaa: 0.0 ± 0.0
Met
1.372MetAla: 1.372 ± 0.185
0.191MetCys: 0.191 ± 0.08
1.029MetAsp: 1.029 ± 0.204
1.067MetGlu: 1.067 ± 0.214
0.953MetPhe: 0.953 ± 0.2
0.991MetGly: 0.991 ± 0.289
0.152MetHis: 0.152 ± 0.084
0.877MetIle: 0.877 ± 0.144
1.029MetLys: 1.029 ± 0.278
1.372MetLeu: 1.372 ± 0.257
0.381MetMet: 0.381 ± 0.138
0.801MetAsn: 0.801 ± 0.173
1.411MetPro: 1.411 ± 0.247
0.991MetGln: 0.991 ± 0.18
0.534MetArg: 0.534 ± 0.138
1.411MetSer: 1.411 ± 0.237
0.991MetThr: 0.991 ± 0.184
0.762MetVal: 0.762 ± 0.216
0.076MetTrp: 0.076 ± 0.052
0.191MetTyr: 0.191 ± 0.077
0.0MetXaa: 0.0 ± 0.0
Asn
4.537AsnAla: 4.537 ± 0.455
0.419AsnCys: 0.419 ± 0.179
2.173AsnAsp: 2.173 ± 0.337
2.936AsnGlu: 2.936 ± 0.351
2.897AsnPhe: 2.897 ± 0.449
4.156AsnGly: 4.156 ± 0.62
0.724AsnHis: 0.724 ± 0.237
3.736AsnIle: 3.736 ± 0.512
3.698AsnLys: 3.698 ± 0.373
6.824AsnLeu: 6.824 ± 0.597
1.144AsnMet: 1.144 ± 0.179
4.27AsnAsn: 4.27 ± 0.399
2.821AsnPro: 2.821 ± 0.358
2.402AsnGln: 2.402 ± 0.297
2.592AsnArg: 2.592 ± 0.344
3.889AsnSer: 3.889 ± 0.44
5.49AsnThr: 5.49 ± 0.653
2.974AsnVal: 2.974 ± 0.346
1.029AsnTrp: 1.029 ± 0.242
2.859AsnTyr: 2.859 ± 0.328
0.0AsnXaa: 0.0 ± 0.0
Pro
2.669ProAla: 2.669 ± 0.314
0.343ProCys: 0.343 ± 0.129
2.287ProAsp: 2.287 ± 0.251
3.279ProGlu: 3.279 ± 0.452
1.563ProPhe: 1.563 ± 0.246
3.241ProGly: 3.241 ± 0.346
0.496ProHis: 0.496 ± 0.163
2.897ProIle: 2.897 ± 0.451
2.249ProLys: 2.249 ± 0.269
3.241ProLeu: 3.241 ± 0.411
0.839ProMet: 0.839 ± 0.215
2.897ProAsn: 2.897 ± 0.494
1.487ProPro: 1.487 ± 0.229
1.563ProGln: 1.563 ± 0.259
1.563ProArg: 1.563 ± 0.249
3.774ProSer: 3.774 ± 0.364
2.859ProThr: 2.859 ± 0.505
3.774ProVal: 3.774 ± 0.553
0.572ProTrp: 0.572 ± 0.131
1.182ProTyr: 1.182 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
3.736GlnAla: 3.736 ± 0.66
0.191GlnCys: 0.191 ± 0.085
1.944GlnAsp: 1.944 ± 0.306
3.927GlnGlu: 3.927 ± 0.412
1.754GlnPhe: 1.754 ± 0.272
2.249GlnGly: 2.249 ± 0.277
0.534GlnHis: 0.534 ± 0.144
3.698GlnIle: 3.698 ± 0.388
2.745GlnLys: 2.745 ± 0.521
4.003GlnLeu: 4.003 ± 0.35
0.534GlnMet: 0.534 ± 0.173
2.402GlnAsn: 2.402 ± 0.357
1.106GlnPro: 1.106 ± 0.215
2.44GlnGln: 2.44 ± 0.473
2.021GlnArg: 2.021 ± 0.373
2.364GlnSer: 2.364 ± 0.293
2.211GlnThr: 2.211 ± 0.358
2.592GlnVal: 2.592 ± 0.34
0.496GlnTrp: 0.496 ± 0.161
1.372GlnTyr: 1.372 ± 0.213
0.0GlnXaa: 0.0 ± 0.0
Arg
2.783ArgAla: 2.783 ± 0.272
0.534ArgCys: 0.534 ± 0.174
2.059ArgAsp: 2.059 ± 0.306
3.279ArgGlu: 3.279 ± 0.413
1.792ArgPhe: 1.792 ± 0.206
2.326ArgGly: 2.326 ± 0.31
0.801ArgHis: 0.801 ± 0.205
4.003ArgIle: 4.003 ± 0.547
3.317ArgLys: 3.317 ± 0.572
3.66ArgLeu: 3.66 ± 0.369
0.801ArgMet: 0.801 ± 0.195
2.211ArgAsn: 2.211 ± 0.266
1.792ArgPro: 1.792 ± 0.402
2.135ArgGln: 2.135 ± 0.375
2.402ArgArg: 2.402 ± 0.224
2.859ArgSer: 2.859 ± 0.255
2.783ArgThr: 2.783 ± 0.307
2.554ArgVal: 2.554 ± 0.334
0.686ArgTrp: 0.686 ± 0.154
2.021ArgTyr: 2.021 ± 0.366
0.0ArgXaa: 0.0 ± 0.0
Ser
4.651SerAla: 4.651 ± 0.419
0.686SerCys: 0.686 ± 0.191
3.088SerAsp: 3.088 ± 0.331
3.05SerGlu: 3.05 ± 0.395
3.317SerPhe: 3.317 ± 0.313
5.566SerGly: 5.566 ± 0.587
0.534SerHis: 0.534 ± 0.163
5.642SerIle: 5.642 ± 0.45
4.727SerLys: 4.727 ± 0.565
6.062SerLeu: 6.062 ± 0.364
0.915SerMet: 0.915 ± 0.168
4.88SerAsn: 4.88 ± 0.484
3.241SerPro: 3.241 ± 0.392
3.126SerGln: 3.126 ± 0.377
4.117SerArg: 4.117 ± 0.457
6.024SerSer: 6.024 ± 0.83
4.575SerThr: 4.575 ± 0.706
5.147SerVal: 5.147 ± 0.514
0.915SerTrp: 0.915 ± 0.194
2.135SerTyr: 2.135 ± 0.525
0.0SerXaa: 0.0 ± 0.0
Thr
6.367ThrAla: 6.367 ± 0.697
0.381ThrCys: 0.381 ± 0.124
2.783ThrAsp: 2.783 ± 0.384
4.308ThrGlu: 4.308 ± 0.391
2.821ThrPhe: 2.821 ± 0.408
4.689ThrGly: 4.689 ± 0.614
1.067ThrHis: 1.067 ± 0.273
4.346ThrIle: 4.346 ± 0.645
3.927ThrLys: 3.927 ± 0.452
5.719ThrLeu: 5.719 ± 0.53
0.801ThrMet: 0.801 ± 0.191
3.774ThrAsn: 3.774 ± 0.502
3.546ThrPro: 3.546 ± 0.663
3.241ThrGln: 3.241 ± 0.359
2.897ThrArg: 2.897 ± 0.358
5.147ThrSer: 5.147 ± 0.636
6.214ThrThr: 6.214 ± 1.22
5.147ThrVal: 5.147 ± 0.827
0.457ThrTrp: 0.457 ± 0.137
2.364ThrTyr: 2.364 ± 0.303
0.0ThrXaa: 0.0 ± 0.0
Val
4.689ValAla: 4.689 ± 0.382
0.381ValCys: 0.381 ± 0.162
3.012ValAsp: 3.012 ± 0.307
4.156ValGlu: 4.156 ± 0.432
2.326ValPhe: 2.326 ± 0.251
4.461ValGly: 4.461 ± 0.684
0.648ValHis: 0.648 ± 0.181
4.079ValIle: 4.079 ± 0.449
4.422ValLys: 4.422 ± 0.522
5.49ValLeu: 5.49 ± 0.531
0.877ValMet: 0.877 ± 0.136
4.079ValAsn: 4.079 ± 0.386
2.897ValPro: 2.897 ± 0.349
1.716ValGln: 1.716 ± 0.227
2.897ValArg: 2.897 ± 0.268
4.499ValSer: 4.499 ± 0.499
5.109ValThr: 5.109 ± 0.835
4.461ValVal: 4.461 ± 0.424
0.915ValTrp: 0.915 ± 0.277
2.897ValTyr: 2.897 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
0.801TrpAla: 0.801 ± 0.215
0.076TrpCys: 0.076 ± 0.058
0.953TrpAsp: 0.953 ± 0.216
1.029TrpGlu: 1.029 ± 0.264
0.648TrpPhe: 0.648 ± 0.139
0.877TrpGly: 0.877 ± 0.18
0.496TrpHis: 0.496 ± 0.138
0.801TrpIle: 0.801 ± 0.181
0.572TrpLys: 0.572 ± 0.182
1.258TrpLeu: 1.258 ± 0.29
0.496TrpMet: 0.496 ± 0.169
0.801TrpAsn: 0.801 ± 0.208
0.191TrpPro: 0.191 ± 0.099
0.686TrpGln: 0.686 ± 0.146
0.572TrpArg: 0.572 ± 0.182
0.991TrpSer: 0.991 ± 0.247
0.762TrpThr: 0.762 ± 0.168
1.029TrpVal: 1.029 ± 0.264
0.343TrpTrp: 0.343 ± 0.143
0.686TrpTyr: 0.686 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.936TyrAla: 2.936 ± 0.337
0.534TyrCys: 0.534 ± 0.164
2.173TyrAsp: 2.173 ± 0.307
1.525TyrGlu: 1.525 ± 0.251
0.953TyrPhe: 0.953 ± 0.188
2.859TyrGly: 2.859 ± 0.43
0.419TyrHis: 0.419 ± 0.171
1.639TyrIle: 1.639 ± 0.341
2.44TyrLys: 2.44 ± 0.344
3.431TyrLeu: 3.431 ± 0.492
0.839TyrMet: 0.839 ± 0.261
2.821TyrAsn: 2.821 ± 0.41
1.449TyrPro: 1.449 ± 0.265
1.792TyrGln: 1.792 ± 0.251
1.563TyrArg: 1.563 ± 0.254
2.44TyrSer: 2.44 ± 0.306
1.906TyrThr: 1.906 ± 0.402
2.326TyrVal: 2.326 ± 0.506
0.762TyrTrp: 0.762 ± 0.245
2.173TyrTyr: 2.173 ± 0.472
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 100 proteins (26231 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski