Amino acid dipepetide frequency for Warrego virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.864AlaAla: 3.864 ± 1.149
0.805AlaCys: 0.805 ± 0.326
2.737AlaAsp: 2.737 ± 0.514
2.576AlaGlu: 2.576 ± 0.854
2.254AlaPhe: 2.254 ± 0.862
3.542AlaGly: 3.542 ± 0.807
1.127AlaHis: 1.127 ± 0.449
4.991AlaIle: 4.991 ± 0.951
4.025AlaLys: 4.025 ± 0.937
5.474AlaLeu: 5.474 ± 1.058
2.576AlaMet: 2.576 ± 0.636
1.932AlaAsn: 1.932 ± 0.386
1.932AlaPro: 1.932 ± 0.603
3.059AlaGln: 3.059 ± 0.766
4.025AlaArg: 4.025 ± 0.938
3.703AlaSer: 3.703 ± 0.816
5.152AlaThr: 5.152 ± 0.903
4.347AlaVal: 4.347 ± 0.648
1.288AlaTrp: 1.288 ± 0.384
2.254AlaTyr: 2.254 ± 0.612
0.0AlaXaa: 0.0 ± 0.0
Cys
1.449CysAla: 1.449 ± 0.698
0.161CysCys: 0.161 ± 0.123
0.966CysAsp: 0.966 ± 0.313
0.483CysGlu: 0.483 ± 0.34
0.966CysPhe: 0.966 ± 0.505
0.805CysGly: 0.805 ± 0.328
0.161CysHis: 0.161 ± 0.155
0.161CysIle: 0.161 ± 0.164
0.805CysLys: 0.805 ± 0.374
0.805CysLeu: 0.805 ± 0.438
0.322CysMet: 0.322 ± 0.166
0.483CysAsn: 0.483 ± 0.343
0.161CysPro: 0.161 ± 0.162
0.322CysGln: 0.322 ± 0.239
0.644CysArg: 0.644 ± 0.329
0.161CysSer: 0.161 ± 0.162
0.161CysThr: 0.161 ± 0.164
0.322CysVal: 0.322 ± 0.204
0.0CysTrp: 0.0 ± 0.0
1.288CysTyr: 1.288 ± 0.468
0.0CysXaa: 0.0 ± 0.0
Asp
3.381AspAla: 3.381 ± 0.632
0.644AspCys: 0.644 ± 0.332
4.83AspAsp: 4.83 ± 0.974
5.474AspGlu: 5.474 ± 0.832
3.059AspPhe: 3.059 ± 0.529
4.186AspGly: 4.186 ± 0.38
0.644AspHis: 0.644 ± 0.299
3.22AspIle: 3.22 ± 0.534
2.254AspLys: 2.254 ± 0.593
5.635AspLeu: 5.635 ± 1.019
1.61AspMet: 1.61 ± 0.417
2.576AspAsn: 2.576 ± 0.794
3.542AspPro: 3.542 ± 0.976
0.322AspGln: 0.322 ± 0.236
4.186AspArg: 4.186 ± 0.92
2.576AspSer: 2.576 ± 0.606
4.669AspThr: 4.669 ± 0.861
4.186AspVal: 4.186 ± 0.619
0.161AspTrp: 0.161 ± 0.145
2.093AspTyr: 2.093 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
4.025GluAla: 4.025 ± 0.769
0.644GluCys: 0.644 ± 0.421
4.991GluAsp: 4.991 ± 0.929
8.211GluGlu: 8.211 ± 1.411
2.898GluPhe: 2.898 ± 0.797
3.059GluGly: 3.059 ± 0.828
1.127GluHis: 1.127 ± 0.399
7.406GluIle: 7.406 ± 1.115
7.728GluLys: 7.728 ± 2.047
4.669GluLeu: 4.669 ± 0.938
2.576GluMet: 2.576 ± 0.813
3.059GluAsn: 3.059 ± 1.225
2.415GluPro: 2.415 ± 0.463
3.22GluGln: 3.22 ± 0.513
4.508GluArg: 4.508 ± 0.691
3.059GluSer: 3.059 ± 0.519
3.22GluThr: 3.22 ± 0.64
5.957GluVal: 5.957 ± 0.791
1.127GluTrp: 1.127 ± 0.56
2.898GluTyr: 2.898 ± 0.508
0.0GluXaa: 0.0 ± 0.0
Phe
2.898PheAla: 2.898 ± 0.565
0.483PheCys: 0.483 ± 0.233
1.61PheAsp: 1.61 ± 0.384
3.22PheGlu: 3.22 ± 0.642
1.771PhePhe: 1.771 ± 0.459
2.254PheGly: 2.254 ± 0.55
1.127PheHis: 1.127 ± 0.398
3.542PheIle: 3.542 ± 0.874
2.898PheLys: 2.898 ± 0.688
3.22PheLeu: 3.22 ± 0.493
0.644PheMet: 0.644 ± 0.26
1.61PheAsn: 1.61 ± 0.57
2.093PhePro: 2.093 ± 0.738
1.449PheGln: 1.449 ± 0.509
2.254PheArg: 2.254 ± 0.626
2.093PheSer: 2.093 ± 0.447
2.576PheThr: 2.576 ± 0.413
2.415PheVal: 2.415 ± 0.923
0.161PheTrp: 0.161 ± 0.134
2.576PheTyr: 2.576 ± 0.542
0.0PheXaa: 0.0 ± 0.0
Gly
3.703GlyAla: 3.703 ± 0.431
0.644GlyCys: 0.644 ± 0.354
4.025GlyAsp: 4.025 ± 0.932
3.059GlyGlu: 3.059 ± 0.881
2.093GlyPhe: 2.093 ± 0.5
2.093GlyGly: 2.093 ± 0.45
1.771GlyHis: 1.771 ± 0.667
3.703GlyIle: 3.703 ± 0.781
3.542GlyLys: 3.542 ± 0.947
3.864GlyLeu: 3.864 ± 0.984
2.576GlyMet: 2.576 ± 0.758
3.22GlyAsn: 3.22 ± 0.789
1.449GlyPro: 1.449 ± 0.601
2.093GlyGln: 2.093 ± 0.634
3.22GlyArg: 3.22 ± 0.846
3.059GlySer: 3.059 ± 0.673
1.932GlyThr: 1.932 ± 0.684
4.991GlyVal: 4.991 ± 0.911
0.805GlyTrp: 0.805 ± 0.405
2.576GlyTyr: 2.576 ± 0.786
0.0GlyXaa: 0.0 ± 0.0
His
0.483HisAla: 0.483 ± 0.277
0.322HisCys: 0.322 ± 0.21
1.449HisAsp: 1.449 ± 0.352
1.449HisGlu: 1.449 ± 0.32
1.127HisPhe: 1.127 ± 0.373
0.966HisGly: 0.966 ± 0.288
0.483HisHis: 0.483 ± 0.215
1.449HisIle: 1.449 ± 0.52
0.966HisLys: 0.966 ± 0.422
3.22HisLeu: 3.22 ± 0.627
0.966HisMet: 0.966 ± 0.355
0.966HisAsn: 0.966 ± 0.257
0.805HisPro: 0.805 ± 0.36
0.644HisGln: 0.644 ± 0.267
1.288HisArg: 1.288 ± 0.422
0.966HisSer: 0.966 ± 0.242
1.127HisThr: 1.127 ± 0.356
1.771HisVal: 1.771 ± 0.474
0.161HisTrp: 0.161 ± 0.162
0.322HisTyr: 0.322 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
4.669IleAla: 4.669 ± 0.481
0.644IleCys: 0.644 ± 0.248
5.474IleAsp: 5.474 ± 0.761
5.796IleGlu: 5.796 ± 0.7
2.415IlePhe: 2.415 ± 0.538
3.864IleGly: 3.864 ± 0.654
1.449IleHis: 1.449 ± 0.561
4.347IleIle: 4.347 ± 0.614
6.762IleLys: 6.762 ± 1.409
4.83IleLeu: 4.83 ± 0.925
3.381IleMet: 3.381 ± 0.698
4.025IleAsn: 4.025 ± 0.789
1.932IlePro: 1.932 ± 0.458
2.898IleGln: 2.898 ± 0.584
3.703IleArg: 3.703 ± 1.007
3.703IleSer: 3.703 ± 0.703
3.381IleThr: 3.381 ± 0.383
5.474IleVal: 5.474 ± 0.909
0.966IleTrp: 0.966 ± 0.457
3.381IleTyr: 3.381 ± 0.921
0.0IleXaa: 0.0 ± 0.0
Lys
3.864LysAla: 3.864 ± 0.644
0.483LysCys: 0.483 ± 0.284
2.576LysAsp: 2.576 ± 0.377
7.889LysGlu: 7.889 ± 2.829
2.576LysPhe: 2.576 ± 0.609
4.186LysGly: 4.186 ± 1.547
2.093LysHis: 2.093 ± 0.48
5.796LysIle: 5.796 ± 1.37
6.601LysLys: 6.601 ± 1.572
4.83LysLeu: 4.83 ± 1.022
2.254LysMet: 2.254 ± 0.916
1.771LysAsn: 1.771 ± 0.305
1.288LysPro: 1.288 ± 0.438
3.22LysGln: 3.22 ± 0.571
5.152LysArg: 5.152 ± 1.169
2.898LysSer: 2.898 ± 0.555
2.093LysThr: 2.093 ± 0.31
3.542LysVal: 3.542 ± 0.64
0.483LysTrp: 0.483 ± 0.205
2.576LysTyr: 2.576 ± 0.628
0.0LysXaa: 0.0 ± 0.0
Leu
5.635LeuAla: 5.635 ± 0.994
0.644LeuCys: 0.644 ± 0.327
2.576LeuAsp: 2.576 ± 0.632
5.796LeuGlu: 5.796 ± 1.208
3.703LeuPhe: 3.703 ± 0.317
3.542LeuGly: 3.542 ± 0.656
1.288LeuHis: 1.288 ± 0.565
4.991LeuIle: 4.991 ± 0.818
4.347LeuLys: 4.347 ± 0.683
6.923LeuLeu: 6.923 ± 1.181
1.932LeuMet: 1.932 ± 0.396
4.508LeuAsn: 4.508 ± 0.619
3.542LeuPro: 3.542 ± 0.554
3.22LeuGln: 3.22 ± 0.716
7.084LeuArg: 7.084 ± 1.151
6.923LeuSer: 6.923 ± 1.42
4.186LeuThr: 4.186 ± 0.544
5.474LeuVal: 5.474 ± 0.661
1.127LeuTrp: 1.127 ± 0.377
3.22LeuTyr: 3.22 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
1.932MetAla: 1.932 ± 0.886
0.483MetCys: 0.483 ± 0.244
2.254MetAsp: 2.254 ± 0.828
2.254MetGlu: 2.254 ± 0.425
1.449MetPhe: 1.449 ± 0.348
1.932MetGly: 1.932 ± 0.603
0.644MetHis: 0.644 ± 0.269
2.093MetIle: 2.093 ± 0.475
2.093MetLys: 2.093 ± 0.599
5.152MetLeu: 5.152 ± 0.901
2.737MetMet: 2.737 ± 0.445
1.932MetAsn: 1.932 ± 0.598
0.966MetPro: 0.966 ± 0.429
1.127MetGln: 1.127 ± 0.389
4.186MetArg: 4.186 ± 0.512
2.576MetSer: 2.576 ± 0.599
1.771MetThr: 1.771 ± 0.371
1.932MetVal: 1.932 ± 0.582
0.483MetTrp: 0.483 ± 0.284
1.127MetTyr: 1.127 ± 0.379
0.0MetXaa: 0.0 ± 0.0
Asn
3.703AsnAla: 3.703 ± 1.066
0.483AsnCys: 0.483 ± 0.209
3.22AsnAsp: 3.22 ± 0.824
4.669AsnGlu: 4.669 ± 0.833
2.415AsnPhe: 2.415 ± 0.694
3.22AsnGly: 3.22 ± 0.797
0.805AsnHis: 0.805 ± 0.179
4.508AsnIle: 4.508 ± 0.779
1.61AsnLys: 1.61 ± 0.366
1.771AsnLeu: 1.771 ± 0.442
2.254AsnMet: 2.254 ± 0.476
1.449AsnAsn: 1.449 ± 0.278
1.771AsnPro: 1.771 ± 0.445
1.61AsnGln: 1.61 ± 0.514
1.449AsnArg: 1.449 ± 0.344
2.415AsnSer: 2.415 ± 0.579
1.61AsnThr: 1.61 ± 0.417
4.025AsnVal: 4.025 ± 0.778
0.805AsnTrp: 0.805 ± 0.516
1.932AsnTyr: 1.932 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
1.932ProAla: 1.932 ± 0.708
0.161ProCys: 0.161 ± 0.123
2.898ProAsp: 2.898 ± 0.722
2.093ProGlu: 2.093 ± 0.574
1.449ProPhe: 1.449 ± 0.4
1.449ProGly: 1.449 ± 0.363
0.966ProHis: 0.966 ± 0.27
3.22ProIle: 3.22 ± 0.59
1.61ProLys: 1.61 ± 0.537
3.059ProLeu: 3.059 ± 0.69
1.127ProMet: 1.127 ± 0.477
3.059ProAsn: 3.059 ± 0.755
2.093ProPro: 2.093 ± 0.571
1.127ProGln: 1.127 ± 0.298
2.737ProArg: 2.737 ± 0.473
1.449ProSer: 1.449 ± 0.446
2.093ProThr: 2.093 ± 0.796
2.093ProVal: 2.093 ± 0.453
0.322ProTrp: 0.322 ± 0.311
1.61ProTyr: 1.61 ± 0.521
0.0ProXaa: 0.0 ± 0.0
Gln
2.898GlnAla: 2.898 ± 0.634
0.161GlnCys: 0.161 ± 0.123
2.093GlnAsp: 2.093 ± 0.626
2.737GlnGlu: 2.737 ± 0.396
1.449GlnPhe: 1.449 ± 0.418
3.059GlnGly: 3.059 ± 0.57
0.322GlnHis: 0.322 ± 0.189
2.898GlnIle: 2.898 ± 0.329
2.254GlnLys: 2.254 ± 0.937
2.093GlnLeu: 2.093 ± 0.951
1.61GlnMet: 1.61 ± 0.368
1.449GlnAsn: 1.449 ± 0.479
1.61GlnPro: 1.61 ± 0.774
0.966GlnGln: 0.966 ± 0.673
3.542GlnArg: 3.542 ± 0.545
1.61GlnSer: 1.61 ± 0.546
2.254GlnThr: 2.254 ± 0.765
2.576GlnVal: 2.576 ± 0.612
0.161GlnTrp: 0.161 ± 0.155
1.449GlnTyr: 1.449 ± 0.428
0.0GlnXaa: 0.0 ± 0.0
Arg
3.864ArgAla: 3.864 ± 0.919
1.288ArgCys: 1.288 ± 0.39
3.059ArgAsp: 3.059 ± 0.524
3.381ArgGlu: 3.381 ± 1.235
2.737ArgPhe: 2.737 ± 0.706
3.381ArgGly: 3.381 ± 0.577
0.644ArgHis: 0.644 ± 0.193
6.118ArgIle: 6.118 ± 0.778
3.381ArgLys: 3.381 ± 0.47
5.635ArgLeu: 5.635 ± 0.894
2.737ArgMet: 2.737 ± 0.731
3.864ArgAsn: 3.864 ± 0.829
1.449ArgPro: 1.449 ± 0.361
3.542ArgGln: 3.542 ± 0.714
4.508ArgArg: 4.508 ± 0.555
3.864ArgSer: 3.864 ± 0.683
3.22ArgThr: 3.22 ± 0.663
5.635ArgVal: 5.635 ± 1.08
1.127ArgTrp: 1.127 ± 0.412
3.059ArgTyr: 3.059 ± 0.408
0.0ArgXaa: 0.0 ± 0.0
Ser
3.381SerAla: 3.381 ± 0.737
0.644SerCys: 0.644 ± 0.223
4.025SerAsp: 4.025 ± 0.502
5.313SerGlu: 5.313 ± 0.696
1.771SerPhe: 1.771 ± 0.273
3.22SerGly: 3.22 ± 0.514
1.932SerHis: 1.932 ± 0.508
2.576SerIle: 2.576 ± 0.548
4.186SerLys: 4.186 ± 0.945
3.864SerLeu: 3.864 ± 0.451
3.381SerMet: 3.381 ± 0.47
1.61SerAsn: 1.61 ± 0.421
2.093SerPro: 2.093 ± 0.385
1.288SerGln: 1.288 ± 0.631
2.898SerArg: 2.898 ± 0.548
3.22SerSer: 3.22 ± 0.989
4.025SerThr: 4.025 ± 0.469
4.186SerVal: 4.186 ± 0.565
0.805SerTrp: 0.805 ± 0.285
1.61SerTyr: 1.61 ± 0.446
0.0SerXaa: 0.0 ± 0.0
Thr
3.059ThrAla: 3.059 ± 0.856
0.483ThrCys: 0.483 ± 0.272
2.737ThrAsp: 2.737 ± 0.563
4.991ThrGlu: 4.991 ± 0.546
1.288ThrPhe: 1.288 ± 0.376
1.932ThrGly: 1.932 ± 0.623
1.288ThrHis: 1.288 ± 0.626
4.186ThrIle: 4.186 ± 0.769
3.381ThrLys: 3.381 ± 0.679
5.635ThrLeu: 5.635 ± 1.076
2.093ThrMet: 2.093 ± 0.644
2.737ThrAsn: 2.737 ± 0.411
2.737ThrPro: 2.737 ± 0.57
2.093ThrGln: 2.093 ± 0.357
3.864ThrArg: 3.864 ± 0.544
3.381ThrSer: 3.381 ± 0.475
2.737ThrThr: 2.737 ± 0.638
2.737ThrVal: 2.737 ± 0.379
0.483ThrTrp: 0.483 ± 0.383
2.415ThrTyr: 2.415 ± 0.609
0.0ThrXaa: 0.0 ± 0.0
Val
3.381ValAla: 3.381 ± 1.045
0.644ValCys: 0.644 ± 0.332
4.508ValAsp: 4.508 ± 0.539
4.186ValGlu: 4.186 ± 0.827
3.542ValPhe: 3.542 ± 0.788
3.381ValGly: 3.381 ± 0.508
2.093ValHis: 2.093 ± 0.698
4.186ValIle: 4.186 ± 0.565
4.186ValLys: 4.186 ± 0.816
6.118ValLeu: 6.118 ± 1.007
2.254ValMet: 2.254 ± 0.631
2.737ValAsn: 2.737 ± 0.576
3.542ValPro: 3.542 ± 0.59
3.059ValGln: 3.059 ± 0.625
4.83ValArg: 4.83 ± 0.56
5.635ValSer: 5.635 ± 0.953
3.703ValThr: 3.703 ± 0.969
3.059ValVal: 3.059 ± 0.633
0.644ValTrp: 0.644 ± 0.416
2.576ValTyr: 2.576 ± 0.452
0.0ValXaa: 0.0 ± 0.0
Trp
0.644TrpAla: 0.644 ± 0.26
0.0TrpCys: 0.0 ± 0.0
0.966TrpAsp: 0.966 ± 0.339
0.805TrpGlu: 0.805 ± 0.346
0.644TrpPhe: 0.644 ± 0.193
0.644TrpGly: 0.644 ± 0.238
0.322TrpHis: 0.322 ± 0.194
1.127TrpIle: 1.127 ± 0.405
1.288TrpLys: 1.288 ± 0.374
0.966TrpLeu: 0.966 ± 0.371
0.322TrpMet: 0.322 ± 0.213
1.288TrpAsn: 1.288 ± 0.462
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.483TrpArg: 0.483 ± 0.258
0.483TrpSer: 0.483 ± 0.236
0.161TrpThr: 0.161 ± 0.134
1.127TrpVal: 1.127 ± 0.362
0.161TrpTrp: 0.161 ± 0.155
0.161TrpTyr: 0.161 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.898TyrAla: 2.898 ± 0.752
0.805TyrCys: 0.805 ± 0.282
2.254TyrAsp: 2.254 ± 0.688
2.415TyrGlu: 2.415 ± 0.551
1.288TyrPhe: 1.288 ± 0.496
3.703TyrGly: 3.703 ± 0.67
0.644TyrHis: 0.644 ± 0.234
2.576TyrIle: 2.576 ± 0.613
2.254TyrLys: 2.254 ± 0.502
3.059TyrLeu: 3.059 ± 0.802
1.61TyrMet: 1.61 ± 0.469
1.449TyrAsn: 1.449 ± 0.385
1.127TyrPro: 1.127 ± 0.402
1.932TyrGln: 1.932 ± 0.472
1.932TyrArg: 1.932 ± 0.568
2.254TyrSer: 2.254 ± 0.661
4.186TyrThr: 4.186 ± 0.757
2.415TyrVal: 2.415 ± 0.614
0.322TyrTrp: 0.322 ± 0.2
1.127TyrTyr: 1.127 ± 0.538
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (6212 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski