Amino acid dipepetide frequency for Metallosphaera turreted icosahedral virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.694AlaAla: 0.694 ± 0.474
0.694AlaCys: 0.694 ± 0.431
1.388AlaAsp: 1.388 ± 0.671
2.082AlaGlu: 2.082 ± 0.683
3.47AlaPhe: 3.47 ± 1.024
6.246AlaGly: 6.246 ± 2.052
0.347AlaHis: 0.347 ± 0.361
6.246AlaIle: 6.246 ± 1.841
2.082AlaLys: 2.082 ± 1.007
6.94AlaLeu: 6.94 ± 1.716
1.735AlaMet: 1.735 ± 0.531
2.082AlaAsn: 2.082 ± 0.848
3.817AlaPro: 3.817 ± 1.005
2.429AlaGln: 2.429 ± 0.837
3.817AlaArg: 3.817 ± 1.253
5.205AlaSer: 5.205 ± 1.802
4.164AlaThr: 4.164 ± 1.585
5.552AlaVal: 5.552 ± 1.312
1.041AlaTrp: 1.041 ± 0.625
2.082AlaTyr: 2.082 ± 0.783
0.0AlaXaa: 0.0 ± 0.0
Cys
0.694CysAla: 0.694 ± 0.443
0.347CysCys: 0.347 ± 0.392
0.347CysAsp: 0.347 ± 0.317
0.347CysGlu: 0.347 ± 0.308
0.694CysPhe: 0.694 ± 0.467
1.735CysGly: 1.735 ± 0.685
0.0CysHis: 0.0 ± 0.0
0.347CysIle: 0.347 ± 0.379
0.347CysLys: 0.347 ± 0.391
0.347CysLeu: 0.347 ± 0.333
0.0CysMet: 0.0 ± 0.0
0.347CysAsn: 0.347 ± 0.35
1.388CysPro: 1.388 ± 0.644
1.735CysGln: 1.735 ± 0.644
0.347CysArg: 0.347 ± 0.365
1.041CysSer: 1.041 ± 0.652
0.347CysThr: 0.347 ± 0.317
1.735CysVal: 1.735 ± 0.9
0.694CysTrp: 0.694 ± 0.497
0.347CysTyr: 0.347 ± 0.392
0.0CysXaa: 0.0 ± 0.0
Asp
2.429AspAla: 2.429 ± 1.562
1.388AspCys: 1.388 ± 0.829
3.817AspAsp: 3.817 ± 2.057
2.082AspGlu: 2.082 ± 0.96
2.429AspPhe: 2.429 ± 1.193
2.776AspGly: 2.776 ± 1.279
1.388AspHis: 1.388 ± 0.652
2.082AspIle: 2.082 ± 0.756
1.041AspLys: 1.041 ± 0.805
4.511AspLeu: 4.511 ± 1.157
2.429AspMet: 2.429 ± 0.737
0.347AspAsn: 0.347 ± 0.299
2.429AspPro: 2.429 ± 0.949
1.388AspGln: 1.388 ± 0.648
1.735AspArg: 1.735 ± 1.011
1.735AspSer: 1.735 ± 0.709
1.041AspThr: 1.041 ± 0.675
5.899AspVal: 5.899 ± 1.886
0.0AspTrp: 0.0 ± 0.0
2.082AspTyr: 2.082 ± 0.813
0.0AspXaa: 0.0 ± 0.0
Glu
4.511GluAla: 4.511 ± 1.424
0.694GluCys: 0.694 ± 0.497
1.388GluAsp: 1.388 ± 0.716
7.287GluGlu: 7.287 ± 2.92
2.429GluPhe: 2.429 ± 0.885
4.164GluGly: 4.164 ± 1.182
0.694GluHis: 0.694 ± 0.49
2.082GluIle: 2.082 ± 0.865
2.082GluLys: 2.082 ± 0.997
5.205GluLeu: 5.205 ± 1.393
3.123GluMet: 3.123 ± 1.346
1.388GluAsn: 1.388 ± 0.48
1.735GluPro: 1.735 ± 0.694
2.082GluGln: 2.082 ± 0.914
0.0GluArg: 0.0 ± 0.0
3.123GluSer: 3.123 ± 0.839
2.776GluThr: 2.776 ± 1.301
6.246GluVal: 6.246 ± 1.668
1.041GluTrp: 1.041 ± 0.518
3.123GluTyr: 3.123 ± 0.8
0.0GluXaa: 0.0 ± 0.0
Phe
1.388PheAla: 1.388 ± 0.811
1.388PheCys: 1.388 ± 0.852
0.694PheAsp: 0.694 ± 0.609
1.735PheGlu: 1.735 ± 0.685
1.388PhePhe: 1.388 ± 0.825
0.694PheGly: 0.694 ± 0.449
0.694PheHis: 0.694 ± 0.497
2.429PheIle: 2.429 ± 0.777
0.0PheLys: 0.0 ± 0.0
4.164PheLeu: 4.164 ± 1.235
0.0PheMet: 0.0 ± 0.0
0.347PheAsn: 0.347 ± 0.322
0.694PhePro: 0.694 ± 0.418
0.694PheGln: 0.694 ± 0.489
0.347PheArg: 0.347 ± 0.303
3.817PheSer: 3.817 ± 0.665
4.164PheThr: 4.164 ± 1.29
2.776PheVal: 2.776 ± 0.808
0.0PheTrp: 0.0 ± 0.0
0.347PheTyr: 0.347 ± 0.349
0.0PheXaa: 0.0 ± 0.0
Gly
7.287GlyAla: 7.287 ± 2.201
1.735GlyCys: 1.735 ± 0.737
1.735GlyAsp: 1.735 ± 0.799
3.47GlyGlu: 3.47 ± 1.423
1.388GlyPhe: 1.388 ± 0.552
17.696GlyGly: 17.696 ± 9.991
0.347GlyHis: 0.347 ± 0.308
3.817GlyIle: 3.817 ± 0.968
5.899GlyLys: 5.899 ± 1.65
4.164GlyLeu: 4.164 ± 1.418
0.694GlyMet: 0.694 ± 0.42
5.205GlyAsn: 5.205 ± 1.654
3.817GlyPro: 3.817 ± 1.515
5.899GlyGln: 5.899 ± 1.693
2.429GlyArg: 2.429 ± 0.732
6.246GlySer: 6.246 ± 2.347
11.103GlyThr: 11.103 ± 3.344
9.022GlyVal: 9.022 ± 1.966
0.0GlyTrp: 0.0 ± 0.0
5.205GlyTyr: 5.205 ± 1.436
0.0GlyXaa: 0.0 ± 0.0
His
1.041HisAla: 1.041 ± 0.563
0.0HisCys: 0.0 ± 0.0
1.041HisAsp: 1.041 ± 0.692
1.041HisGlu: 1.041 ± 0.492
0.0HisPhe: 0.0 ± 0.0
0.694HisGly: 0.694 ± 0.441
0.694HisHis: 0.694 ± 0.487
0.347HisIle: 0.347 ± 0.361
0.347HisLys: 0.347 ± 0.33
0.347HisLeu: 0.347 ± 0.349
1.041HisMet: 1.041 ± 0.589
0.0HisAsn: 0.0 ± 0.0
0.694HisPro: 0.694 ± 0.698
0.347HisGln: 0.347 ± 0.379
0.0HisArg: 0.0 ± 0.0
1.041HisSer: 1.041 ± 0.557
0.347HisThr: 0.347 ± 0.365
0.694HisVal: 0.694 ± 0.568
0.694HisTrp: 0.694 ± 0.433
0.347HisTyr: 0.347 ± 0.308
0.0HisXaa: 0.0 ± 0.0
Ile
3.47IleAla: 3.47 ± 1.197
0.694IleCys: 0.694 ± 0.467
3.47IleAsp: 3.47 ± 1.394
3.817IleGlu: 3.817 ± 0.987
1.735IlePhe: 1.735 ± 0.987
2.429IleGly: 2.429 ± 1.059
0.694IleHis: 0.694 ± 0.451
3.817IleIle: 3.817 ± 1.113
2.082IleLys: 2.082 ± 1.199
5.899IleLeu: 5.899 ± 1.284
1.388IleMet: 1.388 ± 0.772
1.388IleAsn: 1.388 ± 0.651
3.47IlePro: 3.47 ± 1.25
4.858IleGln: 4.858 ± 0.841
3.47IleArg: 3.47 ± 1.125
2.776IleSer: 2.776 ± 0.747
3.123IleThr: 3.123 ± 1.078
3.817IleVal: 3.817 ± 1.531
0.0IleTrp: 0.0 ± 0.0
3.123IleTyr: 3.123 ± 0.965
0.0IleXaa: 0.0 ± 0.0
Lys
1.735LysAla: 1.735 ± 0.966
0.347LysCys: 0.347 ± 0.365
2.776LysAsp: 2.776 ± 1.037
4.858LysGlu: 4.858 ± 1.621
0.694LysPhe: 0.694 ± 0.698
3.47LysGly: 3.47 ± 0.902
0.694LysHis: 0.694 ± 0.47
2.776LysIle: 2.776 ± 0.838
4.858LysLys: 4.858 ± 1.811
3.47LysLeu: 3.47 ± 1.23
1.041LysMet: 1.041 ± 0.637
0.694LysAsn: 0.694 ± 0.506
1.388LysPro: 1.388 ± 0.571
0.347LysGln: 0.347 ± 0.35
3.817LysArg: 3.817 ± 1.149
3.817LysSer: 3.817 ± 1.116
3.123LysThr: 3.123 ± 1.109
4.858LysVal: 4.858 ± 1.416
0.694LysTrp: 0.694 ± 0.451
1.388LysTyr: 1.388 ± 0.643
0.0LysXaa: 0.0 ± 0.0
Leu
4.164LeuAla: 4.164 ± 1.093
0.0LeuCys: 0.0 ± 0.0
3.123LeuAsp: 3.123 ± 1.27
5.552LeuGlu: 5.552 ± 1.471
0.694LeuPhe: 0.694 ± 0.478
7.287LeuGly: 7.287 ± 2.093
0.694LeuHis: 0.694 ± 0.609
4.511LeuIle: 4.511 ± 1.045
3.817LeuLys: 3.817 ± 1.479
5.552LeuLeu: 5.552 ± 1.381
2.429LeuMet: 2.429 ± 0.989
3.47LeuAsn: 3.47 ± 1.131
3.123LeuPro: 3.123 ± 0.768
2.776LeuGln: 2.776 ± 0.79
7.287LeuArg: 7.287 ± 2.085
8.328LeuSer: 8.328 ± 1.477
5.552LeuThr: 5.552 ± 1.378
7.634LeuVal: 7.634 ± 1.407
1.041LeuTrp: 1.041 ± 0.665
1.041LeuTyr: 1.041 ± 0.546
0.0LeuXaa: 0.0 ± 0.0
Met
1.388MetAla: 1.388 ± 0.716
0.0MetCys: 0.0 ± 0.0
2.082MetAsp: 2.082 ± 0.825
1.735MetGlu: 1.735 ± 1.021
0.694MetPhe: 0.694 ± 0.454
1.735MetGly: 1.735 ± 0.96
0.0MetHis: 0.0 ± 0.0
2.429MetIle: 2.429 ± 0.993
2.429MetLys: 2.429 ± 0.875
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
1.041MetAsn: 1.041 ± 0.501
1.041MetPro: 1.041 ± 0.603
0.694MetGln: 0.694 ± 0.429
2.082MetArg: 2.082 ± 0.888
4.164MetSer: 4.164 ± 1.155
1.388MetThr: 1.388 ± 0.815
2.082MetVal: 2.082 ± 0.986
0.347MetTrp: 0.347 ± 0.349
1.041MetTyr: 1.041 ± 0.597
0.0MetXaa: 0.0 ± 0.0
Asn
1.388AsnAla: 1.388 ± 0.598
0.347AsnCys: 0.347 ± 0.317
1.735AsnAsp: 1.735 ± 0.851
0.694AsnGlu: 0.694 ± 0.47
0.0AsnPhe: 0.0 ± 0.0
4.164AsnGly: 4.164 ± 1.311
0.0AsnHis: 0.0 ± 0.0
1.735AsnIle: 1.735 ± 0.757
0.694AsnLys: 0.694 ± 0.683
1.041AsnLeu: 1.041 ± 0.582
1.041AsnMet: 1.041 ± 0.551
0.347AsnAsn: 0.347 ± 0.317
3.47AsnPro: 3.47 ± 0.899
2.082AsnGln: 2.082 ± 0.731
1.388AsnArg: 1.388 ± 0.697
2.429AsnSer: 2.429 ± 0.655
2.082AsnThr: 2.082 ± 0.638
3.123AsnVal: 3.123 ± 0.868
1.735AsnTrp: 1.735 ± 0.973
1.041AsnTyr: 1.041 ± 0.62
0.0AsnXaa: 0.0 ± 0.0
Pro
2.082ProAla: 2.082 ± 0.652
0.694ProCys: 0.694 ± 0.454
1.388ProAsp: 1.388 ± 0.588
3.47ProGlu: 3.47 ± 1.286
1.388ProPhe: 1.388 ± 0.476
3.47ProGly: 3.47 ± 1.24
0.0ProHis: 0.0 ± 0.0
3.123ProIle: 3.123 ± 1.079
2.429ProLys: 2.429 ± 1.137
4.511ProLeu: 4.511 ± 0.951
1.735ProMet: 1.735 ± 1.035
1.041ProAsn: 1.041 ± 0.454
4.858ProPro: 4.858 ± 2.171
3.123ProGln: 3.123 ± 1.061
1.388ProArg: 1.388 ± 0.836
6.593ProSer: 6.593 ± 2.183
7.981ProThr: 7.981 ± 2.937
3.817ProVal: 3.817 ± 1.231
1.388ProTrp: 1.388 ± 0.584
2.429ProTyr: 2.429 ± 0.794
0.0ProXaa: 0.0 ± 0.0
Gln
2.429GlnAla: 2.429 ± 1.012
0.694GlnCys: 0.694 ± 0.454
2.429GlnAsp: 2.429 ± 0.779
4.164GlnGlu: 4.164 ± 1.219
1.041GlnPhe: 1.041 ± 0.427
7.981GlnGly: 7.981 ± 2.294
0.0GlnHis: 0.0 ± 0.0
2.429GlnIle: 2.429 ± 0.855
3.47GlnLys: 3.47 ± 1.256
3.123GlnLeu: 3.123 ± 0.991
1.041GlnMet: 1.041 ± 0.714
1.735GlnAsn: 1.735 ± 0.98
1.388GlnPro: 1.388 ± 0.713
2.429GlnGln: 2.429 ± 1.019
1.388GlnArg: 1.388 ± 0.656
3.123GlnSer: 3.123 ± 0.891
4.858GlnThr: 4.858 ± 1.8
3.817GlnVal: 3.817 ± 0.851
0.694GlnTrp: 0.694 ± 0.479
2.776GlnTyr: 2.776 ± 1.131
0.0GlnXaa: 0.0 ± 0.0
Arg
2.429ArgAla: 2.429 ± 0.968
0.347ArgCys: 0.347 ± 0.391
3.47ArgAsp: 3.47 ± 1.039
2.429ArgGlu: 2.429 ± 1.469
0.694ArgPhe: 0.694 ± 0.478
3.123ArgGly: 3.123 ± 1.156
1.388ArgHis: 1.388 ± 0.967
2.429ArgIle: 2.429 ± 1.006
2.776ArgLys: 2.776 ± 1.186
3.47ArgLeu: 3.47 ± 1.002
2.082ArgMet: 2.082 ± 0.751
0.694ArgAsn: 0.694 ± 0.436
1.735ArgPro: 1.735 ± 0.826
2.429ArgGln: 2.429 ± 1.032
3.123ArgArg: 3.123 ± 1.188
1.388ArgSer: 1.388 ± 0.786
2.429ArgThr: 2.429 ± 1.267
3.817ArgVal: 3.817 ± 1.55
0.694ArgTrp: 0.694 ± 0.513
1.041ArgTyr: 1.041 ± 0.832
0.0ArgXaa: 0.0 ± 0.0
Ser
6.94SerAla: 6.94 ± 2.412
1.041SerCys: 1.041 ± 0.58
1.388SerAsp: 1.388 ± 0.526
3.47SerGlu: 3.47 ± 0.856
1.388SerPhe: 1.388 ± 0.771
9.368SerGly: 9.368 ± 2.452
1.041SerHis: 1.041 ± 0.527
3.47SerIle: 3.47 ± 1.193
3.123SerLys: 3.123 ± 0.99
6.593SerLeu: 6.593 ± 1.793
2.429SerMet: 2.429 ± 0.863
4.164SerAsn: 4.164 ± 1.628
5.552SerPro: 5.552 ± 1.47
5.205SerGln: 5.205 ± 1.7
2.082SerArg: 2.082 ± 0.9
6.246SerSer: 6.246 ± 2.003
7.981SerThr: 7.981 ± 3.883
4.164SerVal: 4.164 ± 1.021
0.347SerTrp: 0.347 ± 0.317
2.082SerTyr: 2.082 ± 0.752
0.0SerXaa: 0.0 ± 0.0
Thr
6.246ThrAla: 6.246 ± 2.013
1.388ThrCys: 1.388 ± 0.754
1.735ThrAsp: 1.735 ± 0.777
1.735ThrGlu: 1.735 ± 0.751
2.776ThrPhe: 2.776 ± 1.118
7.981ThrGly: 7.981 ± 3.152
0.347ThrHis: 0.347 ± 0.328
4.511ThrIle: 4.511 ± 1.497
2.429ThrLys: 2.429 ± 0.569
4.858ThrLeu: 4.858 ± 1.78
1.388ThrMet: 1.388 ± 0.621
1.735ThrAsn: 1.735 ± 0.994
7.981ThrPro: 7.981 ± 1.824
5.205ThrGln: 5.205 ± 2.049
2.429ThrArg: 2.429 ± 1.031
8.675ThrSer: 8.675 ± 3.148
11.45ThrThr: 11.45 ± 4.823
4.858ThrVal: 4.858 ± 1.425
2.082ThrTrp: 2.082 ± 0.762
2.429ThrTyr: 2.429 ± 0.829
0.0ThrXaa: 0.0 ± 0.0
Val
7.634ValAla: 7.634 ± 2.103
1.041ValCys: 1.041 ± 0.553
6.94ValAsp: 6.94 ± 1.501
3.817ValGlu: 3.817 ± 1.261
3.123ValPhe: 3.123 ± 1.122
5.899ValGly: 5.899 ± 1.183
1.388ValHis: 1.388 ± 0.645
5.899ValIle: 5.899 ± 1.305
3.47ValLys: 3.47 ± 0.923
7.634ValLeu: 7.634 ± 1.857
1.388ValMet: 1.388 ± 0.591
2.429ValAsn: 2.429 ± 0.909
5.552ValPro: 5.552 ± 1.154
4.164ValGln: 4.164 ± 1.446
1.735ValArg: 1.735 ± 0.806
5.552ValSer: 5.552 ± 1.382
4.858ValThr: 4.858 ± 1.475
5.899ValVal: 5.899 ± 1.676
1.388ValTrp: 1.388 ± 0.702
6.94ValTyr: 6.94 ± 2.202
0.0ValXaa: 0.0 ± 0.0
Trp
0.347TrpAla: 0.347 ± 0.303
0.0TrpCys: 0.0 ± 0.0
0.347TrpAsp: 0.347 ± 0.391
1.041TrpGlu: 1.041 ± 0.548
0.0TrpPhe: 0.0 ± 0.0
2.429TrpGly: 2.429 ± 0.893
0.0TrpHis: 0.0 ± 0.0
0.347TrpIle: 0.347 ± 0.431
1.041TrpLys: 1.041 ± 0.504
1.388TrpLeu: 1.388 ± 0.774
0.347TrpMet: 0.347 ± 0.349
1.041TrpAsn: 1.041 ± 0.584
0.347TrpPro: 0.347 ± 0.395
0.347TrpGln: 0.347 ± 0.303
0.347TrpArg: 0.347 ± 0.379
0.694TrpSer: 0.694 ± 0.521
0.694TrpThr: 0.694 ± 0.482
1.388TrpVal: 1.388 ± 0.62
0.347TrpTrp: 0.347 ± 0.392
1.735TrpTyr: 1.735 ± 0.942
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.817TyrAla: 3.817 ± 1.189
0.347TyrCys: 0.347 ± 0.379
1.735TyrAsp: 1.735 ± 0.881
0.694TyrGlu: 0.694 ± 0.492
1.735TyrPhe: 1.735 ± 0.678
3.817TyrGly: 3.817 ± 1.585
0.347TyrHis: 0.347 ± 0.317
0.694TyrIle: 0.694 ± 0.529
2.429TyrLys: 2.429 ± 1.193
4.858TyrLeu: 4.858 ± 1.341
0.694TyrMet: 0.694 ± 0.404
1.041TyrAsn: 1.041 ± 0.491
2.429TyrPro: 2.429 ± 0.853
2.776TyrGln: 2.776 ± 1.025
2.776TyrArg: 2.776 ± 1.31
2.082TyrSer: 2.082 ± 0.655
2.776TyrThr: 2.776 ± 1.069
5.552TyrVal: 5.552 ± 1.592
0.0TyrTrp: 0.0 ± 0.0
1.735TyrTyr: 1.735 ± 1.003
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 21 proteins (2883 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski