Amino acid dipepetide frequency for Thermoproteus tenax spherical virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.726AlaAla: 10.726 ± 1.564
0.975AlaCys: 0.975 ± 0.573
2.113AlaAsp: 2.113 ± 0.652
5.363AlaGlu: 5.363 ± 1.131
4.713AlaPhe: 4.713 ± 0.891
5.851AlaGly: 5.851 ± 1.021
0.813AlaHis: 0.813 ± 0.386
7.964AlaIle: 7.964 ± 1.386
2.925AlaLys: 2.925 ± 0.61
14.627AlaLeu: 14.627 ± 2.433
3.738AlaMet: 3.738 ± 0.87
2.763AlaAsn: 2.763 ± 0.885
3.738AlaPro: 3.738 ± 0.776
1.3AlaGln: 1.3 ± 0.485
2.275AlaArg: 2.275 ± 0.739
5.038AlaSer: 5.038 ± 0.828
4.063AlaThr: 4.063 ± 0.837
10.889AlaVal: 10.889 ± 1.121
0.975AlaTrp: 0.975 ± 0.304
5.851AlaTyr: 5.851 ± 1.067
0.0AlaXaa: 0.0 ± 0.0
Cys
0.813CysAla: 0.813 ± 0.372
0.163CysCys: 0.163 ± 0.175
1.138CysAsp: 1.138 ± 0.414
0.325CysGlu: 0.325 ± 0.242
0.325CysPhe: 0.325 ± 0.278
1.138CysGly: 1.138 ± 0.527
0.0CysHis: 0.0 ± 0.0
0.813CysIle: 0.813 ± 0.387
0.488CysLys: 0.488 ± 0.402
0.65CysLeu: 0.65 ± 0.282
0.163CysMet: 0.163 ± 0.183
0.488CysAsn: 0.488 ± 0.269
0.813CysPro: 0.813 ± 0.493
0.488CysGln: 0.488 ± 0.355
0.488CysArg: 0.488 ± 0.27
0.488CysSer: 0.488 ± 0.339
0.975CysThr: 0.975 ± 0.306
1.95CysVal: 1.95 ± 0.612
0.0CysTrp: 0.0 ± 0.0
0.488CysTyr: 0.488 ± 0.274
0.0CysXaa: 0.0 ± 0.0
Asp
2.6AspAla: 2.6 ± 0.752
0.325AspCys: 0.325 ± 0.217
2.275AspAsp: 2.275 ± 0.783
1.138AspGlu: 1.138 ± 0.391
0.813AspPhe: 0.813 ± 0.429
3.738AspGly: 3.738 ± 1.027
0.813AspHis: 0.813 ± 0.359
2.438AspIle: 2.438 ± 0.649
1.625AspLys: 1.625 ± 0.607
3.575AspLeu: 3.575 ± 0.669
1.138AspMet: 1.138 ± 0.408
1.138AspAsn: 1.138 ± 0.359
1.788AspPro: 1.788 ± 0.477
0.65AspGln: 0.65 ± 0.302
1.788AspArg: 1.788 ± 0.568
1.95AspSer: 1.95 ± 0.459
2.113AspThr: 2.113 ± 0.593
2.275AspVal: 2.275 ± 0.51
0.488AspTrp: 0.488 ± 0.24
2.113AspTyr: 2.113 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
5.038GluAla: 5.038 ± 0.911
0.975GluCys: 0.975 ± 0.437
1.625GluAsp: 1.625 ± 0.545
1.788GluGlu: 1.788 ± 0.637
1.625GluPhe: 1.625 ± 0.489
3.088GluGly: 3.088 ± 0.754
0.813GluHis: 0.813 ± 0.38
3.088GluIle: 3.088 ± 0.83
2.6GluLys: 2.6 ± 0.932
3.413GluLeu: 3.413 ± 0.836
1.138GluMet: 1.138 ± 0.487
0.975GluAsn: 0.975 ± 0.452
0.975GluPro: 0.975 ± 0.439
0.975GluGln: 0.975 ± 0.364
1.625GluArg: 1.625 ± 0.466
1.138GluSer: 1.138 ± 0.337
1.3GluThr: 1.3 ± 0.47
2.925GluVal: 2.925 ± 0.793
0.975GluTrp: 0.975 ± 0.375
2.763GluTyr: 2.763 ± 0.821
0.0GluXaa: 0.0 ± 0.0
Phe
2.925PheAla: 2.925 ± 0.676
0.163PheCys: 0.163 ± 0.189
1.3PheAsp: 1.3 ± 0.436
0.65PheGlu: 0.65 ± 0.303
1.625PhePhe: 1.625 ± 0.455
3.575PheGly: 3.575 ± 0.711
0.163PheHis: 0.163 ± 0.143
2.438PheIle: 2.438 ± 0.604
1.788PheLys: 1.788 ± 0.559
2.925PheLeu: 2.925 ± 0.691
1.3PheMet: 1.3 ± 0.378
1.625PheAsn: 1.625 ± 0.487
1.138PhePro: 1.138 ± 0.388
0.813PheGln: 0.813 ± 0.315
1.3PheArg: 1.3 ± 0.432
2.6PheSer: 2.6 ± 0.655
2.763PheThr: 2.763 ± 0.707
3.413PheVal: 3.413 ± 0.582
0.325PheTrp: 0.325 ± 0.203
2.6PheTyr: 2.6 ± 0.785
0.0PheXaa: 0.0 ± 0.0
Gly
5.688GlyAla: 5.688 ± 1.131
0.325GlyCys: 0.325 ± 0.235
3.088GlyAsp: 3.088 ± 0.804
1.625GlyGlu: 1.625 ± 0.525
2.113GlyPhe: 2.113 ± 0.669
4.876GlyGly: 4.876 ± 1.0
1.463GlyHis: 1.463 ± 0.362
5.038GlyIle: 5.038 ± 0.847
4.063GlyLys: 4.063 ± 0.846
5.851GlyLeu: 5.851 ± 0.794
2.6GlyMet: 2.6 ± 0.673
3.25GlyAsn: 3.25 ± 0.682
3.738GlyPro: 3.738 ± 0.691
1.95GlyGln: 1.95 ± 0.809
2.925GlyArg: 2.925 ± 0.678
4.551GlySer: 4.551 ± 0.96
5.038GlyThr: 5.038 ± 1.299
6.176GlyVal: 6.176 ± 0.917
1.788GlyTrp: 1.788 ± 0.479
6.013GlyTyr: 6.013 ± 1.055
0.163GlyXaa: 0.163 ± 0.167
His
1.463HisAla: 1.463 ± 0.412
0.0HisCys: 0.0 ± 0.0
0.65HisAsp: 0.65 ± 0.295
0.65HisGlu: 0.65 ± 0.346
0.163HisPhe: 0.163 ± 0.164
0.975HisGly: 0.975 ± 0.344
0.163HisHis: 0.163 ± 0.143
1.625HisIle: 1.625 ± 0.389
0.975HisLys: 0.975 ± 0.325
1.3HisLeu: 1.3 ± 0.497
0.65HisMet: 0.65 ± 0.299
0.0HisAsn: 0.0 ± 0.0
0.813HisPro: 0.813 ± 0.344
0.163HisGln: 0.163 ± 0.145
0.813HisArg: 0.813 ± 0.347
1.138HisSer: 1.138 ± 0.341
0.325HisThr: 0.325 ± 0.198
1.95HisVal: 1.95 ± 0.481
0.0HisTrp: 0.0 ± 0.0
0.975HisTyr: 0.975 ± 0.435
0.0HisXaa: 0.0 ± 0.0
Ile
7.801IleAla: 7.801 ± 1.575
1.138IleCys: 1.138 ± 0.327
2.275IleAsp: 2.275 ± 0.57
3.413IleGlu: 3.413 ± 0.751
2.275IlePhe: 2.275 ± 0.736
5.526IleGly: 5.526 ± 1.357
2.113IleHis: 2.113 ± 0.575
7.314IleIle: 7.314 ± 1.483
3.901IleLys: 3.901 ± 0.786
6.501IleLeu: 6.501 ± 0.796
1.95IleMet: 1.95 ± 0.541
2.275IleAsn: 2.275 ± 0.64
5.363IlePro: 5.363 ± 0.764
2.113IleGln: 2.113 ± 0.711
2.925IleArg: 2.925 ± 0.786
5.201IleSer: 5.201 ± 0.809
4.388IleThr: 4.388 ± 0.821
7.314IleVal: 7.314 ± 0.818
1.3IleTrp: 1.3 ± 0.525
5.688IleTyr: 5.688 ± 0.777
0.0IleXaa: 0.0 ± 0.0
Lys
4.063LysAla: 4.063 ± 0.935
1.3LysCys: 1.3 ± 0.419
1.3LysAsp: 1.3 ± 0.502
3.413LysGlu: 3.413 ± 0.942
1.3LysPhe: 1.3 ± 0.469
3.088LysGly: 3.088 ± 0.544
0.325LysHis: 0.325 ± 0.211
3.088LysIle: 3.088 ± 0.711
1.3LysLys: 1.3 ± 0.502
3.088LysLeu: 3.088 ± 0.687
1.788LysMet: 1.788 ± 0.444
0.65LysAsn: 0.65 ± 0.309
2.763LysPro: 2.763 ± 0.787
0.163LysGln: 0.163 ± 0.141
2.438LysArg: 2.438 ± 0.566
2.113LysSer: 2.113 ± 0.656
2.6LysThr: 2.6 ± 0.588
2.925LysVal: 2.925 ± 0.765
0.65LysTrp: 0.65 ± 0.306
2.925LysTyr: 2.925 ± 0.815
0.0LysXaa: 0.0 ± 0.0
Leu
14.139LeuAla: 14.139 ± 2.219
0.975LeuCys: 0.975 ± 0.452
3.088LeuAsp: 3.088 ± 1.051
3.413LeuGlu: 3.413 ± 1.063
3.088LeuPhe: 3.088 ± 0.673
6.338LeuGly: 6.338 ± 1.412
1.3LeuHis: 1.3 ± 0.437
8.939LeuIle: 8.939 ± 1.466
3.413LeuLys: 3.413 ± 0.559
6.826LeuLeu: 6.826 ± 1.322
2.275LeuMet: 2.275 ± 0.568
2.438LeuAsn: 2.438 ± 0.79
5.201LeuPro: 5.201 ± 0.874
0.813LeuGln: 0.813 ± 0.318
3.738LeuArg: 3.738 ± 1.124
6.826LeuSer: 6.826 ± 1.091
6.501LeuThr: 6.501 ± 0.926
7.476LeuVal: 7.476 ± 0.961
1.788LeuTrp: 1.788 ± 0.497
6.501LeuTyr: 6.501 ± 0.973
0.325LeuXaa: 0.325 ± 0.215
Met
4.063MetAla: 4.063 ± 0.863
0.163MetCys: 0.163 ± 0.162
1.3MetAsp: 1.3 ± 0.454
1.463MetGlu: 1.463 ± 0.468
1.138MetPhe: 1.138 ± 0.466
1.788MetGly: 1.788 ± 0.483
0.163MetHis: 0.163 ± 0.156
1.788MetIle: 1.788 ± 0.512
0.65MetLys: 0.65 ± 0.311
2.438MetLeu: 2.438 ± 0.702
0.65MetMet: 0.65 ± 0.373
0.813MetAsn: 0.813 ± 0.451
2.925MetPro: 2.925 ± 0.657
0.813MetGln: 0.813 ± 0.389
2.113MetArg: 2.113 ± 0.509
1.138MetSer: 1.138 ± 0.372
2.6MetThr: 2.6 ± 0.505
2.438MetVal: 2.438 ± 0.489
0.325MetTrp: 0.325 ± 0.221
1.138MetTyr: 1.138 ± 0.342
0.325MetXaa: 0.325 ± 0.333
Asn
3.25AsnAla: 3.25 ± 0.636
0.325AsnCys: 0.325 ± 0.204
0.65AsnAsp: 0.65 ± 0.284
0.488AsnGlu: 0.488 ± 0.329
0.975AsnPhe: 0.975 ± 0.368
2.763AsnGly: 2.763 ± 0.794
0.325AsnHis: 0.325 ± 0.244
2.113AsnIle: 2.113 ± 0.489
1.95AsnLys: 1.95 ± 0.56
1.788AsnLeu: 1.788 ± 0.455
2.6AsnMet: 2.6 ± 0.663
1.788AsnAsn: 1.788 ± 0.474
2.113AsnPro: 2.113 ± 0.705
1.138AsnGln: 1.138 ± 0.449
0.163AsnArg: 0.163 ± 0.151
2.113AsnSer: 2.113 ± 0.702
3.088AsnThr: 3.088 ± 0.798
2.925AsnVal: 2.925 ± 0.75
0.65AsnTrp: 0.65 ± 0.431
2.113AsnTyr: 2.113 ± 0.777
0.0AsnXaa: 0.0 ± 0.0
Pro
4.063ProAla: 4.063 ± 0.751
0.325ProCys: 0.325 ± 0.245
2.6ProAsp: 2.6 ± 0.777
1.463ProGlu: 1.463 ± 0.435
1.463ProPhe: 1.463 ± 0.675
2.6ProGly: 2.6 ± 0.756
0.813ProHis: 0.813 ± 0.309
5.526ProIle: 5.526 ± 0.956
2.6ProLys: 2.6 ± 0.63
5.038ProLeu: 5.038 ± 0.928
1.95ProMet: 1.95 ± 0.514
1.625ProAsn: 1.625 ± 0.689
1.788ProPro: 1.788 ± 0.62
1.95ProGln: 1.95 ± 0.606
1.95ProArg: 1.95 ± 0.61
3.575ProSer: 3.575 ± 0.74
4.063ProThr: 4.063 ± 0.838
4.063ProVal: 4.063 ± 0.796
0.975ProTrp: 0.975 ± 0.355
2.438ProTyr: 2.438 ± 0.637
0.0ProXaa: 0.0 ± 0.0
Gln
1.95GlnAla: 1.95 ± 0.628
0.488GlnCys: 0.488 ± 0.296
0.163GlnAsp: 0.163 ± 0.151
0.488GlnGlu: 0.488 ± 0.28
1.95GlnPhe: 1.95 ± 0.685
0.65GlnGly: 0.65 ± 0.299
0.325GlnHis: 0.325 ± 0.228
3.088GlnIle: 3.088 ± 0.581
0.488GlnLys: 0.488 ± 0.252
1.95GlnLeu: 1.95 ± 0.533
0.325GlnMet: 0.325 ± 0.307
0.488GlnAsn: 0.488 ± 0.295
1.3GlnPro: 1.3 ± 0.519
0.975GlnGln: 0.975 ± 0.532
0.975GlnArg: 0.975 ± 0.412
1.3GlnSer: 1.3 ± 0.564
1.788GlnThr: 1.788 ± 0.575
1.788GlnVal: 1.788 ± 0.477
0.163GlnTrp: 0.163 ± 0.156
1.3GlnTyr: 1.3 ± 0.555
0.0GlnXaa: 0.0 ± 0.0
Arg
3.575ArgAla: 3.575 ± 0.682
0.325ArgCys: 0.325 ± 0.24
1.3ArgAsp: 1.3 ± 0.448
2.438ArgGlu: 2.438 ± 0.722
1.3ArgPhe: 1.3 ± 0.371
1.95ArgGly: 1.95 ± 0.63
1.625ArgHis: 1.625 ± 0.504
2.925ArgIle: 2.925 ± 0.732
2.113ArgLys: 2.113 ± 0.718
5.038ArgLeu: 5.038 ± 1.246
0.488ArgMet: 0.488 ± 0.276
1.463ArgAsn: 1.463 ± 0.471
1.138ArgPro: 1.138 ± 0.42
1.138ArgGln: 1.138 ± 0.411
2.763ArgArg: 2.763 ± 0.93
1.625ArgSer: 1.625 ± 0.6
2.113ArgThr: 2.113 ± 0.481
3.088ArgVal: 3.088 ± 0.712
0.975ArgTrp: 0.975 ± 0.328
1.625ArgTyr: 1.625 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
5.038SerAla: 5.038 ± 0.913
0.65SerCys: 0.65 ± 0.296
2.113SerAsp: 2.113 ± 0.432
1.625SerGlu: 1.625 ± 0.472
2.438SerPhe: 2.438 ± 0.509
7.314SerGly: 7.314 ± 1.637
0.325SerHis: 0.325 ± 0.23
5.526SerIle: 5.526 ± 1.074
1.463SerLys: 1.463 ± 0.417
4.226SerLeu: 4.226 ± 0.614
1.463SerMet: 1.463 ± 0.331
0.975SerAsn: 0.975 ± 0.382
3.413SerPro: 3.413 ± 0.835
1.788SerGln: 1.788 ± 0.47
2.438SerArg: 2.438 ± 0.639
3.25SerSer: 3.25 ± 0.668
3.738SerThr: 3.738 ± 0.844
4.713SerVal: 4.713 ± 1.021
0.975SerTrp: 0.975 ± 0.349
3.575SerTyr: 3.575 ± 0.823
0.0SerXaa: 0.0 ± 0.0
Thr
6.338ThrAla: 6.338 ± 1.088
1.3ThrCys: 1.3 ± 0.562
1.3ThrAsp: 1.3 ± 0.414
1.625ThrGlu: 1.625 ± 0.672
2.275ThrPhe: 2.275 ± 0.582
4.063ThrGly: 4.063 ± 1.287
0.488ThrHis: 0.488 ± 0.274
6.338ThrIle: 6.338 ± 1.354
2.275ThrLys: 2.275 ± 0.655
6.826ThrLeu: 6.826 ± 0.865
1.3ThrMet: 1.3 ± 0.372
2.763ThrAsn: 2.763 ± 0.826
2.6ThrPro: 2.6 ± 0.619
1.138ThrGln: 1.138 ± 0.38
2.275ThrArg: 2.275 ± 0.536
2.438ThrSer: 2.438 ± 0.719
8.126ThrThr: 8.126 ± 2.057
7.476ThrVal: 7.476 ± 1.292
1.625ThrTrp: 1.625 ± 0.6
4.388ThrTyr: 4.388 ± 1.319
0.163ThrXaa: 0.163 ± 0.133
Val
7.476ValAla: 7.476 ± 1.163
1.788ValCys: 1.788 ± 0.562
3.25ValAsp: 3.25 ± 0.742
4.063ValGlu: 4.063 ± 1.049
3.575ValPhe: 3.575 ± 0.748
7.964ValGly: 7.964 ± 0.988
2.275ValHis: 2.275 ± 0.6
3.901ValIle: 3.901 ± 0.991
3.575ValLys: 3.575 ± 0.697
9.589ValLeu: 9.589 ± 1.23
2.438ValMet: 2.438 ± 0.622
4.551ValAsn: 4.551 ± 1.034
4.063ValPro: 4.063 ± 0.827
2.113ValGln: 2.113 ± 0.564
2.763ValArg: 2.763 ± 0.709
6.176ValSer: 6.176 ± 1.172
6.013ValThr: 6.013 ± 1.053
14.464ValVal: 14.464 ± 1.687
0.975ValTrp: 0.975 ± 0.348
5.688ValTyr: 5.688 ± 0.981
0.0ValXaa: 0.0 ± 0.0
Trp
0.975TrpAla: 0.975 ± 0.31
0.163TrpCys: 0.163 ± 0.175
0.325TrpAsp: 0.325 ± 0.203
1.138TrpGlu: 1.138 ± 0.443
0.488TrpPhe: 0.488 ± 0.239
1.3TrpGly: 1.3 ± 0.405
0.163TrpHis: 0.163 ± 0.156
1.138TrpIle: 1.138 ± 0.406
0.813TrpLys: 0.813 ± 0.331
3.575TrpLeu: 3.575 ± 0.846
0.325TrpMet: 0.325 ± 0.209
0.325TrpAsn: 0.325 ± 0.235
1.463TrpPro: 1.463 ± 0.447
0.0TrpGln: 0.0 ± 0.0
0.325TrpArg: 0.325 ± 0.2
1.625TrpSer: 1.625 ± 0.468
0.325TrpThr: 0.325 ± 0.219
1.625TrpVal: 1.625 ± 0.611
0.813TrpTrp: 0.813 ± 0.404
0.163TrpTyr: 0.163 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.713TyrAla: 4.713 ± 0.817
0.325TyrCys: 0.325 ± 0.216
2.763TyrAsp: 2.763 ± 0.458
2.113TyrGlu: 2.113 ± 0.606
1.788TyrPhe: 1.788 ± 0.525
3.738TyrGly: 3.738 ± 0.84
0.325TyrHis: 0.325 ± 0.212
5.526TyrIle: 5.526 ± 1.071
2.113TyrLys: 2.113 ± 0.7
6.176TyrLeu: 6.176 ± 0.817
1.788TyrMet: 1.788 ± 0.459
2.763TyrAsn: 2.763 ± 0.798
3.738TyrPro: 3.738 ± 1.167
1.625TyrGln: 1.625 ± 0.487
2.925TyrArg: 2.925 ± 0.637
2.763TyrSer: 2.763 ± 1.024
5.201TyrThr: 5.201 ± 1.232
6.663TyrVal: 6.663 ± 1.264
1.3TyrTrp: 1.3 ± 0.448
4.876TyrTyr: 4.876 ± 0.974
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.163XaaAsp: 0.163 ± 0.165
0.163XaaGlu: 0.163 ± 0.167
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.163XaaIle: 0.163 ± 0.167
0.163XaaLys: 0.163 ± 0.133
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.163XaaAsn: 0.163 ± 0.133
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.163XaaVal: 0.163 ± 0.167
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 38 proteins (6154 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski