Amino acid dipepetide frequency for Rhinolophus simulator polyomavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.572AlaAla: 6.572 ± 1.953
1.095AlaCys: 1.095 ± 0.847
2.191AlaAsp: 2.191 ± 0.577
3.286AlaGlu: 3.286 ± 0.861
0.0AlaPhe: 0.0 ± 0.0
1.643AlaGly: 1.643 ± 0.479
1.095AlaHis: 1.095 ± 0.809
3.834AlaIle: 3.834 ± 1.592
2.191AlaLys: 2.191 ± 0.809
7.667AlaLeu: 7.667 ± 2.357
0.548AlaMet: 0.548 ± 0.477
1.095AlaAsn: 1.095 ± 0.674
0.548AlaPro: 0.548 ± 0.485
2.191AlaGln: 2.191 ± 0.793
5.476AlaArg: 5.476 ± 1.914
3.286AlaSer: 3.286 ± 1.334
3.286AlaThr: 3.286 ± 1.02
8.762AlaVal: 8.762 ± 1.364
2.738AlaTrp: 2.738 ± 0.931
4.381AlaTyr: 4.381 ± 1.274
0.0AlaXaa: 0.0 ± 0.0
Cys
1.095CysAla: 1.095 ± 0.697
0.548CysCys: 0.548 ± 0.719
0.548CysAsp: 0.548 ± 0.349
0.0CysGlu: 0.0 ± 0.0
1.095CysPhe: 1.095 ± 1.438
1.095CysGly: 1.095 ± 1.438
0.0CysHis: 0.0 ± 0.0
2.191CysIle: 2.191 ± 1.34
2.738CysLys: 2.738 ± 1.264
2.191CysLeu: 2.191 ± 1.348
0.0CysMet: 0.0 ± 0.0
1.095CysAsn: 1.095 ± 0.404
1.095CysPro: 1.095 ± 0.404
0.548CysGln: 0.548 ± 0.349
0.0CysArg: 0.0 ± 0.0
3.834CysSer: 3.834 ± 1.026
1.095CysThr: 1.095 ± 0.697
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
2.738CysTyr: 2.738 ± 2.009
0.0CysXaa: 0.0 ± 0.0
Asp
2.738AspAla: 2.738 ± 1.136
0.548AspCys: 0.548 ± 0.719
3.286AspAsp: 3.286 ± 0.879
5.476AspGlu: 5.476 ± 1.831
2.738AspPhe: 2.738 ± 1.314
3.286AspGly: 3.286 ± 1.189
0.548AspHis: 0.548 ± 0.349
2.738AspIle: 2.738 ± 1.468
3.286AspLys: 3.286 ± 0.879
2.191AspLeu: 2.191 ± 1.001
1.643AspMet: 1.643 ± 0.822
2.191AspAsn: 2.191 ± 0.804
3.834AspPro: 3.834 ± 2.067
0.548AspGln: 0.548 ± 0.349
2.191AspArg: 2.191 ± 0.793
2.738AspSer: 2.738 ± 0.936
2.191AspThr: 2.191 ± 0.793
2.738AspVal: 2.738 ± 1.115
2.738AspTrp: 2.738 ± 1.55
1.643AspTyr: 1.643 ± 0.579
0.0AspXaa: 0.0 ± 0.0
Glu
4.381GluAla: 4.381 ± 1.973
2.738GluCys: 2.738 ± 1.264
3.286GluAsp: 3.286 ± 0.98
7.119GluGlu: 7.119 ± 1.232
2.738GluPhe: 2.738 ± 1.314
4.381GluGly: 4.381 ± 2.105
1.643GluHis: 1.643 ± 0.78
2.738GluIle: 2.738 ± 0.815
6.572GluLys: 6.572 ± 1.893
8.215GluLeu: 8.215 ± 1.334
1.095GluMet: 1.095 ± 0.674
5.476GluAsn: 5.476 ± 2.069
1.095GluPro: 1.095 ± 0.404
2.738GluGln: 2.738 ± 0.64
3.286GluArg: 3.286 ± 0.696
2.191GluSer: 2.191 ± 0.897
4.381GluThr: 4.381 ± 1.272
2.738GluVal: 2.738 ± 1.569
0.548GluTrp: 0.548 ± 0.349
1.643GluTyr: 1.643 ± 0.927
0.0GluXaa: 0.0 ± 0.0
Phe
4.381PheAla: 4.381 ± 0.761
2.191PheCys: 2.191 ± 0.671
0.0PheAsp: 0.0 ± 0.0
3.834PheGlu: 3.834 ± 2.44
0.548PhePhe: 0.548 ± 0.485
3.286PheGly: 3.286 ± 2.138
2.191PheHis: 2.191 ± 0.637
0.548PheIle: 0.548 ± 0.738
2.191PheLys: 2.191 ± 1.395
6.024PheLeu: 6.024 ± 1.601
1.095PheMet: 1.095 ± 0.628
1.643PheAsn: 1.643 ± 0.68
3.286PhePro: 3.286 ± 1.229
0.548PheGln: 0.548 ± 0.477
0.548PheArg: 0.548 ± 0.349
1.643PheSer: 1.643 ± 0.76
1.643PheThr: 1.643 ± 0.822
0.548PheVal: 0.548 ± 0.738
0.548PheTrp: 0.548 ± 0.719
2.738PheTyr: 2.738 ± 1.115
0.0PheXaa: 0.0 ± 0.0
Gly
4.929GlyAla: 4.929 ± 1.916
1.095GlyCys: 1.095 ± 0.674
3.834GlyAsp: 3.834 ± 0.503
3.834GlyGlu: 3.834 ± 1.611
2.738GlyPhe: 2.738 ± 0.892
7.119GlyGly: 7.119 ± 1.02
2.191GlyHis: 2.191 ± 1.052
3.286GlyIle: 3.286 ± 1.158
4.381GlyLys: 4.381 ± 1.936
5.476GlyLeu: 5.476 ± 1.172
1.643GlyMet: 1.643 ± 1.68
2.738GlyAsn: 2.738 ± 0.857
6.572GlyPro: 6.572 ± 1.446
3.834GlyGln: 3.834 ± 1.102
0.0GlyArg: 0.0 ± 0.0
3.834GlySer: 3.834 ± 1.688
1.095GlyThr: 1.095 ± 0.969
4.381GlyVal: 4.381 ± 1.239
0.0GlyTrp: 0.0 ± 0.0
2.738GlyTyr: 2.738 ± 0.821
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.643HisCys: 1.643 ± 0.797
2.191HisAsp: 2.191 ± 0.897
1.643HisGlu: 1.643 ± 0.698
2.738HisPhe: 2.738 ± 0.815
0.548HisGly: 0.548 ± 0.738
0.548HisHis: 0.548 ± 0.349
0.0HisIle: 0.0 ± 0.0
0.548HisLys: 0.548 ± 0.719
2.191HisLeu: 2.191 ± 0.793
1.095HisMet: 1.095 ± 0.969
1.095HisAsn: 1.095 ± 0.847
2.738HisPro: 2.738 ± 0.86
1.095HisGln: 1.095 ± 0.809
2.191HisArg: 2.191 ± 0.897
1.643HisSer: 1.643 ± 0.78
1.643HisThr: 1.643 ± 0.804
1.095HisVal: 1.095 ± 0.697
0.548HisTrp: 0.548 ± 0.719
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
6.024IleAla: 6.024 ± 1.529
1.095IleCys: 1.095 ± 0.697
2.191IleAsp: 2.191 ± 1.395
4.929IleGlu: 4.929 ± 1.468
2.738IlePhe: 2.738 ± 1.132
0.548IleGly: 0.548 ± 0.485
2.191IleHis: 2.191 ± 1.449
1.643IleIle: 1.643 ± 0.698
1.095IleLys: 1.095 ± 0.697
3.286IleLeu: 3.286 ± 1.289
0.548IleMet: 0.548 ± 0.349
2.738IleAsn: 2.738 ± 0.825
0.548IlePro: 0.548 ± 0.485
0.0IleGln: 0.0 ± 0.0
1.095IleArg: 1.095 ± 0.992
0.548IleSer: 0.548 ± 0.477
4.381IleThr: 4.381 ± 1.606
6.024IleVal: 6.024 ± 2.012
0.0IleTrp: 0.0 ± 0.0
1.643IleTyr: 1.643 ± 0.797
0.0IleXaa: 0.0 ± 0.0
Lys
3.834LysAla: 3.834 ± 1.391
1.095LysCys: 1.095 ± 0.674
2.738LysAsp: 2.738 ± 0.825
4.929LysGlu: 4.929 ± 1.062
1.643LysPhe: 1.643 ± 0.68
3.834LysGly: 3.834 ± 1.603
2.191LysHis: 2.191 ± 1.395
3.834LysIle: 3.834 ± 1.909
7.119LysLys: 7.119 ± 1.639
7.119LysLeu: 7.119 ± 0.922
2.191LysMet: 2.191 ± 1.348
2.191LysAsn: 2.191 ± 0.809
3.834LysPro: 3.834 ± 1.53
2.191LysGln: 2.191 ± 1.029
6.024LysArg: 6.024 ± 0.807
2.738LysSer: 2.738 ± 0.825
3.834LysThr: 3.834 ± 1.431
1.643LysVal: 1.643 ± 1.046
0.548LysTrp: 0.548 ± 0.719
1.643LysTyr: 1.643 ± 0.797
0.0LysXaa: 0.0 ± 0.0
Leu
3.286LeuAla: 3.286 ± 1.559
2.191LeuCys: 2.191 ± 0.671
6.024LeuAsp: 6.024 ± 1.61
8.215LeuGlu: 8.215 ± 2.634
4.929LeuPhe: 4.929 ± 0.903
3.834LeuGly: 3.834 ± 1.347
2.738LeuHis: 2.738 ± 2.036
4.381LeuIle: 4.381 ± 0.814
1.643LeuLys: 1.643 ± 1.046
8.762LeuLeu: 8.762 ± 2.369
5.476LeuMet: 5.476 ± 1.213
8.762LeuAsn: 8.762 ± 1.491
7.119LeuPro: 7.119 ± 0.979
6.024LeuGln: 6.024 ± 1.279
2.191LeuArg: 2.191 ± 1.57
4.929LeuSer: 4.929 ± 0.89
3.286LeuThr: 3.286 ± 0.895
6.024LeuVal: 6.024 ± 1.393
3.286LeuTrp: 3.286 ± 1.453
5.476LeuTyr: 5.476 ± 2.584
0.0LeuXaa: 0.0 ± 0.0
Met
2.738MetAla: 2.738 ± 0.931
1.095MetCys: 1.095 ± 0.674
1.643MetAsp: 1.643 ± 0.982
2.738MetGlu: 2.738 ± 1.241
1.095MetPhe: 1.095 ± 0.969
2.738MetGly: 2.738 ± 0.567
1.095MetHis: 1.095 ± 0.674
0.548MetIle: 0.548 ± 0.738
3.834MetLys: 3.834 ± 2.128
4.929MetLeu: 4.929 ± 1.409
1.095MetMet: 1.095 ± 0.674
1.095MetAsn: 1.095 ± 0.404
0.548MetPro: 0.548 ± 0.485
0.548MetGln: 0.548 ± 0.349
0.548MetArg: 0.548 ± 0.738
1.095MetSer: 1.095 ± 0.809
3.286MetThr: 3.286 ± 1.354
0.0MetVal: 0.0 ± 0.0
0.548MetTrp: 0.548 ± 0.485
0.548MetTyr: 0.548 ± 0.738
0.0MetXaa: 0.0 ± 0.0
Asn
1.095AsnAla: 1.095 ± 0.697
1.095AsnCys: 1.095 ± 0.697
3.286AsnAsp: 3.286 ± 1.22
1.643AsnGlu: 1.643 ± 0.68
1.643AsnPhe: 1.643 ± 0.579
1.643AsnGly: 1.643 ± 1.376
0.0AsnHis: 0.0 ± 0.0
2.191AsnIle: 2.191 ± 0.671
4.929AsnLys: 4.929 ± 2.034
5.476AsnLeu: 5.476 ± 0.568
1.095AsnMet: 1.095 ± 0.717
1.095AsnAsn: 1.095 ± 0.969
3.834AsnPro: 3.834 ± 1.734
0.0AsnGln: 0.0 ± 0.0
1.643AsnArg: 1.643 ± 0.76
3.286AsnSer: 3.286 ± 0.696
3.286AsnThr: 3.286 ± 0.696
4.929AsnVal: 4.929 ± 0.484
0.548AsnTrp: 0.548 ± 0.719
2.191AsnTyr: 2.191 ± 0.793
0.0AsnXaa: 0.0 ± 0.0
Pro
4.381ProAla: 4.381 ± 1.244
0.548ProCys: 0.548 ± 0.349
5.476ProAsp: 5.476 ± 1.651
2.738ProGlu: 2.738 ± 0.8
1.095ProPhe: 1.095 ± 0.697
4.929ProGly: 4.929 ± 1.481
0.0ProHis: 0.0 ± 0.0
1.643ProIle: 1.643 ± 0.822
8.762ProLys: 8.762 ± 1.106
4.929ProLeu: 4.929 ± 2.135
3.834ProMet: 3.834 ± 1.582
0.0ProAsn: 0.0 ± 0.0
3.834ProPro: 3.834 ± 1.07
1.095ProGln: 1.095 ± 0.404
2.191ProArg: 2.191 ± 1.809
6.572ProSer: 6.572 ± 1.011
2.738ProThr: 2.738 ± 1.374
2.191ProVal: 2.191 ± 0.809
0.0ProTrp: 0.0 ± 0.0
1.095ProTyr: 1.095 ± 0.404
0.0ProXaa: 0.0 ± 0.0
Gln
2.738GlnAla: 2.738 ± 0.931
0.0GlnCys: 0.0 ± 0.0
0.548GlnAsp: 0.548 ± 0.349
2.738GlnGlu: 2.738 ± 1.264
2.738GlnPhe: 2.738 ± 0.783
2.738GlnGly: 2.738 ± 1.186
1.095GlnHis: 1.095 ± 0.786
4.381GlnIle: 4.381 ± 0.913
2.191GlnLys: 2.191 ± 1.029
2.738GlnLeu: 2.738 ± 1.538
0.548GlnMet: 0.548 ± 0.485
1.643GlnAsn: 1.643 ± 0.78
2.191GlnPro: 2.191 ± 0.866
1.095GlnGln: 1.095 ± 0.697
1.095GlnArg: 1.095 ± 0.809
1.643GlnSer: 1.643 ± 0.579
2.191GlnThr: 2.191 ± 0.577
2.738GlnVal: 2.738 ± 1.496
0.548GlnTrp: 0.548 ± 0.349
0.548GlnTyr: 0.548 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
1.643ArgAla: 1.643 ± 0.78
0.0ArgCys: 0.0 ± 0.0
1.095ArgAsp: 1.095 ± 0.53
3.834ArgGlu: 3.834 ± 1.399
1.643ArgPhe: 1.643 ± 0.797
1.643ArgGly: 1.643 ± 0.877
1.095ArgHis: 1.095 ± 0.809
1.095ArgIle: 1.095 ± 0.992
1.643ArgLys: 1.643 ± 0.822
6.572ArgLeu: 6.572 ± 2.092
1.643ArgMet: 1.643 ± 0.804
3.286ArgAsn: 3.286 ± 0.98
2.738ArgPro: 2.738 ± 0.783
4.381ArgGln: 4.381 ± 1.235
3.834ArgArg: 3.834 ± 2.347
0.548ArgSer: 0.548 ± 0.738
1.643ArgThr: 1.643 ± 0.804
5.476ArgVal: 5.476 ± 1.272
1.095ArgTrp: 1.095 ± 0.809
2.738ArgTyr: 2.738 ± 1.763
0.0ArgXaa: 0.0 ± 0.0
Ser
4.381SerAla: 4.381 ± 1.106
1.095SerCys: 1.095 ± 0.404
2.191SerAsp: 2.191 ± 0.866
3.834SerGlu: 3.834 ± 1.181
3.834SerPhe: 3.834 ± 1.946
4.929SerGly: 4.929 ± 0.796
1.643SerHis: 1.643 ± 0.804
1.095SerIle: 1.095 ± 0.953
3.834SerLys: 3.834 ± 1.431
5.476SerLeu: 5.476 ± 2.0
1.095SerMet: 1.095 ± 0.717
1.643SerAsn: 1.643 ± 0.863
1.643SerPro: 1.643 ± 0.822
1.095SerGln: 1.095 ± 0.697
6.024SerArg: 6.024 ± 1.841
7.119SerSer: 7.119 ± 1.382
3.286SerThr: 3.286 ± 1.685
4.929SerVal: 4.929 ± 1.273
0.0SerTrp: 0.0 ± 0.0
3.286SerTyr: 3.286 ± 0.792
0.0SerXaa: 0.0 ± 0.0
Thr
2.738ThrAla: 2.738 ± 1.241
1.095ThrCys: 1.095 ± 0.847
3.286ThrAsp: 3.286 ± 1.28
2.191ThrGlu: 2.191 ± 0.759
1.643ThrPhe: 1.643 ± 0.78
4.381ThrGly: 4.381 ± 1.553
0.548ThrHis: 0.548 ± 0.349
2.738ThrIle: 2.738 ± 1.538
1.095ThrLys: 1.095 ± 0.697
5.476ThrLeu: 5.476 ± 1.531
2.191ThrMet: 2.191 ± 1.078
1.643ThrAsn: 1.643 ± 0.822
4.929ThrPro: 4.929 ± 1.229
2.191ThrGln: 2.191 ± 0.866
2.738ThrArg: 2.738 ± 0.64
3.286ThrSer: 3.286 ± 1.966
6.024ThrThr: 6.024 ± 1.65
4.381ThrVal: 4.381 ± 0.975
1.095ThrTrp: 1.095 ± 0.404
1.095ThrTyr: 1.095 ± 0.404
0.0ThrXaa: 0.0 ± 0.0
Val
2.191ValAla: 2.191 ± 0.865
2.191ValCys: 2.191 ± 1.348
1.643ValAsp: 1.643 ± 0.579
3.834ValGlu: 3.834 ± 1.283
2.191ValPhe: 2.191 ± 1.057
4.929ValGly: 4.929 ± 1.773
1.095ValHis: 1.095 ± 0.724
3.834ValIle: 3.834 ± 1.225
4.381ValLys: 4.381 ± 1.538
6.024ValLeu: 6.024 ± 1.246
2.191ValMet: 2.191 ± 2.163
4.929ValAsn: 4.929 ± 0.998
4.381ValPro: 4.381 ± 0.687
1.643ValGln: 1.643 ± 0.822
3.286ValArg: 3.286 ± 1.813
5.476ValSer: 5.476 ± 1.528
3.834ValThr: 3.834 ± 1.154
5.476ValVal: 5.476 ± 2.159
1.095ValTrp: 1.095 ± 1.438
1.643ValTyr: 1.643 ± 0.579
0.0ValXaa: 0.0 ± 0.0
Trp
1.095TrpAla: 1.095 ± 0.809
0.0TrpCys: 0.0 ± 0.0
2.738TrpAsp: 2.738 ± 1.55
1.643TrpGlu: 1.643 ± 0.68
1.095TrpPhe: 1.095 ± 1.438
2.738TrpGly: 2.738 ± 2.091
0.548TrpHis: 0.548 ± 0.349
0.0TrpIle: 0.0 ± 0.0
0.548TrpLys: 0.548 ± 0.349
0.548TrpLeu: 0.548 ± 0.477
1.095TrpMet: 1.095 ± 0.809
0.548TrpAsn: 0.548 ± 0.349
0.548TrpPro: 0.548 ± 0.485
1.095TrpGln: 1.095 ± 0.674
1.095TrpArg: 1.095 ± 0.809
0.548TrpSer: 0.548 ± 0.485
0.0TrpThr: 0.0 ± 0.0
1.095TrpVal: 1.095 ± 0.809
1.095TrpTrp: 1.095 ± 0.674
0.548TrpTyr: 0.548 ± 0.349
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.643TyrAla: 1.643 ± 0.76
0.548TyrCys: 0.548 ± 0.719
1.095TyrAsp: 1.095 ± 0.809
0.548TyrGlu: 0.548 ± 0.349
0.548TyrPhe: 0.548 ± 0.485
5.476TyrGly: 5.476 ± 0.599
2.738TyrHis: 2.738 ± 0.8
0.548TyrIle: 0.548 ± 0.349
2.191TyrLys: 2.191 ± 1.348
3.286TyrLeu: 3.286 ± 1.052
1.095TyrMet: 1.095 ± 0.739
0.0TyrAsn: 0.0 ± 0.0
2.738TyrPro: 2.738 ± 1.496
3.286TyrGln: 3.286 ± 1.608
2.191TyrArg: 2.191 ± 0.637
5.476TyrSer: 5.476 ± 2.019
1.643TyrThr: 1.643 ± 0.579
1.095TyrVal: 1.095 ± 0.404
1.643TyrTrp: 1.643 ± 0.78
2.191TyrTyr: 2.191 ± 0.637
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1827 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski