Amino acid dipepetide frequency for Rhinolophus simulator polyomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.197AlaAla: 6.197 ± 1.969
0.0AlaCys: 0.0 ± 0.0
2.817AlaAsp: 2.817 ± 1.507
7.324AlaGlu: 7.324 ± 2.397
3.38AlaPhe: 3.38 ± 1.418
1.69AlaGly: 1.69 ± 0.647
0.563AlaHis: 0.563 ± 0.78
5.07AlaIle: 5.07 ± 1.761
3.38AlaLys: 3.38 ± 0.952
9.014AlaLeu: 9.014 ± 2.334
0.0AlaMet: 0.0 ± 0.0
1.127AlaAsn: 1.127 ± 0.476
0.563AlaPro: 0.563 ± 0.491
1.69AlaGln: 1.69 ± 0.649
5.07AlaArg: 5.07 ± 2.881
2.817AlaSer: 2.817 ± 0.456
3.944AlaThr: 3.944 ± 2.462
3.38AlaVal: 3.38 ± 1.114
0.563AlaTrp: 0.563 ± 0.491
1.69AlaTyr: 1.69 ± 1.134
0.0AlaXaa: 0.0 ± 0.0
Cys
0.563CysAla: 0.563 ± 0.388
0.0CysCys: 0.0 ± 0.0
1.69CysAsp: 1.69 ± 1.165
0.0CysGlu: 0.0 ± 0.0
0.563CysPhe: 0.563 ± 0.388
0.563CysGly: 0.563 ± 0.388
1.127CysHis: 1.127 ± 0.423
1.127CysIle: 1.127 ± 0.591
3.38CysLys: 3.38 ± 1.257
2.817CysLeu: 2.817 ± 1.379
0.563CysMet: 0.563 ± 0.57
1.69CysAsn: 1.69 ± 0.58
0.563CysPro: 0.563 ± 0.491
0.563CysGln: 0.563 ± 0.388
1.127CysArg: 1.127 ± 0.777
2.254CysSer: 2.254 ± 0.897
0.563CysThr: 0.563 ± 0.388
1.127CysVal: 1.127 ± 0.591
1.69CysTrp: 1.69 ± 1.095
2.254CysTyr: 2.254 ± 1.069
0.0CysXaa: 0.0 ± 0.0
Asp
1.69AspAla: 1.69 ± 0.817
1.69AspCys: 1.69 ± 0.822
2.254AspAsp: 2.254 ± 1.553
1.127AspGlu: 1.127 ± 0.777
1.69AspPhe: 1.69 ± 0.649
3.944AspGly: 3.944 ± 0.978
2.254AspHis: 2.254 ± 0.67
3.38AspIle: 3.38 ± 1.21
2.817AspLys: 2.817 ± 1.379
5.634AspLeu: 5.634 ± 1.743
2.254AspMet: 2.254 ± 1.297
3.944AspAsn: 3.944 ± 1.63
4.507AspPro: 4.507 ± 0.923
2.254AspGln: 2.254 ± 0.535
0.563AspArg: 0.563 ± 0.53
1.69AspSer: 1.69 ± 0.647
1.127AspThr: 1.127 ± 0.767
1.69AspVal: 1.69 ± 1.165
1.69AspTrp: 1.69 ± 0.927
2.817AspTyr: 2.817 ± 0.456
0.0AspXaa: 0.0 ± 0.0
Glu
5.634GluAla: 5.634 ± 2.476
3.38GluCys: 3.38 ± 1.774
6.197GluAsp: 6.197 ± 1.273
11.831GluGlu: 11.831 ± 2.035
3.944GluPhe: 3.944 ± 2.719
2.817GluGly: 2.817 ± 0.624
1.127GluHis: 1.127 ± 0.423
1.69GluIle: 1.69 ± 0.689
2.817GluLys: 2.817 ± 1.495
10.141GluLeu: 10.141 ± 3.15
2.254GluMet: 2.254 ± 1.095
5.07GluAsn: 5.07 ± 1.265
1.127GluPro: 1.127 ± 0.423
3.944GluGln: 3.944 ± 1.285
2.254GluArg: 2.254 ± 0.818
2.254GluSer: 2.254 ± 0.701
1.69GluThr: 1.69 ± 0.689
3.944GluVal: 3.944 ± 2.254
1.69GluTrp: 1.69 ± 0.647
2.254GluTyr: 2.254 ± 0.746
0.0GluXaa: 0.0 ± 0.0
Phe
3.38PheAla: 3.38 ± 1.114
1.127PheCys: 1.127 ± 0.777
2.817PheAsp: 2.817 ± 1.942
4.507PheGlu: 4.507 ± 0.901
1.127PhePhe: 1.127 ± 0.423
2.254PheGly: 2.254 ± 1.448
1.127PheHis: 1.127 ± 0.777
2.254PheIle: 2.254 ± 0.67
1.127PheLys: 1.127 ± 1.14
8.451PheLeu: 8.451 ± 1.607
0.0PheMet: 0.0 ± 0.0
1.69PheAsn: 1.69 ± 0.647
6.761PhePro: 6.761 ± 0.809
0.0PheGln: 0.0 ± 0.0
0.563PheArg: 0.563 ± 0.491
3.38PheSer: 3.38 ± 0.952
2.817PheThr: 2.817 ± 0.492
1.127PheVal: 1.127 ± 0.768
1.127PheTrp: 1.127 ± 0.767
1.127PheTyr: 1.127 ± 0.767
0.0PheXaa: 0.0 ± 0.0
Gly
3.38GlyAla: 3.38 ± 1.301
0.563GlyCys: 0.563 ± 0.388
3.38GlyAsp: 3.38 ± 1.159
4.507GlyGlu: 4.507 ± 1.953
2.254GlyPhe: 2.254 ± 0.67
6.761GlyGly: 6.761 ± 1.37
0.563GlyHis: 0.563 ± 0.491
4.507GlyIle: 4.507 ± 0.4
3.944GlyLys: 3.944 ± 1.889
5.07GlyLeu: 5.07 ± 1.101
1.127GlyMet: 1.127 ± 0.982
2.254GlyAsn: 2.254 ± 1.069
5.634GlyPro: 5.634 ± 2.048
2.817GlyGln: 2.817 ± 0.456
1.127GlyArg: 1.127 ± 0.767
1.69GlySer: 1.69 ± 0.647
3.944GlyThr: 3.944 ± 2.03
7.324GlyVal: 7.324 ± 1.429
0.0GlyTrp: 0.0 ± 0.0
0.563GlyTyr: 0.563 ± 0.491
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.127HisGlu: 1.127 ± 0.777
2.254HisPhe: 2.254 ± 0.535
0.0HisGly: 0.0 ± 0.0
1.127HisHis: 1.127 ± 1.031
0.563HisIle: 0.563 ± 0.53
2.254HisLys: 2.254 ± 1.142
2.254HisLeu: 2.254 ± 1.536
0.563HisMet: 0.563 ± 0.57
1.127HisAsn: 1.127 ± 0.423
2.254HisPro: 2.254 ± 0.67
0.563HisGln: 0.563 ± 0.78
3.38HisArg: 3.38 ± 0.657
1.69HisSer: 1.69 ± 0.817
3.38HisThr: 3.38 ± 0.833
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.563HisTyr: 0.563 ± 0.388
0.0HisXaa: 0.0 ± 0.0
Ile
5.07IleAla: 5.07 ± 2.433
1.127IleCys: 1.127 ± 0.423
2.817IleAsp: 2.817 ± 1.368
2.254IleGlu: 2.254 ± 1.54
3.38IlePhe: 3.38 ± 1.861
2.817IleGly: 2.817 ± 1.166
0.0IleHis: 0.0 ± 0.0
1.127IleIle: 1.127 ± 0.724
0.563IleLys: 0.563 ± 0.388
4.507IleLeu: 4.507 ± 1.792
1.69IleMet: 1.69 ± 0.848
0.563IleAsn: 0.563 ± 0.491
2.817IlePro: 2.817 ± 1.021
4.507IleGln: 4.507 ± 0.87
0.563IleArg: 0.563 ± 0.491
5.07IleSer: 5.07 ± 1.305
3.38IleThr: 3.38 ± 0.688
1.69IleVal: 1.69 ± 0.647
0.563IleTrp: 0.563 ± 0.388
2.817IleTyr: 2.817 ± 0.971
0.0IleXaa: 0.0 ± 0.0
Lys
2.817LysAla: 2.817 ± 0.929
2.254LysCys: 2.254 ± 1.183
0.0LysAsp: 0.0 ± 0.0
4.507LysGlu: 4.507 ± 2.172
1.69LysPhe: 1.69 ± 0.58
2.817LysGly: 2.817 ± 0.929
0.563LysHis: 0.563 ± 0.388
3.944LysIle: 3.944 ± 1.918
10.141LysLys: 10.141 ± 2.295
7.887LysLeu: 7.887 ± 2.383
1.127LysMet: 1.127 ± 0.768
2.254LysAsn: 2.254 ± 1.184
1.127LysPro: 1.127 ± 0.591
2.817LysGln: 2.817 ± 1.238
5.07LysArg: 5.07 ± 1.52
3.38LysSer: 3.38 ± 1.108
4.507LysThr: 4.507 ± 1.623
2.817LysVal: 2.817 ± 1.379
0.0LysTrp: 0.0 ± 0.0
0.563LysTyr: 0.563 ± 0.388
0.0LysXaa: 0.0 ± 0.0
Leu
5.634LeuAla: 5.634 ± 2.914
3.38LeuCys: 3.38 ± 1.257
3.944LeuAsp: 3.944 ± 1.044
11.268LeuGlu: 11.268 ± 2.75
3.944LeuPhe: 3.944 ± 1.613
3.38LeuGly: 3.38 ± 1.299
3.38LeuHis: 3.38 ± 1.514
7.324LeuIle: 7.324 ± 1.357
5.634LeuLys: 5.634 ± 2.307
7.324LeuLeu: 7.324 ± 2.149
2.817LeuMet: 2.817 ± 1.063
6.197LeuAsn: 6.197 ± 1.308
7.324LeuPro: 7.324 ± 1.24
6.197LeuGln: 6.197 ± 1.925
5.07LeuArg: 5.07 ± 1.359
8.451LeuSer: 8.451 ± 1.691
4.507LeuThr: 4.507 ± 1.932
3.38LeuVal: 3.38 ± 0.815
0.563LeuTrp: 0.563 ± 0.388
6.197LeuTyr: 6.197 ± 2.107
0.0LeuXaa: 0.0 ± 0.0
Met
4.507MetAla: 4.507 ± 0.87
0.563MetCys: 0.563 ± 0.388
1.69MetAsp: 1.69 ± 0.58
1.69MetGlu: 1.69 ± 0.822
0.563MetPhe: 0.563 ± 0.388
1.127MetGly: 1.127 ± 0.707
1.127MetHis: 1.127 ± 0.591
0.563MetIle: 0.563 ± 0.78
1.69MetLys: 1.69 ± 1.499
2.817MetLeu: 2.817 ± 0.624
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.254MetPro: 2.254 ± 0.535
2.254MetGln: 2.254 ± 1.54
1.127MetArg: 1.127 ± 0.591
0.0MetSer: 0.0 ± 0.0
0.563MetThr: 0.563 ± 0.388
0.563MetVal: 0.563 ± 0.491
0.563MetTrp: 0.563 ± 0.491
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.69AsnAla: 1.69 ± 0.649
1.127AsnCys: 1.127 ± 0.423
1.69AsnAsp: 1.69 ± 1.473
2.254AsnGlu: 2.254 ± 1.297
1.69AsnPhe: 1.69 ± 0.822
2.254AsnGly: 2.254 ± 1.297
1.127AsnHis: 1.127 ± 0.982
1.69AsnIle: 1.69 ± 1.095
2.254AsnLys: 2.254 ± 1.553
4.507AsnLeu: 4.507 ± 1.134
2.254AsnMet: 2.254 ± 0.554
0.0AsnAsn: 0.0 ± 0.0
3.38AsnPro: 3.38 ± 0.585
0.563AsnGln: 0.563 ± 0.388
1.127AsnArg: 1.127 ± 0.777
2.817AsnSer: 2.817 ± 0.456
3.944AsnThr: 3.944 ± 1.63
3.38AsnVal: 3.38 ± 0.585
0.0AsnTrp: 0.0 ± 0.0
2.254AsnTyr: 2.254 ± 0.535
0.0AsnXaa: 0.0 ± 0.0
Pro
0.563ProAla: 0.563 ± 0.491
1.69ProCys: 1.69 ± 0.58
5.634ProAsp: 5.634 ± 1.199
3.38ProGlu: 3.38 ± 1.861
0.563ProPhe: 0.563 ± 0.388
6.197ProGly: 6.197 ± 1.582
0.0ProHis: 0.0 ± 0.0
2.254ProIle: 2.254 ± 1.158
5.07ProLys: 5.07 ± 1.381
7.324ProLeu: 7.324 ± 1.503
1.127ProMet: 1.127 ± 0.423
1.69ProAsn: 1.69 ± 0.647
6.197ProPro: 6.197 ± 1.273
4.507ProGln: 4.507 ± 2.144
3.38ProArg: 3.38 ± 2.426
2.817ProSer: 2.817 ± 0.456
1.69ProThr: 1.69 ± 0.647
2.254ProVal: 2.254 ± 1.297
0.0ProTrp: 0.0 ± 0.0
2.817ProTyr: 2.817 ± 0.87
0.0ProXaa: 0.0 ± 0.0
Gln
5.07GlnAla: 5.07 ± 1.588
0.563GlnCys: 0.563 ± 0.388
0.563GlnAsp: 0.563 ± 0.491
1.127GlnGlu: 1.127 ± 0.777
3.38GlnPhe: 3.38 ± 1.786
2.817GlnGly: 2.817 ± 1.121
0.563GlnHis: 0.563 ± 0.57
3.38GlnIle: 3.38 ± 0.585
1.127GlnLys: 1.127 ± 0.591
3.944GlnLeu: 3.944 ± 0.593
0.0GlnMet: 0.0 ± 0.0
2.254GlnAsn: 2.254 ± 0.746
1.127GlnPro: 1.127 ± 0.982
7.324GlnGln: 7.324 ± 2.312
6.197GlnArg: 6.197 ± 2.101
3.944GlnSer: 3.944 ± 1.789
2.254GlnThr: 2.254 ± 1.535
1.127GlnVal: 1.127 ± 0.724
0.563GlnTrp: 0.563 ± 0.57
2.254GlnTyr: 2.254 ± 0.752
0.0GlnXaa: 0.0 ± 0.0
Arg
2.817ArgAla: 2.817 ± 1.368
1.127ArgCys: 1.127 ± 0.591
3.944ArgAsp: 3.944 ± 1.084
2.817ArgGlu: 2.817 ± 0.758
2.817ArgPhe: 2.817 ± 0.996
3.944ArgGly: 3.944 ± 1.16
2.817ArgHis: 2.817 ± 0.996
0.563ArgIle: 0.563 ± 0.388
3.944ArgLys: 3.944 ± 1.332
2.817ArgLeu: 2.817 ± 0.971
1.69ArgMet: 1.69 ± 1.359
0.563ArgAsn: 0.563 ± 0.388
2.254ArgPro: 2.254 ± 1.489
1.69ArgGln: 1.69 ± 1.183
3.38ArgArg: 3.38 ± 1.388
0.563ArgSer: 0.563 ± 0.491
2.817ArgThr: 2.817 ± 1.371
4.507ArgVal: 4.507 ± 0.814
1.127ArgTrp: 1.127 ± 0.767
4.507ArgTyr: 4.507 ± 1.556
0.0ArgXaa: 0.0 ± 0.0
Ser
5.07SerAla: 5.07 ± 1.761
1.127SerCys: 1.127 ± 0.724
3.38SerAsp: 3.38 ± 0.657
4.507SerGlu: 4.507 ± 1.492
2.254SerPhe: 2.254 ± 0.979
4.507SerGly: 4.507 ± 1.448
1.69SerHis: 1.69 ± 1.134
2.817SerIle: 2.817 ± 0.919
0.0SerLys: 0.0 ± 0.0
6.197SerLeu: 6.197 ± 1.4
0.563SerMet: 0.563 ± 0.491
2.817SerAsn: 2.817 ± 0.777
1.69SerPro: 1.69 ± 1.165
2.254SerGln: 2.254 ± 1.142
4.507SerArg: 4.507 ± 1.312
2.817SerSer: 2.817 ± 0.736
2.817SerThr: 2.817 ± 0.839
6.761SerVal: 6.761 ± 1.343
2.254SerTrp: 2.254 ± 0.752
0.563SerTyr: 0.563 ± 0.491
0.0SerXaa: 0.0 ± 0.0
Thr
2.817ThrAla: 2.817 ± 0.945
1.127ThrCys: 1.127 ± 0.423
2.254ThrAsp: 2.254 ± 0.67
7.324ThrGlu: 7.324 ± 1.832
3.944ThrPhe: 3.944 ± 1.141
2.817ThrGly: 2.817 ± 0.823
1.127ThrHis: 1.127 ± 0.768
1.127ThrIle: 1.127 ± 0.982
1.127ThrLys: 1.127 ± 0.993
6.761ThrLeu: 6.761 ± 1.165
1.69ThrMet: 1.69 ± 0.925
1.127ThrAsn: 1.127 ± 0.982
5.07ThrPro: 5.07 ± 0.916
1.69ThrGln: 1.69 ± 0.647
1.127ThrArg: 1.127 ± 0.767
4.507ThrSer: 4.507 ± 0.535
5.634ThrThr: 5.634 ± 1.162
2.254ThrVal: 2.254 ± 0.849
1.127ThrTrp: 1.127 ± 0.767
2.254ThrTyr: 2.254 ± 0.907
0.0ThrXaa: 0.0 ± 0.0
Val
0.563ValAla: 0.563 ± 0.388
1.127ValCys: 1.127 ± 0.591
1.127ValAsp: 1.127 ± 0.777
1.127ValGlu: 1.127 ± 0.707
1.127ValPhe: 1.127 ± 0.476
3.944ValGly: 3.944 ± 2.972
1.127ValHis: 1.127 ± 0.767
2.254ValIle: 2.254 ± 0.979
3.944ValLys: 3.944 ± 1.187
4.507ValLeu: 4.507 ± 0.625
1.69ValMet: 1.69 ± 0.958
3.944ValAsn: 3.944 ± 0.593
2.254ValPro: 2.254 ± 1.177
3.944ValGln: 3.944 ± 1.084
3.38ValArg: 3.38 ± 1.21
4.507ValSer: 4.507 ± 0.814
5.634ValThr: 5.634 ± 1.701
2.254ValVal: 2.254 ± 0.849
0.0ValTrp: 0.0 ± 0.0
2.254ValTyr: 2.254 ± 0.517
0.0ValXaa: 0.0 ± 0.0
Trp
2.254TrpAla: 2.254 ± 1.535
0.563TrpCys: 0.563 ± 0.491
1.127TrpAsp: 1.127 ± 0.423
2.254TrpGlu: 2.254 ± 0.535
2.254TrpPhe: 2.254 ± 1.642
0.563TrpGly: 0.563 ± 0.57
0.0TrpHis: 0.0 ± 0.0
0.563TrpIle: 0.563 ± 0.57
1.69TrpLys: 1.69 ± 1.165
1.127TrpLeu: 1.127 ± 0.777
1.127TrpMet: 1.127 ± 0.767
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.127TrpSer: 1.127 ± 0.767
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.563TrpTrp: 0.563 ± 0.388
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.563TyrAla: 0.563 ± 0.388
1.69TyrCys: 1.69 ± 0.822
1.69TyrAsp: 1.69 ± 1.134
1.127TyrGlu: 1.127 ± 0.724
3.944TyrPhe: 3.944 ± 1.779
6.197TyrGly: 6.197 ± 2.057
1.69TyrHis: 1.69 ± 0.817
1.127TyrIle: 1.127 ± 0.767
3.38TyrLys: 3.38 ± 1.861
2.817TyrLeu: 2.817 ± 0.992
0.563TyrMet: 0.563 ± 0.388
1.69TyrAsn: 1.69 ± 0.649
2.254TyrPro: 2.254 ± 0.849
0.0TyrGln: 0.0 ± 0.0
1.69TyrArg: 1.69 ± 0.817
2.817TyrSer: 2.817 ± 1.371
2.254TyrThr: 2.254 ± 0.746
1.127TyrVal: 1.127 ± 0.767
1.127TyrTrp: 1.127 ± 0.777
3.38TyrTyr: 3.38 ± 1.338
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1776 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski