Amino acid dipepetide frequency for Phodopus sungorus papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.314AlaAla: 3.314 ± 0.992
2.071AlaCys: 2.071 ± 1.438
6.628AlaAsp: 6.628 ± 1.257
4.143AlaGlu: 4.143 ± 1.674
2.9AlaPhe: 2.9 ± 1.039
3.728AlaGly: 3.728 ± 0.727
1.243AlaHis: 1.243 ± 1.033
0.829AlaIle: 0.829 ± 0.433
3.314AlaLys: 3.314 ± 1.064
4.971AlaLeu: 4.971 ± 2.039
2.071AlaMet: 2.071 ± 0.809
2.486AlaAsn: 2.486 ± 0.907
2.9AlaPro: 2.9 ± 0.797
2.071AlaGln: 2.071 ± 1.175
4.143AlaArg: 4.143 ± 0.612
4.557AlaSer: 4.557 ± 1.408
4.557AlaThr: 4.557 ± 1.292
2.486AlaVal: 2.486 ± 0.723
0.0AlaTrp: 0.0 ± 0.0
2.9AlaTyr: 2.9 ± 1.351
0.0AlaXaa: 0.0 ± 0.0
Cys
1.243CysAla: 1.243 ± 0.565
0.829CysCys: 0.829 ± 0.604
0.414CysAsp: 0.414 ± 0.344
0.414CysGlu: 0.414 ± 0.302
0.829CysPhe: 0.829 ± 0.584
0.414CysGly: 0.414 ± 0.473
0.414CysHis: 0.414 ± 0.344
1.243CysIle: 1.243 ± 1.041
0.829CysLys: 0.829 ± 0.689
1.243CysLeu: 1.243 ± 0.685
0.414CysMet: 0.414 ± 0.473
1.243CysAsn: 1.243 ± 0.571
2.071CysPro: 2.071 ± 1.168
1.657CysGln: 1.657 ± 0.683
1.657CysArg: 1.657 ± 1.576
1.243CysSer: 1.243 ± 0.906
0.829CysThr: 0.829 ± 0.358
1.243CysVal: 1.243 ± 1.246
1.243CysTrp: 1.243 ± 0.376
0.414CysTyr: 0.414 ± 0.473
0.0CysXaa: 0.0 ± 0.0
Asp
4.143AspAla: 4.143 ± 0.719
1.243AspCys: 1.243 ± 0.553
2.071AspAsp: 2.071 ± 0.897
5.8AspGlu: 5.8 ± 1.635
2.9AspPhe: 2.9 ± 0.948
4.143AspGly: 4.143 ± 0.736
0.414AspHis: 0.414 ± 0.411
5.8AspIle: 5.8 ± 1.796
3.314AspLys: 3.314 ± 0.523
5.8AspLeu: 5.8 ± 1.1
0.829AspMet: 0.829 ± 0.358
2.9AspAsn: 2.9 ± 0.597
5.385AspPro: 5.385 ± 1.573
4.143AspGln: 4.143 ± 1.023
2.486AspArg: 2.486 ± 0.631
4.971AspSer: 4.971 ± 1.826
3.728AspThr: 3.728 ± 0.922
3.728AspVal: 3.728 ± 1.456
0.829AspTrp: 0.829 ± 0.358
2.486AspTyr: 2.486 ± 0.631
0.0AspXaa: 0.0 ± 0.0
Glu
3.728GluAla: 3.728 ± 1.751
0.414GluCys: 0.414 ± 0.302
4.557GluAsp: 4.557 ± 0.688
4.557GluGlu: 4.557 ± 1.805
2.486GluPhe: 2.486 ± 1.206
4.143GluGly: 4.143 ± 1.219
2.071GluHis: 2.071 ± 0.939
3.314GluIle: 3.314 ± 1.584
1.657GluLys: 1.657 ± 0.878
4.143GluLeu: 4.143 ± 0.867
0.829GluMet: 0.829 ± 0.604
2.486GluAsn: 2.486 ± 0.738
3.314GluPro: 3.314 ± 1.038
3.728GluGln: 3.728 ± 1.392
4.971GluArg: 4.971 ± 0.8
4.557GluSer: 4.557 ± 1.702
4.143GluThr: 4.143 ± 1.069
3.728GluVal: 3.728 ± 1.547
0.829GluTrp: 0.829 ± 0.402
1.243GluTyr: 1.243 ± 0.631
0.0GluXaa: 0.0 ± 0.0
Phe
2.9PheAla: 2.9 ± 0.746
0.829PheCys: 0.829 ± 0.505
3.314PheAsp: 3.314 ± 0.832
2.071PheGlu: 2.071 ± 0.438
2.486PhePhe: 2.486 ± 0.752
3.314PheGly: 3.314 ± 1.009
0.414PheHis: 0.414 ± 0.326
2.486PheIle: 2.486 ± 1.401
2.486PheLys: 2.486 ± 1.038
4.557PheLeu: 4.557 ± 1.353
1.243PheMet: 1.243 ± 0.681
0.414PheAsn: 0.414 ± 0.344
2.071PhePro: 2.071 ± 0.741
2.9PheGln: 2.9 ± 1.211
2.071PheArg: 2.071 ± 0.601
2.071PheSer: 2.071 ± 0.45
1.243PheThr: 1.243 ± 0.634
2.9PheVal: 2.9 ± 1.398
1.243PheTrp: 1.243 ± 0.634
1.243PheTyr: 1.243 ± 0.374
0.0PheXaa: 0.0 ± 0.0
Gly
4.971GlyAla: 4.971 ± 1.09
0.829GlyCys: 0.829 ± 0.358
4.143GlyAsp: 4.143 ± 1.023
4.557GlyGlu: 4.557 ± 1.552
1.657GlyPhe: 1.657 ± 0.715
6.214GlyGly: 6.214 ± 2.422
2.071GlyHis: 2.071 ± 0.33
2.486GlyIle: 2.486 ± 0.742
2.486GlyLys: 2.486 ± 0.822
5.385GlyLeu: 5.385 ± 1.485
1.243GlyMet: 1.243 ± 0.811
3.728GlyAsn: 3.728 ± 0.732
4.557GlyPro: 4.557 ± 1.698
2.9GlyGln: 2.9 ± 0.907
6.628GlyArg: 6.628 ± 2.134
4.143GlySer: 4.143 ± 1.744
7.042GlyThr: 7.042 ± 2.029
4.557GlyVal: 4.557 ± 0.847
0.414GlyTrp: 0.414 ± 0.302
1.243GlyTyr: 1.243 ± 0.506
0.0GlyXaa: 0.0 ± 0.0
His
0.829HisAla: 0.829 ± 0.614
0.829HisCys: 0.829 ± 0.604
0.829HisAsp: 0.829 ± 0.614
0.829HisGlu: 0.829 ± 0.441
0.829HisPhe: 0.829 ± 0.566
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.243HisIle: 1.243 ± 0.668
0.829HisLys: 0.829 ± 0.822
1.243HisLeu: 1.243 ± 0.571
0.0HisMet: 0.0 ± 0.307
0.0HisAsn: 0.0 ± 0.0
1.657HisPro: 1.657 ± 0.731
0.414HisGln: 0.414 ± 0.302
0.414HisArg: 0.414 ± 0.302
1.243HisSer: 1.243 ± 0.42
0.829HisThr: 0.829 ± 0.481
0.829HisVal: 0.829 ± 0.481
0.829HisTrp: 0.829 ± 0.358
0.414HisTyr: 0.414 ± 0.344
0.0HisXaa: 0.0 ± 0.0
Ile
4.557IleAla: 4.557 ± 0.915
0.829IleCys: 0.829 ± 0.689
3.728IleAsp: 3.728 ± 0.934
3.314IleGlu: 3.314 ± 1.561
2.071IlePhe: 2.071 ± 1.207
4.971IleGly: 4.971 ± 1.006
0.414IleHis: 0.414 ± 0.344
2.071IleIle: 2.071 ± 1.207
1.657IleLys: 1.657 ± 1.044
3.728IleLeu: 3.728 ± 0.915
0.0IleMet: 0.0 ± 0.0
1.657IleAsn: 1.657 ± 0.605
2.486IlePro: 2.486 ± 1.149
2.071IleGln: 2.071 ± 0.902
1.243IleArg: 1.243 ± 0.741
4.971IleSer: 4.971 ± 0.258
2.486IleThr: 2.486 ± 0.823
2.486IleVal: 2.486 ± 0.895
0.0IleTrp: 0.0 ± 0.0
2.071IleTyr: 2.071 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
2.071LysAla: 2.071 ± 0.698
1.657LysCys: 1.657 ± 0.913
2.486LysAsp: 2.486 ± 0.971
2.9LysGlu: 2.9 ± 0.79
1.243LysPhe: 1.243 ± 0.661
1.243LysGly: 1.243 ± 1.053
0.829LysHis: 0.829 ± 0.604
2.071LysIle: 2.071 ± 0.734
3.314LysLys: 3.314 ± 1.25
5.8LysLeu: 5.8 ± 1.553
0.829LysMet: 0.829 ± 0.641
1.243LysAsn: 1.243 ± 0.661
1.657LysPro: 1.657 ± 1.18
2.486LysGln: 2.486 ± 0.697
4.557LysArg: 4.557 ± 0.516
2.9LysSer: 2.9 ± 1.267
2.9LysThr: 2.9 ± 1.157
3.314LysVal: 3.314 ± 1.009
0.829LysTrp: 0.829 ± 0.689
1.657LysTyr: 1.657 ± 0.677
0.0LysXaa: 0.0 ± 0.0
Leu
5.385LeuAla: 5.385 ± 1.085
1.243LeuCys: 1.243 ± 1.041
6.214LeuAsp: 6.214 ± 0.901
3.728LeuGlu: 3.728 ± 0.829
5.8LeuPhe: 5.8 ± 1.68
9.942LeuGly: 9.942 ± 1.971
1.657LeuHis: 1.657 ± 0.725
2.9LeuIle: 2.9 ± 1.189
3.728LeuLys: 3.728 ± 1.199
10.771LeuLeu: 10.771 ± 3.321
1.243LeuMet: 1.243 ± 0.901
1.657LeuAsn: 1.657 ± 0.731
4.971LeuPro: 4.971 ± 1.609
4.143LeuGln: 4.143 ± 0.882
4.143LeuArg: 4.143 ± 1.007
5.8LeuSer: 5.8 ± 1.993
2.9LeuThr: 2.9 ± 0.473
6.628LeuVal: 6.628 ± 1.083
0.829LeuTrp: 0.829 ± 0.584
3.728LeuTyr: 3.728 ± 0.771
0.0LeuXaa: 0.0 ± 0.0
Met
0.414MetAla: 0.414 ± 0.302
0.414MetCys: 0.414 ± 0.344
1.243MetAsp: 1.243 ± 0.506
0.0MetGlu: 0.0 ± 0.0
1.243MetPhe: 1.243 ± 0.565
0.829MetGly: 0.829 ± 0.402
0.0MetHis: 0.0 ± 0.0
1.243MetIle: 1.243 ± 0.931
1.243MetLys: 1.243 ± 0.711
0.829MetLeu: 0.829 ± 0.505
0.0MetMet: 0.0 ± 0.0
1.657MetAsn: 1.657 ± 0.796
1.657MetPro: 1.657 ± 0.698
0.829MetGln: 0.829 ± 0.637
2.071MetArg: 2.071 ± 0.545
0.829MetSer: 0.829 ± 0.604
1.657MetThr: 1.657 ± 0.512
1.243MetVal: 1.243 ± 0.681
0.414MetTrp: 0.414 ± 0.344
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.486AsnAla: 2.486 ± 0.847
1.243AsnCys: 1.243 ± 0.506
2.071AsnAsp: 2.071 ± 0.393
2.071AsnGlu: 2.071 ± 0.548
2.071AsnPhe: 2.071 ± 0.972
2.486AsnGly: 2.486 ± 1.651
0.0AsnHis: 0.0 ± 0.0
1.243AsnIle: 1.243 ± 0.631
2.071AsnLys: 2.071 ± 1.06
2.071AsnLeu: 2.071 ± 0.902
0.829AsnMet: 0.829 ± 0.41
1.657AsnAsn: 1.657 ± 1.18
4.557AsnPro: 4.557 ± 0.859
1.243AsnGln: 1.243 ± 1.033
2.9AsnArg: 2.9 ± 1.073
2.9AsnSer: 2.9 ± 1.011
1.657AsnThr: 1.657 ± 0.948
4.143AsnVal: 4.143 ± 0.954
0.414AsnTrp: 0.414 ± 0.302
0.829AsnTyr: 0.829 ± 0.441
0.0AsnXaa: 0.0 ± 0.0
Pro
2.9ProAla: 2.9 ± 1.098
1.243ProCys: 1.243 ± 0.876
7.042ProAsp: 7.042 ± 2.089
4.557ProGlu: 4.557 ± 1.606
2.071ProPhe: 2.071 ± 0.33
2.071ProGly: 2.071 ± 1.632
0.414ProHis: 0.414 ± 0.302
4.143ProIle: 4.143 ± 0.928
2.9ProLys: 2.9 ± 0.917
6.214ProLeu: 6.214 ± 0.931
1.243ProMet: 1.243 ± 0.711
2.486ProAsn: 2.486 ± 0.876
4.143ProPro: 4.143 ± 1.234
2.486ProGln: 2.486 ± 1.585
3.314ProArg: 3.314 ± 1.409
7.871ProSer: 7.871 ± 1.439
3.728ProThr: 3.728 ± 1.762
2.9ProVal: 2.9 ± 1.203
0.414ProTrp: 0.414 ± 0.411
1.657ProTyr: 1.657 ± 1.273
0.0ProXaa: 0.0 ± 0.0
Gln
2.486GlnAla: 2.486 ± 0.372
0.414GlnCys: 0.414 ± 0.326
4.971GlnAsp: 4.971 ± 0.722
3.728GlnGlu: 3.728 ± 1.188
2.486GlnPhe: 2.486 ± 0.847
1.657GlnGly: 1.657 ± 0.573
0.0GlnHis: 0.0 ± 0.0
2.071GlnIle: 2.071 ± 0.82
0.829GlnLys: 0.829 ± 0.689
2.486GlnLeu: 2.486 ± 0.75
1.657GlnMet: 1.657 ± 0.484
3.314GlnAsn: 3.314 ± 1.612
4.143GlnPro: 4.143 ± 1.526
1.657GlnGln: 1.657 ± 0.882
3.314GlnArg: 3.314 ± 1.879
1.657GlnSer: 1.657 ± 0.93
2.9GlnThr: 2.9 ± 0.933
2.9GlnVal: 2.9 ± 0.834
1.243GlnTrp: 1.243 ± 0.579
1.657GlnTyr: 1.657 ± 0.599
0.0GlnXaa: 0.0 ± 0.0
Arg
3.314ArgAla: 3.314 ± 0.418
1.657ArgCys: 1.657 ± 0.747
2.486ArgAsp: 2.486 ± 1.295
4.143ArgGlu: 4.143 ± 0.78
3.314ArgPhe: 3.314 ± 0.418
5.385ArgGly: 5.385 ± 1.909
1.243ArgHis: 1.243 ± 0.634
2.071ArgIle: 2.071 ± 0.854
3.314ArgLys: 3.314 ± 0.396
7.457ArgLeu: 7.457 ± 1.298
0.414ArgMet: 0.414 ± 0.473
3.314ArgAsn: 3.314 ± 0.529
5.385ArgPro: 5.385 ± 1.657
2.9ArgGln: 2.9 ± 0.907
8.699ArgArg: 8.699 ± 2.704
4.143ArgSer: 4.143 ± 1.398
2.9ArgThr: 2.9 ± 0.831
6.214ArgVal: 6.214 ± 1.272
0.0ArgTrp: 0.0 ± 0.0
2.071ArgTyr: 2.071 ± 0.854
0.0ArgXaa: 0.0 ± 0.0
Ser
6.628SerAla: 6.628 ± 2.236
0.829SerCys: 0.829 ± 0.637
2.071SerAsp: 2.071 ± 0.714
4.971SerGlu: 4.971 ± 1.448
2.486SerPhe: 2.486 ± 1.119
9.528SerGly: 9.528 ± 1.843
0.414SerHis: 0.414 ± 0.411
2.071SerIle: 2.071 ± 1.072
3.728SerLys: 3.728 ± 1.147
9.114SerLeu: 9.114 ± 1.468
1.243SerMet: 1.243 ± 0.506
1.243SerAsn: 1.243 ± 0.681
4.143SerPro: 4.143 ± 1.446
2.9SerGln: 2.9 ± 0.592
4.557SerArg: 4.557 ± 1.698
5.8SerSer: 5.8 ± 1.037
6.214SerThr: 6.214 ± 1.951
2.9SerVal: 2.9 ± 0.978
0.829SerTrp: 0.829 ± 0.822
1.657SerTyr: 1.657 ± 0.256
0.0SerXaa: 0.0 ± 0.0
Thr
2.486ThrAla: 2.486 ± 0.992
1.657ThrCys: 1.657 ± 0.698
4.143ThrAsp: 4.143 ± 0.836
3.314ThrGlu: 3.314 ± 0.875
2.071ThrPhe: 2.071 ± 0.714
5.385ThrGly: 5.385 ± 2.145
1.243ThrHis: 1.243 ± 0.714
2.9ThrIle: 2.9 ± 1.098
1.243ThrLys: 1.243 ± 0.42
4.143ThrLeu: 4.143 ± 0.488
0.829ThrMet: 0.829 ± 0.505
2.486ThrAsn: 2.486 ± 0.833
3.314ThrPro: 3.314 ± 0.883
3.314ThrGln: 3.314 ± 0.968
4.143ThrArg: 4.143 ± 0.421
7.042ThrSer: 7.042 ± 2.587
4.143ThrThr: 4.143 ± 0.793
7.042ThrVal: 7.042 ± 1.508
0.829ThrTrp: 0.829 ± 0.822
0.414ThrTyr: 0.414 ± 0.326
0.0ThrXaa: 0.0 ± 0.0
Val
3.728ValAla: 3.728 ± 2.142
0.829ValCys: 0.829 ± 0.764
5.385ValAsp: 5.385 ± 1.215
3.728ValGlu: 3.728 ± 0.875
1.657ValPhe: 1.657 ± 0.833
4.143ValGly: 4.143 ± 1.578
0.829ValHis: 0.829 ± 0.653
3.728ValIle: 3.728 ± 1.28
3.728ValLys: 3.728 ± 0.891
3.728ValLeu: 3.728 ± 1.373
1.657ValMet: 1.657 ± 0.573
2.9ValAsn: 2.9 ± 1.167
3.728ValPro: 3.728 ± 1.144
2.071ValGln: 2.071 ± 0.807
4.971ValArg: 4.971 ± 0.986
5.385ValSer: 5.385 ± 2.274
5.8ValThr: 5.8 ± 1.624
4.971ValVal: 4.971 ± 0.82
1.243ValTrp: 1.243 ± 0.661
2.9ValTyr: 2.9 ± 1.1
0.0ValXaa: 0.0 ± 0.0
Trp
0.414TrpAla: 0.414 ± 0.302
0.0TrpCys: 0.0 ± 0.0
1.243TrpAsp: 1.243 ± 0.826
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.414TrpHis: 0.414 ± 0.411
0.414TrpIle: 0.414 ± 0.302
0.829TrpLys: 0.829 ± 0.505
0.829TrpLeu: 0.829 ± 0.358
0.414TrpMet: 0.414 ± 0.344
0.829TrpAsn: 0.829 ± 0.689
0.414TrpPro: 0.414 ± 0.344
0.829TrpGln: 0.829 ± 0.358
2.486TrpArg: 2.486 ± 1.107
0.414TrpSer: 0.414 ± 0.411
1.243TrpThr: 1.243 ± 0.826
1.243TrpVal: 1.243 ± 0.579
0.0TrpTrp: 0.0 ± 0.0
0.829TrpTyr: 0.829 ± 0.402
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.314TyrAla: 3.314 ± 0.968
0.829TyrCys: 0.829 ± 0.505
2.071TyrAsp: 2.071 ± 0.854
2.071TyrGlu: 2.071 ± 0.816
1.657TyrPhe: 1.657 ± 0.955
1.657TyrGly: 1.657 ± 0.256
0.414TyrHis: 0.414 ± 0.473
2.486TyrIle: 2.486 ± 1.026
2.486TyrLys: 2.486 ± 0.731
2.9TyrLeu: 2.9 ± 1.087
0.414TyrMet: 0.414 ± 0.302
1.243TyrAsn: 1.243 ± 0.374
1.243TyrPro: 1.243 ± 0.682
0.829TyrGln: 0.829 ± 0.358
1.657TyrArg: 1.657 ± 0.568
0.829TyrSer: 0.829 ± 0.441
1.243TyrThr: 1.243 ± 0.398
1.657TyrVal: 1.657 ± 0.683
0.414TyrTrp: 0.414 ± 0.411
1.657TyrTyr: 1.657 ± 1.168
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2415 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski