Amino acid dipepetide frequency for Pan troglodytes polyomavirus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.797AlaAla: 3.797 ± 0.617
0.475AlaCys: 0.475 ± 0.515
2.373AlaAsp: 2.373 ± 0.756
2.848AlaGlu: 2.848 ± 1.028
1.424AlaPhe: 1.424 ± 0.434
1.898AlaGly: 1.898 ± 0.993
1.898AlaHis: 1.898 ± 0.751
3.797AlaIle: 3.797 ± 1.801
2.373AlaLys: 2.373 ± 0.705
3.322AlaLeu: 3.322 ± 1.706
0.0AlaMet: 0.0 ± 0.0
1.898AlaAsn: 1.898 ± 0.993
3.322AlaPro: 3.322 ± 1.424
1.424AlaGln: 1.424 ± 0.434
4.271AlaArg: 4.271 ± 2.301
3.322AlaSer: 3.322 ± 1.288
1.898AlaThr: 1.898 ± 1.39
3.797AlaVal: 3.797 ± 1.075
0.475AlaTrp: 0.475 ± 0.314
1.424AlaTyr: 1.424 ± 0.657
0.0AlaXaa: 0.0 ± 0.0
Cys
2.848CysAla: 2.848 ± 1.275
0.949CysCys: 0.949 ± 0.611
0.475CysAsp: 0.475 ± 0.314
0.475CysGlu: 0.475 ± 0.314
1.898CysPhe: 1.898 ± 0.99
1.424CysGly: 1.424 ± 0.629
0.475CysHis: 0.475 ± 0.515
1.424CysIle: 1.424 ± 0.565
1.898CysLys: 1.898 ± 0.864
2.373CysLeu: 2.373 ± 0.894
0.949CysMet: 0.949 ± 0.611
0.949CysAsn: 0.949 ± 0.627
1.898CysPro: 1.898 ± 1.225
0.475CysGln: 0.475 ± 0.314
0.949CysArg: 0.949 ± 0.611
1.898CysSer: 1.898 ± 0.85
0.475CysThr: 0.475 ± 0.314
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.424CysTyr: 1.424 ± 0.629
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
2.848AspAsp: 2.848 ± 1.131
6.645AspGlu: 6.645 ± 0.881
1.898AspPhe: 1.898 ± 1.255
4.746AspGly: 4.746 ± 1.389
0.475AspHis: 0.475 ± 0.314
3.322AspIle: 3.322 ± 1.288
3.322AspLys: 3.322 ± 1.287
5.221AspLeu: 5.221 ± 1.705
2.848AspMet: 2.848 ± 0.836
2.373AspAsn: 2.373 ± 0.579
5.221AspPro: 5.221 ± 0.867
3.322AspGln: 3.322 ± 1.182
0.949AspArg: 0.949 ± 0.497
3.322AspSer: 3.322 ± 1.026
1.424AspThr: 1.424 ± 0.636
1.424AspVal: 1.424 ± 0.941
2.373AspTrp: 2.373 ± 1.176
2.848AspTyr: 2.848 ± 0.823
0.0AspXaa: 0.0 ± 0.0
Glu
6.17GluAla: 6.17 ± 2.243
0.949GluCys: 0.949 ± 0.627
4.271GluAsp: 4.271 ± 0.683
8.068GluGlu: 8.068 ± 1.635
1.898GluPhe: 1.898 ± 0.922
3.797GluGly: 3.797 ± 1.43
0.0GluHis: 0.0 ± 0.0
2.848GluIle: 2.848 ± 1.572
7.119GluLys: 7.119 ± 2.328
5.221GluLeu: 5.221 ± 0.872
1.424GluMet: 1.424 ± 0.565
4.271GluAsn: 4.271 ± 1.483
0.949GluPro: 0.949 ± 0.878
1.898GluGln: 1.898 ± 0.697
1.424GluArg: 1.424 ± 0.634
2.848GluSer: 2.848 ± 1.342
0.949GluThr: 0.949 ± 0.98
2.848GluVal: 2.848 ± 1.197
0.475GluTrp: 0.475 ± 0.314
1.898GluTyr: 1.898 ± 0.679
0.0GluXaa: 0.0 ± 0.0
Phe
2.373PheAla: 2.373 ± 0.756
1.424PheCys: 1.424 ± 0.758
1.424PheAsp: 1.424 ± 0.565
3.322PheGlu: 3.322 ± 1.438
0.949PhePhe: 0.949 ± 0.425
0.949PheGly: 0.949 ± 0.878
2.373PheHis: 2.373 ± 0.695
2.848PheIle: 2.848 ± 1.409
3.322PheLys: 3.322 ± 1.438
4.746PheLeu: 4.746 ± 0.632
0.949PheMet: 0.949 ± 0.627
2.373PheAsn: 2.373 ± 0.623
2.848PhePro: 2.848 ± 0.763
2.373PheGln: 2.373 ± 1.099
0.475PheArg: 0.475 ± 0.314
4.746PheSer: 4.746 ± 0.632
3.797PheThr: 3.797 ± 1.185
0.949PheVal: 0.949 ± 0.627
0.0PheTrp: 0.0 ± 0.0
0.949PheTyr: 0.949 ± 0.497
0.0PheXaa: 0.0 ± 0.0
Gly
1.898GlyAla: 1.898 ± 0.899
0.475GlyCys: 0.475 ± 0.314
4.271GlyAsp: 4.271 ± 1.338
2.848GlyGlu: 2.848 ± 0.698
2.848GlyPhe: 2.848 ± 0.681
7.119GlyGly: 7.119 ± 1.369
0.949GlyHis: 0.949 ± 0.54
2.373GlyIle: 2.373 ± 0.929
1.424GlyLys: 1.424 ± 0.941
6.17GlyLeu: 6.17 ± 1.66
1.898GlyMet: 1.898 ± 0.751
3.322GlyAsn: 3.322 ± 0.904
4.746GlyPro: 4.746 ± 0.771
2.848GlyGln: 2.848 ± 0.823
0.949GlyArg: 0.949 ± 0.497
1.424GlySer: 1.424 ± 0.434
2.848GlyThr: 2.848 ± 0.643
3.797GlyVal: 3.797 ± 1.645
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
1.898HisAla: 1.898 ± 1.409
0.475HisCys: 0.475 ± 0.314
0.475HisAsp: 0.475 ± 0.515
0.475HisGlu: 0.475 ± 0.49
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.424HisLys: 1.424 ± 0.605
1.424HisLeu: 1.424 ± 1.038
3.322HisMet: 3.322 ± 1.07
1.424HisAsn: 1.424 ± 0.634
3.797HisPro: 3.797 ± 1.668
1.424HisGln: 1.424 ± 0.565
3.322HisArg: 3.322 ± 1.29
1.424HisSer: 1.424 ± 0.629
0.949HisThr: 0.949 ± 0.425
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.949HisTyr: 0.949 ± 0.627
0.0HisXaa: 0.0 ± 0.0
Ile
2.373IleAla: 2.373 ± 0.579
0.949IleCys: 0.949 ± 0.425
1.424IleAsp: 1.424 ± 0.605
3.797IleGlu: 3.797 ± 1.435
2.373IlePhe: 2.373 ± 1.372
1.898IleGly: 1.898 ± 0.822
0.475IleHis: 0.475 ± 0.314
3.322IleIle: 3.322 ± 1.437
3.797IleLys: 3.797 ± 0.927
7.594IleLeu: 7.594 ± 2.117
2.373IleMet: 2.373 ± 0.983
1.424IleAsn: 1.424 ± 0.936
1.898IlePro: 1.898 ± 0.751
2.373IleGln: 2.373 ± 0.729
0.475IleArg: 0.475 ± 0.314
4.271IleSer: 4.271 ± 1.399
2.848IleThr: 2.848 ± 1.197
5.221IleVal: 5.221 ± 1.446
1.424IleTrp: 1.424 ± 0.636
1.424IleTyr: 1.424 ± 0.671
0.0IleXaa: 0.0 ± 0.0
Lys
1.424LysAla: 1.424 ± 0.605
2.848LysCys: 2.848 ± 1.259
2.373LysAsp: 2.373 ± 0.787
5.695LysGlu: 5.695 ± 0.959
2.848LysPhe: 2.848 ± 0.907
4.746LysGly: 4.746 ± 1.418
4.271LysHis: 4.271 ± 0.912
2.373LysIle: 2.373 ± 1.151
4.746LysLys: 4.746 ± 1.008
3.322LysLeu: 3.322 ± 1.021
2.848LysMet: 2.848 ± 1.13
0.949LysAsn: 0.949 ± 0.627
3.797LysPro: 3.797 ± 0.842
1.424LysGln: 1.424 ± 0.605
6.17LysArg: 6.17 ± 1.748
2.373LysSer: 2.373 ± 0.532
3.797LysThr: 3.797 ± 1.593
3.322LysVal: 3.322 ± 1.021
0.0LysTrp: 0.0 ± 0.0
1.424LysTyr: 1.424 ± 0.671
0.0LysXaa: 0.0 ± 0.0
Leu
4.271LeuAla: 4.271 ± 2.36
1.898LeuCys: 1.898 ± 0.734
7.119LeuAsp: 7.119 ± 1.516
7.119LeuGlu: 7.119 ± 1.461
6.645LeuPhe: 6.645 ± 0.576
3.322LeuGly: 3.322 ± 1.904
1.898LeuHis: 1.898 ± 1.534
8.068LeuIle: 8.068 ± 1.893
2.848LeuLys: 2.848 ± 0.616
11.865LeuLeu: 11.865 ± 2.355
2.373LeuMet: 2.373 ± 0.623
7.594LeuAsn: 7.594 ± 1.575
8.068LeuPro: 8.068 ± 1.645
3.797LeuGln: 3.797 ± 1.593
5.221LeuArg: 5.221 ± 1.882
7.119LeuSer: 7.119 ± 1.213
6.17LeuThr: 6.17 ± 2.362
6.17LeuVal: 6.17 ± 0.304
2.373LeuTrp: 2.373 ± 1.039
3.797LeuTyr: 3.797 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
1.424MetAla: 1.424 ± 0.671
1.424MetCys: 1.424 ± 0.565
2.373MetAsp: 2.373 ± 0.894
0.475MetGlu: 0.475 ± 0.439
0.949MetPhe: 0.949 ± 0.425
1.898MetGly: 1.898 ± 0.891
0.949MetHis: 0.949 ± 0.767
0.949MetIle: 0.949 ± 0.767
4.746MetLys: 4.746 ± 1.965
4.271MetLeu: 4.271 ± 0.975
0.0MetMet: 0.0 ± 0.0
1.898MetAsn: 1.898 ± 0.554
1.424MetPro: 1.424 ± 0.825
1.424MetGln: 1.424 ± 0.565
4.271MetArg: 4.271 ± 1.497
1.898MetSer: 1.898 ± 0.597
0.949MetThr: 0.949 ± 0.425
0.0MetVal: 0.0 ± 0.0
0.475MetTrp: 0.475 ± 0.439
0.475MetTyr: 0.475 ± 0.49
0.0MetXaa: 0.0 ± 0.0
Asn
3.322AsnAla: 3.322 ± 1.268
1.424AsnCys: 1.424 ± 0.941
1.898AsnAsp: 1.898 ± 0.85
1.898AsnGlu: 1.898 ± 0.85
2.848AsnPhe: 2.848 ± 0.773
0.949AsnGly: 0.949 ± 0.54
0.0AsnHis: 0.0 ± 0.0
2.373AsnIle: 2.373 ± 0.756
3.322AsnLys: 3.322 ± 1.753
5.695AsnLeu: 5.695 ± 1.677
1.898AsnMet: 1.898 ± 0.534
3.322AsnAsn: 3.322 ± 0.904
2.373AsnPro: 2.373 ± 1.063
3.322AsnGln: 3.322 ± 0.793
2.373AsnArg: 2.373 ± 0.804
2.848AsnSer: 2.848 ± 0.348
2.373AsnThr: 2.373 ± 1.655
2.848AsnVal: 2.848 ± 0.794
0.475AsnTrp: 0.475 ± 0.314
3.322AsnTyr: 3.322 ± 0.541
0.0AsnXaa: 0.0 ± 0.0
Pro
2.848ProAla: 2.848 ± 1.649
2.848ProCys: 2.848 ± 0.836
6.645ProAsp: 6.645 ± 1.188
2.848ProGlu: 2.848 ± 1.011
1.424ProPhe: 1.424 ± 0.758
3.322ProGly: 3.322 ± 1.437
0.475ProHis: 0.475 ± 0.314
0.949ProIle: 0.949 ± 0.425
4.746ProLys: 4.746 ± 1.182
9.018ProLeu: 9.018 ± 2.503
0.475ProMet: 0.475 ± 0.439
1.898ProAsn: 1.898 ± 0.642
11.391ProPro: 11.391 ± 1.764
3.797ProGln: 3.797 ± 1.149
5.221ProArg: 5.221 ± 2.849
3.322ProSer: 3.322 ± 0.82
3.797ProThr: 3.797 ± 1.782
6.645ProVal: 6.645 ± 1.563
0.475ProTrp: 0.475 ± 0.49
0.475ProTyr: 0.475 ± 0.439
0.0ProXaa: 0.0 ± 0.0
Gln
2.373GlnAla: 2.373 ± 0.823
0.949GlnCys: 0.949 ± 0.627
2.373GlnAsp: 2.373 ± 0.919
0.0GlnGlu: 0.0 ± 0.0
4.746GlnPhe: 4.746 ± 1.534
1.424GlnGly: 1.424 ± 0.805
1.424GlnHis: 1.424 ± 1.038
5.221GlnIle: 5.221 ± 0.583
2.373GlnLys: 2.373 ± 0.823
5.695GlnLeu: 5.695 ± 1.305
1.898GlnMet: 1.898 ± 0.71
0.949GlnAsn: 0.949 ± 0.425
1.898GlnPro: 1.898 ± 0.619
3.322GlnGln: 3.322 ± 1.209
3.322GlnArg: 3.322 ± 0.843
2.373GlnSer: 2.373 ± 0.705
2.373GlnThr: 2.373 ± 0.695
2.848GlnVal: 2.848 ± 1.133
0.0GlnTrp: 0.0 ± 0.0
1.424GlnTyr: 1.424 ± 0.647
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
1.424ArgCys: 1.424 ± 1.038
5.221ArgAsp: 5.221 ± 0.736
2.848ArgGlu: 2.848 ± 0.831
3.322ArgPhe: 3.322 ± 0.508
2.373ArgGly: 2.373 ± 0.705
0.949ArgHis: 0.949 ± 0.567
3.322ArgIle: 3.322 ± 0.861
4.271ArgLys: 4.271 ± 1.179
3.322ArgLeu: 3.322 ± 0.981
0.949ArgMet: 0.949 ± 0.597
3.797ArgAsn: 3.797 ± 0.96
2.848ArgPro: 2.848 ± 2.301
4.271ArgGln: 4.271 ± 2.305
9.967ArgArg: 9.967 ± 3.295
3.322ArgSer: 3.322 ± 0.805
1.424ArgThr: 1.424 ± 0.805
2.848ArgVal: 2.848 ± 0.461
0.475ArgTrp: 0.475 ± 0.49
2.848ArgTyr: 2.848 ± 1.38
0.0ArgXaa: 0.0 ± 0.0
Ser
3.797SerAla: 3.797 ± 1.209
1.898SerCys: 1.898 ± 0.85
2.848SerAsp: 2.848 ± 0.616
2.373SerGlu: 2.373 ± 0.823
3.797SerPhe: 3.797 ± 1.364
4.746SerGly: 4.746 ± 1.617
0.475SerHis: 0.475 ± 0.314
2.848SerIle: 2.848 ± 1.361
1.898SerLys: 1.898 ± 0.85
14.713SerLeu: 14.713 ± 1.847
1.424SerMet: 1.424 ± 0.983
2.373SerAsn: 2.373 ± 1.138
4.746SerPro: 4.746 ± 1.064
4.271SerGln: 4.271 ± 1.059
2.373SerArg: 2.373 ± 0.623
8.068SerSer: 8.068 ± 2.443
5.221SerThr: 5.221 ± 1.485
2.373SerVal: 2.373 ± 1.36
0.0SerTrp: 0.0 ± 0.0
1.424SerTyr: 1.424 ± 0.936
0.0SerXaa: 0.0 ± 0.0
Thr
1.424ThrAla: 1.424 ± 0.936
1.424ThrCys: 1.424 ± 0.605
1.898ThrAsp: 1.898 ± 1.075
3.322ThrGlu: 3.322 ± 1.159
0.0ThrPhe: 0.0 ± 0.0
1.898ThrGly: 1.898 ± 0.718
0.475ThrHis: 0.475 ± 0.49
0.475ThrIle: 0.475 ± 0.439
1.898ThrLys: 1.898 ± 1.225
5.221ThrLeu: 5.221 ± 1.265
0.475ThrMet: 0.475 ± 0.314
2.848ThrAsn: 2.848 ± 1.525
5.695ThrPro: 5.695 ± 1.053
1.424ThrGln: 1.424 ± 0.605
2.373ThrArg: 2.373 ± 0.882
8.543ThrSer: 8.543 ± 1.384
3.322ThrThr: 3.322 ± 1.114
4.746ThrVal: 4.746 ± 2.241
0.475ThrTrp: 0.475 ± 0.49
1.424ThrTyr: 1.424 ± 0.918
0.0ThrXaa: 0.0 ± 0.0
Val
1.424ValAla: 1.424 ± 0.434
0.949ValCys: 0.949 ± 0.611
2.373ValAsp: 2.373 ± 0.426
2.848ValGlu: 2.848 ± 1.759
1.898ValPhe: 1.898 ± 0.534
1.424ValGly: 1.424 ± 0.854
2.848ValHis: 2.848 ± 0.461
2.848ValIle: 2.848 ± 0.794
2.848ValLys: 2.848 ± 1.209
6.645ValLeu: 6.645 ± 1.44
3.797ValMet: 3.797 ± 2.078
3.322ValAsn: 3.322 ± 1.127
2.848ValPro: 2.848 ± 0.952
0.949ValGln: 0.949 ± 0.98
2.848ValArg: 2.848 ± 0.681
5.695ValSer: 5.695 ± 1.345
3.797ValThr: 3.797 ± 1.714
3.322ValVal: 3.322 ± 1.706
0.0ValTrp: 0.0 ± 0.0
1.898ValTyr: 1.898 ± 0.597
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.475TrpCys: 0.475 ± 0.439
1.424TrpAsp: 1.424 ± 0.657
0.475TrpGlu: 0.475 ± 0.439
0.475TrpPhe: 0.475 ± 0.314
0.475TrpGly: 0.475 ± 0.515
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.949TrpLys: 0.949 ± 0.567
0.0TrpLeu: 0.0 ± 0.0
0.475TrpMet: 0.475 ± 0.49
0.475TrpAsn: 0.475 ± 0.314
0.0TrpPro: 0.0 ± 0.0
1.424TrpGln: 1.424 ± 0.565
0.475TrpArg: 0.475 ± 0.49
0.949TrpSer: 0.949 ± 0.98
0.0TrpThr: 0.0 ± 0.0
0.475TrpVal: 0.475 ± 0.49
0.949TrpTrp: 0.949 ± 0.567
1.424TrpTyr: 1.424 ± 0.758
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.373TyrAla: 2.373 ± 0.823
0.0TyrCys: 0.0 ± 0.0
0.475TyrAsp: 0.475 ± 0.49
0.949TyrGlu: 0.949 ± 0.425
0.949TyrPhe: 0.949 ± 0.878
3.797TyrGly: 3.797 ± 0.826
2.373TyrHis: 2.373 ± 0.882
2.373TyrIle: 2.373 ± 1.619
1.424TyrLys: 1.424 ± 0.657
1.898TyrLeu: 1.898 ± 1.012
1.424TyrMet: 1.424 ± 0.627
1.424TyrAsn: 1.424 ± 0.936
2.848TyrPro: 2.848 ± 0.836
1.424TyrGln: 1.424 ± 0.671
2.373TyrArg: 2.373 ± 1.325
1.898TyrSer: 1.898 ± 0.53
0.949TyrThr: 0.949 ± 0.627
0.949TyrVal: 0.949 ± 0.6
0.475TyrTrp: 0.475 ± 0.314
0.949TyrTyr: 0.949 ± 0.98
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2108 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski