Amino acid dipepetide frequency for Bornean orang-utan polyomavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.986AlaAla: 3.986 ± 2.241
0.569AlaCys: 0.569 ± 0.586
1.708AlaAsp: 1.708 ± 0.685
3.417AlaGlu: 3.417 ± 1.059
1.708AlaPhe: 1.708 ± 0.685
1.708AlaGly: 1.708 ± 0.835
3.417AlaHis: 3.417 ± 1.386
3.417AlaIle: 3.417 ± 1.204
1.139AlaLys: 1.139 ± 0.787
7.973AlaLeu: 7.973 ± 1.74
0.569AlaMet: 0.569 ± 0.563
1.708AlaAsn: 1.708 ± 1.285
1.708AlaPro: 1.708 ± 0.92
0.0AlaGln: 0.0 ± 0.0
6.834AlaArg: 6.834 ± 2.46
4.556AlaSer: 4.556 ± 2.173
2.278AlaThr: 2.278 ± 1.086
6.264AlaVal: 6.264 ± 2.201
0.569AlaTrp: 0.569 ± 0.393
1.139AlaTyr: 1.139 ± 0.502
0.0AlaXaa: 0.0 ± 0.0
Cys
1.139CysAla: 1.139 ± 0.693
0.0CysCys: 0.0 ± 0.0
1.139CysAsp: 1.139 ± 0.502
0.0CysGlu: 0.0 ± 0.0
1.708CysPhe: 1.708 ± 1.49
0.569CysGly: 0.569 ± 0.73
0.569CysHis: 0.569 ± 0.586
1.139CysIle: 1.139 ± 1.46
3.417CysLys: 3.417 ± 1.277
2.847CysLeu: 2.847 ± 1.477
0.569CysMet: 0.569 ± 0.393
1.139CysAsn: 1.139 ± 0.865
3.417CysPro: 3.417 ± 0.77
0.569CysGln: 0.569 ± 0.393
0.569CysArg: 0.569 ± 0.586
2.278CysSer: 2.278 ± 1.574
0.569CysThr: 0.569 ± 0.393
0.0CysVal: 0.0 ± 0.0
0.569CysTrp: 0.569 ± 0.586
2.278CysTyr: 2.278 ± 1.386
0.0CysXaa: 0.0 ± 0.0
Asp
1.708AspAla: 1.708 ± 1.229
0.0AspCys: 0.0 ± 0.0
2.278AspAsp: 2.278 ± 0.624
5.125AspGlu: 5.125 ± 0.421
2.278AspPhe: 2.278 ± 1.162
6.264AspGly: 6.264 ± 2.103
0.569AspHis: 0.569 ± 0.393
5.125AspIle: 5.125 ± 1.521
3.417AspLys: 3.417 ± 0.972
4.556AspLeu: 4.556 ± 1.248
0.569AspMet: 0.569 ± 0.586
1.139AspAsn: 1.139 ± 0.787
4.556AspPro: 4.556 ± 1.422
2.278AspGln: 2.278 ± 1.583
2.278AspArg: 2.278 ± 0.893
3.417AspSer: 3.417 ± 2.035
0.0AspThr: 0.0 ± 0.0
2.278AspVal: 2.278 ± 1.574
2.847AspTrp: 2.847 ± 1.479
1.708AspTyr: 1.708 ± 0.693
0.0AspXaa: 0.0 ± 0.0
Glu
3.417GluAla: 3.417 ± 1.415
4.556GluCys: 4.556 ± 1.859
5.125GluAsp: 5.125 ± 1.311
7.403GluGlu: 7.403 ± 3.807
2.847GluPhe: 2.847 ± 1.967
4.556GluGly: 4.556 ± 1.055
1.708GluHis: 1.708 ± 0.671
1.139GluIle: 1.139 ± 0.705
5.695GluLys: 5.695 ± 0.828
5.125GluLeu: 5.125 ± 1.486
0.569GluMet: 0.569 ± 0.563
6.264GluAsn: 6.264 ± 0.987
1.708GluPro: 1.708 ± 0.685
0.0GluGln: 0.0 ± 0.0
1.139GluArg: 1.139 ± 0.787
6.834GluSer: 6.834 ± 2.021
2.847GluThr: 2.847 ± 1.077
5.125GluVal: 5.125 ± 1.188
0.0GluTrp: 0.0 ± 0.0
4.556GluTyr: 4.556 ± 1.012
0.0GluXaa: 0.0 ± 0.0
Phe
5.695PheAla: 5.695 ± 0.965
1.139PheCys: 1.139 ± 0.693
1.708PheAsp: 1.708 ± 0.867
3.417PheGlu: 3.417 ± 2.361
0.569PhePhe: 0.569 ± 0.563
3.986PheGly: 3.986 ± 1.129
0.569PheHis: 0.569 ± 0.586
0.569PheIle: 0.569 ± 0.586
3.417PheLys: 3.417 ± 1.188
5.695PheLeu: 5.695 ± 2.276
0.0PheMet: 0.0 ± 0.0
1.708PheAsn: 1.708 ± 1.18
3.417PhePro: 3.417 ± 0.77
2.847PheGln: 2.847 ± 1.077
0.569PheArg: 0.569 ± 0.393
2.847PheSer: 2.847 ± 1.521
2.847PheThr: 2.847 ± 1.493
0.569PheVal: 0.569 ± 0.393
0.569PheTrp: 0.569 ± 0.73
0.569PheTyr: 0.569 ± 0.586
0.0PheXaa: 0.0 ± 0.0
Gly
2.847GlyAla: 2.847 ± 1.671
1.139GlyCys: 1.139 ± 0.502
3.986GlyAsp: 3.986 ± 0.669
4.556GlyGlu: 4.556 ± 2.144
2.847GlyPhe: 2.847 ± 0.909
4.556GlyGly: 4.556 ± 0.743
3.417GlyHis: 3.417 ± 1.386
5.125GlyIle: 5.125 ± 1.054
3.986GlyLys: 3.986 ± 1.534
6.264GlyLeu: 6.264 ± 2.913
2.278GlyMet: 2.278 ± 1.543
1.708GlyAsn: 1.708 ± 0.671
5.125GlyPro: 5.125 ± 1.827
3.417GlyGln: 3.417 ± 1.505
1.708GlyArg: 1.708 ± 1.368
5.125GlySer: 5.125 ± 0.934
1.708GlyThr: 1.708 ± 0.534
4.556GlyVal: 4.556 ± 0.802
0.0GlyTrp: 0.0 ± 0.0
1.708GlyTyr: 1.708 ± 1.167
0.0GlyXaa: 0.0 ± 0.0
His
2.278HisAla: 2.278 ± 1.086
1.708HisCys: 1.708 ± 0.859
0.569HisAsp: 0.569 ± 0.393
1.139HisGlu: 1.139 ± 0.831
1.708HisPhe: 1.708 ± 0.693
0.0HisGly: 0.0 ± 0.0
0.569HisHis: 0.569 ± 0.393
0.569HisIle: 0.569 ± 0.73
1.139HisLys: 1.139 ± 0.502
2.278HisLeu: 2.278 ± 0.762
1.708HisMet: 1.708 ± 1.681
0.569HisAsn: 0.569 ± 0.393
1.708HisPro: 1.708 ± 0.859
4.556HisGln: 4.556 ± 2.88
1.708HisArg: 1.708 ± 0.859
0.0HisSer: 0.0 ± 0.0
1.708HisThr: 1.708 ± 0.693
0.569HisVal: 0.569 ± 0.393
0.569HisTrp: 0.569 ± 0.73
1.708HisTyr: 1.708 ± 0.693
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
0.0IleCys: 0.0 ± 0.0
1.139IleAsp: 1.139 ± 0.502
5.695IleGlu: 5.695 ± 1.322
1.708IlePhe: 1.708 ± 1.18
2.278IleGly: 2.278 ± 1.225
1.139IleHis: 1.139 ± 0.831
1.139IleIle: 1.139 ± 0.787
1.139IleLys: 1.139 ± 0.693
5.125IleLeu: 5.125 ± 2.079
0.569IleMet: 0.569 ± 0.393
1.708IleAsn: 1.708 ± 0.859
3.986IlePro: 3.986 ± 0.669
0.0IleGln: 0.0 ± 0.0
3.417IleArg: 3.417 ± 1.229
3.417IleSer: 3.417 ± 2.263
3.417IleThr: 3.417 ± 1.024
4.556IleVal: 4.556 ± 1.215
0.569IleTrp: 0.569 ± 0.73
2.278IleTyr: 2.278 ± 1.142
0.0IleXaa: 0.0 ± 0.0
Lys
2.278LysAla: 2.278 ± 1.142
2.278LysCys: 2.278 ± 2.081
2.847LysAsp: 2.847 ± 1.493
3.986LysGlu: 3.986 ± 0.427
1.139LysPhe: 1.139 ± 0.693
3.417LysGly: 3.417 ± 1.402
3.417LysHis: 3.417 ± 1.718
3.417LysIle: 3.417 ± 0.945
5.125LysLys: 5.125 ± 1.833
7.403LysLeu: 7.403 ± 2.53
2.278LysMet: 2.278 ± 1.386
2.278LysAsn: 2.278 ± 1.003
3.417LysPro: 3.417 ± 0.972
1.139LysGln: 1.139 ± 0.693
4.556LysArg: 4.556 ± 1.504
2.847LysSer: 2.847 ± 1.135
5.695LysThr: 5.695 ± 2.27
1.139LysVal: 1.139 ± 0.865
0.0LysTrp: 0.0 ± 0.0
3.986LysTyr: 3.986 ± 1.153
0.0LysXaa: 0.0 ± 0.0
Leu
2.847LeuAla: 2.847 ± 0.493
2.847LeuCys: 2.847 ± 1.354
5.695LeuAsp: 5.695 ± 1.455
2.847LeuGlu: 2.847 ± 0.995
6.264LeuPhe: 6.264 ± 1.302
4.556LeuGly: 4.556 ± 1.18
1.139LeuHis: 1.139 ± 0.831
6.264LeuIle: 6.264 ± 1.221
4.556LeuLys: 4.556 ± 1.358
9.112LeuLeu: 9.112 ± 3.241
7.973LeuMet: 7.973 ± 1.596
8.542LeuAsn: 8.542 ± 1.329
5.695LeuPro: 5.695 ± 1.732
7.973LeuGln: 7.973 ± 1.757
3.417LeuArg: 3.417 ± 0.637
5.125LeuSer: 5.125 ± 2.179
5.695LeuThr: 5.695 ± 1.066
5.695LeuVal: 5.695 ± 1.27
1.139LeuTrp: 1.139 ± 0.693
3.417LeuTyr: 3.417 ± 1.381
0.0LeuXaa: 0.0 ± 0.0
Met
5.695MetAla: 5.695 ± 0.93
0.569MetCys: 0.569 ± 0.393
1.708MetAsp: 1.708 ± 0.671
2.278MetGlu: 2.278 ± 1.33
1.708MetPhe: 1.708 ± 0.685
2.847MetGly: 2.847 ± 1.501
1.708MetHis: 1.708 ± 0.867
0.0MetIle: 0.0 ± 0.0
1.708MetLys: 1.708 ± 0.671
2.278MetLeu: 2.278 ± 1.225
1.139MetMet: 1.139 ± 1.172
1.708MetAsn: 1.708 ± 0.685
1.139MetPro: 1.139 ± 1.172
0.569MetGln: 0.569 ± 0.586
0.0MetArg: 0.0 ± 0.0
1.139MetSer: 1.139 ± 0.693
1.139MetThr: 1.139 ± 0.593
0.569MetVal: 0.569 ± 0.586
0.569MetTrp: 0.569 ± 0.586
0.569MetTyr: 0.569 ± 0.586
0.0MetXaa: 0.0 ± 0.0
Asn
2.847AsnAla: 2.847 ± 1.511
1.708AsnCys: 1.708 ± 0.671
2.847AsnAsp: 2.847 ± 1.493
3.417AsnGlu: 3.417 ± 0.972
3.417AsnPhe: 3.417 ± 1.188
1.139AsnGly: 1.139 ± 0.502
1.139AsnHis: 1.139 ± 0.693
2.847AsnIle: 2.847 ± 0.831
2.847AsnLys: 2.847 ± 1.055
5.695AsnLeu: 5.695 ± 2.276
0.0AsnMet: 0.0 ± 0.0
0.569AsnAsn: 0.569 ± 0.586
2.847AsnPro: 2.847 ± 0.87
0.0AsnGln: 0.0 ± 0.0
1.708AsnArg: 1.708 ± 0.835
2.278AsnSer: 2.278 ± 0.624
3.417AsnThr: 3.417 ± 1.506
4.556AsnVal: 4.556 ± 1.289
0.569AsnTrp: 0.569 ± 0.393
1.139AsnTyr: 1.139 ± 0.502
0.0AsnXaa: 0.0 ± 0.0
Pro
1.708ProAla: 1.708 ± 0.693
0.569ProCys: 0.569 ± 0.393
6.834ProAsp: 6.834 ± 1.545
1.708ProGlu: 1.708 ± 0.859
1.708ProPhe: 1.708 ± 0.685
6.834ProGly: 6.834 ± 2.072
0.0ProHis: 0.0 ± 0.0
2.278ProIle: 2.278 ± 1.003
5.695ProLys: 5.695 ± 1.6
4.556ProLeu: 4.556 ± 1.165
1.139ProMet: 1.139 ± 1.172
1.139ProAsn: 1.139 ± 0.787
5.125ProPro: 5.125 ± 1.131
3.417ProGln: 3.417 ± 1.765
2.278ProArg: 2.278 ± 1.052
3.986ProSer: 3.986 ± 0.839
5.695ProThr: 5.695 ± 0.712
3.417ProVal: 3.417 ± 2.739
1.139ProTrp: 1.139 ± 0.831
1.139ProTyr: 1.139 ± 0.502
0.0ProXaa: 0.0 ± 0.0
Gln
3.417GlnAla: 3.417 ± 0.813
0.569GlnCys: 0.569 ± 0.73
3.417GlnAsp: 3.417 ± 0.637
4.556GlnGlu: 4.556 ± 1.974
2.847GlnPhe: 2.847 ± 1.493
1.708GlnGly: 1.708 ± 1.017
0.569GlnHis: 0.569 ± 0.73
2.847GlnIle: 2.847 ± 0.493
3.417GlnLys: 3.417 ± 1.366
2.278GlnLeu: 2.278 ± 1.661
1.139GlnMet: 1.139 ± 0.685
0.0GlnAsn: 0.0 ± 0.0
1.708GlnPro: 1.708 ± 1.017
1.139GlnGln: 1.139 ± 0.502
1.708GlnArg: 1.708 ± 0.693
1.139GlnSer: 1.139 ± 0.502
1.139GlnThr: 1.139 ± 0.502
3.986GlnVal: 3.986 ± 0.669
0.0GlnTrp: 0.0 ± 0.0
0.569GlnTyr: 0.569 ± 0.393
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
0.569ArgCys: 0.569 ± 0.393
2.847ArgAsp: 2.847 ± 0.995
3.417ArgGlu: 3.417 ± 1.059
2.278ArgPhe: 2.278 ± 1.142
2.847ArgGly: 2.847 ± 0.761
2.847ArgHis: 2.847 ± 1.479
1.708ArgIle: 1.708 ± 1.017
5.125ArgLys: 5.125 ± 1.466
6.834ArgLeu: 6.834 ± 1.631
1.708ArgMet: 1.708 ± 1.757
2.278ArgAsn: 2.278 ± 1.186
2.847ArgPro: 2.847 ± 0.761
2.847ArgGln: 2.847 ± 0.831
5.125ArgArg: 5.125 ± 2.079
2.278ArgSer: 2.278 ± 1.142
0.0ArgThr: 0.0 ± 0.0
2.278ArgVal: 2.278 ± 0.624
1.139ArgTrp: 1.139 ± 0.831
3.417ArgTyr: 3.417 ± 1.883
0.0ArgXaa: 0.0 ± 0.0
Ser
4.556SerAla: 4.556 ± 1.278
0.569SerCys: 0.569 ± 0.393
0.569SerAsp: 0.569 ± 0.393
5.695SerGlu: 5.695 ± 1.571
3.417SerPhe: 3.417 ± 1.835
5.125SerGly: 5.125 ± 2.237
1.139SerHis: 1.139 ± 0.831
0.569SerIle: 0.569 ± 0.563
3.986SerLys: 3.986 ± 1.153
6.834SerLeu: 6.834 ± 0.793
0.569SerMet: 0.569 ± 0.586
3.986SerAsn: 3.986 ± 0.427
2.847SerPro: 2.847 ± 0.995
1.139SerGln: 1.139 ± 0.787
8.542SerArg: 8.542 ± 2.237
3.986SerSer: 3.986 ± 0.427
3.986SerThr: 3.986 ± 1.888
2.278SerVal: 2.278 ± 1.003
1.139SerTrp: 1.139 ± 0.693
1.708SerTyr: 1.708 ± 0.92
0.0SerXaa: 0.0 ± 0.0
Thr
2.278ThrAla: 2.278 ± 0.952
2.278ThrCys: 2.278 ± 1.052
1.708ThrAsp: 1.708 ± 0.693
4.556ThrGlu: 4.556 ± 1.798
2.278ThrPhe: 2.278 ± 1.661
3.986ThrGly: 3.986 ± 2.123
0.569ThrHis: 0.569 ± 0.393
1.139ThrIle: 1.139 ± 0.593
0.569ThrLys: 0.569 ± 0.393
6.264ThrLeu: 6.264 ± 1.448
1.708ThrMet: 1.708 ± 0.835
2.278ThrAsn: 2.278 ± 0.999
3.417ThrPro: 3.417 ± 0.637
3.986ThrGln: 3.986 ± 1.615
1.139ThrArg: 1.139 ± 0.502
2.847ThrSer: 2.847 ± 0.933
2.847ThrThr: 2.847 ± 0.493
4.556ThrVal: 4.556 ± 1.14
0.569ThrTrp: 0.569 ± 0.393
1.708ThrTyr: 1.708 ± 0.92
0.0ThrXaa: 0.0 ± 0.0
Val
5.695ValAla: 5.695 ± 1.322
1.708ValCys: 1.708 ± 0.693
0.569ValAsp: 0.569 ± 0.563
5.125ValGlu: 5.125 ± 1.482
0.569ValPhe: 0.569 ± 0.393
4.556ValGly: 4.556 ± 2.939
0.569ValHis: 0.569 ± 0.393
3.417ValIle: 3.417 ± 1.204
2.847ValLys: 2.847 ± 1.493
5.695ValLeu: 5.695 ± 1.322
1.708ValMet: 1.708 ± 0.534
4.556ValAsn: 4.556 ± 1.028
2.278ValPro: 2.278 ± 0.999
1.139ValGln: 1.139 ± 1.172
1.139ValArg: 1.139 ± 1.172
5.125ValSer: 5.125 ± 1.127
3.986ValThr: 3.986 ± 1.982
2.847ValVal: 2.847 ± 1.354
3.417ValTrp: 3.417 ± 1.58
2.847ValTyr: 2.847 ± 0.761
0.0ValXaa: 0.0 ± 0.0
Trp
1.708TrpAla: 1.708 ± 0.693
0.569TrpCys: 0.569 ± 0.586
2.847TrpAsp: 2.847 ± 1.479
1.708TrpGlu: 1.708 ± 0.671
1.139TrpPhe: 1.139 ± 1.46
0.569TrpGly: 0.569 ± 0.73
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.569TrpLys: 0.569 ± 0.393
0.0TrpLeu: 0.0 ± 0.0
1.139TrpMet: 1.139 ± 0.831
0.569TrpAsn: 0.569 ± 0.393
0.569TrpPro: 0.569 ± 0.73
1.139TrpGln: 1.139 ± 0.693
1.139TrpArg: 1.139 ± 0.831
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.139TrpVal: 1.139 ± 0.831
0.569TrpTrp: 0.569 ± 0.393
1.139TrpTyr: 1.139 ± 0.787
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.569TyrAla: 0.569 ± 0.563
1.139TyrCys: 1.139 ± 0.693
2.847TyrAsp: 2.847 ± 0.993
0.569TyrGlu: 0.569 ± 0.393
0.569TyrPhe: 0.569 ± 0.586
5.125TyrGly: 5.125 ± 1.15
1.708TyrHis: 1.708 ± 0.859
0.569TyrIle: 0.569 ± 0.586
2.847TyrLys: 2.847 ± 1.565
3.986TyrLeu: 3.986 ± 1.402
1.139TyrMet: 1.139 ± 0.787
1.139TyrAsn: 1.139 ± 0.693
2.847TyrPro: 2.847 ± 0.761
0.0TyrGln: 0.0 ± 0.0
3.417TyrArg: 3.417 ± 1.381
3.417TyrSer: 3.417 ± 0.77
1.708TyrThr: 1.708 ± 0.685
3.417TyrVal: 3.417 ± 1.506
0.569TyrTrp: 0.569 ± 0.393
3.417TyrTyr: 3.417 ± 1.765
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1757 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski