Amino acid dipepetide frequency for Chimpanzee polyomavirus Bob

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.652AlaAla: 6.652 ± 1.664
0.554AlaCys: 0.554 ± 0.586
3.326AlaAsp: 3.326 ± 1.564
3.326AlaGlu: 3.326 ± 1.005
3.88AlaPhe: 3.88 ± 1.444
2.217AlaGly: 2.217 ± 1.102
2.772AlaHis: 2.772 ± 1.08
4.989AlaIle: 4.989 ± 1.333
3.88AlaLys: 3.88 ± 1.337
7.206AlaLeu: 7.206 ± 2.128
1.109AlaMet: 1.109 ± 0.747
1.663AlaAsn: 1.663 ± 1.243
1.663AlaPro: 1.663 ± 1.338
2.772AlaGln: 2.772 ± 0.496
3.326AlaArg: 3.326 ± 1.687
6.652AlaSer: 6.652 ± 1.861
0.0AlaThr: 0.0 ± 0.0
6.652AlaVal: 6.652 ± 2.198
0.554AlaTrp: 0.554 ± 0.414
1.109AlaTyr: 1.109 ± 0.632
0.0AlaXaa: 0.0 ± 0.0
Cys
1.109CysAla: 1.109 ± 0.439
0.0CysCys: 0.0 ± 0.0
0.554CysAsp: 0.554 ± 0.446
0.554CysGlu: 0.554 ± 0.414
2.217CysPhe: 2.217 ± 1.856
2.217CysGly: 2.217 ± 1.002
0.0CysHis: 0.0 ± 0.0
0.554CysIle: 0.554 ± 0.657
2.772CysLys: 2.772 ± 1.128
2.772CysLeu: 2.772 ± 1.246
0.0CysMet: 0.0 ± 0.0
2.217CysAsn: 2.217 ± 1.264
1.109CysPro: 1.109 ± 0.439
2.217CysGln: 2.217 ± 1.178
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.554CysThr: 0.554 ± 0.414
3.88CysVal: 3.88 ± 0.704
0.0CysTrp: 0.0 ± 0.0
1.109CysTyr: 1.109 ± 0.632
0.0CysXaa: 0.0 ± 0.0
Asp
1.109AspAla: 1.109 ± 0.439
0.554AspCys: 0.554 ± 0.657
0.554AspAsp: 0.554 ± 0.414
1.663AspGlu: 1.663 ± 0.728
3.326AspPhe: 3.326 ± 1.924
4.435AspGly: 4.435 ± 1.962
0.554AspHis: 0.554 ± 0.414
4.989AspIle: 4.989 ± 0.639
4.435AspLys: 4.435 ± 1.323
2.772AspLeu: 2.772 ± 1.495
3.326AspMet: 3.326 ± 1.338
1.663AspAsn: 1.663 ± 0.782
3.326AspPro: 3.326 ± 1.532
2.217AspGln: 2.217 ± 1.154
2.217AspArg: 2.217 ± 1.168
2.217AspSer: 2.217 ± 0.878
1.663AspThr: 1.663 ± 0.782
1.663AspVal: 1.663 ± 0.843
2.217AspTrp: 2.217 ± 1.586
1.109AspTyr: 1.109 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
4.989GluAla: 4.989 ± 2.373
1.109GluCys: 1.109 ± 0.632
2.217GluAsp: 2.217 ± 1.1
6.098GluGlu: 6.098 ± 3.163
2.772GluPhe: 2.772 ± 1.08
5.543GluGly: 5.543 ± 0.578
1.109GluHis: 1.109 ± 0.892
2.217GluIle: 2.217 ± 0.779
8.315GluLys: 8.315 ± 0.705
4.989GluLeu: 4.989 ± 0.97
2.217GluMet: 2.217 ± 1.264
4.989GluAsn: 4.989 ± 1.333
1.109GluPro: 1.109 ± 0.892
2.772GluGln: 2.772 ± 1.494
1.109GluArg: 1.109 ± 0.829
4.435GluSer: 4.435 ± 0.32
2.217GluThr: 2.217 ± 0.495
3.88GluVal: 3.88 ± 1.372
1.109GluTrp: 1.109 ± 0.892
0.554GluTyr: 0.554 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
2.772PheAla: 2.772 ± 1.495
1.663PheCys: 1.663 ± 0.843
2.772PheAsp: 2.772 ± 0.877
4.989PheGlu: 4.989 ± 2.586
0.554PhePhe: 0.554 ± 0.414
5.543PheGly: 5.543 ± 1.69
2.217PheHis: 2.217 ± 0.891
2.217PheIle: 2.217 ± 0.803
2.217PheLys: 2.217 ± 0.662
3.88PheLeu: 3.88 ± 1.511
0.0PheMet: 0.0 ± 0.0
1.109PheAsn: 1.109 ± 0.863
2.772PhePro: 2.772 ± 1.431
3.326PheGln: 3.326 ± 1.043
0.554PheArg: 0.554 ± 0.414
8.315PheSer: 8.315 ± 3.006
3.326PheThr: 3.326 ± 1.211
2.217PheVal: 2.217 ± 0.878
0.0PheTrp: 0.0 ± 0.0
0.554PheTyr: 0.554 ± 0.446
0.0PheXaa: 0.0 ± 0.0
Gly
2.772GlyAla: 2.772 ± 1.42
0.554GlyCys: 0.554 ± 0.414
3.88GlyAsp: 3.88 ± 1.115
2.772GlyGlu: 2.772 ± 0.978
1.109GlyPhe: 1.109 ± 0.892
7.206GlyGly: 7.206 ± 0.876
0.554GlyHis: 0.554 ± 0.446
6.652GlyIle: 6.652 ± 1.693
3.326GlyLys: 3.326 ± 0.496
9.424GlyLeu: 9.424 ± 0.926
1.663GlyMet: 1.663 ± 0.552
1.109GlyAsn: 1.109 ± 0.632
3.88GlyPro: 3.88 ± 1.404
2.217GlyGln: 2.217 ± 0.878
2.217GlyArg: 2.217 ± 0.803
6.098GlySer: 6.098 ± 1.394
3.326GlyThr: 3.326 ± 1.659
3.88GlyVal: 3.88 ± 1.115
1.109GlyTrp: 1.109 ± 0.793
2.217GlyTyr: 2.217 ± 0.971
0.0GlyXaa: 0.0 ± 0.0
His
1.109HisAla: 1.109 ± 0.439
0.0HisCys: 0.0 ± 0.0
1.663HisAsp: 1.663 ± 0.682
1.109HisGlu: 1.109 ± 0.632
1.663HisPhe: 1.663 ± 1.078
0.0HisGly: 0.0 ± 0.0
1.663HisHis: 1.663 ± 0.782
0.0HisIle: 0.0 ± 0.0
1.663HisLys: 1.663 ± 0.728
2.217HisLeu: 2.217 ± 0.971
1.663HisMet: 1.663 ± 0.728
0.0HisAsn: 0.0 ± 0.0
1.663HisPro: 1.663 ± 0.552
1.109HisGln: 1.109 ± 0.793
1.109HisArg: 1.109 ± 0.632
0.554HisSer: 0.554 ± 0.414
0.0HisThr: 0.0 ± 0.0
2.217HisVal: 2.217 ± 0.495
0.554HisTrp: 0.554 ± 0.446
0.554HisTyr: 0.554 ± 0.414
0.0HisXaa: 0.0 ± 0.0
Ile
3.326IleAla: 3.326 ± 0.505
1.109IleCys: 1.109 ± 0.439
3.326IleAsp: 3.326 ± 0.769
9.978IleGlu: 9.978 ± 0.841
2.217IlePhe: 2.217 ± 0.835
3.326IleGly: 3.326 ± 1.247
1.109IleHis: 1.109 ± 0.892
4.989IleIle: 4.989 ± 1.433
1.663IleLys: 1.663 ± 0.843
4.435IleLeu: 4.435 ± 1.52
1.109IleMet: 1.109 ± 0.632
1.109IleAsn: 1.109 ± 0.439
3.326IlePro: 3.326 ± 1.042
2.217IleGln: 2.217 ± 0.971
2.772IleArg: 2.772 ± 0.677
5.543IleSer: 5.543 ± 1.498
1.663IleThr: 1.663 ± 0.998
1.109IleVal: 1.109 ± 0.612
1.109IleTrp: 1.109 ± 0.632
2.217IleTyr: 2.217 ± 0.662
0.0IleXaa: 0.0 ± 0.0
Lys
6.098LysAla: 6.098 ± 2.917
1.663LysCys: 1.663 ± 0.782
1.663LysAsp: 1.663 ± 0.782
2.217LysGlu: 2.217 ± 1.195
2.772LysPhe: 2.772 ± 1.431
3.88LysGly: 3.88 ± 1.146
1.663LysHis: 1.663 ± 0.843
4.435LysIle: 4.435 ± 0.938
1.663LysLys: 1.663 ± 0.782
7.206LysLeu: 7.206 ± 0.802
2.772LysMet: 2.772 ± 1.034
1.109LysAsn: 1.109 ± 0.439
3.326LysPro: 3.326 ± 0.96
2.217LysGln: 2.217 ± 1.183
6.098LysArg: 6.098 ± 1.831
3.88LysSer: 3.88 ± 1.217
4.989LysThr: 4.989 ± 1.954
2.772LysVal: 2.772 ± 0.75
1.109LysTrp: 1.109 ± 0.717
2.772LysTyr: 2.772 ± 1.68
0.0LysXaa: 0.0 ± 0.0
Leu
6.652LeuAla: 6.652 ± 2.091
2.772LeuCys: 2.772 ± 0.677
7.761LeuAsp: 7.761 ± 1.683
7.206LeuGlu: 7.206 ± 0.587
9.978LeuPhe: 9.978 ± 1.642
5.543LeuGly: 5.543 ± 1.389
1.663LeuHis: 1.663 ± 0.682
7.206LeuIle: 7.206 ± 1.605
3.326LeuLys: 3.326 ± 1.924
13.858LeuLeu: 13.858 ± 3.046
1.663LeuMet: 1.663 ± 0.896
4.989LeuAsn: 4.989 ± 0.454
4.989LeuPro: 4.989 ± 1.049
5.543LeuGln: 5.543 ± 1.551
3.326LeuArg: 3.326 ± 1.723
4.435LeuSer: 4.435 ± 2.132
6.652LeuThr: 6.652 ± 1.467
6.098LeuVal: 6.098 ± 0.787
1.109LeuTrp: 1.109 ± 0.632
2.217LeuTyr: 2.217 ± 0.615
0.0LeuXaa: 0.0 ± 0.0
Met
3.88MetAla: 3.88 ± 0.704
1.109MetCys: 1.109 ± 0.829
2.217MetAsp: 2.217 ± 1.131
0.0MetGlu: 0.0 ± 0.0
2.217MetPhe: 2.217 ± 1.154
1.109MetGly: 1.109 ± 0.612
0.0MetHis: 0.0 ± 0.0
0.554MetIle: 0.554 ± 0.586
1.663MetLys: 1.663 ± 0.552
5.543MetLeu: 5.543 ± 1.403
1.663MetMet: 1.663 ± 0.552
1.663MetAsn: 1.663 ± 0.728
2.772MetPro: 2.772 ± 1.115
0.0MetGln: 0.0 ± 0.0
0.554MetArg: 0.554 ± 0.414
0.0MetSer: 0.0 ± 0.0
3.326MetThr: 3.326 ± 0.496
0.554MetVal: 0.554 ± 0.446
1.109MetTrp: 1.109 ± 0.717
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.326AsnAla: 3.326 ± 0.496
2.217AsnCys: 2.217 ± 1.264
0.554AsnAsp: 0.554 ± 0.446
1.663AsnGlu: 1.663 ± 0.782
2.772AsnPhe: 2.772 ± 1.537
2.772AsnGly: 2.772 ± 1.494
0.0AsnHis: 0.0 ± 0.0
2.217AsnIle: 2.217 ± 1.1
3.326AsnLys: 3.326 ± 1.924
6.098AsnLeu: 6.098 ± 0.886
1.663AsnMet: 1.663 ± 0.682
0.554AsnAsn: 0.554 ± 0.446
1.109AsnPro: 1.109 ± 0.439
2.772AsnGln: 2.772 ± 0.877
1.663AsnArg: 1.663 ± 1.242
3.88AsnSer: 3.88 ± 0.373
1.663AsnThr: 1.663 ± 0.782
0.554AsnVal: 0.554 ± 0.414
1.663AsnTrp: 1.663 ± 0.839
1.109AsnTyr: 1.109 ± 0.439
0.0AsnXaa: 0.0 ± 0.0
Pro
3.326ProAla: 3.326 ± 2.062
1.109ProCys: 1.109 ± 0.829
4.989ProAsp: 4.989 ± 1.049
2.217ProGlu: 2.217 ± 0.495
2.217ProPhe: 2.217 ± 0.803
3.326ProGly: 3.326 ± 0.496
0.0ProHis: 0.0 ± 0.0
2.772ProIle: 2.772 ± 1.144
5.543ProLys: 5.543 ± 2.748
4.989ProLeu: 4.989 ± 0.987
1.109ProMet: 1.109 ± 0.892
1.663ProAsn: 1.663 ± 0.782
7.206ProPro: 7.206 ± 0.582
2.772ProGln: 2.772 ± 1.144
2.217ProArg: 2.217 ± 1.276
4.989ProSer: 4.989 ± 1.473
4.989ProThr: 4.989 ± 2.486
2.217ProVal: 2.217 ± 0.878
0.0ProTrp: 0.0 ± 0.0
0.554ProTyr: 0.554 ± 0.446
0.0ProXaa: 0.0 ± 0.0
Gln
2.772GlnAla: 2.772 ± 1.393
1.109GlnCys: 1.109 ± 0.632
0.0GlnAsp: 0.0 ± 0.0
5.543GlnGlu: 5.543 ± 0.775
2.772GlnPhe: 2.772 ± 0.65
2.217GlnGly: 2.217 ± 1.195
1.109GlnHis: 1.109 ± 0.717
2.217GlnIle: 2.217 ± 0.662
4.435GlnLys: 4.435 ± 1.748
4.435GlnLeu: 4.435 ± 1.3
1.109GlnMet: 1.109 ± 0.439
1.109GlnAsn: 1.109 ± 0.829
2.217GlnPro: 2.217 ± 0.878
0.554GlnGln: 0.554 ± 0.414
4.435GlnArg: 4.435 ± 2.205
1.109GlnSer: 1.109 ± 0.793
4.435GlnThr: 4.435 ± 0.99
1.109GlnVal: 1.109 ± 0.863
0.0GlnTrp: 0.0 ± 0.0
0.554GlnTyr: 0.554 ± 0.446
0.0GlnXaa: 0.0 ± 0.0
Arg
2.772ArgAla: 2.772 ± 0.649
1.109ArgCys: 1.109 ± 0.717
0.554ArgAsp: 0.554 ± 0.414
2.772ArgGlu: 2.772 ± 0.956
1.109ArgPhe: 1.109 ± 0.829
2.217ArgGly: 2.217 ± 0.662
1.109ArgHis: 1.109 ± 0.793
1.663ArgIle: 1.663 ± 0.782
2.772ArgLys: 2.772 ± 0.671
4.435ArgLeu: 4.435 ± 1.486
1.109ArgMet: 1.109 ± 0.612
2.217ArgAsn: 2.217 ± 0.662
0.554ArgPro: 0.554 ± 0.414
2.217ArgGln: 2.217 ± 0.891
3.326ArgArg: 3.326 ± 1.78
4.989ArgSer: 4.989 ± 1.702
1.109ArgThr: 1.109 ± 1.173
1.663ArgVal: 1.663 ± 0.766
1.109ArgTrp: 1.109 ± 0.793
2.217ArgTyr: 2.217 ± 1.784
0.0ArgXaa: 0.0 ± 0.0
Ser
3.88SerAla: 3.88 ± 0.95
2.217SerCys: 2.217 ± 0.779
3.326SerAsp: 3.326 ± 1.685
4.435SerGlu: 4.435 ± 1.037
4.989SerPhe: 4.989 ± 1.613
5.543SerGly: 5.543 ± 2.911
1.109SerHis: 1.109 ± 0.829
2.772SerIle: 2.772 ± 0.496
3.88SerLys: 3.88 ± 1.511
8.315SerLeu: 8.315 ± 1.158
1.109SerMet: 1.109 ± 0.704
7.206SerAsn: 7.206 ± 2.872
5.543SerPro: 5.543 ± 0.578
4.989SerGln: 4.989 ± 0.742
2.772SerArg: 2.772 ± 0.671
6.652SerSer: 6.652 ± 3.142
3.326SerThr: 3.326 ± 0.721
2.772SerVal: 2.772 ± 1.494
0.0SerTrp: 0.0 ± 0.0
2.772SerTyr: 2.772 ± 0.956
0.0SerXaa: 0.0 ± 0.0
Thr
3.326ThrAla: 3.326 ± 1.151
1.663ThrCys: 1.663 ± 1.112
1.663ThrAsp: 1.663 ± 0.728
4.435ThrGlu: 4.435 ± 1.199
1.109ThrPhe: 1.109 ± 0.793
3.326ThrGly: 3.326 ± 1.67
1.109ThrHis: 1.109 ± 0.793
2.772ThrIle: 2.772 ± 1.05
1.663ThrLys: 1.663 ± 0.782
6.652ThrLeu: 6.652 ± 0.941
2.217ThrMet: 2.217 ± 0.803
1.109ThrAsn: 1.109 ± 0.892
7.761ThrPro: 7.761 ± 3.21
0.554ThrGln: 0.554 ± 0.414
0.0ThrArg: 0.0 ± 0.0
3.326ThrSer: 3.326 ± 0.721
5.543ThrThr: 5.543 ± 1.92
6.098ThrVal: 6.098 ± 2.504
0.554ThrTrp: 0.554 ± 0.657
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
2.217ValAla: 2.217 ± 1.1
1.663ValCys: 1.663 ± 1.418
2.217ValAsp: 2.217 ± 0.835
1.663ValGlu: 1.663 ± 0.766
1.109ValPhe: 1.109 ± 0.593
2.772ValGly: 2.772 ± 1.625
1.109ValHis: 1.109 ± 0.439
3.326ValIle: 3.326 ± 1.338
5.543ValLys: 5.543 ± 1.135
6.098ValLeu: 6.098 ± 1.096
2.772ValMet: 2.772 ± 0.673
5.543ValAsn: 5.543 ± 2.092
2.217ValPro: 2.217 ± 1.784
0.554ValGln: 0.554 ± 0.446
1.663ValArg: 1.663 ± 0.452
7.761ValSer: 7.761 ± 1.734
3.326ValThr: 3.326 ± 1.45
1.109ValVal: 1.109 ± 0.829
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
1.109TrpCys: 1.109 ± 0.793
0.0TrpAsp: 0.0 ± 0.0
1.109TrpGlu: 1.109 ± 0.717
0.554TrpPhe: 0.554 ± 0.657
1.109TrpGly: 1.109 ± 0.717
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.109TrpLys: 1.109 ± 0.632
0.0TrpLeu: 0.0 ± 0.0
1.109TrpMet: 1.109 ± 0.793
0.554TrpAsn: 0.554 ± 0.414
0.0TrpPro: 0.0 ± 0.0
1.109TrpGln: 1.109 ± 0.632
1.109TrpArg: 1.109 ± 0.612
0.554TrpSer: 0.554 ± 0.446
1.109TrpThr: 1.109 ± 0.793
1.109TrpVal: 1.109 ± 0.793
0.0TrpTrp: 0.0 ± 0.0
1.663TrpTyr: 1.663 ± 0.728
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.217TyrAla: 2.217 ± 0.662
0.554TyrCys: 0.554 ± 0.414
2.772TyrAsp: 2.772 ± 0.75
0.0TyrGlu: 0.0 ± 0.0
1.663TyrPhe: 1.663 ± 0.896
1.663TyrGly: 1.663 ± 0.766
1.663TyrHis: 1.663 ± 0.552
1.109TyrIle: 1.109 ± 0.439
1.663TyrLys: 1.663 ± 1.221
1.663TyrLeu: 1.663 ± 0.728
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
1.663TyrPro: 1.663 ± 1.338
1.109TyrGln: 1.109 ± 0.439
0.554TyrArg: 0.554 ± 0.446
2.217TyrSer: 2.217 ± 1.195
1.663TyrThr: 1.663 ± 0.728
1.109TyrVal: 1.109 ± 0.632
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1805 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski