Amino acid dipepetide frequency for Common chimpanzee papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.0AlaAla: 5.0 ± 2.355
2.222AlaCys: 2.222 ± 0.923
1.667AlaAsp: 1.667 ± 0.789
2.778AlaGlu: 2.778 ± 0.604
2.222AlaPhe: 2.222 ± 1.529
3.333AlaGly: 3.333 ± 1.037
0.556AlaHis: 0.556 ± 0.439
4.444AlaIle: 4.444 ± 1.523
1.667AlaLys: 1.667 ± 1.055
1.667AlaLeu: 1.667 ± 0.991
1.667AlaMet: 1.667 ± 0.814
1.667AlaAsn: 1.667 ± 0.363
2.778AlaPro: 2.778 ± 0.711
3.333AlaGln: 3.333 ± 0.694
3.889AlaArg: 3.889 ± 0.936
7.222AlaSer: 7.222 ± 1.168
5.556AlaThr: 5.556 ± 1.906
3.889AlaVal: 3.889 ± 1.35
0.0AlaTrp: 0.0 ± 0.0
2.778AlaTyr: 2.778 ± 1.408
0.0AlaXaa: 0.0 ± 0.0
Cys
3.333CysAla: 3.333 ± 1.319
0.0CysCys: 0.0 ± 0.0
0.556CysAsp: 0.556 ± 0.468
0.556CysGlu: 0.556 ± 0.587
2.778CysPhe: 2.778 ± 1.7
1.111CysGly: 1.111 ± 0.751
0.556CysHis: 0.556 ± 0.587
1.111CysIle: 1.111 ± 0.508
3.889CysLys: 3.889 ± 1.555
2.222CysLeu: 2.222 ± 1.178
1.111CysMet: 1.111 ± 1.174
3.333CysAsn: 3.333 ± 1.782
1.667CysPro: 1.667 ± 0.661
1.111CysGln: 1.111 ± 0.508
0.556CysArg: 0.556 ± 0.541
1.111CysSer: 1.111 ± 0.508
2.222CysThr: 2.222 ± 0.976
2.222CysVal: 2.222 ± 1.226
1.111CysTrp: 1.111 ± 0.628
0.556CysTyr: 0.556 ± 0.587
0.0CysXaa: 0.0 ± 0.0
Asp
1.667AspAla: 1.667 ± 0.363
2.222AspCys: 2.222 ± 0.923
1.111AspAsp: 1.111 ± 0.465
1.667AspGlu: 1.667 ± 1.342
1.111AspPhe: 1.111 ± 0.508
1.667AspGly: 1.667 ± 0.917
0.0AspHis: 0.0 ± 0.0
6.111AspIle: 6.111 ± 2.839
1.111AspLys: 1.111 ± 0.465
2.778AspLeu: 2.778 ± 1.013
0.556AspMet: 0.556 ± 0.441
2.778AspAsn: 2.778 ± 0.939
3.333AspPro: 3.333 ± 1.2
0.556AspGln: 0.556 ± 0.587
0.556AspArg: 0.556 ± 0.441
4.444AspSer: 4.444 ± 1.177
5.556AspThr: 5.556 ± 1.379
2.778AspVal: 2.778 ± 1.139
0.0AspTrp: 0.0 ± 0.0
2.222AspTyr: 2.222 ± 0.828
0.0AspXaa: 0.0 ± 0.0
Glu
2.778GluAla: 2.778 ± 0.772
1.111GluCys: 1.111 ± 0.781
2.778GluAsp: 2.778 ± 0.912
3.889GluGlu: 3.889 ± 1.059
2.222GluPhe: 2.222 ± 0.92
0.556GluGly: 0.556 ± 0.439
1.111GluHis: 1.111 ± 0.465
1.667GluIle: 1.667 ± 0.832
3.889GluLys: 3.889 ± 1.376
5.556GluLeu: 5.556 ± 1.579
1.111GluMet: 1.111 ± 0.936
2.778GluAsn: 2.778 ± 1.318
1.667GluPro: 1.667 ± 0.832
5.0GluGln: 5.0 ± 1.539
0.0GluArg: 0.0 ± 0.0
1.667GluSer: 1.667 ± 1.032
1.667GluThr: 1.667 ± 1.055
3.889GluVal: 3.889 ± 1.394
1.111GluTrp: 1.111 ± 0.628
1.111GluTyr: 1.111 ± 0.628
0.0GluXaa: 0.0 ± 0.0
Phe
3.889PheAla: 3.889 ± 0.871
0.556PheCys: 0.556 ± 0.587
5.0PheAsp: 5.0 ± 1.228
2.778PheGlu: 2.778 ± 1.115
3.333PhePhe: 3.333 ± 0.974
2.778PheGly: 2.778 ± 0.69
0.556PheHis: 0.556 ± 0.587
3.333PheIle: 3.333 ± 1.331
1.111PheLys: 1.111 ± 0.508
1.667PheLeu: 1.667 ± 0.363
0.556PheMet: 0.556 ± 0.439
1.667PheAsn: 1.667 ± 1.323
1.667PhePro: 1.667 ± 0.626
1.667PheGln: 1.667 ± 0.917
1.667PheArg: 1.667 ± 0.363
2.222PheSer: 2.222 ± 1.107
1.111PheThr: 1.111 ± 0.877
3.333PheVal: 3.333 ± 1.918
0.556PheTrp: 0.556 ± 0.441
2.222PheTyr: 2.222 ± 1.108
0.0PheXaa: 0.0 ± 0.0
Gly
1.111GlyAla: 1.111 ± 0.465
0.556GlyCys: 0.556 ± 0.441
2.222GlyAsp: 2.222 ± 1.226
1.667GlyGlu: 1.667 ± 0.793
2.222GlyPhe: 2.222 ± 0.623
2.778GlyGly: 2.778 ± 2.193
2.778GlyHis: 2.778 ± 0.711
1.667GlyIle: 1.667 ± 0.363
5.0GlyLys: 5.0 ± 1.143
4.444GlyLeu: 4.444 ± 0.868
1.111GlyMet: 1.111 ± 0.936
2.222GlyAsn: 2.222 ± 0.812
5.0GlyPro: 5.0 ± 2.031
1.667GlyGln: 1.667 ± 0.793
3.333GlyArg: 3.333 ± 0.654
3.333GlySer: 3.333 ± 1.536
5.556GlyThr: 5.556 ± 1.13
2.778GlyVal: 2.778 ± 1.222
0.0GlyTrp: 0.0 ± 0.0
1.111GlyTyr: 1.111 ± 0.465
0.0GlyXaa: 0.0 ± 0.0
His
0.556HisAla: 0.556 ± 0.441
2.778HisCys: 2.778 ± 1.633
0.0HisAsp: 0.0 ± 0.0
1.111HisGlu: 1.111 ± 0.513
2.222HisPhe: 2.222 ± 0.879
1.111HisGly: 1.111 ± 0.689
0.0HisHis: 0.0 ± 0.0
2.222HisIle: 2.222 ± 0.993
2.222HisLys: 2.222 ± 0.923
4.444HisLeu: 4.444 ± 2.031
0.0HisMet: 0.0 ± 0.0
2.778HisAsn: 2.778 ± 0.912
1.111HisPro: 1.111 ± 0.465
1.111HisGln: 1.111 ± 1.062
1.111HisArg: 1.111 ± 0.68
2.222HisSer: 2.222 ± 0.967
1.667HisThr: 1.667 ± 0.651
0.556HisVal: 0.556 ± 0.441
1.667HisTrp: 1.667 ± 0.871
0.556HisTyr: 0.556 ± 0.468
0.0HisXaa: 0.0 ± 0.0
Ile
0.556IleAla: 0.556 ± 1.048
1.667IleCys: 1.667 ± 0.828
0.556IleAsp: 0.556 ± 0.587
1.667IleGlu: 1.667 ± 0.879
3.889IlePhe: 3.889 ± 1.334
2.778IleGly: 2.778 ± 1.016
1.667IleHis: 1.667 ± 0.789
2.222IleIle: 2.222 ± 1.529
1.667IleLys: 1.667 ± 0.626
4.444IleLeu: 4.444 ± 2.604
0.556IleMet: 0.556 ± 0.468
1.111IleAsn: 1.111 ± 0.513
4.444IlePro: 4.444 ± 1.879
2.222IleGln: 2.222 ± 1.335
2.222IleArg: 2.222 ± 1.121
5.556IleSer: 5.556 ± 2.842
3.333IleThr: 3.333 ± 1.063
7.222IleVal: 7.222 ± 1.747
0.556IleTrp: 0.556 ± 0.468
2.778IleTyr: 2.778 ± 1.939
0.0IleXaa: 0.0 ± 0.0
Lys
3.333LysAla: 3.333 ± 0.654
1.111LysCys: 1.111 ± 0.936
1.667LysAsp: 1.667 ± 1.381
2.778LysGlu: 2.778 ± 1.301
2.222LysPhe: 2.222 ± 1.764
3.333LysGly: 3.333 ± 1.307
3.333LysHis: 3.333 ± 1.824
2.222LysIle: 2.222 ± 0.623
2.778LysLys: 2.778 ± 1.061
2.222LysLeu: 2.222 ± 0.828
0.0LysMet: 0.0 ± 0.427
0.556LysAsn: 0.556 ± 0.541
2.222LysPro: 2.222 ± 1.008
3.333LysGln: 3.333 ± 1.656
3.889LysArg: 3.889 ± 0.965
2.222LysSer: 2.222 ± 0.503
4.444LysThr: 4.444 ± 2.333
3.889LysVal: 3.889 ± 1.177
0.556LysTrp: 0.556 ± 0.439
3.333LysTyr: 3.333 ± 1.398
0.0LysXaa: 0.0 ± 0.0
Leu
2.778LeuAla: 2.778 ± 0.766
6.111LeuCys: 6.111 ± 3.172
3.333LeuAsp: 3.333 ± 0.765
5.0LeuGlu: 5.0 ± 1.806
5.0LeuPhe: 5.0 ± 1.568
5.0LeuGly: 5.0 ± 0.939
7.778LeuHis: 7.778 ± 2.733
2.222LeuIle: 2.222 ± 3.097
3.889LeuLys: 3.889 ± 1.601
11.111LeuLeu: 11.111 ± 9.33
1.667LeuMet: 1.667 ± 1.068
2.778LeuAsn: 2.778 ± 1.036
3.333LeuPro: 3.333 ± 1.047
8.333LeuGln: 8.333 ± 2.798
0.556LeuArg: 0.556 ± 0.441
3.889LeuSer: 3.889 ± 1.178
6.111LeuThr: 6.111 ± 3.888
6.667LeuVal: 6.667 ± 2.135
1.111LeuTrp: 1.111 ± 1.125
5.0LeuTyr: 5.0 ± 1.452
0.0LeuXaa: 0.0 ± 0.0
Met
1.667MetAla: 1.667 ± 0.793
0.0MetCys: 0.0 ± 0.0
1.667MetAsp: 1.667 ± 0.793
3.333MetGlu: 3.333 ± 1.886
0.556MetPhe: 0.556 ± 0.441
0.556MetGly: 0.556 ± 0.468
1.667MetHis: 1.667 ± 0.975
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
0.556MetLeu: 0.556 ± 0.541
1.111MetMet: 1.111 ± 0.878
0.556MetAsn: 0.556 ± 0.441
0.0MetPro: 0.0 ± 0.0
0.556MetGln: 0.556 ± 0.468
1.111MetArg: 1.111 ± 0.508
0.0MetSer: 0.0 ± 0.0
0.556MetThr: 0.556 ± 0.441
1.667MetVal: 1.667 ± 0.651
1.111MetTrp: 1.111 ± 0.508
0.556MetTyr: 0.556 ± 0.439
0.0MetXaa: 0.0 ± 0.0
Asn
2.222AsnAla: 2.222 ± 1.379
0.556AsnCys: 0.556 ± 0.587
0.556AsnAsp: 0.556 ± 0.541
2.778AsnGlu: 2.778 ± 1.107
1.111AsnPhe: 1.111 ± 0.628
2.222AsnGly: 2.222 ± 0.859
0.0AsnHis: 0.0 ± 0.0
3.889AsnIle: 3.889 ± 0.823
3.889AsnLys: 3.889 ± 2.544
2.778AsnLeu: 2.778 ± 1.036
0.556AsnMet: 0.556 ± 0.441
5.556AsnAsn: 5.556 ± 2.959
3.333AsnPro: 3.333 ± 1.394
0.556AsnGln: 0.556 ± 0.587
1.111AsnArg: 1.111 ± 0.882
5.0AsnSer: 5.0 ± 1.762
5.556AsnThr: 5.556 ± 0.945
1.667AsnVal: 1.667 ± 0.651
0.0AsnTrp: 0.0 ± 0.0
0.556AsnTyr: 0.556 ± 0.468
0.0AsnXaa: 0.0 ± 0.0
Pro
6.667ProAla: 6.667 ± 2.63
0.556ProCys: 0.556 ± 0.441
5.556ProAsp: 5.556 ± 2.059
0.0ProGlu: 0.0 ± 0.0
1.111ProPhe: 1.111 ± 0.716
1.667ProGly: 1.667 ± 0.363
0.556ProHis: 0.556 ± 0.439
2.222ProIle: 2.222 ± 0.624
2.222ProLys: 2.222 ± 0.696
8.889ProLeu: 8.889 ± 1.431
1.111ProMet: 1.111 ± 0.841
2.778ProAsn: 2.778 ± 0.939
8.333ProPro: 8.333 ± 1.938
1.667ProGln: 1.667 ± 0.711
1.667ProArg: 1.667 ± 1.316
6.111ProSer: 6.111 ± 1.532
5.556ProThr: 5.556 ± 1.181
5.556ProVal: 5.556 ± 2.374
0.556ProTrp: 0.556 ± 0.468
2.778ProTyr: 2.778 ± 1.618
0.0ProXaa: 0.0 ± 0.0
Gln
3.889GlnAla: 3.889 ± 2.665
1.667GlnCys: 1.667 ± 0.651
3.889GlnAsp: 3.889 ± 1.347
2.222GlnGlu: 2.222 ± 1.493
2.222GlnPhe: 2.222 ± 1.182
2.222GlnGly: 2.222 ± 0.923
1.667GlnHis: 1.667 ± 1.107
1.667GlnIle: 1.667 ± 0.902
1.111GlnLys: 1.111 ± 0.611
5.556GlnLeu: 5.556 ± 1.068
2.222GlnMet: 2.222 ± 1.016
0.556GlnAsn: 0.556 ± 0.439
2.778GlnPro: 2.778 ± 1.234
1.667GlnGln: 1.667 ± 1.376
3.333GlnArg: 3.333 ± 0.765
2.778GlnSer: 2.778 ± 1.017
4.444GlnThr: 4.444 ± 1.275
2.778GlnVal: 2.778 ± 1.312
0.556GlnTrp: 0.556 ± 0.468
2.778GlnTyr: 2.778 ± 1.091
0.0GlnXaa: 0.0 ± 0.0
Arg
2.778ArgAla: 2.778 ± 1.618
1.111ArgCys: 1.111 ± 1.174
0.0ArgAsp: 0.0 ± 0.0
0.0ArgGlu: 0.0 ± 0.0
1.111ArgPhe: 1.111 ± 0.716
2.778ArgGly: 2.778 ± 0.698
2.222ArgHis: 2.222 ± 1.255
2.222ArgIle: 2.222 ± 0.906
4.444ArgLys: 4.444 ± 2.007
5.0ArgLeu: 5.0 ± 0.939
0.0ArgMet: 0.0 ± 0.0
1.111ArgAsn: 1.111 ± 0.513
4.444ArgPro: 4.444 ± 2.205
1.667ArgGln: 1.667 ± 1.404
2.778ArgArg: 2.778 ± 1.811
1.111ArgSer: 1.111 ± 0.465
1.111ArgThr: 1.111 ± 0.465
2.222ArgVal: 2.222 ± 0.929
0.0ArgTrp: 0.0 ± 0.0
1.667ArgTyr: 1.667 ± 0.871
0.0ArgXaa: 0.0 ± 0.0
Ser
5.0SerAla: 5.0 ± 1.629
1.111SerCys: 1.111 ± 0.909
2.778SerAsp: 2.778 ± 1.218
4.444SerGlu: 4.444 ± 1.143
1.667SerPhe: 1.667 ± 0.635
4.444SerGly: 4.444 ± 1.999
0.556SerHis: 0.556 ± 0.468
5.556SerIle: 5.556 ± 2.833
1.667SerLys: 1.667 ± 0.363
6.667SerLeu: 6.667 ± 2.613
0.556SerMet: 0.556 ± 0.439
5.556SerAsn: 5.556 ± 2.26
5.0SerPro: 5.0 ± 1.09
3.333SerGln: 3.333 ± 1.106
2.778SerArg: 2.778 ± 1.218
10.0SerSer: 10.0 ± 1.458
11.667SerThr: 11.667 ± 1.921
5.0SerVal: 5.0 ± 1.694
0.556SerTrp: 0.556 ± 0.439
2.222SerTyr: 2.222 ± 1.303
0.0SerXaa: 0.0 ± 0.0
Thr
5.0ThrAla: 5.0 ± 1.219
2.778ThrCys: 2.778 ± 0.69
3.333ThrAsp: 3.333 ± 1.063
0.556ThrGlu: 0.556 ± 0.468
1.667ThrPhe: 1.667 ± 1.316
3.889ThrGly: 3.889 ± 0.876
1.667ThrHis: 1.667 ± 1.071
3.889ThrIle: 3.889 ± 0.745
1.667ThrLys: 1.667 ± 0.828
8.889ThrLeu: 8.889 ± 2.982
1.667ThrMet: 1.667 ± 0.732
1.667ThrAsn: 1.667 ± 1.057
6.667ThrPro: 6.667 ± 1.318
5.0ThrGln: 5.0 ± 2.296
2.222ThrArg: 2.222 ± 1.196
11.111ThrSer: 11.111 ± 2.387
11.667ThrThr: 11.667 ± 4.84
11.111ThrVal: 11.111 ± 2.543
0.556ThrTrp: 0.556 ± 0.468
2.778ThrTyr: 2.778 ± 1.347
0.0ThrXaa: 0.0 ± 0.0
Val
3.333ValAla: 3.333 ± 1.435
3.333ValCys: 3.333 ± 1.453
5.0ValAsp: 5.0 ± 1.015
5.556ValGlu: 5.556 ± 1.438
3.333ValPhe: 3.333 ± 1.124
4.444ValGly: 4.444 ± 1.935
1.667ValHis: 1.667 ± 1.113
2.222ValIle: 2.222 ± 0.92
3.333ValLys: 3.333 ± 1.358
6.667ValLeu: 6.667 ± 2.735
0.0ValMet: 0.0 ± 0.0
3.333ValAsn: 3.333 ± 1.379
5.556ValPro: 5.556 ± 1.171
6.111ValGln: 6.111 ± 1.208
2.222ValArg: 2.222 ± 0.906
7.778ValSer: 7.778 ± 1.393
6.667ValThr: 6.667 ± 2.102
3.333ValVal: 3.333 ± 1.098
1.111ValTrp: 1.111 ± 0.628
1.667ValTyr: 1.667 ± 0.728
0.0ValXaa: 0.0 ± 0.0
Trp
0.556TrpAla: 0.556 ± 0.441
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.111TrpGlu: 1.111 ± 0.508
0.0TrpPhe: 0.0 ± 0.0
2.222TrpGly: 2.222 ± 0.929
0.556TrpHis: 0.556 ± 0.468
0.0TrpIle: 0.0 ± 0.0
1.667TrpLys: 1.667 ± 1.055
2.222TrpLeu: 2.222 ± 1.26
0.0TrpMet: 0.0 ± 0.0
0.556TrpAsn: 0.556 ± 0.441
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.111TrpArg: 1.111 ± 0.628
0.0TrpSer: 0.0 ± 0.0
1.667TrpThr: 1.667 ± 1.055
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.222TyrAla: 2.222 ± 1.255
1.667TyrCys: 1.667 ± 0.635
0.556TyrAsp: 0.556 ± 0.468
1.667TyrGlu: 1.667 ± 0.879
1.667TyrPhe: 1.667 ± 0.793
1.667TyrGly: 1.667 ± 0.363
0.556TyrHis: 0.556 ± 0.441
2.222TyrIle: 2.222 ± 1.037
2.778TyrLys: 2.778 ± 0.989
3.889TyrLeu: 3.889 ± 1.866
1.111TyrMet: 1.111 ± 0.508
0.556TyrAsn: 0.556 ± 0.468
2.222TyrPro: 2.222 ± 1.075
1.111TyrGln: 1.111 ± 0.689
1.667TyrArg: 1.667 ± 0.635
2.778TyrSer: 2.778 ± 1.269
1.667TyrThr: 1.667 ± 0.363
6.111TyrVal: 6.111 ± 1.499
0.556TyrTrp: 0.556 ± 0.441
1.111TyrTyr: 1.111 ± 0.936
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1801 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski