Amino acid dipepetide frequency for Bovine papillomavirus type 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.117AlaAla: 7.117 ± 2.21
1.582AlaCys: 1.582 ± 0.733
3.163AlaAsp: 3.163 ± 0.628
3.954AlaGlu: 3.954 ± 0.926
0.791AlaPhe: 0.791 ± 0.671
3.954AlaGly: 3.954 ± 1.053
0.395AlaHis: 0.395 ± 0.335
3.954AlaIle: 3.954 ± 1.352
3.559AlaLys: 3.559 ± 1.17
4.745AlaLeu: 4.745 ± 2.144
0.395AlaMet: 0.395 ± 0.335
2.372AlaAsn: 2.372 ± 0.461
3.163AlaPro: 3.163 ± 0.851
3.163AlaGln: 3.163 ± 0.87
3.559AlaArg: 3.559 ± 0.713
3.163AlaSer: 3.163 ± 1.454
3.163AlaThr: 3.163 ± 0.795
4.745AlaVal: 4.745 ± 1.389
0.395AlaTrp: 0.395 ± 0.335
1.582AlaTyr: 1.582 ± 0.543
0.0AlaXaa: 0.0 ± 0.0
Cys
0.791CysAla: 0.791 ± 0.542
0.791CysCys: 0.791 ± 0.568
1.186CysAsp: 1.186 ± 0.655
0.791CysGlu: 0.791 ± 0.4
0.791CysPhe: 0.791 ± 0.4
0.791CysGly: 0.791 ± 0.591
1.186CysHis: 1.186 ± 0.734
1.186CysIle: 1.186 ± 0.571
1.186CysLys: 1.186 ± 0.913
3.954CysLeu: 3.954 ± 1.728
0.0CysMet: 0.0 ± 0.0
0.395CysAsn: 0.395 ± 0.335
2.372CysPro: 2.372 ± 0.849
0.395CysGln: 0.395 ± 0.43
1.582CysArg: 1.582 ± 0.448
4.35CysSer: 4.35 ± 1.24
1.582CysThr: 1.582 ± 0.554
1.186CysVal: 1.186 ± 1.289
0.395CysTrp: 0.395 ± 0.339
1.186CysTyr: 1.186 ± 0.925
0.0CysXaa: 0.0 ± 0.0
Asp
3.954AspAla: 3.954 ± 0.989
1.582AspCys: 1.582 ± 0.517
1.582AspAsp: 1.582 ± 0.799
3.954AspGlu: 3.954 ± 0.977
1.977AspPhe: 1.977 ± 0.785
4.35AspGly: 4.35 ± 0.928
1.186AspHis: 1.186 ± 0.579
3.163AspIle: 3.163 ± 1.016
3.559AspLys: 3.559 ± 0.959
6.722AspLeu: 6.722 ± 1.353
2.372AspMet: 2.372 ± 0.459
1.977AspAsn: 1.977 ± 1.039
3.559AspPro: 3.559 ± 0.867
1.582AspGln: 1.582 ± 0.566
3.954AspArg: 3.954 ± 0.848
5.14AspSer: 5.14 ± 1.25
3.954AspThr: 3.954 ± 0.908
5.14AspVal: 5.14 ± 2.591
1.186AspTrp: 1.186 ± 0.351
0.791AspTyr: 0.791 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
2.372GluAla: 2.372 ± 0.895
0.395GluCys: 0.395 ± 0.335
3.954GluAsp: 3.954 ± 1.179
4.745GluGlu: 4.745 ± 1.384
1.582GluPhe: 1.582 ± 0.658
3.559GluGly: 3.559 ± 1.121
1.582GluHis: 1.582 ± 0.884
1.582GluIle: 1.582 ± 0.884
2.372GluLys: 2.372 ± 0.723
4.745GluLeu: 4.745 ± 0.722
0.791GluMet: 0.791 ± 0.396
3.559GluAsn: 3.559 ± 0.892
3.163GluPro: 3.163 ± 1.443
2.768GluGln: 2.768 ± 1.585
3.559GluArg: 3.559 ± 0.944
2.768GluSer: 2.768 ± 0.865
5.14GluThr: 5.14 ± 1.896
2.372GluVal: 2.372 ± 0.918
0.0GluTrp: 0.0 ± 0.0
0.791GluTyr: 0.791 ± 0.679
0.0GluXaa: 0.0 ± 0.0
Phe
1.977PheAla: 1.977 ± 0.666
0.395PheCys: 0.395 ± 0.335
3.163PheAsp: 3.163 ± 1.055
3.163PheGlu: 3.163 ± 1.072
1.582PhePhe: 1.582 ± 0.733
1.977PheGly: 1.977 ± 0.667
1.977PheHis: 1.977 ± 0.836
1.977PheIle: 1.977 ± 0.845
1.186PheLys: 1.186 ± 0.739
3.954PheLeu: 3.954 ± 0.764
0.395PheMet: 0.395 ± 0.502
1.186PheAsn: 1.186 ± 0.661
1.186PhePro: 1.186 ± 0.556
1.186PheGln: 1.186 ± 0.616
2.372PheArg: 2.372 ± 0.451
2.372PheSer: 2.372 ± 1.121
2.372PheThr: 2.372 ± 0.63
3.163PheVal: 3.163 ± 1.068
1.977PheTrp: 1.977 ± 0.548
1.977PheTyr: 1.977 ± 0.452
0.0PheXaa: 0.0 ± 0.0
Gly
1.977GlyAla: 1.977 ± 0.776
2.768GlyCys: 2.768 ± 1.155
3.954GlyAsp: 3.954 ± 0.976
4.745GlyGlu: 4.745 ± 1.578
1.977GlyPhe: 1.977 ± 0.314
5.931GlyGly: 5.931 ± 1.238
1.186GlyHis: 1.186 ± 0.433
2.768GlyIle: 2.768 ± 1.053
2.768GlyLys: 2.768 ± 1.336
4.745GlyLeu: 4.745 ± 1.115
0.791GlyMet: 0.791 ± 0.508
3.954GlyAsn: 3.954 ± 1.526
5.931GlyPro: 5.931 ± 2.236
4.35GlyGln: 4.35 ± 0.95
7.513GlyArg: 7.513 ± 1.902
9.885GlySer: 9.885 ± 1.903
3.559GlyThr: 3.559 ± 0.771
4.745GlyVal: 4.745 ± 1.629
0.395GlyTrp: 0.395 ± 0.361
1.186GlyTyr: 1.186 ± 0.629
0.0GlyXaa: 0.0 ± 0.0
His
1.582HisAla: 1.582 ± 0.816
0.791HisCys: 0.791 ± 0.563
1.977HisAsp: 1.977 ± 0.932
1.186HisGlu: 1.186 ± 0.579
1.186HisPhe: 1.186 ± 0.381
1.582HisGly: 1.582 ± 0.543
0.791HisHis: 0.791 ± 0.359
0.791HisIle: 0.791 ± 0.563
1.582HisLys: 1.582 ± 0.719
0.791HisLeu: 0.791 ± 0.671
0.791HisMet: 0.791 ± 0.671
1.186HisAsn: 1.186 ± 0.651
1.582HisPro: 1.582 ± 0.557
0.0HisGln: 0.0 ± 0.0
3.163HisArg: 3.163 ± 2.078
1.582HisSer: 1.582 ± 0.465
0.395HisThr: 0.395 ± 0.321
1.582HisVal: 1.582 ± 0.591
0.0HisTrp: 0.0 ± 0.0
0.791HisTyr: 0.791 ± 0.671
0.0HisXaa: 0.0 ± 0.0
Ile
1.186IleAla: 1.186 ± 0.511
0.791IleCys: 0.791 ± 0.537
4.745IleAsp: 4.745 ± 1.233
4.35IleGlu: 4.35 ± 0.8
1.582IlePhe: 1.582 ± 1.022
3.163IleGly: 3.163 ± 1.275
0.0IleHis: 0.0 ± 0.0
1.186IleIle: 1.186 ± 0.964
0.791IleLys: 0.791 ± 0.4
5.536IleLeu: 5.536 ± 1.265
0.791IleMet: 0.791 ± 0.671
0.791IleAsn: 0.791 ± 0.408
3.954IlePro: 3.954 ± 1.403
2.372IleGln: 2.372 ± 1.217
0.395IleArg: 0.395 ± 0.335
3.163IleSer: 3.163 ± 1.132
5.14IleThr: 5.14 ± 1.164
0.791IleVal: 0.791 ± 0.396
0.0IleTrp: 0.0 ± 0.0
0.395IleTyr: 0.395 ± 0.339
0.0IleXaa: 0.0 ± 0.0
Lys
2.768LysAla: 2.768 ± 0.93
1.977LysCys: 1.977 ± 0.571
4.745LysAsp: 4.745 ± 1.304
1.186LysGlu: 1.186 ± 0.661
1.582LysPhe: 1.582 ± 1.024
1.977LysGly: 1.977 ± 0.82
0.395LysHis: 0.395 ± 0.335
1.582LysIle: 1.582 ± 0.517
3.559LysLys: 3.559 ± 0.789
2.372LysLeu: 2.372 ± 0.961
1.186LysMet: 1.186 ± 0.655
3.559LysAsn: 3.559 ± 1.481
2.768LysPro: 2.768 ± 0.621
1.977LysGln: 1.977 ± 1.039
4.35LysArg: 4.35 ± 1.182
4.35LysSer: 4.35 ± 1.478
1.977LysThr: 1.977 ± 0.422
3.559LysVal: 3.559 ± 0.603
0.791LysTrp: 0.791 ± 0.408
1.582LysTyr: 1.582 ± 0.745
0.0LysXaa: 0.0 ± 0.0
Leu
3.559LeuAla: 3.559 ± 1.292
1.186LeuCys: 1.186 ± 0.835
4.35LeuAsp: 4.35 ± 0.708
3.559LeuGlu: 3.559 ± 1.23
4.745LeuPhe: 4.745 ± 1.06
7.908LeuGly: 7.908 ± 1.201
3.163LeuHis: 3.163 ± 1.071
3.954LeuIle: 3.954 ± 0.629
7.908LeuLys: 7.908 ± 1.928
9.095LeuLeu: 9.095 ± 2.162
0.395LeuMet: 0.395 ± 0.335
2.372LeuAsn: 2.372 ± 0.691
5.14LeuPro: 5.14 ± 1.42
5.536LeuGln: 5.536 ± 1.232
5.536LeuArg: 5.536 ± 0.718
8.699LeuSer: 8.699 ± 1.411
3.559LeuThr: 3.559 ± 0.647
3.163LeuVal: 3.163 ± 0.867
1.186LeuTrp: 1.186 ± 0.381
2.372LeuTyr: 2.372 ± 1.065
0.0LeuXaa: 0.0 ± 0.0
Met
2.372MetAla: 2.372 ± 0.483
1.186MetCys: 1.186 ± 0.786
1.582MetAsp: 1.582 ± 0.591
0.0MetGlu: 0.0 ± 0.0
0.395MetPhe: 0.395 ± 0.335
0.395MetGly: 0.395 ± 0.339
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.395MetLys: 0.395 ± 0.416
1.582MetLeu: 1.582 ± 0.634
1.186MetMet: 1.186 ± 0.655
0.395MetAsn: 0.395 ± 0.339
0.791MetPro: 0.791 ± 0.507
1.582MetGln: 1.582 ± 0.65
0.395MetArg: 0.395 ± 0.335
0.791MetSer: 0.791 ± 0.671
1.582MetThr: 1.582 ± 0.554
1.186MetVal: 1.186 ± 0.739
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.977AsnAla: 1.977 ± 1.259
0.395AsnCys: 0.395 ± 0.335
1.977AsnAsp: 1.977 ± 1.009
1.582AsnGlu: 1.582 ± 0.961
1.186AsnPhe: 1.186 ± 0.786
3.954AsnGly: 3.954 ± 0.629
0.791AsnHis: 0.791 ± 0.537
2.768AsnIle: 2.768 ± 1.07
2.372AsnLys: 2.372 ± 1.091
3.163AsnLeu: 3.163 ± 1.273
0.791AsnMet: 0.791 ± 0.406
0.395AsnAsn: 0.395 ± 0.339
3.559AsnPro: 3.559 ± 1.683
2.372AsnGln: 2.372 ± 0.892
1.186AsnArg: 1.186 ± 0.57
5.536AsnSer: 5.536 ± 2.319
2.372AsnThr: 2.372 ± 0.849
1.582AsnVal: 1.582 ± 0.799
0.395AsnTrp: 0.395 ± 0.321
0.791AsnTyr: 0.791 ± 0.671
0.0AsnXaa: 0.0 ± 0.0
Pro
5.536ProAla: 5.536 ± 0.965
2.372ProCys: 2.372 ± 1.116
5.536ProAsp: 5.536 ± 0.965
2.372ProGlu: 2.372 ± 1.302
1.582ProPhe: 1.582 ± 0.584
5.536ProGly: 5.536 ± 2.011
1.186ProHis: 1.186 ± 0.762
1.582ProIle: 1.582 ± 0.591
2.768ProLys: 2.768 ± 0.93
5.536ProLeu: 5.536 ± 0.935
0.395ProMet: 0.395 ± 0.335
1.582ProAsn: 1.582 ± 0.799
6.327ProPro: 6.327 ± 1.905
2.372ProGln: 2.372 ± 0.828
2.372ProArg: 2.372 ± 1.131
5.931ProSer: 5.931 ± 2.169
2.768ProThr: 2.768 ± 1.082
4.745ProVal: 4.745 ± 1.319
0.791ProTrp: 0.791 ± 0.537
2.768ProTyr: 2.768 ± 0.991
0.0ProXaa: 0.0 ± 0.0
Gln
2.372GlnAla: 2.372 ± 1.16
0.791GlnCys: 0.791 ± 0.859
1.977GlnAsp: 1.977 ± 0.998
3.163GlnGlu: 3.163 ± 0.696
1.186GlnPhe: 1.186 ± 0.571
4.35GlnGly: 4.35 ± 0.835
1.186GlnHis: 1.186 ± 0.6
1.977GlnIle: 1.977 ± 1.397
1.186GlnLys: 1.186 ± 0.352
4.35GlnLeu: 4.35 ± 0.972
1.582GlnMet: 1.582 ± 0.244
3.163GlnAsn: 3.163 ± 0.699
2.372GlnPro: 2.372 ± 0.749
1.977GlnGln: 1.977 ± 1.039
4.745GlnArg: 4.745 ± 1.934
2.372GlnSer: 2.372 ± 0.641
2.768GlnThr: 2.768 ± 1.208
3.163GlnVal: 3.163 ± 0.958
0.791GlnTrp: 0.791 ± 0.671
1.977GlnTyr: 1.977 ± 1.071
0.0GlnXaa: 0.0 ± 0.0
Arg
6.327ArgAla: 6.327 ± 1.231
2.372ArgCys: 2.372 ± 1.307
2.372ArgAsp: 2.372 ± 1.2
1.186ArgGlu: 1.186 ± 0.661
3.954ArgPhe: 3.954 ± 1.398
7.908ArgGly: 7.908 ± 1.57
1.977ArgHis: 1.977 ± 0.807
2.372ArgIle: 2.372 ± 1.387
3.954ArgLys: 3.954 ± 0.904
5.931ArgLeu: 5.931 ± 0.66
0.791ArgMet: 0.791 ± 0.549
1.977ArgAsn: 1.977 ± 0.909
5.931ArgPro: 5.931 ± 1.235
2.372ArgGln: 2.372 ± 1.466
3.954ArgArg: 3.954 ± 0.847
2.768ArgSer: 2.768 ± 0.742
2.768ArgThr: 2.768 ± 1.071
3.954ArgVal: 3.954 ± 0.527
0.395ArgTrp: 0.395 ± 0.335
1.582ArgTyr: 1.582 ± 0.646
0.0ArgXaa: 0.0 ± 0.0
Ser
4.35SerAla: 4.35 ± 0.862
1.186SerCys: 1.186 ± 0.543
5.931SerAsp: 5.931 ± 0.962
3.954SerGlu: 3.954 ± 1.297
4.35SerPhe: 4.35 ± 1.501
7.908SerGly: 7.908 ± 1.966
1.977SerHis: 1.977 ± 0.82
2.372SerIle: 2.372 ± 0.451
2.372SerLys: 2.372 ± 0.749
5.14SerLeu: 5.14 ± 1.147
1.186SerMet: 1.186 ± 0.585
2.372SerAsn: 2.372 ± 1.575
4.35SerPro: 4.35 ± 1.014
4.35SerGln: 4.35 ± 0.948
8.304SerArg: 8.304 ± 1.984
9.885SerSer: 9.885 ± 2.94
7.117SerThr: 7.117 ± 1.584
7.117SerVal: 7.117 ± 0.963
0.395SerTrp: 0.395 ± 0.335
1.977SerTyr: 1.977 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
1.582ThrAla: 1.582 ± 0.633
2.372ThrCys: 2.372 ± 0.334
5.14ThrAsp: 5.14 ± 1.502
3.954ThrGlu: 3.954 ± 0.908
3.954ThrPhe: 3.954 ± 1.053
3.163ThrGly: 3.163 ± 0.482
1.582ThrHis: 1.582 ± 0.604
4.35ThrIle: 4.35 ± 1.82
1.186ThrLys: 1.186 ± 0.678
4.35ThrLeu: 4.35 ± 1.688
1.186ThrMet: 1.186 ± 0.954
2.372ThrAsn: 2.372 ± 0.843
3.559ThrPro: 3.559 ± 1.07
3.163ThrGln: 3.163 ± 0.7
1.582ThrArg: 1.582 ± 0.718
6.327ThrSer: 6.327 ± 1.317
4.745ThrThr: 4.745 ± 2.225
5.14ThrVal: 5.14 ± 1.119
1.186ThrTrp: 1.186 ± 0.679
3.559ThrTyr: 3.559 ± 1.148
0.0ThrXaa: 0.0 ± 0.0
Val
3.559ValAla: 3.559 ± 1.384
1.582ValCys: 1.582 ± 0.554
3.163ValAsp: 3.163 ± 1.844
1.977ValGlu: 1.977 ± 0.76
3.559ValPhe: 3.559 ± 0.856
3.954ValGly: 3.954 ± 1.205
2.372ValHis: 2.372 ± 0.652
2.372ValIle: 2.372 ± 0.609
3.163ValLys: 3.163 ± 0.535
4.745ValLeu: 4.745 ± 1.265
0.395ValMet: 0.395 ± 0.416
3.954ValAsn: 3.954 ± 0.774
3.163ValPro: 3.163 ± 0.702
3.559ValGln: 3.559 ± 0.829
4.35ValArg: 4.35 ± 1.896
4.35ValSer: 4.35 ± 0.465
6.722ValThr: 6.722 ± 1.349
0.791ValVal: 0.791 ± 0.643
1.186ValTrp: 1.186 ± 1.018
2.372ValTyr: 2.372 ± 0.543
0.0ValXaa: 0.0 ± 0.0
Trp
1.582TrpAla: 1.582 ± 0.448
0.0TrpCys: 0.0 ± 0.0
0.791TrpAsp: 0.791 ± 0.406
0.791TrpGlu: 0.791 ± 0.406
0.395TrpPhe: 0.395 ± 0.321
1.186TrpGly: 1.186 ± 0.352
0.0TrpHis: 0.0 ± 0.0
0.395TrpIle: 0.395 ± 0.335
1.186TrpLys: 1.186 ± 0.661
2.372TrpLeu: 2.372 ± 1.324
0.0TrpMet: 0.0 ± 0.0
0.395TrpAsn: 0.395 ± 0.339
0.395TrpPro: 0.395 ± 0.321
0.791TrpGln: 0.791 ± 0.4
0.0TrpArg: 0.0 ± 0.0
0.395TrpSer: 0.395 ± 0.361
1.186TrpThr: 1.186 ± 0.556
0.395TrpVal: 0.395 ± 0.335
0.0TrpTrp: 0.0 ± 0.0
0.791TrpTyr: 0.791 ± 0.396
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.977TyrAla: 1.977 ± 0.845
1.582TyrCys: 1.582 ± 0.894
0.395TyrAsp: 0.395 ± 0.321
1.186TyrGlu: 1.186 ± 0.433
1.582TyrPhe: 1.582 ± 0.543
1.582TyrGly: 1.582 ± 0.692
0.395TyrHis: 0.395 ± 0.339
0.791TyrIle: 0.791 ± 0.568
1.186TyrLys: 1.186 ± 0.351
3.559TyrLeu: 3.559 ± 1.11
0.0TyrMet: 0.0 ± 0.0
1.186TyrAsn: 1.186 ± 0.66
0.791TyrPro: 0.791 ± 0.679
1.582TyrGln: 1.582 ± 0.799
1.977TyrArg: 1.977 ± 0.835
2.768TyrSer: 2.768 ± 0.845
1.582TyrThr: 1.582 ± 0.584
2.768TyrVal: 2.768 ± 0.573
1.582TyrTrp: 1.582 ± 1.047
1.582TyrTyr: 1.582 ± 0.852
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2530 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski