Amino acid dipepetide frequency for Bos taurus papillomavirus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.468AlaAla: 4.468 ± 1.505
0.447AlaCys: 0.447 ± 0.541
4.468AlaAsp: 4.468 ± 1.566
6.256AlaGlu: 6.256 ± 0.635
2.681AlaPhe: 2.681 ± 1.066
3.128AlaGly: 3.128 ± 1.351
0.894AlaHis: 0.894 ± 0.422
1.787AlaIle: 1.787 ± 1.125
3.575AlaLys: 3.575 ± 1.288
5.362AlaLeu: 5.362 ± 2.51
0.447AlaMet: 0.447 ± 0.361
3.128AlaAsn: 3.128 ± 1.209
2.234AlaPro: 2.234 ± 0.995
1.34AlaGln: 1.34 ± 0.668
2.234AlaArg: 2.234 ± 1.679
4.468AlaSer: 4.468 ± 1.23
2.234AlaThr: 2.234 ± 1.395
2.234AlaVal: 2.234 ± 1.149
1.787AlaTrp: 1.787 ± 0.988
1.34AlaTyr: 1.34 ± 0.777
0.0AlaXaa: 0.0 ± 0.0
Cys
1.34CysAla: 1.34 ± 0.794
1.787CysCys: 1.787 ± 1.145
1.787CysAsp: 1.787 ± 0.988
1.787CysGlu: 1.787 ± 0.779
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.447CysHis: 0.447 ± 0.361
1.787CysIle: 1.787 ± 1.156
0.447CysLys: 0.447 ± 0.361
1.34CysLeu: 1.34 ± 0.7
0.894CysMet: 0.894 ± 1.082
0.447CysAsn: 0.447 ± 0.426
1.787CysPro: 1.787 ± 0.888
1.34CysGln: 1.34 ± 0.587
0.447CysArg: 0.447 ± 0.541
2.681CysSer: 2.681 ± 1.32
1.34CysThr: 1.34 ± 1.113
1.34CysVal: 1.34 ± 0.363
0.0CysTrp: 0.0 ± 0.0
0.894CysTyr: 0.894 ± 0.59
0.0CysXaa: 0.0 ± 0.0
Asp
1.787AspAla: 1.787 ± 1.052
0.894AspCys: 0.894 ± 0.722
5.362AspAsp: 5.362 ± 1.554
4.021AspGlu: 4.021 ± 0.824
3.128AspPhe: 3.128 ± 0.785
3.575AspGly: 3.575 ± 0.79
0.894AspHis: 0.894 ± 0.537
5.809AspIle: 5.809 ± 0.804
1.787AspLys: 1.787 ± 0.844
5.809AspLeu: 5.809 ± 1.681
0.894AspMet: 0.894 ± 0.422
3.575AspAsn: 3.575 ± 1.06
6.256AspPro: 6.256 ± 1.191
1.34AspGln: 1.34 ± 0.668
3.575AspArg: 3.575 ± 1.823
5.809AspSer: 5.809 ± 1.185
7.149AspThr: 7.149 ± 2.377
3.128AspVal: 3.128 ± 0.949
0.894AspTrp: 0.894 ± 0.458
1.34AspTyr: 1.34 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
4.021GluAla: 4.021 ± 1.652
2.234GluCys: 2.234 ± 0.583
4.915GluAsp: 4.915 ± 0.635
7.596GluGlu: 7.596 ± 2.87
3.128GluPhe: 3.128 ± 1.078
4.021GluGly: 4.021 ± 1.284
0.447GluHis: 0.447 ± 0.541
4.915GluIle: 4.915 ± 1.087
3.128GluLys: 3.128 ± 1.372
7.149GluLeu: 7.149 ± 1.135
2.234GluMet: 2.234 ± 0.733
3.575GluAsn: 3.575 ± 1.189
2.681GluPro: 2.681 ± 0.883
2.234GluGln: 2.234 ± 0.751
2.681GluArg: 2.681 ± 0.589
3.575GluSer: 3.575 ± 0.995
5.362GluThr: 5.362 ± 1.295
2.681GluVal: 2.681 ± 1.149
0.447GluTrp: 0.447 ± 0.413
1.787GluTyr: 1.787 ± 1.143
0.0GluXaa: 0.0 ± 0.0
Phe
2.681PheAla: 2.681 ± 0.522
0.894PheCys: 0.894 ± 0.59
1.787PheAsp: 1.787 ± 1.213
4.021PheGlu: 4.021 ± 1.356
3.575PhePhe: 3.575 ± 1.504
5.362PheGly: 5.362 ± 1.741
0.894PheHis: 0.894 ± 0.722
0.894PheIle: 0.894 ± 0.422
4.468PheLys: 4.468 ± 1.982
4.021PheLeu: 4.021 ± 1.039
0.894PheMet: 0.894 ± 0.722
2.234PheAsn: 2.234 ± 1.214
1.787PhePro: 1.787 ± 0.844
2.234PheGln: 2.234 ± 1.035
0.894PheArg: 0.894 ± 0.537
1.34PheSer: 1.34 ± 0.715
0.894PheThr: 0.894 ± 0.609
4.915PheVal: 4.915 ± 0.697
1.787PheTrp: 1.787 ± 0.844
1.34PheTyr: 1.34 ± 0.733
0.0PheXaa: 0.0 ± 0.0
Gly
1.34GlyAla: 1.34 ± 0.46
0.447GlyCys: 0.447 ± 0.413
5.362GlyAsp: 5.362 ± 0.887
7.149GlyGlu: 7.149 ± 1.372
1.787GlyPhe: 1.787 ± 0.733
3.575GlyGly: 3.575 ± 1.722
2.234GlyHis: 2.234 ± 1.371
3.575GlyIle: 3.575 ± 0.966
3.128GlyLys: 3.128 ± 0.664
5.362GlyLeu: 5.362 ± 1.743
0.0GlyMet: 0.0 ± 0.0
3.128GlyAsn: 3.128 ± 1.466
3.128GlyPro: 3.128 ± 2.236
1.787GlyGln: 1.787 ± 0.963
4.915GlyArg: 4.915 ± 2.268
6.256GlySer: 6.256 ± 1.381
4.915GlyThr: 4.915 ± 1.846
2.234GlyVal: 2.234 ± 1.093
0.447GlyTrp: 0.447 ± 0.361
1.34GlyTyr: 1.34 ± 1.182
0.0GlyXaa: 0.0 ± 0.0
His
0.447HisAla: 0.447 ± 0.413
1.34HisCys: 1.34 ± 0.593
0.447HisAsp: 0.447 ± 0.413
0.447HisGlu: 0.447 ± 0.394
0.447HisPhe: 0.447 ± 0.361
1.34HisGly: 1.34 ± 1.083
0.447HisHis: 0.447 ± 0.426
0.894HisIle: 0.894 ± 0.422
0.447HisLys: 0.447 ± 0.361
0.894HisLeu: 0.894 ± 0.537
0.447HisMet: 0.447 ± 0.361
0.447HisAsn: 0.447 ± 0.413
2.681HisPro: 2.681 ± 0.522
1.34HisGln: 1.34 ± 0.833
1.787HisArg: 1.787 ± 1.703
0.447HisSer: 0.447 ± 0.361
1.34HisThr: 1.34 ± 0.934
0.894HisVal: 0.894 ± 0.422
0.447HisTrp: 0.447 ± 0.426
1.34HisTyr: 1.34 ± 0.602
0.0HisXaa: 0.0 ± 0.0
Ile
3.575IleAla: 3.575 ± 1.498
0.894IleCys: 0.894 ± 0.642
4.468IleAsp: 4.468 ± 1.118
2.681IleGlu: 2.681 ± 1.145
3.128IlePhe: 3.128 ± 1.206
4.915IleGly: 4.915 ± 1.056
1.34IleHis: 1.34 ± 0.46
1.34IleIle: 1.34 ± 0.807
0.0IleLys: 0.0 ± 0.0
4.915IleLeu: 4.915 ± 1.499
0.447IleMet: 0.447 ± 0.426
4.021IleAsn: 4.021 ± 0.672
3.128IlePro: 3.128 ± 1.526
3.128IleGln: 3.128 ± 0.747
4.468IleArg: 4.468 ± 1.096
3.128IleSer: 3.128 ± 1.278
2.234IleThr: 2.234 ± 0.985
3.128IleVal: 3.128 ± 1.151
0.0IleTrp: 0.0 ± 0.0
0.894IleTyr: 0.894 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
2.234LysAla: 2.234 ± 1.149
1.34LysCys: 1.34 ± 0.668
1.787LysAsp: 1.787 ± 0.651
3.128LysGlu: 3.128 ± 1.411
2.681LysPhe: 2.681 ± 0.882
1.34LysGly: 1.34 ± 0.363
1.34LysHis: 1.34 ± 0.706
1.787LysIle: 1.787 ± 0.627
3.575LysLys: 3.575 ± 1.229
3.575LysLeu: 3.575 ± 1.829
0.447LysMet: 0.447 ± 0.361
3.575LysAsn: 3.575 ± 0.889
1.787LysPro: 1.787 ± 0.733
1.34LysGln: 1.34 ± 0.441
5.809LysArg: 5.809 ± 0.925
5.362LysSer: 5.362 ± 1.864
2.681LysThr: 2.681 ± 1.146
1.787LysVal: 1.787 ± 0.844
0.0LysTrp: 0.0 ± 0.0
3.128LysTyr: 3.128 ± 1.05
0.0LysXaa: 0.0 ± 0.0
Leu
4.915LeuAla: 4.915 ± 0.786
2.234LeuCys: 2.234 ± 1.589
5.809LeuAsp: 5.809 ± 0.907
6.256LeuGlu: 6.256 ± 1.518
4.468LeuPhe: 4.468 ± 1.7
4.915LeuGly: 4.915 ± 1.193
1.787LeuHis: 1.787 ± 0.833
4.021LeuIle: 4.021 ± 1.481
2.234LeuLys: 2.234 ± 0.718
9.83LeuLeu: 9.83 ± 2.352
0.447LeuMet: 0.447 ± 0.403
4.021LeuAsn: 4.021 ± 1.042
1.787LeuPro: 1.787 ± 1.293
5.362LeuGln: 5.362 ± 2.008
8.49LeuArg: 8.49 ± 2.684
4.468LeuSer: 4.468 ± 0.966
6.256LeuThr: 6.256 ± 1.31
5.362LeuVal: 5.362 ± 0.939
0.0LeuTrp: 0.0 ± 0.0
4.468LeuTyr: 4.468 ± 0.837
0.0LeuXaa: 0.0 ± 0.0
Met
1.34MetAla: 1.34 ± 0.934
1.34MetCys: 1.34 ± 0.587
0.447MetAsp: 0.447 ± 0.361
0.447MetGlu: 0.447 ± 0.426
0.894MetPhe: 0.894 ± 0.537
0.447MetGly: 0.447 ± 0.426
0.894MetHis: 0.894 ± 0.722
1.34MetIle: 1.34 ± 0.61
0.447MetLys: 0.447 ± 0.361
0.0MetLeu: 0.0 ± 0.0
0.894MetMet: 0.894 ± 0.625
2.234MetAsn: 2.234 ± 0.682
0.894MetPro: 0.894 ± 0.458
0.0MetGln: 0.0 ± 0.0
1.34MetArg: 1.34 ± 0.815
1.34MetSer: 1.34 ± 0.727
0.0MetThr: 0.0 ± 0.0
1.34MetVal: 1.34 ± 0.363
0.447MetTrp: 0.447 ± 0.541
0.894MetTyr: 0.894 ± 0.537
0.0MetXaa: 0.0 ± 0.0
Asn
3.128AsnAla: 3.128 ± 0.707
1.34AsnCys: 1.34 ± 1.083
1.787AsnAsp: 1.787 ± 0.629
0.894AsnGlu: 0.894 ± 0.582
2.681AsnPhe: 2.681 ± 0.988
2.234AsnGly: 2.234 ± 1.629
0.0AsnHis: 0.0 ± 0.0
3.575AsnIle: 3.575 ± 0.785
2.681AsnLys: 2.681 ± 1.149
3.575AsnLeu: 3.575 ± 1.308
0.447AsnMet: 0.447 ± 0.361
1.787AsnAsn: 1.787 ± 0.721
3.575AsnPro: 3.575 ± 1.398
1.787AsnGln: 1.787 ± 0.844
2.681AsnArg: 2.681 ± 1.335
2.681AsnSer: 2.681 ± 1.004
5.362AsnThr: 5.362 ± 1.469
1.787AsnVal: 1.787 ± 0.688
1.787AsnTrp: 1.787 ± 0.915
2.234AsnTyr: 2.234 ± 1.058
0.0AsnXaa: 0.0 ± 0.0
Pro
6.256ProAla: 6.256 ± 3.113
1.787ProCys: 1.787 ± 1.038
4.468ProAsp: 4.468 ± 1.824
5.362ProGlu: 5.362 ± 1.641
1.787ProPhe: 1.787 ± 0.47
0.447ProGly: 0.447 ± 0.394
0.447ProHis: 0.447 ± 0.394
1.787ProIle: 1.787 ± 0.92
3.128ProLys: 3.128 ± 0.99
4.915ProLeu: 4.915 ± 1.193
0.447ProMet: 0.447 ± 0.426
1.787ProAsn: 1.787 ± 0.629
4.915ProPro: 4.915 ± 1.263
1.34ProGln: 1.34 ± 0.658
3.575ProArg: 3.575 ± 0.761
1.787ProSer: 1.787 ± 0.89
3.575ProThr: 3.575 ± 1.63
5.809ProVal: 5.809 ± 1.375
0.0ProTrp: 0.0 ± 0.0
1.787ProTyr: 1.787 ± 1.038
0.0ProXaa: 0.0 ± 0.0
Gln
1.34GlnAla: 1.34 ± 1.277
0.447GlnCys: 0.447 ± 0.541
3.575GlnAsp: 3.575 ± 0.889
3.128GlnGlu: 3.128 ± 0.917
2.234GlnPhe: 2.234 ± 1.329
2.681GlnGly: 2.681 ± 0.672
0.447GlnHis: 0.447 ± 0.413
2.234GlnIle: 2.234 ± 0.491
1.787GlnLys: 1.787 ± 1.138
3.575GlnLeu: 3.575 ± 0.847
0.894GlnMet: 0.894 ± 0.422
3.128GlnAsn: 3.128 ± 1.073
1.787GlnPro: 1.787 ± 0.629
0.0GlnGln: 0.0 ± 0.0
2.234GlnArg: 2.234 ± 0.861
2.681GlnSer: 2.681 ± 0.47
2.681GlnThr: 2.681 ± 0.654
3.128GlnVal: 3.128 ± 1.05
0.894GlnTrp: 0.894 ± 0.722
0.447GlnTyr: 0.447 ± 0.413
0.0GlnXaa: 0.0 ± 0.0
Arg
2.234ArgAla: 2.234 ± 0.931
1.34ArgCys: 1.34 ± 1.07
3.128ArgAsp: 3.128 ± 1.787
2.234ArgGlu: 2.234 ± 0.868
1.34ArgPhe: 1.34 ± 0.77
5.809ArgGly: 5.809 ± 2.615
2.681ArgHis: 2.681 ± 1.042
3.575ArgIle: 3.575 ± 1.443
4.021ArgLys: 4.021 ± 1.194
9.383ArgLeu: 9.383 ± 2.467
1.787ArgMet: 1.787 ± 0.825
1.787ArgAsn: 1.787 ± 1.108
3.575ArgPro: 3.575 ± 1.722
2.681ArgGln: 2.681 ± 0.862
5.362ArgArg: 5.362 ± 2.034
4.021ArgSer: 4.021 ± 1.228
4.021ArgThr: 4.021 ± 1.195
6.702ArgVal: 6.702 ± 2.421
0.0ArgTrp: 0.0 ± 0.0
2.234ArgTyr: 2.234 ± 1.176
0.0ArgXaa: 0.0 ± 0.0
Ser
4.021SerAla: 4.021 ± 1.618
2.234SerCys: 2.234 ± 0.913
5.809SerAsp: 5.809 ± 1.522
3.128SerGlu: 3.128 ± 1.397
3.575SerPhe: 3.575 ± 0.844
7.149SerGly: 7.149 ± 1.322
0.894SerHis: 0.894 ± 0.59
3.575SerIle: 3.575 ± 0.916
3.575SerLys: 3.575 ± 1.593
4.915SerLeu: 4.915 ± 1.124
1.34SerMet: 1.34 ± 0.748
0.894SerAsn: 0.894 ± 0.422
3.575SerPro: 3.575 ± 0.906
3.128SerGln: 3.128 ± 0.626
5.362SerArg: 5.362 ± 1.866
4.915SerSer: 4.915 ± 1.7
4.021SerThr: 4.021 ± 0.72
4.021SerVal: 4.021 ± 1.079
1.34SerTrp: 1.34 ± 0.807
3.128SerTyr: 3.128 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
2.234ThrAla: 2.234 ± 1.523
0.447ThrCys: 0.447 ± 0.493
3.128ThrAsp: 3.128 ± 0.41
5.362ThrGlu: 5.362 ± 1.092
4.021ThrPhe: 4.021 ± 1.682
6.256ThrGly: 6.256 ± 1.619
0.894ThrHis: 0.894 ± 0.422
4.468ThrIle: 4.468 ± 1.698
2.234ThrLys: 2.234 ± 1.089
5.809ThrLeu: 5.809 ± 1.697
0.894ThrMet: 0.894 ± 0.422
3.128ThrAsn: 3.128 ± 0.911
5.809ThrPro: 5.809 ± 1.816
4.021ThrGln: 4.021 ± 0.879
4.021ThrArg: 4.021 ± 1.067
4.915ThrSer: 4.915 ± 0.561
2.234ThrThr: 2.234 ± 0.751
5.362ThrVal: 5.362 ± 0.935
0.447ThrTrp: 0.447 ± 0.541
1.34ThrTyr: 1.34 ± 0.753
0.0ThrXaa: 0.0 ± 0.0
Val
4.021ValAla: 4.021 ± 1.429
0.447ValCys: 0.447 ± 0.361
5.362ValAsp: 5.362 ± 1.623
3.128ValGlu: 3.128 ± 1.295
3.575ValPhe: 3.575 ± 0.661
2.681ValGly: 2.681 ± 0.522
0.894ValHis: 0.894 ± 0.46
3.128ValIle: 3.128 ± 0.469
3.575ValLys: 3.575 ± 1.157
1.787ValLeu: 1.787 ± 0.89
0.894ValMet: 0.894 ± 0.444
0.894ValAsn: 0.894 ± 0.481
3.575ValPro: 3.575 ± 1.4
3.128ValGln: 3.128 ± 0.792
4.021ValArg: 4.021 ± 0.773
8.043ValSer: 8.043 ± 2.387
6.256ValThr: 6.256 ± 1.79
4.915ValVal: 4.915 ± 1.087
1.34ValTrp: 1.34 ± 0.858
1.34ValTyr: 1.34 ± 0.858
0.0ValXaa: 0.0 ± 0.0
Trp
0.447TrpAla: 0.447 ± 0.361
0.0TrpCys: 0.0 ± 0.0
0.894TrpAsp: 0.894 ± 0.537
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.894TrpGly: 0.894 ± 0.458
0.0TrpHis: 0.0 ± 0.0
1.34TrpIle: 1.34 ± 1.083
2.234TrpLys: 2.234 ± 1.329
1.787TrpLeu: 1.787 ± 0.733
0.0TrpMet: 0.0 ± 0.0
0.447TrpAsn: 0.447 ± 0.413
0.0TrpPro: 0.0 ± 0.0
0.447TrpGln: 0.447 ± 0.413
2.234TrpArg: 2.234 ± 1.43
0.447TrpSer: 0.447 ± 0.426
0.447TrpThr: 0.447 ± 0.426
0.447TrpVal: 0.447 ± 0.413
0.0TrpTrp: 0.0 ± 0.0
0.447TrpTyr: 0.447 ± 0.361
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.681TyrAla: 2.681 ± 0.931
0.0TyrCys: 0.0 ± 0.0
2.234TyrAsp: 2.234 ± 0.839
1.787TyrGlu: 1.787 ± 1.235
1.787TyrPhe: 1.787 ± 0.571
1.34TyrGly: 1.34 ± 0.363
0.447TyrHis: 0.447 ± 0.413
0.0TyrIle: 0.0 ± 0.0
2.234TyrLys: 2.234 ± 0.718
3.128TyrLeu: 3.128 ± 0.827
1.787TyrMet: 1.787 ± 0.968
1.787TyrAsn: 1.787 ± 0.627
0.447TyrPro: 0.447 ± 0.394
1.34TyrGln: 1.34 ± 0.441
1.787TyrArg: 1.787 ± 0.667
2.234TyrSer: 2.234 ± 0.92
4.021TyrThr: 4.021 ± 1.165
2.234TyrVal: 2.234 ± 0.682
0.447TyrTrp: 0.447 ± 0.413
2.234TyrTyr: 2.234 ± 1.43
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2239 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski