Amino acid dipepetide frequency for Bos taurus papillomavirus 13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.821AlaAla: 5.821 ± 0.699
2.495AlaCys: 2.495 ± 1.236
3.326AlaAsp: 3.326 ± 0.541
6.237AlaGlu: 6.237 ± 1.497
2.911AlaPhe: 2.911 ± 0.913
9.979AlaGly: 9.979 ± 2.088
0.0AlaHis: 0.0 ± 0.0
2.495AlaIle: 2.495 ± 0.848
4.158AlaLys: 4.158 ± 1.25
5.821AlaLeu: 5.821 ± 2.472
1.247AlaMet: 1.247 ± 0.644
3.326AlaAsn: 3.326 ± 0.696
3.326AlaPro: 3.326 ± 1.104
3.742AlaGln: 3.742 ± 1.472
5.821AlaArg: 5.821 ± 1.734
5.821AlaSer: 5.821 ± 1.295
4.574AlaThr: 4.574 ± 1.741
4.99AlaVal: 4.99 ± 0.862
0.416AlaTrp: 0.416 ± 0.363
2.079AlaTyr: 2.079 ± 1.022
0.0AlaXaa: 0.0 ± 0.0
Cys
2.079CysAla: 2.079 ± 1.096
1.247CysCys: 1.247 ± 1.295
0.0CysAsp: 0.0 ± 0.0
1.247CysGlu: 1.247 ± 0.538
1.247CysPhe: 1.247 ± 0.711
1.663CysGly: 1.663 ± 1.305
0.416CysHis: 0.416 ± 0.521
0.416CysIle: 0.416 ± 0.546
1.663CysLys: 1.663 ± 1.067
2.495CysLeu: 2.495 ± 1.628
0.416CysMet: 0.416 ± 0.546
0.416CysAsn: 0.416 ± 0.333
1.247CysPro: 1.247 ± 0.658
0.416CysGln: 0.416 ± 0.4
1.247CysArg: 1.247 ± 0.95
3.742CysSer: 3.742 ± 1.584
2.911CysThr: 2.911 ± 1.357
0.416CysVal: 0.416 ± 0.363
0.416CysTrp: 0.416 ± 0.333
1.247CysTyr: 1.247 ± 1.204
0.0CysXaa: 0.0 ± 0.0
Asp
4.574AspAla: 4.574 ± 1.137
1.663AspCys: 1.663 ± 0.96
2.495AspAsp: 2.495 ± 1.055
2.495AspGlu: 2.495 ± 1.215
4.574AspPhe: 4.574 ± 0.777
4.574AspGly: 4.574 ± 1.452
1.663AspHis: 1.663 ± 0.608
2.079AspIle: 2.079 ± 0.777
3.326AspLys: 3.326 ± 0.731
4.99AspLeu: 4.99 ± 1.596
0.832AspMet: 0.832 ± 0.428
2.495AspAsn: 2.495 ± 0.844
2.079AspPro: 2.079 ± 0.788
0.832AspGln: 0.832 ± 0.453
2.911AspArg: 2.911 ± 0.998
4.158AspSer: 4.158 ± 1.624
4.158AspThr: 4.158 ± 1.865
1.247AspVal: 1.247 ± 0.696
0.416AspTrp: 0.416 ± 0.333
0.832AspTyr: 0.832 ± 0.429
0.0AspXaa: 0.0 ± 0.0
Glu
4.158GluAla: 4.158 ± 0.865
0.832GluCys: 0.832 ± 0.666
4.99GluAsp: 4.99 ± 1.313
7.9GluGlu: 7.9 ± 2.077
0.832GluPhe: 0.832 ± 0.801
4.158GluGly: 4.158 ± 0.864
0.832GluHis: 0.832 ± 0.647
2.911GluIle: 2.911 ± 1.356
2.911GluLys: 2.911 ± 0.8
4.574GluLeu: 4.574 ± 0.969
0.416GluMet: 0.416 ± 0.4
3.742GluAsn: 3.742 ± 0.821
3.742GluPro: 3.742 ± 1.836
2.911GluGln: 2.911 ± 1.192
3.326GluArg: 3.326 ± 1.175
2.911GluSer: 2.911 ± 1.278
5.405GluThr: 5.405 ± 0.901
1.247GluVal: 1.247 ± 0.598
0.416GluTrp: 0.416 ± 0.333
1.663GluTyr: 1.663 ± 1.087
0.0GluXaa: 0.0 ± 0.0
Phe
2.495PheAla: 2.495 ± 0.587
0.416PheCys: 0.416 ± 0.546
1.663PheAsp: 1.663 ± 0.747
2.079PheGlu: 2.079 ± 0.967
2.495PhePhe: 2.495 ± 1.169
4.158PheGly: 4.158 ± 0.863
0.832PheHis: 0.832 ± 0.712
1.663PheIle: 1.663 ± 0.621
2.911PheLys: 2.911 ± 1.218
4.99PheLeu: 4.99 ± 2.51
0.832PheMet: 0.832 ± 0.681
2.079PheAsn: 2.079 ± 1.182
1.247PhePro: 1.247 ± 0.455
0.832PheGln: 0.832 ± 0.666
3.326PheArg: 3.326 ± 0.777
3.326PheSer: 3.326 ± 1.377
2.079PheThr: 2.079 ± 0.678
1.247PheVal: 1.247 ± 0.644
1.247PheTrp: 1.247 ± 0.706
0.832PheTyr: 0.832 ± 0.727
0.0PheXaa: 0.0 ± 0.0
Gly
6.653GlyAla: 6.653 ± 0.962
2.079GlyCys: 2.079 ± 1.118
4.99GlyAsp: 4.99 ± 1.323
3.326GlyGlu: 3.326 ± 1.016
1.663GlyPhe: 1.663 ± 0.857
5.821GlyGly: 5.821 ± 1.942
2.079GlyHis: 2.079 ± 0.831
2.495GlyIle: 2.495 ± 1.302
2.911GlyLys: 2.911 ± 2.016
7.9GlyLeu: 7.9 ± 1.722
0.832GlyMet: 0.832 ± 0.453
2.495GlyAsn: 2.495 ± 0.867
4.99GlyPro: 4.99 ± 2.451
2.911GlyGln: 2.911 ± 1.164
3.742GlyArg: 3.742 ± 0.702
9.148GlySer: 9.148 ± 1.008
4.99GlyThr: 4.99 ± 1.012
4.158GlyVal: 4.158 ± 0.764
0.832GlyTrp: 0.832 ± 0.801
0.832GlyTyr: 0.832 ± 0.649
0.0GlyXaa: 0.0 ± 0.0
His
1.663HisAla: 1.663 ± 0.642
0.416HisCys: 0.416 ± 0.4
0.832HisAsp: 0.832 ± 0.666
0.832HisGlu: 0.832 ± 0.404
2.495HisPhe: 2.495 ± 0.727
2.079HisGly: 2.079 ± 1.248
0.0HisHis: 0.0 ± 0.0
0.832HisIle: 0.832 ± 0.453
1.247HisLys: 1.247 ± 1.0
2.495HisLeu: 2.495 ± 0.816
0.416HisMet: 0.416 ± 0.363
0.416HisAsn: 0.416 ± 0.363
2.495HisPro: 2.495 ± 0.88
1.247HisGln: 1.247 ± 1.261
2.079HisArg: 2.079 ± 1.062
0.832HisSer: 0.832 ± 0.384
0.416HisThr: 0.416 ± 0.367
2.495HisVal: 2.495 ± 1.276
0.0HisTrp: 0.0 ± 0.0
0.832HisTyr: 0.832 ± 0.384
0.0HisXaa: 0.0 ± 0.0
Ile
2.495IleAla: 2.495 ± 1.004
0.416IleCys: 0.416 ± 0.363
3.742IleAsp: 3.742 ± 1.502
3.326IleGlu: 3.326 ± 0.88
2.079IlePhe: 2.079 ± 0.903
2.911IleGly: 2.911 ± 1.184
0.416IleHis: 0.416 ± 0.333
2.079IleIle: 2.079 ± 1.468
1.663IleLys: 1.663 ± 0.787
4.574IleLeu: 4.574 ± 0.642
0.0IleMet: 0.0 ± 0.0
0.832IleAsn: 0.832 ± 0.452
3.326IlePro: 3.326 ± 2.092
0.832IleGln: 0.832 ± 0.727
2.079IleArg: 2.079 ± 1.229
1.663IleSer: 1.663 ± 0.651
2.911IleThr: 2.911 ± 0.704
0.416IleVal: 0.416 ± 0.367
0.416IleTrp: 0.416 ± 0.363
0.832IleTyr: 0.832 ± 0.494
0.0IleXaa: 0.0 ± 0.0
Lys
3.742LysAla: 3.742 ± 1.404
0.832LysCys: 0.832 ± 0.453
1.663LysAsp: 1.663 ± 0.951
4.574LysGlu: 4.574 ± 1.908
0.832LysPhe: 0.832 ± 0.452
2.911LysGly: 2.911 ± 0.635
2.495LysHis: 2.495 ± 0.893
2.911LysIle: 2.911 ± 1.372
5.405LysLys: 5.405 ± 2.395
4.158LysLeu: 4.158 ± 1.048
0.416LysMet: 0.416 ± 0.363
3.326LysAsn: 3.326 ± 1.133
0.832LysPro: 0.832 ± 0.632
2.079LysGln: 2.079 ± 0.511
5.405LysArg: 5.405 ± 1.329
4.99LysSer: 4.99 ± 2.422
2.911LysThr: 2.911 ± 0.878
3.326LysVal: 3.326 ± 1.368
0.0LysTrp: 0.0 ± 0.0
0.832LysTyr: 0.832 ± 0.613
0.0LysXaa: 0.0 ± 0.0
Leu
7.484LeuAla: 7.484 ± 1.482
3.326LeuCys: 3.326 ± 1.096
7.069LeuAsp: 7.069 ± 1.373
4.99LeuGlu: 4.99 ± 1.222
4.574LeuPhe: 4.574 ± 2.273
6.653LeuGly: 6.653 ± 0.734
2.911LeuHis: 2.911 ± 0.911
3.326LeuIle: 3.326 ± 1.254
6.653LeuLys: 6.653 ± 1.629
13.721LeuLeu: 13.721 ± 4.225
1.247LeuMet: 1.247 ± 0.581
1.663LeuAsn: 1.663 ± 0.735
4.574LeuPro: 4.574 ± 1.263
5.405LeuGln: 5.405 ± 1.233
2.495LeuArg: 2.495 ± 0.517
6.653LeuSer: 6.653 ± 0.891
3.742LeuThr: 3.742 ± 1.27
4.158LeuVal: 4.158 ± 1.407
2.495LeuTrp: 2.495 ± 1.236
4.574LeuTyr: 4.574 ± 1.293
0.0LeuXaa: 0.0 ± 0.0
Met
2.079MetAla: 2.079 ± 0.745
0.0MetCys: 0.0 ± 0.0
0.416MetAsp: 0.416 ± 0.546
1.247MetGlu: 1.247 ± 0.718
0.416MetPhe: 0.416 ± 0.363
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.416MetIle: 0.416 ± 0.546
0.0MetLys: 0.0 ± 0.0
1.247MetLeu: 1.247 ± 0.827
0.416MetMet: 0.416 ± 0.363
0.832MetAsn: 0.832 ± 0.453
1.247MetPro: 1.247 ± 0.653
1.663MetGln: 1.663 ± 0.895
0.832MetArg: 0.832 ± 0.384
0.832MetSer: 0.832 ± 0.494
0.0MetThr: 0.0 ± 0.0
1.247MetVal: 1.247 ± 0.747
0.0MetTrp: 0.0 ± 0.0
0.416MetTyr: 0.416 ± 0.367
0.0MetXaa: 0.0 ± 0.0
Asn
4.158AsnAla: 4.158 ± 1.651
0.832AsnCys: 0.832 ± 0.384
0.832AsnAsp: 0.832 ± 0.666
2.079AsnGlu: 2.079 ± 1.076
0.0AsnPhe: 0.0 ± 0.0
1.663AsnGly: 1.663 ± 0.803
1.247AsnHis: 1.247 ± 0.598
2.079AsnIle: 2.079 ± 0.694
0.832AsnLys: 0.832 ± 0.727
3.326AsnLeu: 3.326 ± 1.291
0.0AsnMet: 0.0 ± 0.0
1.663AsnAsn: 1.663 ± 1.037
1.663AsnPro: 1.663 ± 0.739
2.911AsnGln: 2.911 ± 1.012
1.663AsnArg: 1.663 ± 0.801
2.911AsnSer: 2.911 ± 0.997
2.911AsnThr: 2.911 ± 0.799
1.663AsnVal: 1.663 ± 0.669
2.079AsnTrp: 2.079 ± 0.511
0.832AsnTyr: 0.832 ± 0.404
0.0AsnXaa: 0.0 ± 0.0
Pro
7.069ProAla: 7.069 ± 1.026
2.079ProCys: 2.079 ± 1.197
5.821ProAsp: 5.821 ± 2.509
2.079ProGlu: 2.079 ± 0.98
1.663ProPhe: 1.663 ± 1.228
2.495ProGly: 2.495 ± 0.889
0.832ProHis: 0.832 ± 0.801
0.832ProIle: 0.832 ± 0.494
2.079ProLys: 2.079 ± 0.402
7.484ProLeu: 7.484 ± 1.447
0.416ProMet: 0.416 ± 0.487
2.079ProAsn: 2.079 ± 0.714
5.821ProPro: 5.821 ± 2.769
0.832ProGln: 0.832 ± 0.693
4.158ProArg: 4.158 ± 2.068
4.574ProSer: 4.574 ± 2.042
4.574ProThr: 4.574 ± 1.875
4.574ProVal: 4.574 ± 1.652
0.416ProTrp: 0.416 ± 0.4
1.663ProTyr: 1.663 ± 1.043
0.0ProXaa: 0.0 ± 0.0
Gln
4.158GlnAla: 4.158 ± 1.684
0.416GlnCys: 0.416 ± 0.333
1.663GlnAsp: 1.663 ± 1.088
2.495GlnGlu: 2.495 ± 1.668
1.247GlnPhe: 1.247 ± 1.09
4.99GlnGly: 4.99 ± 0.952
0.0GlnHis: 0.0 ± 0.0
2.495GlnIle: 2.495 ± 0.557
1.247GlnLys: 1.247 ± 0.632
3.326GlnLeu: 3.326 ± 1.066
0.832GlnMet: 0.832 ± 0.384
0.832GlnAsn: 0.832 ± 0.452
3.742GlnPro: 3.742 ± 0.903
2.495GlnGln: 2.495 ± 1.337
1.247GlnArg: 1.247 ± 0.738
2.911GlnSer: 2.911 ± 1.578
3.326GlnThr: 3.326 ± 1.124
2.911GlnVal: 2.911 ± 0.861
0.832GlnTrp: 0.832 ± 0.666
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
5.405ArgAla: 5.405 ± 0.624
2.911ArgCys: 2.911 ± 1.876
1.247ArgAsp: 1.247 ± 1.201
1.247ArgGlu: 1.247 ± 0.747
2.495ArgPhe: 2.495 ± 1.055
4.158ArgGly: 4.158 ± 1.147
2.911ArgHis: 2.911 ± 1.223
0.416ArgIle: 0.416 ± 0.4
6.653ArgLys: 6.653 ± 1.583
3.742ArgLeu: 3.742 ± 1.192
0.416ArgMet: 0.416 ± 0.5
2.495ArgAsn: 2.495 ± 1.194
3.742ArgPro: 3.742 ± 1.24
1.663ArgGln: 1.663 ± 1.088
4.158ArgArg: 4.158 ± 0.877
2.495ArgSer: 2.495 ± 1.025
3.326ArgThr: 3.326 ± 1.454
5.405ArgVal: 5.405 ± 1.268
0.416ArgTrp: 0.416 ± 0.618
4.158ArgTyr: 4.158 ± 0.867
0.0ArgXaa: 0.0 ± 0.0
Ser
4.99SerAla: 4.99 ± 2.079
0.832SerCys: 0.832 ± 0.703
3.326SerAsp: 3.326 ± 0.965
4.158SerGlu: 4.158 ± 0.902
2.911SerPhe: 2.911 ± 1.293
5.405SerGly: 5.405 ± 1.15
2.495SerHis: 2.495 ± 1.373
3.742SerIle: 3.742 ± 1.378
4.158SerLys: 4.158 ± 1.027
8.732SerLeu: 8.732 ± 1.755
2.079SerMet: 2.079 ± 1.324
2.495SerAsn: 2.495 ± 0.856
5.405SerPro: 5.405 ± 1.83
2.495SerGln: 2.495 ± 0.771
3.326SerArg: 3.326 ± 1.097
7.069SerSer: 7.069 ± 2.328
6.653SerThr: 6.653 ± 2.606
4.99SerVal: 4.99 ± 1.301
0.832SerTrp: 0.832 ± 0.384
0.832SerTyr: 0.832 ± 0.666
0.0SerXaa: 0.0 ± 0.0
Thr
3.326ThrAla: 3.326 ± 1.303
1.663ThrCys: 1.663 ± 0.808
2.911ThrAsp: 2.911 ± 1.372
4.99ThrGlu: 4.99 ± 1.391
3.742ThrPhe: 3.742 ± 1.011
6.237ThrGly: 6.237 ± 1.512
0.832ThrHis: 0.832 ± 0.551
2.495ThrIle: 2.495 ± 1.45
1.663ThrLys: 1.663 ± 1.077
3.742ThrLeu: 3.742 ± 0.88
1.247ThrMet: 1.247 ± 0.598
2.911ThrAsn: 2.911 ± 0.75
5.405ThrPro: 5.405 ± 1.336
1.663ThrGln: 1.663 ± 0.649
3.742ThrArg: 3.742 ± 1.388
4.99ThrSer: 4.99 ± 1.888
4.574ThrThr: 4.574 ± 0.78
6.653ThrVal: 6.653 ± 2.741
1.247ThrTrp: 1.247 ± 0.814
2.495ThrTyr: 2.495 ± 1.014
0.0ThrXaa: 0.0 ± 0.0
Val
3.326ValAla: 3.326 ± 1.046
1.247ValCys: 1.247 ± 1.07
1.663ValAsp: 1.663 ± 0.57
2.911ValGlu: 2.911 ± 0.56
2.079ValPhe: 2.079 ± 1.324
3.742ValGly: 3.742 ± 1.78
1.663ValHis: 1.663 ± 0.857
1.663ValIle: 1.663 ± 0.857
2.079ValLys: 2.079 ± 0.776
4.99ValLeu: 4.99 ± 0.779
0.0ValMet: 0.0 ± 0.0
0.416ValAsn: 0.416 ± 0.363
4.574ValPro: 4.574 ± 1.533
4.574ValGln: 4.574 ± 1.812
4.99ValArg: 4.99 ± 0.84
4.158ValSer: 4.158 ± 1.458
4.574ValThr: 4.574 ± 1.174
2.079ValVal: 2.079 ± 1.107
0.832ValTrp: 0.832 ± 0.453
4.158ValTyr: 4.158 ± 1.101
0.0ValXaa: 0.0 ± 0.0
Trp
0.832TrpAla: 0.832 ± 0.452
0.416TrpCys: 0.416 ± 0.546
1.663TrpAsp: 1.663 ± 0.637
0.416TrpGlu: 0.416 ± 0.363
1.663TrpPhe: 1.663 ± 0.608
0.416TrpGly: 0.416 ± 0.333
0.416TrpHis: 0.416 ± 0.618
0.416TrpIle: 0.416 ± 0.333
1.247TrpLys: 1.247 ± 0.821
1.247TrpLeu: 1.247 ± 0.706
0.0TrpMet: 0.0 ± 0.0
0.416TrpAsn: 0.416 ± 0.363
0.0TrpPro: 0.0 ± 0.0
1.247TrpGln: 1.247 ± 0.455
0.416TrpArg: 0.416 ± 0.333
0.832TrpSer: 0.832 ± 0.494
1.663TrpThr: 1.663 ± 0.86
1.247TrpVal: 1.247 ± 0.711
0.0TrpTrp: 0.0 ± 0.0
0.416TrpTyr: 0.416 ± 0.4
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.079TyrAla: 2.079 ± 0.686
0.832TyrCys: 0.832 ± 0.712
2.495TyrAsp: 2.495 ± 0.718
1.247TyrGlu: 1.247 ± 0.387
0.832TyrPhe: 0.832 ± 0.404
0.832TyrGly: 0.832 ± 0.693
2.079TyrHis: 2.079 ± 0.511
1.247TyrIle: 1.247 ± 0.417
0.832TyrLys: 0.832 ± 0.727
4.158TyrLeu: 4.158 ± 1.968
0.832TyrMet: 0.832 ± 0.801
0.416TyrAsn: 0.416 ± 0.546
2.079TyrPro: 2.079 ± 0.648
0.416TyrGln: 0.416 ± 0.333
2.495TyrArg: 2.495 ± 0.881
2.911TyrSer: 2.911 ± 1.534
0.832TyrThr: 0.832 ± 0.494
1.247TyrVal: 1.247 ± 0.705
1.663TyrTrp: 1.663 ± 0.904
1.663TyrTyr: 1.663 ± 0.742
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2406 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski