Amino acid dipepetide frequency for Human papillomavirus type XS2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.224AlaAla: 5.224 ± 1.061
1.741AlaCys: 1.741 ± 0.651
3.047AlaAsp: 3.047 ± 1.682
3.483AlaGlu: 3.483 ± 1.255
3.047AlaPhe: 3.047 ± 0.885
5.66AlaGly: 5.66 ± 0.864
1.306AlaHis: 1.306 ± 0.35
2.177AlaIle: 2.177 ± 0.282
3.483AlaLys: 3.483 ± 1.548
3.918AlaLeu: 3.918 ± 0.833
0.871AlaMet: 0.871 ± 0.482
0.871AlaAsn: 0.871 ± 0.362
3.047AlaPro: 3.047 ± 1.261
3.483AlaGln: 3.483 ± 0.502
4.354AlaArg: 4.354 ± 0.497
4.354AlaSer: 4.354 ± 0.873
3.918AlaThr: 3.918 ± 0.959
3.483AlaVal: 3.483 ± 0.638
0.0AlaTrp: 0.0 ± 0.0
3.483AlaTyr: 3.483 ± 0.572
0.0AlaXaa: 0.0 ± 0.0
Cys
1.306CysAla: 1.306 ± 0.609
0.435CysCys: 0.435 ± 0.297
1.306CysAsp: 1.306 ± 0.614
2.177CysGlu: 2.177 ± 1.245
0.435CysPhe: 0.435 ± 0.434
0.871CysGly: 0.871 ± 0.639
0.435CysHis: 0.435 ± 0.476
1.306CysIle: 1.306 ± 0.538
1.741CysLys: 1.741 ± 1.044
1.306CysLeu: 1.306 ± 0.953
0.435CysMet: 0.435 ± 0.297
0.871CysAsn: 0.871 ± 0.881
2.177CysPro: 2.177 ± 0.901
3.047CysGln: 3.047 ± 0.961
0.871CysArg: 0.871 ± 0.522
0.871CysSer: 0.871 ± 0.511
3.483CysThr: 3.483 ± 1.046
1.741CysVal: 1.741 ± 0.651
1.741CysTrp: 1.741 ± 0.55
1.306CysTyr: 1.306 ± 0.609
0.0CysXaa: 0.0 ± 0.0
Asp
4.789AspAla: 4.789 ± 0.952
2.177AspCys: 2.177 ± 1.321
6.095AspAsp: 6.095 ± 0.999
3.483AspGlu: 3.483 ± 1.074
1.741AspPhe: 1.741 ± 0.603
4.354AspGly: 4.354 ± 1.34
0.435AspHis: 0.435 ± 0.297
3.047AspIle: 3.047 ± 1.337
1.306AspLys: 1.306 ± 0.609
2.612AspLeu: 2.612 ± 0.952
0.871AspMet: 0.871 ± 0.432
2.612AspAsn: 2.612 ± 0.862
3.483AspPro: 3.483 ± 0.797
1.306AspGln: 1.306 ± 0.822
2.612AspArg: 2.612 ± 0.805
4.789AspSer: 4.789 ± 1.3
9.578AspThr: 9.578 ± 1.258
7.401AspVal: 7.401 ± 1.407
1.741AspTrp: 1.741 ± 0.496
1.306AspTyr: 1.306 ± 0.802
0.0AspXaa: 0.0 ± 0.0
Glu
3.047GluAla: 3.047 ± 0.652
0.871GluCys: 0.871 ± 0.601
4.789GluAsp: 4.789 ± 0.958
8.707GluGlu: 8.707 ± 2.617
0.871GluPhe: 0.871 ± 0.44
3.047GluGly: 3.047 ± 1.022
0.435GluHis: 0.435 ± 0.297
2.177GluIle: 2.177 ± 0.424
1.741GluLys: 1.741 ± 0.948
4.354GluLeu: 4.354 ± 1.165
0.435GluMet: 0.435 ± 0.434
2.177GluAsn: 2.177 ± 0.748
5.224GluPro: 5.224 ± 1.137
3.047GluGln: 3.047 ± 0.896
2.612GluArg: 2.612 ± 0.74
3.047GluSer: 3.047 ± 1.245
2.612GluThr: 2.612 ± 0.814
3.918GluVal: 3.918 ± 1.44
0.871GluTrp: 0.871 ± 0.595
2.177GluTyr: 2.177 ± 1.001
0.0GluXaa: 0.0 ± 0.0
Phe
1.741PheAla: 1.741 ± 0.769
0.435PheCys: 0.435 ± 0.297
1.306PheAsp: 1.306 ± 0.624
1.306PheGlu: 1.306 ± 1.061
2.177PhePhe: 2.177 ± 0.748
2.177PheGly: 2.177 ± 0.618
0.0PheHis: 0.0 ± 0.0
3.047PheIle: 3.047 ± 1.257
3.047PheLys: 3.047 ± 0.961
5.66PheLeu: 5.66 ± 1.218
0.871PheMet: 0.871 ± 0.469
1.741PheAsn: 1.741 ± 1.044
0.871PhePro: 0.871 ± 0.362
1.306PheGln: 1.306 ± 0.518
2.177PheArg: 2.177 ± 0.72
2.612PheSer: 2.612 ± 0.699
2.177PheThr: 2.177 ± 0.444
1.741PheVal: 1.741 ± 0.615
1.306PheTrp: 1.306 ± 0.538
2.177PheTyr: 2.177 ± 0.444
0.0PheXaa: 0.0 ± 0.0
Gly
3.918GlyAla: 3.918 ± 0.829
1.741GlyCys: 1.741 ± 0.496
8.707GlyAsp: 8.707 ± 1.853
4.354GlyGlu: 4.354 ± 1.297
1.741GlyPhe: 1.741 ± 0.788
6.095GlyGly: 6.095 ± 1.262
3.047GlyHis: 3.047 ± 0.859
2.612GlyIle: 2.612 ± 0.904
2.177GlyLys: 2.177 ± 0.748
4.354GlyLeu: 4.354 ± 0.993
1.306GlyMet: 1.306 ± 0.659
1.306GlyAsn: 1.306 ± 0.591
4.354GlyPro: 4.354 ± 1.052
2.612GlyGln: 2.612 ± 0.765
3.918GlyArg: 3.918 ± 0.776
3.483GlySer: 3.483 ± 1.123
6.53GlyThr: 6.53 ± 3.685
5.66GlyVal: 5.66 ± 1.364
0.435GlyTrp: 0.435 ± 0.297
2.177GlyTyr: 2.177 ± 0.641
0.0GlyXaa: 0.0 ± 0.0
His
0.871HisAla: 0.871 ± 0.44
0.871HisCys: 0.871 ± 0.952
0.871HisAsp: 0.871 ± 0.385
0.871HisGlu: 0.871 ± 0.867
0.871HisPhe: 0.871 ± 0.362
2.612HisGly: 2.612 ± 0.747
0.0HisHis: 0.0 ± 0.0
1.306HisIle: 1.306 ± 0.401
0.871HisLys: 0.871 ± 0.639
1.306HisLeu: 1.306 ± 0.878
0.435HisMet: 0.435 ± 0.297
0.435HisAsn: 0.435 ± 0.361
1.306HisPro: 1.306 ± 0.802
0.435HisGln: 0.435 ± 0.434
2.612HisArg: 2.612 ± 0.927
0.435HisSer: 0.435 ± 0.297
1.306HisThr: 1.306 ± 0.442
0.871HisVal: 0.871 ± 0.867
0.871HisTrp: 0.871 ± 0.867
1.741HisTyr: 1.741 ± 0.615
0.0HisXaa: 0.0 ± 0.0
Ile
2.177IleAla: 2.177 ± 0.648
1.306IleCys: 1.306 ± 0.659
2.177IleAsp: 2.177 ± 1.015
2.612IleGlu: 2.612 ± 0.768
1.741IlePhe: 1.741 ± 0.795
2.612IleGly: 2.612 ± 0.779
1.306IleHis: 1.306 ± 1.301
2.177IleIle: 2.177 ± 0.933
1.741IleLys: 1.741 ± 0.717
3.483IleLeu: 3.483 ± 1.277
1.741IleMet: 1.741 ± 1.074
0.871IleAsn: 0.871 ± 0.385
4.789IlePro: 4.789 ± 1.071
1.306IleGln: 1.306 ± 0.556
1.306IleArg: 1.306 ± 0.671
3.483IleSer: 3.483 ± 0.673
3.918IleThr: 3.918 ± 0.966
3.483IleVal: 3.483 ± 1.286
0.871IleTrp: 0.871 ± 0.44
1.306IleTyr: 1.306 ± 0.834
0.0IleXaa: 0.0 ± 0.0
Lys
3.047LysAla: 3.047 ± 1.525
1.306LysCys: 1.306 ± 0.588
2.612LysAsp: 2.612 ± 0.893
2.612LysGlu: 2.612 ± 1.017
3.047LysPhe: 3.047 ± 1.191
2.177LysGly: 2.177 ± 0.613
1.306LysHis: 1.306 ± 0.609
0.871LysIle: 0.871 ± 0.44
1.741LysLys: 1.741 ± 0.496
3.483LysLeu: 3.483 ± 0.627
0.871LysMet: 0.871 ± 0.48
1.306LysAsn: 1.306 ± 0.704
1.741LysPro: 1.741 ± 0.479
1.741LysGln: 1.741 ± 1.15
5.224LysArg: 5.224 ± 1.095
2.612LysSer: 2.612 ± 1.112
2.612LysThr: 2.612 ± 0.984
3.918LysVal: 3.918 ± 1.081
0.0LysTrp: 0.0 ± 0.0
2.177LysTyr: 2.177 ± 0.548
0.0LysXaa: 0.0 ± 0.0
Leu
3.047LeuAla: 3.047 ± 0.793
3.047LeuCys: 3.047 ± 0.998
4.789LeuAsp: 4.789 ± 0.507
4.789LeuGlu: 4.789 ± 0.988
2.177LeuPhe: 2.177 ± 1.624
5.66LeuGly: 5.66 ± 0.812
3.047LeuHis: 3.047 ± 1.929
3.483LeuIle: 3.483 ± 1.377
2.612LeuLys: 2.612 ± 0.762
10.448LeuLeu: 10.448 ± 2.747
1.306LeuMet: 1.306 ± 0.583
1.741LeuAsn: 1.741 ± 0.946
2.612LeuPro: 2.612 ± 1.268
5.66LeuGln: 5.66 ± 1.484
6.53LeuArg: 6.53 ± 1.717
6.966LeuSer: 6.966 ± 1.406
5.224LeuThr: 5.224 ± 1.044
4.354LeuVal: 4.354 ± 1.102
1.306LeuTrp: 1.306 ± 0.35
5.224LeuTyr: 5.224 ± 1.141
0.0LeuXaa: 0.0 ± 0.0
Met
1.741MetAla: 1.741 ± 1.044
1.306MetCys: 1.306 ± 0.614
1.306MetAsp: 1.306 ± 0.35
0.435MetGlu: 0.435 ± 0.434
0.871MetPhe: 0.871 ± 0.482
0.435MetGly: 0.435 ± 0.361
0.435MetHis: 0.435 ± 0.44
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.741MetLeu: 1.741 ± 0.791
0.0MetMet: 0.0 ± 0.0
0.871MetAsn: 0.871 ± 0.362
0.0MetPro: 0.0 ± 0.0
0.435MetGln: 0.435 ± 0.361
0.435MetArg: 0.435 ± 0.354
2.177MetSer: 2.177 ± 0.596
1.306MetThr: 1.306 ± 0.35
1.741MetVal: 1.741 ± 0.597
0.435MetTrp: 0.435 ± 0.434
0.871MetTyr: 0.871 ± 1.108
0.0MetXaa: 0.0 ± 0.0
Asn
2.177AsnAla: 2.177 ± 0.758
0.871AsnCys: 0.871 ± 0.595
1.306AsnAsp: 1.306 ± 0.401
1.306AsnGlu: 1.306 ± 0.711
1.306AsnPhe: 1.306 ± 0.713
1.741AsnGly: 1.741 ± 0.651
0.0AsnHis: 0.0 ± 0.0
3.047AsnIle: 3.047 ± 0.81
3.047AsnLys: 3.047 ± 1.191
1.306AsnLeu: 1.306 ± 0.58
0.871AsnMet: 0.871 ± 0.362
1.306AsnAsn: 1.306 ± 0.713
2.177AsnPro: 2.177 ± 0.641
1.306AsnGln: 1.306 ± 0.388
0.871AsnArg: 0.871 ± 0.362
2.612AsnSer: 2.612 ± 0.66
1.741AsnThr: 1.741 ± 0.771
0.871AsnVal: 0.871 ± 0.385
0.435AsnTrp: 0.435 ± 0.297
0.435AsnTyr: 0.435 ± 0.354
0.0AsnXaa: 0.0 ± 0.0
Pro
4.354ProAla: 4.354 ± 1.868
0.435ProCys: 0.435 ± 0.361
5.66ProAsp: 5.66 ± 1.617
3.047ProGlu: 3.047 ± 0.801
3.483ProPhe: 3.483 ± 0.958
2.612ProGly: 2.612 ± 0.702
0.435ProHis: 0.435 ± 0.44
2.612ProIle: 2.612 ± 1.263
3.918ProLys: 3.918 ± 0.479
8.707ProLeu: 8.707 ± 0.794
1.306ProMet: 1.306 ± 0.35
1.741ProAsn: 1.741 ± 0.762
5.224ProPro: 5.224 ± 1.201
0.871ProGln: 0.871 ± 0.548
1.306ProArg: 1.306 ± 0.579
5.66ProSer: 5.66 ± 1.547
3.047ProThr: 3.047 ± 1.055
4.789ProVal: 4.789 ± 1.275
0.435ProTrp: 0.435 ± 0.434
2.612ProTyr: 2.612 ± 0.952
0.0ProXaa: 0.0 ± 0.0
Gln
3.047GlnAla: 3.047 ± 1.255
1.306GlnCys: 1.306 ± 0.671
3.047GlnAsp: 3.047 ± 0.93
2.177GlnGlu: 2.177 ± 1.234
2.177GlnPhe: 2.177 ± 0.89
2.177GlnGly: 2.177 ± 1.094
0.0GlnHis: 0.0 ± 0.0
1.741GlnIle: 1.741 ± 0.84
1.306GlnLys: 1.306 ± 0.442
3.918GlnLeu: 3.918 ± 0.715
0.871GlnMet: 0.871 ± 0.524
1.741GlnAsn: 1.741 ± 0.881
3.047GlnPro: 3.047 ± 0.652
2.177GlnGln: 2.177 ± 0.992
2.612GlnArg: 2.612 ± 2.601
2.177GlnSer: 2.177 ± 0.613
3.047GlnThr: 3.047 ± 0.393
4.354GlnVal: 4.354 ± 1.247
2.177GlnTrp: 2.177 ± 0.657
0.435GlnTyr: 0.435 ± 0.361
0.0GlnXaa: 0.0 ± 0.0
Arg
4.789ArgAla: 4.789 ± 0.585
1.741ArgCys: 1.741 ± 1.045
3.047ArgAsp: 3.047 ± 1.646
2.612ArgGlu: 2.612 ± 1.207
1.741ArgPhe: 1.741 ± 0.597
3.483ArgGly: 3.483 ± 0.539
2.177ArgHis: 2.177 ± 0.781
1.741ArgIle: 1.741 ± 0.769
3.047ArgLys: 3.047 ± 0.393
6.966ArgLeu: 6.966 ± 1.404
0.435ArgMet: 0.435 ± 0.297
0.871ArgAsn: 0.871 ± 0.385
3.483ArgPro: 3.483 ± 1.039
3.483ArgGln: 3.483 ± 2.063
6.53ArgArg: 6.53 ± 1.944
4.354ArgSer: 4.354 ± 0.99
3.483ArgThr: 3.483 ± 0.855
4.354ArgVal: 4.354 ± 1.113
0.871ArgTrp: 0.871 ± 0.44
1.741ArgTyr: 1.741 ± 0.881
0.0ArgXaa: 0.0 ± 0.0
Ser
4.789SerAla: 4.789 ± 1.726
1.741SerCys: 1.741 ± 0.715
3.918SerAsp: 3.918 ± 0.867
1.741SerGlu: 1.741 ± 0.496
1.306SerPhe: 1.306 ± 0.531
6.966SerGly: 6.966 ± 1.721
1.741SerHis: 1.741 ± 0.528
2.612SerIle: 2.612 ± 0.652
2.177SerLys: 2.177 ± 0.92
5.66SerLeu: 5.66 ± 0.825
1.741SerMet: 1.741 ± 0.657
3.918SerAsn: 3.918 ± 1.016
4.354SerPro: 4.354 ± 1.255
2.177SerGln: 2.177 ± 1.094
6.53SerArg: 6.53 ± 1.473
6.966SerSer: 6.966 ± 1.74
9.142SerThr: 9.142 ± 2.737
2.177SerVal: 2.177 ± 0.758
0.0SerTrp: 0.0 ± 0.0
1.741SerTyr: 1.741 ± 0.663
0.0SerXaa: 0.0 ± 0.0
Thr
3.483ThrAla: 3.483 ± 1.456
2.177ThrCys: 2.177 ± 0.444
3.047ThrAsp: 3.047 ± 0.792
4.354ThrGlu: 4.354 ± 0.926
3.483ThrPhe: 3.483 ± 1.108
6.53ThrGly: 6.53 ± 1.712
2.177ThrHis: 2.177 ± 1.278
4.354ThrIle: 4.354 ± 0.96
3.483ThrLys: 3.483 ± 1.328
6.53ThrLeu: 6.53 ± 0.705
1.306ThrMet: 1.306 ± 0.272
2.177ThrAsn: 2.177 ± 0.727
6.095ThrPro: 6.095 ± 2.659
4.354ThrGln: 4.354 ± 0.936
2.612ThrArg: 2.612 ± 0.884
6.53ThrSer: 6.53 ± 1.867
5.66ThrThr: 5.66 ± 1.467
6.095ThrVal: 6.095 ± 0.943
0.435ThrTrp: 0.435 ± 0.434
2.177ThrTyr: 2.177 ± 0.823
0.0ThrXaa: 0.0 ± 0.0
Val
3.918ValAla: 3.918 ± 0.766
3.047ValCys: 3.047 ± 0.815
5.66ValAsp: 5.66 ± 1.312
4.354ValGlu: 4.354 ± 1.202
3.047ValPhe: 3.047 ± 0.792
5.66ValGly: 5.66 ± 1.042
2.177ValHis: 2.177 ± 0.821
3.483ValIle: 3.483 ± 1.097
3.047ValLys: 3.047 ± 1.157
2.612ValLeu: 2.612 ± 0.775
0.435ValMet: 0.435 ± 0.438
0.435ValAsn: 0.435 ± 0.354
6.095ValPro: 6.095 ± 1.92
3.483ValGln: 3.483 ± 1.987
3.918ValArg: 3.918 ± 1.425
5.66ValSer: 5.66 ± 2.359
5.224ValThr: 5.224 ± 1.604
3.918ValVal: 3.918 ± 1.113
0.871ValTrp: 0.871 ± 0.575
3.483ValTyr: 3.483 ± 0.604
0.0ValXaa: 0.0 ± 0.0
Trp
1.741TrpAla: 1.741 ± 0.725
0.435TrpCys: 0.435 ± 0.297
0.871TrpAsp: 0.871 ± 0.482
0.435TrpGlu: 0.435 ± 0.434
0.871TrpPhe: 0.871 ± 0.595
1.306TrpGly: 1.306 ± 0.35
0.0TrpHis: 0.0 ± 0.0
0.871TrpIle: 0.871 ± 0.595
2.177TrpLys: 2.177 ± 1.026
0.871TrpLeu: 0.871 ± 0.362
0.0TrpMet: 0.0 ± 0.0
0.435TrpAsn: 0.435 ± 0.361
0.871TrpPro: 0.871 ± 0.707
0.0TrpGln: 0.0 ± 0.0
1.741TrpArg: 1.741 ± 1.036
1.306TrpSer: 1.306 ± 1.04
0.435TrpThr: 0.435 ± 0.434
0.871TrpVal: 0.871 ± 0.44
0.0TrpTrp: 0.0 ± 0.0
1.306TrpTyr: 1.306 ± 0.822
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.741TyrAla: 1.741 ± 0.815
0.871TyrCys: 0.871 ± 0.522
2.177TyrAsp: 2.177 ± 0.444
1.306TyrGlu: 1.306 ± 0.401
1.741TyrPhe: 1.741 ± 0.485
4.354TyrGly: 4.354 ± 0.318
0.435TyrHis: 0.435 ± 0.354
1.741TyrIle: 1.741 ± 0.788
1.741TyrLys: 1.741 ± 0.858
4.789TyrLeu: 4.789 ± 1.652
0.0TyrMet: 0.0 ± 0.0
1.306TyrAsn: 1.306 ± 0.659
1.306TyrPro: 1.306 ± 0.749
1.306TyrGln: 1.306 ± 0.35
2.177TyrArg: 2.177 ± 0.953
1.306TyrSer: 1.306 ± 0.58
3.047TyrThr: 3.047 ± 1.547
4.789TyrVal: 4.789 ± 1.411
1.741TyrTrp: 1.741 ± 0.753
1.741TyrTyr: 1.741 ± 0.657
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2298 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski