Amino acid dipepetide frequency for Papio hamadryas papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.675AlaAla: 6.675 ± 2.233
1.252AlaCys: 1.252 ± 1.322
4.172AlaAsp: 4.172 ± 1.323
2.503AlaGlu: 2.503 ± 0.706
1.669AlaPhe: 1.669 ± 1.11
6.258AlaGly: 6.258 ± 1.748
1.669AlaHis: 1.669 ± 0.859
3.755AlaIle: 3.755 ± 0.952
3.755AlaLys: 3.755 ± 1.493
5.841AlaLeu: 5.841 ± 2.544
1.669AlaMet: 1.669 ± 0.586
1.669AlaAsn: 1.669 ± 1.214
5.006AlaPro: 5.006 ± 1.843
2.503AlaGln: 2.503 ± 0.962
4.589AlaArg: 4.589 ± 1.794
5.841AlaSer: 5.841 ± 2.455
5.006AlaThr: 5.006 ± 1.438
2.92AlaVal: 2.92 ± 0.706
0.834AlaTrp: 0.834 ± 0.537
2.503AlaTyr: 2.503 ± 0.412
0.0AlaXaa: 0.0 ± 0.0
Cys
1.669CysAla: 1.669 ± 0.586
0.417CysCys: 0.417 ± 0.443
0.834CysAsp: 0.834 ± 0.443
1.252CysGlu: 1.252 ± 0.585
1.669CysPhe: 1.669 ± 0.873
1.252CysGly: 1.252 ± 0.674
0.417CysHis: 0.417 ± 0.594
0.417CysIle: 0.417 ± 0.338
2.503CysLys: 2.503 ± 1.111
3.338CysLeu: 3.338 ± 1.723
0.834CysMet: 0.834 ± 0.426
0.834CysAsn: 0.834 ± 0.443
2.086CysPro: 2.086 ± 0.745
1.669CysGln: 1.669 ± 0.95
0.834CysArg: 0.834 ± 0.514
1.669CysSer: 1.669 ± 1.311
2.086CysThr: 2.086 ± 0.887
2.92CysVal: 2.92 ± 1.449
1.252CysTrp: 1.252 ± 0.538
0.834CysTyr: 0.834 ± 0.814
0.0CysXaa: 0.0 ± 0.0
Asp
4.589AspAla: 4.589 ± 1.481
2.503AspCys: 2.503 ± 1.025
2.92AspAsp: 2.92 ± 1.441
5.423AspGlu: 5.423 ± 1.485
2.503AspPhe: 2.503 ± 0.573
2.92AspGly: 2.92 ± 1.114
1.252AspHis: 1.252 ± 0.655
4.172AspIle: 4.172 ± 2.117
1.669AspLys: 1.669 ± 0.885
5.423AspLeu: 5.423 ± 1.826
0.834AspMet: 0.834 ± 0.441
2.086AspAsn: 2.086 ± 0.495
3.338AspPro: 3.338 ± 0.985
1.669AspGln: 1.669 ± 0.303
1.669AspArg: 1.669 ± 0.731
3.755AspSer: 3.755 ± 0.924
3.338AspThr: 3.338 ± 1.332
5.006AspVal: 5.006 ± 1.621
0.834AspTrp: 0.834 ± 0.675
2.086AspTyr: 2.086 ± 1.104
0.0AspXaa: 0.0 ± 0.0
Glu
2.92GluAla: 2.92 ± 0.51
2.503GluCys: 2.503 ± 1.083
6.258GluAsp: 6.258 ± 2.518
3.755GluGlu: 3.755 ± 1.213
1.252GluPhe: 1.252 ± 0.659
4.172GluGly: 4.172 ± 1.32
2.086GluHis: 2.086 ± 0.891
1.252GluIle: 1.252 ± 1.255
1.669GluLys: 1.669 ± 0.82
4.172GluLeu: 4.172 ± 1.514
0.417GluMet: 0.417 ± 0.379
1.669GluAsn: 1.669 ± 0.616
5.006GluPro: 5.006 ± 1.566
2.086GluGln: 2.086 ± 0.749
2.92GluArg: 2.92 ± 1.444
1.669GluSer: 1.669 ± 0.82
2.503GluThr: 2.503 ± 0.897
3.338GluVal: 3.338 ± 1.121
1.252GluTrp: 1.252 ± 0.447
1.252GluTyr: 1.252 ± 1.077
0.0GluXaa: 0.0 ± 0.0
Phe
1.252PheAla: 1.252 ± 0.842
0.0PheCys: 0.0 ± 0.0
1.252PheAsp: 1.252 ± 0.642
1.252PheGlu: 1.252 ± 0.659
1.252PhePhe: 1.252 ± 0.711
3.338PheGly: 3.338 ± 0.897
1.669PheHis: 1.669 ± 0.731
2.086PheIle: 2.086 ± 0.709
3.755PheLys: 3.755 ± 1.802
5.841PheLeu: 5.841 ± 2.051
0.417PheMet: 0.417 ± 0.338
0.834PheAsn: 0.834 ± 0.71
0.834PhePro: 0.834 ± 0.426
0.834PheGln: 0.834 ± 0.426
1.669PheArg: 1.669 ± 0.586
1.252PheSer: 1.252 ± 0.377
2.086PheThr: 2.086 ± 1.557
2.503PheVal: 2.503 ± 0.851
1.252PheTrp: 1.252 ± 0.711
2.086PheTyr: 2.086 ± 0.888
0.0PheXaa: 0.0 ± 0.0
Gly
4.589GlyAla: 4.589 ± 1.655
1.252GlyCys: 1.252 ± 0.818
6.258GlyAsp: 6.258 ± 0.885
2.92GlyGlu: 2.92 ± 0.926
0.834GlyPhe: 0.834 ± 0.421
7.509GlyGly: 7.509 ± 2.903
2.503GlyHis: 2.503 ± 0.807
3.338GlyIle: 3.338 ± 1.065
4.172GlyLys: 4.172 ± 1.389
5.423GlyLeu: 5.423 ± 1.473
0.834GlyMet: 0.834 ± 0.426
4.589GlyAsn: 4.589 ± 0.907
3.338GlyPro: 3.338 ± 1.016
2.503GlyGln: 2.503 ± 0.765
3.755GlyArg: 3.755 ± 1.503
5.423GlySer: 5.423 ± 1.607
5.423GlyThr: 5.423 ± 1.715
3.755GlyVal: 3.755 ± 0.976
0.834GlyTrp: 0.834 ± 0.443
2.92GlyTyr: 2.92 ± 0.496
0.0GlyXaa: 0.0 ± 0.0
His
2.503HisAla: 2.503 ± 0.613
0.834HisCys: 0.834 ± 0.656
1.669HisAsp: 1.669 ± 0.748
1.252HisGlu: 1.252 ± 0.526
2.503HisPhe: 2.503 ± 1.125
1.252HisGly: 1.252 ± 0.437
1.252HisHis: 1.252 ± 0.683
2.503HisIle: 2.503 ± 1.34
1.252HisLys: 1.252 ± 0.806
2.086HisLeu: 2.086 ± 0.984
0.417HisMet: 0.417 ± 0.338
0.834HisAsn: 0.834 ± 0.53
2.92HisPro: 2.92 ± 1.255
0.834HisGln: 0.834 ± 1.187
0.417HisArg: 0.417 ± 0.414
4.172HisSer: 4.172 ± 1.014
2.503HisThr: 2.503 ± 0.94
2.503HisVal: 2.503 ± 1.162
0.834HisTrp: 0.834 ± 0.498
0.417HisTyr: 0.417 ± 0.338
0.0HisXaa: 0.0 ± 0.0
Ile
2.503IleAla: 2.503 ± 0.962
1.669IleCys: 1.669 ± 0.852
2.503IleAsp: 2.503 ± 1.032
3.755IleGlu: 3.755 ± 1.095
0.834IlePhe: 0.834 ± 0.426
2.086IleGly: 2.086 ± 0.967
1.252IleHis: 1.252 ± 0.721
0.834IleIle: 0.834 ± 0.421
0.417IleLys: 0.417 ± 0.355
2.92IleLeu: 2.92 ± 1.243
0.417IleMet: 0.417 ± 0.522
0.834IleAsn: 0.834 ± 0.443
3.755IlePro: 3.755 ± 1.228
2.086IleGln: 2.086 ± 0.69
2.086IleArg: 2.086 ± 0.843
1.669IleSer: 1.669 ± 0.935
3.338IleThr: 3.338 ± 1.015
4.589IleVal: 4.589 ± 1.583
0.0IleTrp: 0.0 ± 0.0
2.086IleTyr: 2.086 ± 0.854
0.0IleXaa: 0.0 ± 0.0
Lys
2.503LysAla: 2.503 ± 1.568
2.503LysCys: 2.503 ± 1.318
1.669LysAsp: 1.669 ± 0.885
2.503LysGlu: 2.503 ± 1.108
2.503LysPhe: 2.503 ± 1.277
3.338LysGly: 3.338 ± 0.834
2.503LysHis: 2.503 ± 0.926
0.834LysIle: 0.834 ± 0.498
1.669LysLys: 1.669 ± 0.552
2.503LysLeu: 2.503 ± 0.646
0.417LysMet: 0.417 ± 0.355
1.669LysAsn: 1.669 ± 0.987
1.669LysPro: 1.669 ± 1.098
2.086LysGln: 2.086 ± 0.917
4.589LysArg: 4.589 ± 1.045
2.503LysSer: 2.503 ± 1.192
2.086LysThr: 2.086 ± 1.026
4.172LysVal: 4.172 ± 1.365
0.0LysTrp: 0.0 ± 0.0
2.503LysTyr: 2.503 ± 0.506
0.0LysXaa: 0.0 ± 0.0
Leu
3.755LeuAla: 3.755 ± 1.412
4.172LeuCys: 4.172 ± 2.43
4.172LeuAsp: 4.172 ± 0.661
5.423LeuGlu: 5.423 ± 2.803
3.755LeuPhe: 3.755 ± 2.66
4.589LeuGly: 4.589 ± 1.17
4.589LeuHis: 4.589 ± 1.988
2.086LeuIle: 2.086 ± 1.688
5.006LeuLys: 5.006 ± 1.401
10.847LeuLeu: 10.847 ± 5.9
2.086LeuMet: 2.086 ± 0.905
2.503LeuAsn: 2.503 ± 1.12
3.755LeuPro: 3.755 ± 1.021
6.258LeuGln: 6.258 ± 1.486
6.675LeuArg: 6.675 ± 0.962
6.258LeuSer: 6.258 ± 2.008
4.589LeuThr: 4.589 ± 1.038
4.589LeuVal: 4.589 ± 1.1
0.834LeuTrp: 0.834 ± 0.498
5.006LeuTyr: 5.006 ± 1.654
0.0LeuXaa: 0.0 ± 0.0
Met
1.252MetAla: 1.252 ± 0.751
0.417MetCys: 0.417 ± 0.338
2.503MetAsp: 2.503 ± 0.807
0.834MetGlu: 0.834 ± 0.793
0.834MetPhe: 0.834 ± 0.426
1.252MetGly: 1.252 ± 0.708
0.417MetHis: 0.417 ± 0.338
0.417MetIle: 0.417 ± 0.594
0.834MetLys: 0.834 ± 0.421
1.252MetLeu: 1.252 ± 0.711
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.417MetGln: 0.417 ± 0.338
0.834MetArg: 0.834 ± 0.426
2.086MetSer: 2.086 ± 0.887
1.252MetThr: 1.252 ± 0.447
2.086MetVal: 2.086 ± 0.611
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.172AsnAla: 4.172 ± 1.653
1.252AsnCys: 1.252 ± 0.655
1.669AsnAsp: 1.669 ± 0.564
0.417AsnGlu: 0.417 ± 0.577
1.252AsnPhe: 1.252 ± 0.708
2.086AsnGly: 2.086 ± 0.759
0.417AsnHis: 0.417 ± 0.431
0.834AsnIle: 0.834 ± 0.514
2.086AsnLys: 2.086 ± 1.776
1.669AsnLeu: 1.669 ± 1.007
0.417AsnMet: 0.417 ± 0.355
1.252AsnAsn: 1.252 ± 0.751
2.503AsnPro: 2.503 ± 0.687
0.834AsnGln: 0.834 ± 0.498
1.252AsnArg: 1.252 ± 0.708
2.503AsnSer: 2.503 ± 0.984
3.755AsnThr: 3.755 ± 1.535
2.92AsnVal: 2.92 ± 1.442
0.834AsnTrp: 0.834 ± 0.675
0.417AsnTyr: 0.417 ± 0.443
0.0AsnXaa: 0.0 ± 0.0
Pro
7.092ProAla: 7.092 ± 3.635
0.834ProCys: 0.834 ± 0.681
3.755ProAsp: 3.755 ± 1.403
2.086ProGlu: 2.086 ± 0.968
0.834ProPhe: 0.834 ± 0.675
3.338ProGly: 3.338 ± 1.815
1.252ProHis: 1.252 ± 0.683
2.503ProIle: 2.503 ± 1.06
2.92ProLys: 2.92 ± 0.778
7.927ProLeu: 7.927 ± 2.378
0.417ProMet: 0.417 ± 0.414
2.086ProAsn: 2.086 ± 0.854
10.013ProPro: 10.013 ± 2.928
1.669ProGln: 1.669 ± 0.679
2.92ProArg: 2.92 ± 1.435
4.589ProSer: 4.589 ± 1.876
5.841ProThr: 5.841 ± 1.47
4.172ProVal: 4.172 ± 1.58
0.417ProTrp: 0.417 ± 0.577
2.086ProTyr: 2.086 ± 0.982
0.0ProXaa: 0.0 ± 0.0
Gln
2.92GlnAla: 2.92 ± 1.391
0.834GlnCys: 0.834 ± 0.675
2.086GlnAsp: 2.086 ± 0.906
2.086GlnGlu: 2.086 ± 1.106
2.086GlnPhe: 2.086 ± 0.495
2.92GlnGly: 2.92 ± 1.2
0.834GlnHis: 0.834 ± 0.514
1.669GlnIle: 1.669 ± 0.727
1.669GlnLys: 1.669 ± 0.59
4.172GlnLeu: 4.172 ± 0.917
1.669GlnMet: 1.669 ± 0.616
2.086GlnAsn: 2.086 ± 0.973
3.755GlnPro: 3.755 ± 0.821
2.92GlnGln: 2.92 ± 2.069
2.92GlnArg: 2.92 ± 1.347
0.417GlnSer: 0.417 ± 0.355
1.669GlnThr: 1.669 ± 0.303
3.338GlnVal: 3.338 ± 0.798
0.834GlnTrp: 0.834 ± 0.675
0.834GlnTyr: 0.834 ± 0.426
0.0GlnXaa: 0.0 ± 0.0
Arg
6.675ArgAla: 6.675 ± 1.31
2.503ArgCys: 2.503 ± 1.336
0.834ArgAsp: 0.834 ± 0.679
1.669ArgGlu: 1.669 ± 1.063
3.338ArgPhe: 3.338 ± 0.984
4.589ArgGly: 4.589 ± 1.982
2.086ArgHis: 2.086 ± 0.863
1.669ArgIle: 1.669 ± 0.964
3.755ArgLys: 3.755 ± 0.937
6.258ArgLeu: 6.258 ± 0.732
0.834ArgMet: 0.834 ± 0.635
0.417ArgAsn: 0.417 ± 0.338
5.006ArgPro: 5.006 ± 2.413
2.086ArgGln: 2.086 ± 0.825
4.172ArgArg: 4.172 ± 1.366
2.92ArgSer: 2.92 ± 0.697
2.92ArgThr: 2.92 ± 0.62
3.755ArgVal: 3.755 ± 1.567
1.669ArgTrp: 1.669 ± 0.89
1.669ArgTyr: 1.669 ± 0.658
0.0ArgXaa: 0.0 ± 0.0
Ser
4.589SerAla: 4.589 ± 1.269
0.0SerCys: 0.0 ± 0.0
3.338SerAsp: 3.338 ± 1.847
4.172SerGlu: 4.172 ± 0.804
1.252SerPhe: 1.252 ± 0.764
5.841SerGly: 5.841 ± 2.338
1.669SerHis: 1.669 ± 0.616
4.172SerIle: 4.172 ± 0.984
2.92SerLys: 2.92 ± 1.116
6.675SerLeu: 6.675 ± 0.949
2.086SerMet: 2.086 ± 0.962
3.755SerAsn: 3.755 ± 1.729
2.503SerPro: 2.503 ± 0.863
2.92SerGln: 2.92 ± 0.995
5.006SerArg: 5.006 ± 0.724
5.006SerSer: 5.006 ± 1.564
6.675SerThr: 6.675 ± 1.772
3.755SerVal: 3.755 ± 1.001
0.0SerTrp: 0.0 ± 0.0
1.669SerTyr: 1.669 ± 0.843
0.0SerXaa: 0.0 ± 0.0
Thr
2.92ThrAla: 2.92 ± 0.844
2.503ThrCys: 2.503 ± 0.9
2.086ThrAsp: 2.086 ± 1.183
2.086ThrGlu: 2.086 ± 0.381
3.338ThrPhe: 3.338 ± 1.254
5.006ThrGly: 5.006 ± 1.388
2.503ThrHis: 2.503 ± 1.255
2.503ThrIle: 2.503 ± 1.711
0.0ThrLys: 0.0 ± 0.0
6.675ThrLeu: 6.675 ± 1.354
2.086ThrMet: 2.086 ± 0.887
2.503ThrAsn: 2.503 ± 0.681
7.092ThrPro: 7.092 ± 3.254
2.92ThrGln: 2.92 ± 0.755
2.92ThrArg: 2.92 ± 1.033
5.841ThrSer: 5.841 ± 1.818
4.589ThrThr: 4.589 ± 1.63
5.841ThrVal: 5.841 ± 1.843
1.252ThrTrp: 1.252 ± 0.806
1.669ThrTyr: 1.669 ± 0.599
0.0ThrXaa: 0.0 ± 0.0
Val
3.755ValAla: 3.755 ± 1.519
2.086ValCys: 2.086 ± 0.749
5.423ValAsp: 5.423 ± 0.877
5.841ValGlu: 5.841 ± 2.011
2.503ValPhe: 2.503 ± 1.02
3.755ValGly: 3.755 ± 1.391
3.338ValHis: 3.338 ± 1.555
2.086ValIle: 2.086 ± 0.381
1.252ValLys: 1.252 ± 0.447
3.755ValLeu: 3.755 ± 1.569
0.834ValMet: 0.834 ± 0.546
1.669ValAsn: 1.669 ± 0.599
2.92ValPro: 2.92 ± 0.531
2.92ValGln: 2.92 ± 1.905
5.006ValArg: 5.006 ± 0.62
6.675ValSer: 6.675 ± 1.288
5.006ValThr: 5.006 ± 2.028
6.675ValVal: 6.675 ± 1.826
1.252ValTrp: 1.252 ± 0.632
5.006ValTyr: 5.006 ± 1.408
0.0ValXaa: 0.0 ± 0.0
Trp
1.669TrpAla: 1.669 ± 0.768
0.0TrpCys: 0.0 ± 0.0
1.252TrpAsp: 1.252 ± 0.656
0.834TrpGlu: 0.834 ± 0.498
0.417TrpPhe: 0.417 ± 0.338
2.503TrpGly: 2.503 ± 1.217
0.417TrpHis: 0.417 ± 0.431
0.834TrpIle: 0.834 ± 0.675
0.834TrpLys: 0.834 ± 0.443
0.834TrpLeu: 0.834 ± 0.426
0.0TrpMet: 0.0 ± 0.0
0.417TrpAsn: 0.417 ± 0.355
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.086TrpArg: 2.086 ± 0.661
0.834TrpSer: 0.834 ± 0.675
1.252TrpThr: 1.252 ± 0.797
0.834TrpVal: 0.834 ± 0.678
0.0TrpTrp: 0.0 ± 0.0
0.417TrpTyr: 0.417 ± 0.338
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.086TyrAla: 2.086 ± 0.709
1.252TyrCys: 1.252 ± 0.983
3.338TyrAsp: 3.338 ± 0.442
2.086TyrGlu: 2.086 ± 0.922
0.834TyrPhe: 0.834 ± 0.828
4.172TyrGly: 4.172 ± 1.079
0.834TyrHis: 0.834 ± 0.681
1.669TyrIle: 1.669 ± 0.675
2.086TyrLys: 2.086 ± 0.885
3.338TyrLeu: 3.338 ± 1.332
0.0TyrMet: 0.0 ± 0.0
0.834TyrAsn: 0.834 ± 0.563
1.252TyrPro: 1.252 ± 0.92
2.503TyrGln: 2.503 ± 0.741
2.503TyrArg: 2.503 ± 0.884
2.92TyrSer: 2.92 ± 1.879
0.417TyrThr: 0.417 ± 0.355
2.503TyrVal: 2.503 ± 0.617
1.252TyrTrp: 1.252 ± 0.447
1.669TyrTyr: 1.669 ± 0.997
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (2398 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski