Amino acid dipepetide frequency for Panthera leo persica papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.983AlaAla: 5.983 ± 1.872
0.855AlaCys: 0.855 ± 0.949
4.274AlaAsp: 4.274 ± 0.999
5.128AlaGlu: 5.128 ± 1.46
4.701AlaPhe: 4.701 ± 1.552
2.564AlaGly: 2.564 ± 0.641
0.0AlaHis: 0.0 ± 0.0
2.137AlaIle: 2.137 ± 0.745
3.846AlaLys: 3.846 ± 1.842
4.274AlaLeu: 4.274 ± 1.466
0.855AlaMet: 0.855 ± 0.409
1.709AlaAsn: 1.709 ± 0.63
2.564AlaPro: 2.564 ± 1.148
2.564AlaGln: 2.564 ± 1.141
3.419AlaArg: 3.419 ± 0.892
5.983AlaSer: 5.983 ± 1.081
3.419AlaThr: 3.419 ± 1.057
4.701AlaVal: 4.701 ± 0.955
0.427AlaTrp: 0.427 ± 0.39
1.709AlaTyr: 1.709 ± 0.941
0.0AlaXaa: 0.0 ± 0.0
Cys
2.137CysAla: 2.137 ± 1.088
0.855CysCys: 0.855 ± 0.992
0.855CysAsp: 0.855 ± 0.465
0.855CysGlu: 0.855 ± 0.677
1.709CysPhe: 1.709 ± 0.494
1.709CysGly: 1.709 ± 1.556
0.0CysHis: 0.0 ± 0.0
0.855CysIle: 0.855 ± 0.578
2.991CysLys: 2.991 ± 1.051
2.137CysLeu: 2.137 ± 1.806
0.855CysMet: 0.855 ± 0.578
0.427CysAsn: 0.427 ± 0.39
2.991CysPro: 2.991 ± 0.667
0.855CysGln: 0.855 ± 0.677
1.282CysArg: 1.282 ± 0.581
1.282CysSer: 1.282 ± 0.807
1.282CysThr: 1.282 ± 0.581
1.282CysVal: 1.282 ± 1.371
0.427CysTrp: 0.427 ± 0.338
0.427CysTyr: 0.427 ± 0.669
0.0CysXaa: 0.0 ± 0.0
Asp
2.564AspAla: 2.564 ± 1.007
2.137AspCys: 2.137 ± 1.325
2.564AspAsp: 2.564 ± 1.326
5.556AspGlu: 5.556 ± 1.24
1.709AspPhe: 1.709 ± 0.56
2.564AspGly: 2.564 ± 1.104
1.282AspHis: 1.282 ± 0.654
5.128AspIle: 5.128 ± 1.412
4.274AspLys: 4.274 ± 1.696
8.12AspLeu: 8.12 ± 2.54
0.855AspMet: 0.855 ± 0.442
1.709AspAsn: 1.709 ± 0.589
4.274AspPro: 4.274 ± 1.807
2.564AspGln: 2.564 ± 0.979
2.564AspArg: 2.564 ± 1.159
2.991AspSer: 2.991 ± 1.411
2.991AspThr: 2.991 ± 0.998
2.564AspVal: 2.564 ± 0.637
1.282AspTrp: 1.282 ± 1.015
0.427AspTyr: 0.427 ± 0.39
0.0AspXaa: 0.0 ± 0.0
Glu
2.991GluAla: 2.991 ± 1.186
2.564GluCys: 2.564 ± 1.341
5.556GluAsp: 5.556 ± 0.967
5.556GluGlu: 5.556 ± 3.031
2.137GluPhe: 2.137 ± 0.787
2.137GluGly: 2.137 ± 0.915
0.427GluHis: 0.427 ± 0.39
3.419GluIle: 3.419 ± 1.128
3.846GluLys: 3.846 ± 1.358
4.274GluLeu: 4.274 ± 1.245
0.427GluMet: 0.427 ± 0.338
4.274GluAsn: 4.274 ± 1.11
3.419GluPro: 3.419 ± 0.815
5.983GluGln: 5.983 ± 1.437
2.991GluArg: 2.991 ± 0.845
3.846GluSer: 3.846 ± 1.356
4.274GluThr: 4.274 ± 1.646
4.274GluVal: 4.274 ± 0.944
0.855GluTrp: 0.855 ± 0.677
0.855GluTyr: 0.855 ± 0.78
0.0GluXaa: 0.0 ± 0.0
Phe
2.137PheAla: 2.137 ± 1.036
1.709PheCys: 1.709 ± 0.772
3.419PheAsp: 3.419 ± 0.651
3.419PheGlu: 3.419 ± 1.326
3.419PhePhe: 3.419 ± 1.181
2.991PheGly: 2.991 ± 0.488
0.0PheHis: 0.0 ± 0.0
0.427PheIle: 0.427 ± 0.496
2.991PheLys: 2.991 ± 1.508
5.556PheLeu: 5.556 ± 2.017
0.855PheMet: 0.855 ± 0.595
1.282PheAsn: 1.282 ± 0.724
2.137PhePro: 2.137 ± 0.915
2.991PheGln: 2.991 ± 0.965
2.991PheArg: 2.991 ± 0.747
2.564PheSer: 2.564 ± 1.154
2.991PheThr: 2.991 ± 1.252
2.137PheVal: 2.137 ± 1.002
1.709PheTrp: 1.709 ± 0.941
1.709PheTyr: 1.709 ± 0.885
0.0PheXaa: 0.0 ± 0.0
Gly
3.846GlyAla: 3.846 ± 1.542
1.282GlyCys: 1.282 ± 0.561
4.701GlyAsp: 4.701 ± 0.937
5.128GlyGlu: 5.128 ± 1.334
1.709GlyPhe: 1.709 ± 0.509
5.128GlyGly: 5.128 ± 1.256
2.137GlyHis: 2.137 ± 1.05
2.564GlyIle: 2.564 ± 0.691
2.564GlyLys: 2.564 ± 1.394
5.128GlyLeu: 5.128 ± 0.933
0.855GlyMet: 0.855 ± 0.745
2.991GlyAsn: 2.991 ± 1.161
2.564GlyPro: 2.564 ± 0.774
4.701GlyGln: 4.701 ± 0.965
3.419GlyArg: 3.419 ± 1.098
6.41GlySer: 6.41 ± 1.959
2.991GlyThr: 2.991 ± 1.446
5.128GlyVal: 5.128 ± 1.396
0.0GlyTrp: 0.0 ± 0.0
0.855GlyTyr: 0.855 ± 0.574
0.0GlyXaa: 0.0 ± 0.0
His
1.282HisAla: 1.282 ± 1.17
0.427HisCys: 0.427 ± 0.338
0.855HisAsp: 0.855 ± 0.666
1.282HisGlu: 1.282 ± 0.602
0.0HisPhe: 0.0 ± 0.0
0.427HisGly: 0.427 ± 0.669
0.0HisHis: 0.0 ± 0.0
0.427HisIle: 0.427 ± 0.333
1.282HisLys: 1.282 ± 1.015
2.137HisLeu: 2.137 ± 0.993
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
1.282HisPro: 1.282 ± 0.679
1.709HisGln: 1.709 ± 0.913
0.0HisArg: 0.0 ± 0.0
0.855HisSer: 0.855 ± 0.35
1.282HisThr: 1.282 ± 0.764
1.282HisVal: 1.282 ± 0.328
0.855HisTrp: 0.855 ± 0.454
0.855HisTyr: 0.855 ± 0.506
0.0HisXaa: 0.0 ± 0.0
Ile
2.137IleAla: 2.137 ± 0.787
0.427IleCys: 0.427 ± 0.338
1.282IleAsp: 1.282 ± 0.328
2.564IleGlu: 2.564 ± 1.103
1.282IlePhe: 1.282 ± 0.642
4.274IleGly: 4.274 ± 1.723
0.855IleHis: 0.855 ± 0.409
1.282IleIle: 1.282 ± 0.764
0.855IleLys: 0.855 ± 0.409
2.991IleLeu: 2.991 ± 1.464
0.427IleMet: 0.427 ± 0.401
1.282IleAsn: 1.282 ± 0.724
2.137IlePro: 2.137 ± 1.432
1.709IleGln: 1.709 ± 0.52
0.855IleArg: 0.855 ± 0.992
4.701IleSer: 4.701 ± 1.644
0.855IleThr: 0.855 ± 0.465
2.991IleVal: 2.991 ± 0.666
0.427IleTrp: 0.427 ± 0.333
1.282IleTyr: 1.282 ± 0.483
0.0IleXaa: 0.0 ± 0.0
Lys
5.128LysAla: 5.128 ± 1.58
1.709LysCys: 1.709 ± 0.494
1.282LysAsp: 1.282 ± 0.8
3.419LysGlu: 3.419 ± 1.451
3.419LysPhe: 3.419 ± 1.27
5.128LysGly: 5.128 ± 1.898
1.282LysHis: 1.282 ± 0.864
0.855LysIle: 0.855 ± 0.666
2.991LysLys: 2.991 ± 1.602
4.274LysLeu: 4.274 ± 1.606
0.427LysMet: 0.427 ± 0.706
1.282LysAsn: 1.282 ± 0.724
2.137LysPro: 2.137 ± 0.792
3.419LysGln: 3.419 ± 1.079
5.983LysArg: 5.983 ± 1.208
2.991LysSer: 2.991 ± 1.241
2.564LysThr: 2.564 ± 1.088
3.419LysVal: 3.419 ± 0.866
0.427LysTrp: 0.427 ± 0.39
2.564LysTyr: 2.564 ± 0.68
0.0LysXaa: 0.0 ± 0.0
Leu
6.41LeuAla: 6.41 ± 1.115
1.709LeuCys: 1.709 ± 1.355
3.846LeuAsp: 3.846 ± 0.717
5.128LeuGlu: 5.128 ± 1.784
8.12LeuPhe: 8.12 ± 2.122
6.41LeuGly: 6.41 ± 2.073
2.991LeuHis: 2.991 ± 0.575
0.427LeuIle: 0.427 ± 0.338
3.846LeuLys: 3.846 ± 0.965
11.111LeuLeu: 11.111 ± 2.591
1.282LeuMet: 1.282 ± 0.783
3.846LeuAsn: 3.846 ± 0.932
4.701LeuPro: 4.701 ± 2.016
7.692LeuGln: 7.692 ± 1.695
6.838LeuArg: 6.838 ± 1.699
5.983LeuSer: 5.983 ± 2.091
5.983LeuThr: 5.983 ± 1.605
6.838LeuVal: 6.838 ± 1.369
0.427LeuTrp: 0.427 ± 0.401
2.991LeuTyr: 2.991 ± 1.037
0.0LeuXaa: 0.0 ± 0.0
Met
2.137MetAla: 2.137 ± 1.134
0.427MetCys: 0.427 ± 0.338
2.137MetAsp: 2.137 ± 0.705
0.855MetGlu: 0.855 ± 0.506
0.855MetPhe: 0.855 ± 0.409
0.427MetGly: 0.427 ± 0.338
0.427MetHis: 0.427 ± 0.401
0.855MetIle: 0.855 ± 0.709
0.427MetLys: 0.427 ± 0.496
1.282MetLeu: 1.282 ± 0.455
0.0MetMet: 0.0 ± 0.0
0.855MetAsn: 0.855 ± 0.454
0.427MetPro: 0.427 ± 0.669
0.427MetGln: 0.427 ± 0.338
0.427MetArg: 0.427 ± 0.333
0.427MetSer: 0.427 ± 0.338
0.855MetThr: 0.855 ± 0.677
0.855MetVal: 0.855 ± 0.78
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.282AsnAla: 1.282 ± 1.015
0.427AsnCys: 0.427 ± 0.338
2.564AsnAsp: 2.564 ± 1.374
0.855AsnGlu: 0.855 ± 0.409
1.282AsnPhe: 1.282 ± 0.328
1.709AsnGly: 1.709 ± 1.089
0.855AsnHis: 0.855 ± 0.709
1.709AsnIle: 1.709 ± 0.52
1.709AsnLys: 1.709 ± 1.13
2.564AsnLeu: 2.564 ± 0.901
0.427AsnMet: 0.427 ± 0.39
1.709AsnAsn: 1.709 ± 0.818
3.419AsnPro: 3.419 ± 1.221
1.709AsnGln: 1.709 ± 0.818
2.137AsnArg: 2.137 ± 0.789
3.846AsnSer: 3.846 ± 1.112
2.137AsnThr: 2.137 ± 0.809
2.991AsnVal: 2.991 ± 0.918
0.427AsnTrp: 0.427 ± 0.607
1.282AsnTyr: 1.282 ± 0.328
0.0AsnXaa: 0.0 ± 0.0
Pro
5.128ProAla: 5.128 ± 2.13
1.282ProCys: 1.282 ± 0.965
2.137ProAsp: 2.137 ± 0.848
4.701ProGlu: 4.701 ± 1.7
2.137ProPhe: 2.137 ± 1.022
2.137ProGly: 2.137 ± 0.87
0.427ProHis: 0.427 ± 0.333
2.991ProIle: 2.991 ± 1.646
4.701ProLys: 4.701 ± 0.904
4.274ProLeu: 4.274 ± 1.024
0.0ProMet: 0.0 ± 0.0
2.991ProAsn: 2.991 ± 1.243
9.402ProPro: 9.402 ± 5.1
2.564ProGln: 2.564 ± 0.936
5.556ProArg: 5.556 ± 3.022
3.846ProSer: 3.846 ± 1.423
5.128ProThr: 5.128 ± 1.711
5.556ProVal: 5.556 ± 1.774
0.427ProTrp: 0.427 ± 0.401
1.282ProTyr: 1.282 ± 0.746
0.0ProXaa: 0.0 ± 0.0
Gln
1.709GlnAla: 1.709 ± 0.929
1.709GlnCys: 1.709 ± 0.63
2.137GlnAsp: 2.137 ± 0.797
3.419GlnGlu: 3.419 ± 0.937
2.564GlnPhe: 2.564 ± 0.915
4.274GlnGly: 4.274 ± 0.831
0.855GlnHis: 0.855 ± 0.666
0.855GlnIle: 0.855 ± 0.465
1.709GlnLys: 1.709 ± 0.948
7.265GlnLeu: 7.265 ± 2.209
1.282GlnMet: 1.282 ± 0.688
2.564GlnAsn: 2.564 ± 1.009
3.419GlnPro: 3.419 ± 1.228
4.274GlnGln: 4.274 ± 0.617
2.137GlnArg: 2.137 ± 0.621
4.274GlnSer: 4.274 ± 1.624
3.846GlnThr: 3.846 ± 1.239
2.137GlnVal: 2.137 ± 0.481
1.709GlnTrp: 1.709 ± 0.814
1.282GlnTyr: 1.282 ± 0.328
0.0GlnXaa: 0.0 ± 0.0
Arg
2.137ArgAla: 2.137 ± 0.409
1.709ArgCys: 1.709 ± 1.355
4.274ArgAsp: 4.274 ± 1.052
2.991ArgGlu: 2.991 ± 1.398
2.564ArgPhe: 2.564 ± 1.044
5.556ArgGly: 5.556 ± 1.149
0.855ArgHis: 0.855 ± 0.78
1.709ArgIle: 1.709 ± 0.695
5.128ArgLys: 5.128 ± 1.431
8.974ArgLeu: 8.974 ± 1.082
1.282ArgMet: 1.282 ± 0.402
1.709ArgAsn: 1.709 ± 0.752
3.419ArgPro: 3.419 ± 1.362
0.855ArgGln: 0.855 ± 0.675
4.701ArgArg: 4.701 ± 0.904
5.556ArgSer: 5.556 ± 0.462
2.137ArgThr: 2.137 ± 1.097
5.556ArgVal: 5.556 ± 1.642
0.855ArgTrp: 0.855 ± 0.465
2.137ArgTyr: 2.137 ± 0.777
0.0ArgXaa: 0.0 ± 0.0
Ser
5.128SerAla: 5.128 ± 1.048
0.855SerCys: 0.855 ± 0.949
6.838SerAsp: 6.838 ± 2.156
1.709SerGlu: 1.709 ± 0.941
1.282SerPhe: 1.282 ± 0.455
8.12SerGly: 8.12 ± 2.707
1.282SerHis: 1.282 ± 0.455
0.427SerIle: 0.427 ± 0.338
3.419SerLys: 3.419 ± 0.999
8.547SerLeu: 8.547 ± 2.271
0.855SerMet: 0.855 ± 0.35
1.709SerAsn: 1.709 ± 0.52
5.556SerPro: 5.556 ± 0.588
2.564SerGln: 2.564 ± 0.747
5.556SerArg: 5.556 ± 1.478
5.128SerSer: 5.128 ± 2.311
6.41SerThr: 6.41 ± 2.085
7.265SerVal: 7.265 ± 2.025
0.855SerTrp: 0.855 ± 0.409
2.564SerTyr: 2.564 ± 0.768
0.0SerXaa: 0.0 ± 0.0
Thr
2.991ThrAla: 2.991 ± 1.059
0.855ThrCys: 0.855 ± 0.35
2.991ThrAsp: 2.991 ± 0.689
2.991ThrGlu: 2.991 ± 0.765
2.991ThrPhe: 2.991 ± 0.457
2.991ThrGly: 2.991 ± 1.18
0.427ThrHis: 0.427 ± 0.333
2.137ThrIle: 2.137 ± 0.987
2.991ThrLys: 2.991 ± 0.62
4.274ThrLeu: 4.274 ± 0.801
1.282ThrMet: 1.282 ± 0.642
1.282ThrAsn: 1.282 ± 0.724
5.128ThrPro: 5.128 ± 1.782
2.564ThrGln: 2.564 ± 0.656
5.983ThrArg: 5.983 ± 1.751
6.41ThrSer: 6.41 ± 1.491
4.701ThrThr: 4.701 ± 1.947
4.701ThrVal: 4.701 ± 0.743
0.427ThrTrp: 0.427 ± 0.401
0.855ThrTyr: 0.855 ± 0.78
0.0ThrXaa: 0.0 ± 0.0
Val
3.419ValAla: 3.419 ± 0.821
2.564ValCys: 2.564 ± 1.389
4.701ValAsp: 4.701 ± 0.657
6.41ValGlu: 6.41 ± 1.434
4.701ValPhe: 4.701 ± 1.606
3.419ValGly: 3.419 ± 1.779
0.855ValHis: 0.855 ± 0.35
3.419ValIle: 3.419 ± 1.118
3.419ValLys: 3.419 ± 0.931
5.556ValLeu: 5.556 ± 1.159
1.282ValMet: 1.282 ± 0.71
1.709ValAsn: 1.709 ± 0.499
5.983ValPro: 5.983 ± 1.214
2.991ValGln: 2.991 ± 1.044
4.701ValArg: 4.701 ± 1.336
6.41ValSer: 6.41 ± 1.746
3.419ValThr: 3.419 ± 1.002
2.137ValVal: 2.137 ± 0.97
1.282ValTrp: 1.282 ± 0.724
0.855ValTyr: 0.855 ± 0.506
0.0ValXaa: 0.0 ± 0.0
Trp
1.282TrpAla: 1.282 ± 0.642
0.427TrpCys: 0.427 ± 0.39
0.427TrpAsp: 0.427 ± 0.338
1.282TrpGlu: 1.282 ± 0.764
0.0TrpPhe: 0.0 ± 0.0
0.855TrpGly: 0.855 ± 0.465
0.427TrpHis: 0.427 ± 0.39
1.282TrpIle: 1.282 ± 0.642
0.855TrpLys: 0.855 ± 0.578
2.137TrpLeu: 2.137 ± 1.047
0.0TrpMet: 0.0 ± 0.0
0.855TrpAsn: 0.855 ± 0.465
0.0TrpPro: 0.0 ± 0.0
0.427TrpGln: 0.427 ± 0.401
0.427TrpArg: 0.427 ± 0.401
0.0TrpSer: 0.0 ± 0.0
1.282TrpThr: 1.282 ± 1.204
1.282TrpVal: 1.282 ± 0.642
0.0TrpTrp: 0.0 ± 0.0
0.427TrpTyr: 0.427 ± 0.338
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.282TyrAla: 1.282 ± 0.402
1.282TyrCys: 1.282 ± 0.833
1.282TyrAsp: 1.282 ± 0.455
0.855TyrGlu: 0.855 ± 0.757
0.855TyrPhe: 0.855 ± 0.409
1.282TyrGly: 1.282 ± 0.679
0.855TyrHis: 0.855 ± 0.454
1.709TyrIle: 1.709 ± 0.818
1.282TyrLys: 1.282 ± 0.402
1.709TyrLeu: 1.709 ± 0.818
0.427TyrMet: 0.427 ± 0.338
0.855TyrAsn: 0.855 ± 0.78
1.709TyrPro: 1.709 ± 0.885
0.855TyrGln: 0.855 ± 0.677
2.137TyrArg: 2.137 ± 0.774
2.564TyrSer: 2.564 ± 1.442
0.427TyrThr: 0.427 ± 0.39
2.137TyrVal: 2.137 ± 0.621
0.855TyrTrp: 0.855 ± 0.409
2.137TyrTyr: 2.137 ± 0.9
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2341 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski