Amino acid dipepetide frequency for Cervus papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.655AlaAla: 6.655 ± 1.972
2.218AlaCys: 2.218 ± 1.097
3.106AlaAsp: 3.106 ± 0.869
6.655AlaGlu: 6.655 ± 2.902
1.331AlaPhe: 1.331 ± 0.371
5.768AlaGly: 5.768 ± 1.399
1.331AlaHis: 1.331 ± 0.401
2.662AlaIle: 2.662 ± 0.494
2.218AlaLys: 2.218 ± 0.707
5.324AlaLeu: 5.324 ± 2.227
0.887AlaMet: 0.887 ± 0.658
3.549AlaAsn: 3.549 ± 0.837
2.218AlaPro: 2.218 ± 0.983
1.775AlaGln: 1.775 ± 0.71
3.993AlaArg: 3.993 ± 1.187
2.662AlaSer: 2.662 ± 0.607
3.106AlaThr: 3.106 ± 0.653
6.211AlaVal: 6.211 ± 1.478
0.444AlaTrp: 0.444 ± 0.329
0.444AlaTyr: 0.444 ± 0.424
0.0AlaXaa: 0.0 ± 0.0
Cys
1.331CysAla: 1.331 ± 0.704
0.444CysCys: 0.444 ± 0.329
0.887CysAsp: 0.887 ± 0.425
0.0CysGlu: 0.0 ± 0.0
1.331CysPhe: 1.331 ± 0.691
0.887CysGly: 0.887 ± 0.694
0.0CysHis: 0.0 ± 0.0
1.331CysIle: 1.331 ± 0.861
1.331CysLys: 1.331 ± 0.691
2.218CysLeu: 2.218 ± 1.278
0.444CysMet: 0.444 ± 0.666
0.0CysAsn: 0.0 ± 0.0
2.218CysPro: 2.218 ± 0.567
0.444CysGln: 0.444 ± 0.329
1.331CysArg: 1.331 ± 0.701
2.662CysSer: 2.662 ± 1.131
1.775CysThr: 1.775 ± 0.637
2.218CysVal: 2.218 ± 1.983
0.444CysTrp: 0.444 ± 0.38
1.775CysTyr: 1.775 ± 1.452
0.0CysXaa: 0.0 ± 0.0
Asp
3.106AspAla: 3.106 ± 0.755
2.218AspCys: 2.218 ± 0.657
3.106AspAsp: 3.106 ± 0.65
3.106AspGlu: 3.106 ± 1.268
2.218AspPhe: 2.218 ± 1.268
3.549AspGly: 3.549 ± 1.988
0.0AspHis: 0.0 ± 0.0
3.549AspIle: 3.549 ± 0.879
1.775AspLys: 1.775 ± 0.958
7.098AspLeu: 7.098 ± 1.364
0.887AspMet: 0.887 ± 0.505
3.106AspAsn: 3.106 ± 1.054
2.662AspPro: 2.662 ± 1.28
3.549AspGln: 3.549 ± 1.023
2.662AspArg: 2.662 ± 1.185
6.655AspSer: 6.655 ± 3.224
5.768AspThr: 5.768 ± 1.908
5.324AspVal: 5.324 ± 2.265
0.887AspTrp: 0.887 ± 0.425
1.775AspTyr: 1.775 ± 1.083
0.0AspXaa: 0.0 ± 0.0
Glu
4.437GluAla: 4.437 ± 1.112
0.444GluCys: 0.444 ± 0.329
5.324GluAsp: 5.324 ± 0.665
4.88GluGlu: 4.88 ± 2.04
0.0GluPhe: 0.0 ± 0.0
4.437GluGly: 4.437 ± 0.783
0.887GluHis: 0.887 ± 0.433
0.887GluIle: 0.887 ± 0.628
3.993GluLys: 3.993 ± 2.24
4.88GluLeu: 4.88 ± 0.941
0.887GluMet: 0.887 ± 0.537
5.324GluAsn: 5.324 ± 0.83
5.324GluPro: 5.324 ± 2.295
2.218GluGln: 2.218 ± 1.121
2.662GluArg: 2.662 ± 1.049
3.993GluSer: 3.993 ± 0.593
2.662GluThr: 2.662 ± 1.387
3.106GluVal: 3.106 ± 1.683
0.0GluTrp: 0.0 ± 0.0
1.331GluTyr: 1.331 ± 0.828
0.0GluXaa: 0.0 ± 0.0
Phe
1.775PheAla: 1.775 ± 0.734
0.444PheCys: 0.444 ± 0.609
3.106PheAsp: 3.106 ± 1.409
1.775PheGlu: 1.775 ± 0.561
3.993PhePhe: 3.993 ± 1.537
2.662PheGly: 2.662 ± 0.589
0.444PheHis: 0.444 ± 0.424
2.218PheIle: 2.218 ± 1.043
2.218PheLys: 2.218 ± 0.781
3.549PheLeu: 3.549 ± 0.686
0.887PheMet: 0.887 ± 0.922
1.775PheAsn: 1.775 ± 1.091
1.775PhePro: 1.775 ± 0.7
2.218PheGln: 2.218 ± 0.712
2.662PheArg: 2.662 ± 0.88
3.106PheSer: 3.106 ± 1.059
2.662PheThr: 2.662 ± 1.073
2.218PheVal: 2.218 ± 0.527
2.218PheTrp: 2.218 ± 0.336
2.662PheTyr: 2.662 ± 0.955
0.0PheXaa: 0.0 ± 0.0
Gly
2.662GlyAla: 2.662 ± 1.379
0.444GlyCys: 0.444 ± 0.424
5.324GlyAsp: 5.324 ± 0.723
4.88GlyGlu: 4.88 ± 0.911
3.993GlyPhe: 3.993 ± 0.985
5.324GlyGly: 5.324 ± 2.724
0.444GlyHis: 0.444 ± 0.38
4.88GlyIle: 4.88 ± 2.074
2.218GlyLys: 2.218 ± 0.657
4.88GlyLeu: 4.88 ± 1.336
0.887GlyMet: 0.887 ± 0.412
3.993GlyAsn: 3.993 ± 1.187
4.88GlyPro: 4.88 ± 1.278
1.775GlyGln: 1.775 ± 0.825
8.429GlyArg: 8.429 ± 2.702
9.76GlySer: 9.76 ± 1.875
5.768GlyThr: 5.768 ± 1.356
5.768GlyVal: 5.768 ± 1.602
0.444GlyTrp: 0.444 ± 0.424
0.887GlyTyr: 0.887 ± 0.658
0.0GlyXaa: 0.0 ± 0.0
His
2.218HisAla: 2.218 ± 0.61
1.331HisCys: 1.331 ± 1.178
0.887HisAsp: 0.887 ± 0.433
1.331HisGlu: 1.331 ± 0.401
0.887HisPhe: 0.887 ± 0.433
0.444HisGly: 0.444 ± 0.385
0.444HisHis: 0.444 ± 0.385
0.887HisIle: 0.887 ± 0.537
0.0HisLys: 0.0 ± 0.0
0.887HisLeu: 0.887 ± 0.632
0.0HisMet: 0.0 ± 0.0
1.331HisAsn: 1.331 ± 0.503
1.331HisPro: 1.331 ± 0.503
0.444HisGln: 0.444 ± 0.329
0.887HisArg: 0.887 ± 0.378
1.775HisSer: 1.775 ± 0.958
0.887HisThr: 0.887 ± 0.378
1.331HisVal: 1.331 ± 0.693
0.0HisTrp: 0.0 ± 0.0
0.887HisTyr: 0.887 ± 0.378
0.0HisXaa: 0.0 ± 0.0
Ile
3.106IleAla: 3.106 ± 1.496
1.331IleCys: 1.331 ± 1.322
4.437IleAsp: 4.437 ± 1.705
3.549IleGlu: 3.549 ± 0.585
1.331IlePhe: 1.331 ± 0.735
6.655IleGly: 6.655 ± 3.309
0.0IleHis: 0.0 ± 0.0
2.218IleIle: 2.218 ± 1.29
1.331IleLys: 1.331 ± 0.568
4.437IleLeu: 4.437 ± 2.131
1.331IleMet: 1.331 ± 0.733
1.775IleAsn: 1.775 ± 1.002
3.993IlePro: 3.993 ± 0.748
1.331IleGln: 1.331 ± 0.401
1.775IleArg: 1.775 ± 0.561
3.106IleSer: 3.106 ± 1.339
0.887IleThr: 0.887 ± 0.433
0.887IleVal: 0.887 ± 0.378
0.0IleTrp: 0.0 ± 0.0
2.218IleTyr: 2.218 ± 0.716
0.0IleXaa: 0.0 ± 0.0
Lys
2.662LysAla: 2.662 ± 1.293
1.775LysCys: 1.775 ± 0.637
1.775LysAsp: 1.775 ± 0.85
2.662LysGlu: 2.662 ± 1.471
1.331LysPhe: 1.331 ± 0.568
3.549LysGly: 3.549 ± 0.599
1.331LysHis: 1.331 ± 0.423
1.331LysIle: 1.331 ± 0.704
4.437LysLys: 4.437 ± 1.487
2.662LysLeu: 2.662 ± 1.117
1.331LysMet: 1.331 ± 1.067
4.88LysAsn: 4.88 ± 1.538
1.775LysPro: 1.775 ± 1.392
2.218LysGln: 2.218 ± 1.139
3.106LysArg: 3.106 ± 1.298
3.549LysSer: 3.549 ± 1.899
1.331LysThr: 1.331 ± 0.401
3.549LysVal: 3.549 ± 1.319
1.331LysTrp: 1.331 ± 0.401
1.775LysTyr: 1.775 ± 0.847
0.0LysXaa: 0.0 ± 0.0
Leu
3.549LeuAla: 3.549 ± 0.879
1.775LeuCys: 1.775 ± 0.968
4.88LeuAsp: 4.88 ± 0.842
3.106LeuGlu: 3.106 ± 1.199
3.993LeuPhe: 3.993 ± 1.228
7.098LeuGly: 7.098 ± 1.74
2.662LeuHis: 2.662 ± 0.868
5.324LeuIle: 5.324 ± 1.629
4.88LeuLys: 4.88 ± 1.107
8.429LeuLeu: 8.429 ± 1.573
2.218LeuMet: 2.218 ± 1.077
2.218LeuAsn: 2.218 ± 1.077
6.211LeuPro: 6.211 ± 1.694
4.437LeuGln: 4.437 ± 0.659
4.437LeuArg: 4.437 ± 1.607
7.542LeuSer: 7.542 ± 2.12
5.768LeuThr: 5.768 ± 1.764
3.549LeuVal: 3.549 ± 1.424
0.444LeuTrp: 0.444 ± 0.385
2.218LeuTyr: 2.218 ± 1.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.662MetAla: 2.662 ± 0.494
0.444MetCys: 0.444 ± 0.329
1.331MetAsp: 1.331 ± 0.371
0.444MetGlu: 0.444 ± 0.424
0.444MetPhe: 0.444 ± 0.329
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.444MetIle: 0.444 ± 0.329
0.444MetLys: 0.444 ± 0.424
1.331MetLeu: 1.331 ± 1.068
0.444MetMet: 0.444 ± 0.38
1.331MetAsn: 1.331 ± 0.658
1.331MetPro: 1.331 ± 1.181
0.444MetGln: 0.444 ± 0.424
0.887MetArg: 0.887 ± 0.425
0.887MetSer: 0.887 ± 0.628
0.444MetThr: 0.444 ± 0.38
1.775MetVal: 1.775 ± 0.787
0.444MetTrp: 0.444 ± 0.424
0.444MetTyr: 0.444 ± 0.38
0.0MetXaa: 0.0 ± 0.0
Asn
1.775AsnAla: 1.775 ± 0.848
1.775AsnCys: 1.775 ± 0.943
2.662AsnAsp: 2.662 ± 0.97
4.437AsnGlu: 4.437 ± 1.692
1.331AsnPhe: 1.331 ± 0.658
2.662AsnGly: 2.662 ± 1.199
1.775AsnHis: 1.775 ± 0.535
3.106AsnIle: 3.106 ± 0.939
3.106AsnLys: 3.106 ± 1.26
2.662AsnLeu: 2.662 ± 1.274
0.887AsnMet: 0.887 ± 0.537
1.775AsnAsn: 1.775 ± 1.091
3.106AsnPro: 3.106 ± 1.048
0.887AsnGln: 0.887 ± 0.425
1.331AsnArg: 1.331 ± 1.178
3.993AsnSer: 3.993 ± 0.807
3.106AsnThr: 3.106 ± 0.494
3.106AsnVal: 3.106 ± 1.095
0.444AsnTrp: 0.444 ± 0.385
0.887AsnTyr: 0.887 ± 0.726
0.0AsnXaa: 0.0 ± 0.0
Pro
6.655ProAla: 6.655 ± 2.004
0.444ProCys: 0.444 ± 0.609
4.88ProAsp: 4.88 ± 1.787
2.218ProGlu: 2.218 ± 0.61
1.775ProPhe: 1.775 ± 0.624
3.993ProGly: 3.993 ± 1.998
1.775ProHis: 1.775 ± 0.861
2.218ProIle: 2.218 ± 0.527
3.993ProLys: 3.993 ± 1.193
8.873ProLeu: 8.873 ± 1.342
0.0ProMet: 0.0 ± 0.0
2.662ProAsn: 2.662 ± 0.85
8.429ProPro: 8.429 ± 1.959
1.331ProGln: 1.331 ± 0.423
3.993ProArg: 3.993 ± 2.591
5.768ProSer: 5.768 ± 1.408
1.331ProThr: 1.331 ± 0.806
5.768ProVal: 5.768 ± 1.266
0.887ProTrp: 0.887 ± 0.378
1.331ProTyr: 1.331 ± 0.503
0.0ProXaa: 0.0 ± 0.0
Gln
2.218GlnAla: 2.218 ± 0.85
0.444GlnCys: 0.444 ± 0.329
0.887GlnAsp: 0.887 ± 0.782
2.662GlnGlu: 2.662 ± 0.639
0.887GlnPhe: 0.887 ± 0.412
4.88GlnGly: 4.88 ± 1.02
0.0GlnHis: 0.0 ± 0.0
1.775GlnIle: 1.775 ± 0.95
0.887GlnLys: 0.887 ± 0.425
2.218GlnLeu: 2.218 ± 0.781
1.331GlnMet: 1.331 ± 0.681
0.444GlnAsn: 0.444 ± 0.38
2.662GlnPro: 2.662 ± 1.015
2.218GlnGln: 2.218 ± 0.734
1.775GlnArg: 1.775 ± 0.521
0.887GlnSer: 0.887 ± 0.425
1.775GlnThr: 1.775 ± 0.567
2.218GlnVal: 2.218 ± 0.95
0.887GlnTrp: 0.887 ± 0.433
2.218GlnTyr: 2.218 ± 0.527
0.0GlnXaa: 0.0 ± 0.0
Arg
4.437ArgAla: 4.437 ± 0.919
1.775ArgCys: 1.775 ± 1.496
2.218ArgAsp: 2.218 ± 0.864
1.331ArgGlu: 1.331 ± 0.701
2.218ArgPhe: 2.218 ± 1.121
5.324ArgGly: 5.324 ± 1.397
1.331ArgHis: 1.331 ± 0.84
3.106ArgIle: 3.106 ± 0.897
6.211ArgLys: 6.211 ± 2.565
5.324ArgLeu: 5.324 ± 1.139
0.0ArgMet: 0.0 ± 0.0
1.775ArgAsn: 1.775 ± 0.535
5.768ArgPro: 5.768 ± 2.577
0.444ArgGln: 0.444 ± 0.424
3.549ArgArg: 3.549 ± 1.6
4.437ArgSer: 4.437 ± 1.113
3.106ArgThr: 3.106 ± 1.16
4.88ArgVal: 4.88 ± 1.309
0.887ArgTrp: 0.887 ± 0.425
1.331ArgTyr: 1.331 ± 0.701
0.0ArgXaa: 0.0 ± 0.0
Ser
4.88SerAla: 4.88 ± 0.474
2.218SerCys: 2.218 ± 1.082
7.098SerAsp: 7.098 ± 1.058
2.218SerGlu: 2.218 ± 0.739
5.768SerPhe: 5.768 ± 0.802
8.429SerGly: 8.429 ± 1.977
2.218SerHis: 2.218 ± 0.677
3.993SerIle: 3.993 ± 1.304
2.218SerLys: 2.218 ± 0.516
6.211SerLeu: 6.211 ± 1.176
0.887SerMet: 0.887 ± 0.516
2.662SerAsn: 2.662 ± 0.997
3.993SerPro: 3.993 ± 1.407
2.662SerGln: 2.662 ± 1.069
4.88SerArg: 4.88 ± 0.908
8.429SerSer: 8.429 ± 2.013
7.098SerThr: 7.098 ± 2.091
3.993SerVal: 3.993 ± 1.991
0.887SerTrp: 0.887 ± 0.658
0.887SerTyr: 0.887 ± 0.425
0.0SerXaa: 0.0 ± 0.0
Thr
3.106ThrAla: 3.106 ± 1.54
0.887ThrCys: 0.887 ± 0.433
4.437ThrAsp: 4.437 ± 1.089
4.88ThrGlu: 4.88 ± 0.854
4.88ThrPhe: 4.88 ± 1.231
5.324ThrGly: 5.324 ± 0.82
0.887ThrHis: 0.887 ± 0.433
2.662ThrIle: 2.662 ± 1.028
1.775ThrLys: 1.775 ± 1.171
4.437ThrLeu: 4.437 ± 1.06
0.444ThrMet: 0.444 ± 0.329
3.106ThrAsn: 3.106 ± 1.181
4.437ThrPro: 4.437 ± 2.38
0.887ThrGln: 0.887 ± 0.425
3.993ThrArg: 3.993 ± 1.26
2.218ThrSer: 2.218 ± 0.669
3.549ThrThr: 3.549 ± 1.522
6.655ThrVal: 6.655 ± 1.133
1.331ThrTrp: 1.331 ± 0.423
1.331ThrTyr: 1.331 ± 0.503
0.0ThrXaa: 0.0 ± 0.0
Val
3.993ValAla: 3.993 ± 1.383
1.775ValCys: 1.775 ± 0.71
4.437ValAsp: 4.437 ± 1.604
5.324ValGlu: 5.324 ± 2.662
5.324ValPhe: 5.324 ± 2.088
3.549ValGly: 3.549 ± 0.979
1.331ValHis: 1.331 ± 0.401
1.775ValIle: 1.775 ± 0.709
3.106ValLys: 3.106 ± 0.653
4.437ValLeu: 4.437 ± 1.089
0.444ValMet: 0.444 ± 0.329
2.218ValAsn: 2.218 ± 0.734
3.549ValPro: 3.549 ± 1.022
2.218ValGln: 2.218 ± 1.261
4.88ValArg: 4.88 ± 1.702
7.098ValSer: 7.098 ± 1.895
5.768ValThr: 5.768 ± 2.408
0.887ValVal: 0.887 ± 0.412
1.331ValTrp: 1.331 ± 1.139
2.218ValTyr: 2.218 ± 0.527
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.444TrpAsp: 0.444 ± 0.38
0.887TrpGlu: 0.887 ± 0.537
0.444TrpPhe: 0.444 ± 0.385
2.218TrpGly: 2.218 ± 0.61
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.887TrpLys: 0.887 ± 0.425
2.218TrpLeu: 2.218 ± 1.261
0.444TrpMet: 0.444 ± 0.329
0.444TrpAsn: 0.444 ± 0.385
0.444TrpPro: 0.444 ± 0.385
0.444TrpGln: 0.444 ± 0.38
0.0TrpArg: 0.0 ± 0.0
1.331TrpSer: 1.331 ± 0.423
1.775TrpThr: 1.775 ± 1.171
1.331TrpVal: 1.331 ± 0.568
0.0TrpTrp: 0.0 ± 0.0
1.331TrpTyr: 1.331 ± 0.401
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.331TyrAla: 1.331 ± 0.658
0.887TyrCys: 0.887 ± 0.658
1.775TyrAsp: 1.775 ± 0.843
1.331TyrGlu: 1.331 ± 1.154
1.331TyrPhe: 1.331 ± 0.75
0.887TyrGly: 0.887 ± 0.977
1.331TyrHis: 1.331 ± 0.684
1.331TyrIle: 1.331 ± 0.658
1.331TyrLys: 1.331 ± 0.658
2.662TyrLeu: 2.662 ± 1.274
0.887TyrMet: 0.887 ± 0.444
0.444TyrAsn: 0.444 ± 0.609
2.218TyrPro: 2.218 ± 0.91
1.331TyrGln: 1.331 ± 0.735
2.218TyrArg: 2.218 ± 0.683
1.775TyrSer: 1.775 ± 1.053
2.662TyrThr: 2.662 ± 1.247
0.887TyrVal: 0.887 ± 0.849
1.331TyrTrp: 1.331 ± 0.828
1.775TyrTyr: 1.775 ± 0.559
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2255 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski