Amino acid dipepetide frequency for Pygoscelis adeliae papillomavirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.983AlaAla: 4.983 ± 0.924
0.383AlaCys: 0.383 ± 0.534
4.216AlaAsp: 4.216 ± 1.48
3.833AlaGlu: 3.833 ± 0.689
1.15AlaPhe: 1.15 ± 0.648
4.599AlaGly: 4.599 ± 1.351
1.533AlaHis: 1.533 ± 1.022
2.3AlaIle: 2.3 ± 0.732
1.916AlaLys: 1.916 ± 0.738
4.599AlaLeu: 4.599 ± 1.336
0.767AlaMet: 0.767 ± 0.747
1.15AlaAsn: 1.15 ± 0.567
5.366AlaPro: 5.366 ± 1.37
1.533AlaGln: 1.533 ± 0.773
3.833AlaArg: 3.833 ± 1.372
4.216AlaSer: 4.216 ± 1.419
2.683AlaThr: 2.683 ± 0.697
4.599AlaVal: 4.599 ± 1.259
1.15AlaTrp: 1.15 ± 0.382
1.15AlaTyr: 1.15 ± 0.309
0.0AlaXaa: 0.0 ± 0.0
Cys
0.767CysAla: 0.767 ± 0.579
0.0CysCys: 0.0 ± 0.0
3.066CysAsp: 3.066 ± 1.293
1.15CysGlu: 1.15 ± 0.686
1.15CysPhe: 1.15 ± 0.413
1.533CysGly: 1.533 ± 0.547
0.767CysHis: 0.767 ± 0.697
1.533CysIle: 1.533 ± 0.644
1.533CysLys: 1.533 ± 0.97
0.383CysLeu: 0.383 ± 0.289
0.0CysMet: 0.0 ± 0.0
1.533CysAsn: 1.533 ± 0.941
1.533CysPro: 1.533 ± 0.811
0.767CysGln: 0.767 ± 0.414
1.533CysArg: 1.533 ± 0.816
2.3CysSer: 2.3 ± 0.993
1.15CysThr: 1.15 ± 0.855
0.383CysVal: 0.383 ± 0.289
0.767CysTrp: 0.767 ± 0.545
0.383CysTyr: 0.383 ± 0.447
0.0CysXaa: 0.0 ± 0.0
Asp
2.3AspAla: 2.3 ± 1.121
1.916AspCys: 1.916 ± 0.485
4.599AspAsp: 4.599 ± 1.292
2.3AspGlu: 2.3 ± 0.543
1.15AspPhe: 1.15 ± 0.716
5.366AspGly: 5.366 ± 1.236
0.383AspHis: 0.383 ± 0.333
3.066AspIle: 3.066 ± 0.843
1.916AspLys: 1.916 ± 0.429
6.516AspLeu: 6.516 ± 1.396
1.533AspMet: 1.533 ± 0.438
3.833AspAsn: 3.833 ± 1.337
7.282AspPro: 7.282 ± 1.839
1.15AspGln: 1.15 ± 0.526
1.15AspArg: 1.15 ± 1.12
4.599AspSer: 4.599 ± 1.479
4.599AspThr: 4.599 ± 1.23
4.216AspVal: 4.216 ± 1.334
0.767AspTrp: 0.767 ± 0.353
1.916AspTyr: 1.916 ± 0.561
0.0AspXaa: 0.0 ± 0.0
Glu
5.366GluAla: 5.366 ± 0.84
1.15GluCys: 1.15 ± 0.631
4.599GluAsp: 4.599 ± 0.936
8.049GluGlu: 8.049 ± 2.607
1.533GluPhe: 1.533 ± 0.407
4.983GluGly: 4.983 ± 1.844
0.383GluHis: 0.383 ± 0.289
3.066GluIle: 3.066 ± 1.161
2.3GluLys: 2.3 ± 0.556
8.049GluLeu: 8.049 ± 1.645
1.15GluMet: 1.15 ± 0.775
2.683GluAsn: 2.683 ± 0.795
4.599GluPro: 4.599 ± 2.26
2.3GluGln: 2.3 ± 0.667
3.066GluArg: 3.066 ± 1.21
5.366GluSer: 5.366 ± 1.979
4.216GluThr: 4.216 ± 1.01
1.15GluVal: 1.15 ± 0.585
0.383GluTrp: 0.383 ± 0.289
2.683GluTyr: 2.683 ± 1.115
0.0GluXaa: 0.0 ± 0.0
Phe
0.383PheAla: 0.383 ± 0.329
0.383PheCys: 0.383 ± 0.333
1.15PheAsp: 1.15 ± 0.716
2.683PheGlu: 2.683 ± 0.93
1.916PhePhe: 1.916 ± 0.53
1.15PheGly: 1.15 ± 0.309
0.767PheHis: 0.767 ± 0.667
1.533PheIle: 1.533 ± 0.532
1.533PheLys: 1.533 ± 0.547
4.599PheLeu: 4.599 ± 1.175
0.383PheMet: 0.383 ± 0.329
1.916PheAsn: 1.916 ± 1.037
2.3PhePro: 2.3 ± 0.718
0.767PheGln: 0.767 ± 0.545
3.066PheArg: 3.066 ± 0.842
3.066PheSer: 3.066 ± 1.182
0.383PheThr: 0.383 ± 0.329
1.533PheVal: 1.533 ± 0.94
0.767PheTrp: 0.767 ± 0.353
1.15PheTyr: 1.15 ± 0.553
0.0PheXaa: 0.0 ± 0.0
Gly
3.066GlyAla: 3.066 ± 1.252
1.15GlyCys: 1.15 ± 0.413
4.216GlyAsp: 4.216 ± 1.235
4.599GlyGlu: 4.599 ± 1.677
1.916GlyPhe: 1.916 ± 0.485
6.516GlyGly: 6.516 ± 1.884
0.383GlyHis: 0.383 ± 0.333
1.916GlyIle: 1.916 ± 1.108
2.683GlyLys: 2.683 ± 1.15
8.049GlyLeu: 8.049 ± 2.264
1.916GlyMet: 1.916 ± 1.211
4.216GlyAsn: 4.216 ± 0.689
5.366GlyPro: 5.366 ± 0.955
5.366GlyGln: 5.366 ± 0.94
6.133GlyArg: 6.133 ± 1.214
4.983GlySer: 4.983 ± 2.175
3.45GlyThr: 3.45 ± 1.113
4.983GlyVal: 4.983 ± 1.062
1.533GlyTrp: 1.533 ± 0.735
0.767GlyTyr: 0.767 ± 0.601
0.0GlyXaa: 0.0 ± 0.0
His
0.767HisAla: 0.767 ± 0.353
0.767HisCys: 0.767 ± 0.353
1.15HisAsp: 1.15 ± 0.638
0.767HisGlu: 0.767 ± 0.363
0.767HisPhe: 0.767 ± 0.353
2.3HisGly: 2.3 ± 0.429
0.0HisHis: 0.0 ± 0.0
1.15HisIle: 1.15 ± 0.666
0.0HisLys: 0.0 ± 0.0
1.916HisLeu: 1.916 ± 1.323
0.383HisMet: 0.383 ± 0.333
0.383HisAsn: 0.383 ± 0.333
0.767HisPro: 0.767 ± 0.353
1.15HisGln: 1.15 ± 0.738
1.15HisArg: 1.15 ± 0.593
1.15HisSer: 1.15 ± 0.629
1.15HisThr: 1.15 ± 0.382
0.767HisVal: 0.767 ± 0.661
0.767HisTrp: 0.767 ± 0.458
0.767HisTyr: 0.767 ± 0.557
0.0HisXaa: 0.0 ± 0.0
Ile
1.533IleAla: 1.533 ± 0.825
0.767IleCys: 0.767 ± 0.353
4.216IleAsp: 4.216 ± 1.451
2.3IleGlu: 2.3 ± 0.877
1.15IlePhe: 1.15 ± 0.382
3.833IleGly: 3.833 ± 0.624
0.767IleHis: 0.767 ± 0.383
2.683IleIle: 2.683 ± 1.146
1.533IleLys: 1.533 ± 0.714
3.066IleLeu: 3.066 ± 1.639
0.767IleMet: 0.767 ± 0.661
0.383IleAsn: 0.383 ± 0.289
4.983IlePro: 4.983 ± 1.896
0.383IleGln: 0.383 ± 0.289
1.916IleArg: 1.916 ± 1.003
3.833IleSer: 3.833 ± 1.004
3.066IleThr: 3.066 ± 0.646
1.533IleVal: 1.533 ± 0.571
0.383IleTrp: 0.383 ± 0.329
1.916IleTyr: 1.916 ± 0.954
0.0IleXaa: 0.0 ± 0.0
Lys
1.15LysAla: 1.15 ± 0.681
1.15LysCys: 1.15 ± 0.716
0.383LysAsp: 0.383 ± 0.333
1.15LysGlu: 1.15 ± 0.41
1.15LysPhe: 1.15 ± 0.631
1.15LysGly: 1.15 ± 0.594
0.383LysHis: 0.383 ± 0.333
3.066LysIle: 3.066 ± 0.85
3.45LysLys: 3.45 ± 1.062
2.3LysLeu: 2.3 ± 0.834
0.383LysMet: 0.383 ± 0.329
1.15LysAsn: 1.15 ± 0.868
2.3LysPro: 2.3 ± 0.554
0.767LysGln: 0.767 ± 0.661
5.366LysArg: 5.366 ± 1.361
2.3LysSer: 2.3 ± 1.444
3.833LysThr: 3.833 ± 1.601
1.533LysVal: 1.533 ± 0.932
0.0LysTrp: 0.0 ± 0.0
3.066LysTyr: 3.066 ± 0.813
0.0LysXaa: 0.0 ± 0.0
Leu
4.983LeuAla: 4.983 ± 1.342
2.683LeuCys: 2.683 ± 1.203
5.366LeuAsp: 5.366 ± 1.283
9.966LeuGlu: 9.966 ± 0.813
4.599LeuPhe: 4.599 ± 1.181
4.599LeuGly: 4.599 ± 1.607
1.15LeuHis: 1.15 ± 0.681
2.3LeuIle: 2.3 ± 0.6
3.833LeuLys: 3.833 ± 1.074
11.115LeuLeu: 11.115 ± 2.499
1.15LeuMet: 1.15 ± 0.885
3.833LeuAsn: 3.833 ± 1.335
6.516LeuPro: 6.516 ± 1.52
5.749LeuGln: 5.749 ± 1.848
6.133LeuArg: 6.133 ± 1.163
8.432LeuSer: 8.432 ± 1.244
5.366LeuThr: 5.366 ± 1.622
3.833LeuVal: 3.833 ± 1.371
1.916LeuTrp: 1.916 ± 0.776
3.45LeuTyr: 3.45 ± 1.092
0.0LeuXaa: 0.0 ± 0.0
Met
1.15MetAla: 1.15 ± 0.631
0.767MetCys: 0.767 ± 0.511
0.383MetAsp: 0.383 ± 0.373
0.767MetGlu: 0.767 ± 0.363
0.767MetPhe: 0.767 ± 0.485
0.0MetGly: 0.0 ± 0.0
1.15MetHis: 1.15 ± 1.12
0.383MetIle: 0.383 ± 0.329
0.383MetLys: 0.383 ± 0.546
1.15MetLeu: 1.15 ± 0.681
0.767MetMet: 0.767 ± 0.545
0.383MetAsn: 0.383 ± 0.329
1.15MetPro: 1.15 ± 1.153
1.15MetGln: 1.15 ± 0.631
1.916MetArg: 1.916 ± 0.372
1.15MetSer: 1.15 ± 0.666
1.533MetThr: 1.533 ± 0.514
0.767MetVal: 0.767 ± 0.579
0.0MetTrp: 0.0 ± 0.0
0.767MetTyr: 0.767 ± 0.697
0.0MetXaa: 0.0 ± 0.0
Asn
2.683AsnAla: 2.683 ± 0.805
0.767AsnCys: 0.767 ± 0.545
0.767AsnAsp: 0.767 ± 0.579
2.3AsnGlu: 2.3 ± 0.764
0.383AsnPhe: 0.383 ± 0.546
1.15AsnGly: 1.15 ± 0.666
0.0AsnHis: 0.0 ± 0.0
0.383AsnIle: 0.383 ± 0.447
1.15AsnLys: 1.15 ± 0.382
3.45AsnLeu: 3.45 ± 0.578
0.767AsnMet: 0.767 ± 0.511
1.916AsnAsn: 1.916 ± 0.999
3.066AsnPro: 3.066 ± 1.097
1.916AsnGln: 1.916 ± 1.003
4.599AsnArg: 4.599 ± 1.237
2.683AsnSer: 2.683 ± 1.675
4.599AsnThr: 4.599 ± 1.067
3.45AsnVal: 3.45 ± 1.125
0.0AsnTrp: 0.0 ± 0.0
1.916AsnTyr: 1.916 ± 1.041
0.0AsnXaa: 0.0 ± 0.0
Pro
8.049ProAla: 8.049 ± 1.946
0.767ProCys: 0.767 ± 0.579
5.749ProAsp: 5.749 ± 2.012
4.216ProGlu: 4.216 ± 1.056
2.683ProPhe: 2.683 ± 1.995
4.599ProGly: 4.599 ± 2.385
1.15ProHis: 1.15 ± 0.309
3.066ProIle: 3.066 ± 1.598
1.916ProLys: 1.916 ± 0.692
6.899ProLeu: 6.899 ± 2.518
0.383ProMet: 0.383 ± 0.289
3.066ProAsn: 3.066 ± 0.644
7.282ProPro: 7.282 ± 2.58
1.916ProGln: 1.916 ± 0.576
3.45ProArg: 3.45 ± 0.95
7.282ProSer: 7.282 ± 2.209
7.282ProThr: 7.282 ± 2.711
2.683ProVal: 2.683 ± 1.195
0.767ProTrp: 0.767 ± 0.553
3.066ProTyr: 3.066 ± 1.334
0.0ProXaa: 0.0 ± 0.0
Gln
1.916GlnAla: 1.916 ± 0.514
0.383GlnCys: 0.383 ± 0.289
0.767GlnAsp: 0.767 ± 0.383
2.3GlnGlu: 2.3 ± 1.262
1.15GlnPhe: 1.15 ± 0.745
3.066GlnGly: 3.066 ± 1.097
1.533GlnHis: 1.533 ± 0.582
1.533GlnIle: 1.533 ± 0.433
1.533GlnLys: 1.533 ± 0.632
5.749GlnLeu: 5.749 ± 1.165
0.383GlnMet: 0.383 ± 0.373
1.15GlnAsn: 1.15 ± 0.941
1.916GlnPro: 1.916 ± 0.561
1.15GlnGln: 1.15 ± 0.693
2.683GlnArg: 2.683 ± 0.853
3.066GlnSer: 3.066 ± 0.739
1.533GlnThr: 1.533 ± 1.184
2.683GlnVal: 2.683 ± 1.495
0.383GlnTrp: 0.383 ± 0.289
1.533GlnTyr: 1.533 ± 0.72
0.0GlnXaa: 0.0 ± 0.0
Arg
3.066ArgAla: 3.066 ± 1.177
1.533ArgCys: 1.533 ± 0.547
2.683ArgAsp: 2.683 ± 0.883
3.45ArgGlu: 3.45 ± 1.412
1.15ArgPhe: 1.15 ± 0.567
6.133ArgGly: 6.133 ± 1.886
2.3ArgHis: 2.3 ± 1.095
3.45ArgIle: 3.45 ± 1.274
4.216ArgLys: 4.216 ± 1.627
6.516ArgLeu: 6.516 ± 2.802
1.533ArgMet: 1.533 ± 0.577
2.3ArgAsn: 2.3 ± 0.702
6.133ArgPro: 6.133 ± 1.529
1.15ArgGln: 1.15 ± 1.0
8.432ArgArg: 8.432 ± 1.724
4.599ArgSer: 4.599 ± 0.622
5.749ArgThr: 5.749 ± 1.323
5.366ArgVal: 5.366 ± 1.467
0.383ArgTrp: 0.383 ± 0.373
3.066ArgTyr: 3.066 ± 1.118
0.0ArgXaa: 0.0 ± 0.0
Ser
4.216SerAla: 4.216 ± 1.019
0.383SerCys: 0.383 ± 0.373
6.133SerAsp: 6.133 ± 1.39
7.666SerGlu: 7.666 ± 1.985
2.683SerPhe: 2.683 ± 0.868
10.349SerGly: 10.349 ± 1.788
2.683SerHis: 2.683 ± 0.605
1.15SerIle: 1.15 ± 0.514
3.066SerLys: 3.066 ± 1.084
8.432SerLeu: 8.432 ± 2.237
0.767SerMet: 0.767 ± 0.473
3.066SerAsn: 3.066 ± 1.946
3.066SerPro: 3.066 ± 0.792
2.683SerGln: 2.683 ± 1.073
4.216SerArg: 4.216 ± 1.002
7.666SerSer: 7.666 ± 1.3
7.282SerThr: 7.282 ± 0.63
4.983SerVal: 4.983 ± 1.596
0.383SerTrp: 0.383 ± 0.289
1.916SerTyr: 1.916 ± 1.597
0.0SerXaa: 0.0 ± 0.0
Thr
5.366ThrAla: 5.366 ± 0.868
1.916ThrCys: 1.916 ± 1.674
4.599ThrAsp: 4.599 ± 1.362
5.366ThrGlu: 5.366 ± 1.681
2.683ThrPhe: 2.683 ± 1.072
6.516ThrGly: 6.516 ± 1.716
0.767ThrHis: 0.767 ± 0.658
3.45ThrIle: 3.45 ± 1.479
1.15ThrLys: 1.15 ± 0.708
3.833ThrLeu: 3.833 ± 1.422
1.533ThrMet: 1.533 ± 0.547
1.533ThrAsn: 1.533 ± 0.51
3.833ThrPro: 3.833 ± 0.597
2.683ThrGln: 2.683 ± 1.383
4.216ThrArg: 4.216 ± 1.774
6.516ThrSer: 6.516 ± 0.473
5.749ThrThr: 5.749 ± 1.184
4.599ThrVal: 4.599 ± 1.562
1.533ThrTrp: 1.533 ± 0.943
2.3ThrTyr: 2.3 ± 1.207
0.0ThrXaa: 0.0 ± 0.0
Val
2.3ValAla: 2.3 ± 0.967
0.383ValCys: 0.383 ± 0.289
3.066ValAsp: 3.066 ± 1.122
3.066ValGlu: 3.066 ± 1.258
1.533ValPhe: 1.533 ± 0.263
4.599ValGly: 4.599 ± 1.707
1.15ValHis: 1.15 ± 0.591
2.3ValIle: 2.3 ± 0.811
0.383ValLys: 0.383 ± 0.373
3.833ValLeu: 3.833 ± 1.503
1.533ValMet: 1.533 ± 0.644
1.916ValAsn: 1.916 ± 1.217
5.749ValPro: 5.749 ± 3.078
1.916ValGln: 1.916 ± 0.829
5.749ValArg: 5.749 ± 1.348
4.983ValSer: 4.983 ± 0.765
4.599ValThr: 4.599 ± 0.772
3.066ValVal: 3.066 ± 1.141
0.383ValTrp: 0.383 ± 0.373
1.916ValTyr: 1.916 ± 0.754
0.0ValXaa: 0.0 ± 0.0
Trp
0.383TrpAla: 0.383 ± 0.289
0.767TrpCys: 0.767 ± 0.414
0.767TrpAsp: 0.767 ± 0.363
0.383TrpGlu: 0.383 ± 0.333
0.383TrpPhe: 0.383 ± 0.329
0.383TrpGly: 0.383 ± 0.373
0.383TrpHis: 0.383 ± 0.333
1.15TrpIle: 1.15 ± 0.686
0.767TrpLys: 0.767 ± 0.414
2.3TrpLeu: 2.3 ± 0.763
0.0TrpMet: 0.0 ± 0.0
0.767TrpAsn: 0.767 ± 0.485
0.383TrpPro: 0.383 ± 0.329
0.767TrpGln: 0.767 ± 0.747
1.15TrpArg: 1.15 ± 0.556
1.15TrpSer: 1.15 ± 0.708
0.383TrpThr: 0.383 ± 0.447
0.767TrpVal: 0.767 ± 0.353
0.383TrpTrp: 0.383 ± 0.329
0.383TrpTyr: 0.383 ± 0.329
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.533TyrAla: 1.533 ± 0.756
4.216TyrCys: 4.216 ± 1.792
2.683TyrAsp: 2.683 ± 0.569
0.767TyrGlu: 0.767 ± 0.553
1.533TyrPhe: 1.533 ± 0.263
0.383TyrGly: 0.383 ± 0.289
0.383TyrHis: 0.383 ± 0.373
1.533TyrIle: 1.533 ± 0.706
0.767TyrLys: 0.767 ± 0.414
4.599TyrLeu: 4.599 ± 1.156
0.0TyrMet: 0.0 ± 0.0
0.767TyrAsn: 0.767 ± 0.58
2.683TyrPro: 2.683 ± 0.895
1.15TyrGln: 1.15 ± 0.571
3.45TyrArg: 3.45 ± 1.353
3.45TyrSer: 3.45 ± 0.722
1.533TyrThr: 1.533 ± 0.753
1.533TyrVal: 1.533 ± 0.532
1.15TyrTrp: 1.15 ± 0.708
1.916TyrTyr: 1.916 ± 0.714
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (2610 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski