Amino acid dipepetide frequency for Phocoena phocoena papillomavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.426AlaAla: 7.426 ± 1.208
1.65AlaCys: 1.65 ± 0.501
4.125AlaAsp: 4.125 ± 1.346
2.888AlaGlu: 2.888 ± 1.199
2.475AlaPhe: 2.475 ± 0.396
3.3AlaGly: 3.3 ± 0.943
2.063AlaHis: 2.063 ± 0.723
2.888AlaIle: 2.888 ± 0.455
3.713AlaLys: 3.713 ± 1.164
4.538AlaLeu: 4.538 ± 1.389
2.475AlaMet: 2.475 ± 0.463
1.238AlaAsn: 1.238 ± 0.687
5.776AlaPro: 5.776 ± 1.63
1.65AlaGln: 1.65 ± 0.588
4.125AlaArg: 4.125 ± 0.95
3.713AlaSer: 3.713 ± 1.044
4.538AlaThr: 4.538 ± 1.156
2.888AlaVal: 2.888 ± 0.923
0.0AlaTrp: 0.0 ± 0.0
1.65AlaTyr: 1.65 ± 0.711
0.0AlaXaa: 0.0 ± 0.0
Cys
1.65CysAla: 1.65 ± 0.749
0.0CysCys: 0.0 ± 0.0
1.238CysAsp: 1.238 ± 0.785
1.238CysGlu: 1.238 ± 1.078
1.238CysPhe: 1.238 ± 0.751
0.413CysGly: 0.413 ± 0.318
0.413CysHis: 0.413 ± 0.359
1.65CysIle: 1.65 ± 0.898
2.063CysLys: 2.063 ± 0.673
2.063CysLeu: 2.063 ± 0.933
1.65CysMet: 1.65 ± 0.811
1.238CysAsn: 1.238 ± 0.648
1.65CysPro: 1.65 ± 0.588
0.0CysGln: 0.0 ± 0.0
1.65CysArg: 1.65 ± 0.508
1.65CysSer: 1.65 ± 0.933
1.65CysThr: 1.65 ± 0.641
0.825CysVal: 0.825 ± 0.541
2.063CysTrp: 2.063 ± 0.713
0.413CysTyr: 0.413 ± 0.359
0.0CysXaa: 0.0 ± 0.0
Asp
2.888AspAla: 2.888 ± 0.406
2.888AspCys: 2.888 ± 0.863
3.713AspAsp: 3.713 ± 1.236
2.888AspGlu: 2.888 ± 1.124
3.3AspPhe: 3.3 ± 0.978
7.013AspGly: 7.013 ± 1.478
1.65AspHis: 1.65 ± 0.808
4.538AspIle: 4.538 ± 0.896
2.063AspLys: 2.063 ± 1.03
4.125AspLeu: 4.125 ± 1.039
1.65AspMet: 1.65 ± 0.588
2.475AspAsn: 2.475 ± 0.821
5.776AspPro: 5.776 ± 1.181
2.063AspGln: 2.063 ± 0.772
3.3AspArg: 3.3 ± 0.515
4.538AspSer: 4.538 ± 1.06
3.713AspThr: 3.713 ± 0.88
5.363AspVal: 5.363 ± 1.814
0.413AspTrp: 0.413 ± 0.318
2.475AspTyr: 2.475 ± 1.248
0.0AspXaa: 0.0 ± 0.0
Glu
2.475GluAla: 2.475 ± 0.745
0.413GluCys: 0.413 ± 0.318
6.601GluAsp: 6.601 ± 2.567
3.3GluGlu: 3.3 ± 0.768
2.063GluPhe: 2.063 ± 0.943
4.95GluGly: 4.95 ± 1.139
0.413GluHis: 0.413 ± 0.386
1.65GluIle: 1.65 ± 0.718
3.3GluLys: 3.3 ± 1.687
4.95GluLeu: 4.95 ± 0.816
1.238GluMet: 1.238 ± 0.648
0.825GluAsn: 0.825 ± 0.541
2.063GluPro: 2.063 ± 0.995
3.3GluGln: 3.3 ± 0.957
1.65GluArg: 1.65 ± 1.072
7.013GluSer: 7.013 ± 2.393
4.95GluThr: 4.95 ± 1.131
2.888GluVal: 2.888 ± 0.722
0.0GluTrp: 0.0 ± 0.0
1.238GluTyr: 1.238 ± 0.501
0.0GluXaa: 0.0 ± 0.0
Phe
2.888PheAla: 2.888 ± 0.822
1.65PheCys: 1.65 ± 1.154
3.713PheAsp: 3.713 ± 1.487
2.475PheGlu: 2.475 ± 1.584
1.238PhePhe: 1.238 ± 0.316
2.063PheGly: 2.063 ± 0.43
0.413PheHis: 0.413 ± 0.471
2.063PheIle: 2.063 ± 0.807
2.888PheLys: 2.888 ± 0.897
4.95PheLeu: 4.95 ± 0.981
0.413PheMet: 0.413 ± 0.441
0.413PheAsn: 0.413 ± 0.386
2.063PhePro: 2.063 ± 0.517
2.063PheGln: 2.063 ± 0.595
1.238PheArg: 1.238 ± 0.395
2.475PheSer: 2.475 ± 0.778
3.3PheThr: 3.3 ± 0.748
2.888PheVal: 2.888 ± 1.415
0.825PheTrp: 0.825 ± 0.637
2.888PheTyr: 2.888 ± 1.134
0.0PheXaa: 0.0 ± 0.0
Gly
4.125GlyAla: 4.125 ± 0.932
2.063GlyCys: 2.063 ± 0.878
6.188GlyAsp: 6.188 ± 0.893
5.363GlyGlu: 5.363 ± 0.988
4.125GlyPhe: 4.125 ± 1.468
10.314GlyGly: 10.314 ± 2.892
2.888GlyHis: 2.888 ± 0.73
4.95GlyIle: 4.95 ± 0.667
2.063GlyLys: 2.063 ± 0.746
4.125GlyLeu: 4.125 ± 0.398
1.238GlyMet: 1.238 ± 0.648
4.538GlyAsn: 4.538 ± 1.179
3.713GlyPro: 3.713 ± 1.387
3.3GlyGln: 3.3 ± 0.642
4.125GlyArg: 4.125 ± 0.573
5.776GlySer: 5.776 ± 1.159
4.538GlyThr: 4.538 ± 1.583
4.95GlyVal: 4.95 ± 1.061
0.825GlyTrp: 0.825 ± 0.54
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.825HisAla: 0.825 ± 0.473
0.413HisCys: 0.413 ± 0.318
1.238HisAsp: 1.238 ± 0.961
1.238HisGlu: 1.238 ± 0.434
0.825HisPhe: 0.825 ± 0.389
1.65HisGly: 1.65 ± 0.606
0.825HisHis: 0.825 ± 0.691
1.238HisIle: 1.238 ± 0.652
1.65HisLys: 1.65 ± 0.728
1.238HisLeu: 1.238 ± 0.715
0.825HisMet: 0.825 ± 0.771
0.0HisAsn: 0.0 ± 0.0
4.125HisPro: 4.125 ± 1.895
0.413HisGln: 0.413 ± 0.359
1.65HisArg: 1.65 ± 0.606
0.825HisSer: 0.825 ± 0.426
2.888HisThr: 2.888 ± 1.051
2.888HisVal: 2.888 ± 0.57
0.825HisTrp: 0.825 ± 0.541
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.65IleAla: 1.65 ± 0.49
0.413IleCys: 0.413 ± 0.318
3.713IleAsp: 3.713 ± 0.711
3.3IleGlu: 3.3 ± 0.815
2.475IlePhe: 2.475 ± 0.928
2.475IleGly: 2.475 ± 1.047
0.0IleHis: 0.0 ± 0.0
0.413IleIle: 0.413 ± 0.386
1.238IleLys: 1.238 ± 0.648
2.888IleLeu: 2.888 ± 1.329
0.413IleMet: 0.413 ± 0.351
0.825IleAsn: 0.825 ± 0.473
2.063IlePro: 2.063 ± 1.105
1.238IleGln: 1.238 ± 0.687
2.063IleArg: 2.063 ± 0.779
4.538IleSer: 4.538 ± 0.721
2.063IleThr: 2.063 ± 1.034
2.063IleVal: 2.063 ± 0.757
0.413IleTrp: 0.413 ± 0.471
1.238IleTyr: 1.238 ± 0.434
0.0IleXaa: 0.0 ± 0.0
Lys
1.65LysAla: 1.65 ± 0.606
1.65LysCys: 1.65 ± 0.501
1.65LysAsp: 1.65 ± 1.143
2.888LysGlu: 2.888 ± 1.1
2.475LysPhe: 2.475 ± 0.803
2.063LysGly: 2.063 ± 0.903
1.65LysHis: 1.65 ± 0.744
2.063LysIle: 2.063 ± 1.075
5.363LysLys: 5.363 ± 0.736
1.238LysLeu: 1.238 ± 0.575
0.413LysMet: 0.413 ± 0.318
2.888LysAsn: 2.888 ± 1.018
1.65LysPro: 1.65 ± 0.644
0.413LysGln: 0.413 ± 0.386
5.776LysArg: 5.776 ± 1.011
3.713LysSer: 3.713 ± 1.327
3.3LysThr: 3.3 ± 0.957
2.888LysVal: 2.888 ± 1.239
1.238LysTrp: 1.238 ± 0.316
1.238LysTyr: 1.238 ± 0.746
0.0LysXaa: 0.0 ± 0.0
Leu
5.776LeuAla: 5.776 ± 1.044
2.475LeuCys: 2.475 ± 1.59
4.538LeuAsp: 4.538 ± 1.138
4.125LeuGlu: 4.125 ± 1.647
5.363LeuPhe: 5.363 ± 1.157
5.776LeuGly: 5.776 ± 0.928
2.888LeuHis: 2.888 ± 1.642
3.3LeuIle: 3.3 ± 0.506
3.3LeuLys: 3.3 ± 1.212
9.901LeuLeu: 9.901 ± 1.94
0.825LeuMet: 0.825 ± 0.637
1.238LeuAsn: 1.238 ± 1.157
4.125LeuPro: 4.125 ± 1.73
5.776LeuGln: 5.776 ± 0.985
3.713LeuArg: 3.713 ± 1.764
6.188LeuSer: 6.188 ± 1.825
6.601LeuThr: 6.601 ± 1.959
4.95LeuVal: 4.95 ± 1.529
0.0LeuTrp: 0.0 ± 0.0
3.3LeuTyr: 3.3 ± 0.779
0.0LeuXaa: 0.0 ± 0.0
Met
1.65MetAla: 1.65 ± 0.712
0.825MetCys: 0.825 ± 0.372
2.888MetAsp: 2.888 ± 1.163
0.825MetGlu: 0.825 ± 0.473
0.413MetPhe: 0.413 ± 0.318
1.65MetGly: 1.65 ± 0.711
0.413MetHis: 0.413 ± 0.32
0.0MetIle: 0.0 ± 0.0
0.413MetLys: 0.413 ± 0.318
2.063MetLeu: 2.063 ± 0.388
0.413MetMet: 0.413 ± 0.32
0.825MetAsn: 0.825 ± 0.372
0.413MetPro: 0.413 ± 0.318
0.0MetGln: 0.0 ± 0.0
0.413MetArg: 0.413 ± 0.318
1.238MetSer: 1.238 ± 0.652
1.238MetThr: 1.238 ± 0.588
1.238MetVal: 1.238 ± 0.635
0.825MetTrp: 0.825 ± 0.771
1.238MetTyr: 1.238 ± 0.785
0.0MetXaa: 0.0 ± 0.0
Asn
1.65AsnAla: 1.65 ± 0.26
1.238AsnCys: 1.238 ± 0.709
0.825AsnAsp: 0.825 ± 0.771
0.825AsnGlu: 0.825 ± 0.771
0.413AsnPhe: 0.413 ± 0.386
2.475AsnGly: 2.475 ± 0.633
0.0AsnHis: 0.0 ± 0.0
0.825AsnIle: 0.825 ± 0.771
1.65AsnLys: 1.65 ± 1.051
1.238AsnLeu: 1.238 ± 0.416
0.0AsnMet: 0.0 ± 0.0
0.825AsnAsn: 0.825 ± 0.473
2.475AsnPro: 2.475 ± 0.898
1.238AsnGln: 1.238 ± 0.395
0.825AsnArg: 0.825 ± 0.771
2.888AsnSer: 2.888 ± 1.082
2.888AsnThr: 2.888 ± 1.396
3.3AsnVal: 3.3 ± 1.403
0.825AsnTrp: 0.825 ± 0.637
1.238AsnTyr: 1.238 ± 0.434
0.0AsnXaa: 0.0 ± 0.0
Pro
5.776ProAla: 5.776 ± 1.148
1.65ProCys: 1.65 ± 0.661
7.013ProAsp: 7.013 ± 2.851
4.125ProGlu: 4.125 ± 1.131
1.238ProPhe: 1.238 ± 0.592
4.538ProGly: 4.538 ± 1.853
1.65ProHis: 1.65 ± 0.79
0.825ProIle: 0.825 ± 0.541
3.713ProLys: 3.713 ± 1.39
9.076ProLeu: 9.076 ± 1.025
0.413ProMet: 0.413 ± 0.618
1.65ProAsn: 1.65 ± 0.588
8.663ProPro: 8.663 ± 1.664
2.063ProGln: 2.063 ± 0.717
3.3ProArg: 3.3 ± 1.295
6.601ProSer: 6.601 ± 3.516
2.475ProThr: 2.475 ± 1.136
4.125ProVal: 4.125 ± 0.698
0.0ProTrp: 0.0 ± 0.0
0.825ProTyr: 0.825 ± 0.771
0.0ProXaa: 0.0 ± 0.0
Gln
2.888GlnAla: 2.888 ± 0.722
0.413GlnCys: 0.413 ± 0.318
3.713GlnAsp: 3.713 ± 0.699
2.063GlnGlu: 2.063 ± 0.595
0.825GlnPhe: 0.825 ± 0.404
2.475GlnGly: 2.475 ± 0.796
0.413GlnHis: 0.413 ± 0.618
0.413GlnIle: 0.413 ± 0.318
0.413GlnLys: 0.413 ± 0.318
3.713GlnLeu: 3.713 ± 1.327
2.888GlnMet: 2.888 ± 1.02
1.65GlnAsn: 1.65 ± 0.517
3.3GlnPro: 3.3 ± 1.351
2.888GlnGln: 2.888 ± 1.188
2.475GlnArg: 2.475 ± 1.013
1.238GlnSer: 1.238 ± 0.715
2.475GlnThr: 2.475 ± 1.146
2.475GlnVal: 2.475 ± 0.48
1.238GlnTrp: 1.238 ± 0.955
0.413GlnTyr: 0.413 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
5.363ArgAla: 5.363 ± 1.183
2.063ArgCys: 2.063 ± 1.311
1.65ArgAsp: 1.65 ± 0.673
3.3ArgGlu: 3.3 ± 0.515
2.475ArgPhe: 2.475 ± 0.959
4.95ArgGly: 4.95 ± 1.661
1.238ArgHis: 1.238 ± 0.785
0.825ArgIle: 0.825 ± 0.416
3.713ArgLys: 3.713 ± 1.103
4.95ArgLeu: 4.95 ± 0.962
0.413ArgMet: 0.413 ± 0.361
1.65ArgAsn: 1.65 ± 0.54
2.888ArgPro: 2.888 ± 0.86
2.475ArgGln: 2.475 ± 0.724
4.538ArgArg: 4.538 ± 1.083
3.713ArgSer: 3.713 ± 0.666
3.3ArgThr: 3.3 ± 1.234
2.888ArgVal: 2.888 ± 1.081
1.65ArgTrp: 1.65 ± 0.832
1.238ArgTyr: 1.238 ± 0.722
0.0ArgXaa: 0.0 ± 0.0
Ser
5.776SerAla: 5.776 ± 1.325
1.238SerCys: 1.238 ± 0.746
2.888SerAsp: 2.888 ± 0.779
7.013SerGlu: 7.013 ± 0.728
5.776SerPhe: 5.776 ± 1.328
8.251SerGly: 8.251 ± 1.567
2.888SerHis: 2.888 ± 0.669
2.888SerIle: 2.888 ± 1.051
2.063SerLys: 2.063 ± 0.51
9.076SerLeu: 9.076 ± 2.038
2.063SerMet: 2.063 ± 0.541
1.65SerAsn: 1.65 ± 0.54
4.95SerPro: 4.95 ± 2.321
1.65SerGln: 1.65 ± 0.959
3.713SerArg: 3.713 ± 0.992
11.551SerSer: 11.551 ± 2.608
8.251SerThr: 8.251 ± 1.699
4.125SerVal: 4.125 ± 1.686
0.413SerTrp: 0.413 ± 0.471
1.238SerTyr: 1.238 ± 0.592
0.0SerXaa: 0.0 ± 0.0
Thr
2.475ThrAla: 2.475 ± 0.66
1.238ThrCys: 1.238 ± 0.501
4.95ThrAsp: 4.95 ± 1.169
2.888ThrGlu: 2.888 ± 1.025
2.063ThrPhe: 2.063 ± 0.995
6.188ThrGly: 6.188 ± 0.592
2.475ThrHis: 2.475 ± 0.81
1.65ThrIle: 1.65 ± 0.644
2.063ThrLys: 2.063 ± 1.007
4.538ThrLeu: 4.538 ± 1.542
0.825ThrMet: 0.825 ± 0.372
1.238ThrAsn: 1.238 ± 0.687
7.426ThrPro: 7.426 ± 1.228
4.125ThrGln: 4.125 ± 1.856
3.713ThrArg: 3.713 ± 1.182
7.838ThrSer: 7.838 ± 2.768
3.713ThrThr: 3.713 ± 1.075
6.601ThrVal: 6.601 ± 1.468
2.063ThrTrp: 2.063 ± 0.742
2.475ThrTyr: 2.475 ± 0.778
0.0ThrXaa: 0.0 ± 0.0
Val
0.825ValAla: 0.825 ± 0.771
2.475ValCys: 2.475 ± 1.326
4.125ValAsp: 4.125 ± 0.869
2.888ValGlu: 2.888 ± 0.791
1.65ValPhe: 1.65 ± 0.517
5.776ValGly: 5.776 ± 1.669
2.063ValHis: 2.063 ± 0.968
1.238ValIle: 1.238 ± 0.638
2.888ValLys: 2.888 ± 0.891
6.601ValLeu: 6.601 ± 1.47
0.0ValMet: 0.0 ± 0.0
0.825ValAsn: 0.825 ± 0.404
5.363ValPro: 5.363 ± 1.369
2.888ValGln: 2.888 ± 1.019
4.125ValArg: 4.125 ± 0.721
8.663ValSer: 8.663 ± 1.756
5.363ValThr: 5.363 ± 1.346
5.363ValVal: 5.363 ± 1.83
1.65ValTrp: 1.65 ± 0.816
1.65ValTyr: 1.65 ± 0.701
0.0ValXaa: 0.0 ± 0.0
Trp
2.063TrpAla: 2.063 ± 0.57
0.0TrpCys: 0.0 ± 0.0
0.825TrpAsp: 0.825 ± 0.372
0.413TrpGlu: 0.413 ± 0.359
1.238TrpPhe: 1.238 ± 0.501
1.238TrpGly: 1.238 ± 0.592
0.0TrpHis: 0.0 ± 0.0
1.238TrpIle: 1.238 ± 0.575
1.238TrpLys: 1.238 ± 0.955
0.825TrpLeu: 0.825 ± 0.372
0.0TrpMet: 0.0 ± 0.0
0.825TrpAsn: 0.825 ± 0.771
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.65TrpArg: 1.65 ± 1.439
0.413TrpSer: 0.413 ± 0.318
2.475TrpThr: 2.475 ± 1.434
1.65TrpVal: 1.65 ± 0.933
0.0TrpTrp: 0.0 ± 0.0
0.413TrpTyr: 0.413 ± 0.318
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.3TyrAla: 3.3 ± 1.107
0.0TyrCys: 0.0 ± 0.0
1.238TyrAsp: 1.238 ± 0.687
1.238TyrGlu: 1.238 ± 0.687
1.238TyrPhe: 1.238 ± 0.575
1.65TyrGly: 1.65 ± 0.26
1.238TyrHis: 1.238 ± 0.434
0.825TyrIle: 0.825 ± 0.473
0.413TyrLys: 0.413 ± 0.318
2.063TyrLeu: 2.063 ± 0.753
0.413TyrMet: 0.413 ± 0.319
0.825TyrAsn: 0.825 ± 0.771
1.238TyrPro: 1.238 ± 0.434
0.825TyrGln: 0.825 ± 0.749
1.65TyrArg: 1.65 ± 0.54
2.888TyrSer: 2.888 ± 0.723
0.825TyrThr: 0.825 ± 0.372
2.063TyrVal: 2.063 ± 0.723
1.238TyrTrp: 1.238 ± 1.157
0.413TyrTyr: 0.413 ± 0.359
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski