Amino acid dipepetide frequency for Gordonia phage Pollux

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.824AlaAla: 16.824 ± 1.717
0.948AlaCys: 0.948 ± 0.315
8.234AlaAsp: 8.234 ± 0.653
8.116AlaGlu: 8.116 ± 0.716
3.258AlaPhe: 3.258 ± 0.721
9.893AlaGly: 9.893 ± 1.049
2.666AlaHis: 2.666 ± 0.47
4.502AlaIle: 4.502 ± 0.555
4.206AlaLys: 4.206 ± 0.576
9.004AlaLeu: 9.004 ± 0.877
4.147AlaMet: 4.147 ± 0.75
3.436AlaAsn: 3.436 ± 0.729
4.798AlaPro: 4.798 ± 0.472
4.68AlaGln: 4.68 ± 0.782
8.116AlaArg: 8.116 ± 0.833
6.931AlaSer: 6.931 ± 0.777
8.234AlaThr: 8.234 ± 0.77
7.642AlaVal: 7.642 ± 0.614
2.31AlaTrp: 2.31 ± 0.333
2.192AlaTyr: 2.192 ± 0.312
0.0AlaXaa: 0.0 ± 0.0
Cys
1.126CysAla: 1.126 ± 0.348
0.059CysCys: 0.059 ± 0.057
0.652CysAsp: 0.652 ± 0.225
0.889CysGlu: 0.889 ± 0.287
0.237CysPhe: 0.237 ± 0.131
1.481CysGly: 1.481 ± 0.395
0.296CysHis: 0.296 ± 0.138
0.178CysIle: 0.178 ± 0.106
0.296CysLys: 0.296 ± 0.131
0.296CysLeu: 0.296 ± 0.121
0.296CysMet: 0.296 ± 0.126
0.415CysAsn: 0.415 ± 0.153
0.296CysPro: 0.296 ± 0.15
0.355CysGln: 0.355 ± 0.163
1.185CysArg: 1.185 ± 0.29
0.355CysSer: 0.355 ± 0.125
0.592CysThr: 0.592 ± 0.188
0.355CysVal: 0.355 ± 0.128
0.118CysTrp: 0.118 ± 0.085
0.296CysTyr: 0.296 ± 0.119
0.0CysXaa: 0.0 ± 0.0
Asp
8.471AspAla: 8.471 ± 0.654
0.77AspCys: 0.77 ± 0.201
5.45AspAsp: 5.45 ± 0.724
4.028AspGlu: 4.028 ± 0.606
1.836AspPhe: 1.836 ± 0.285
6.99AspGly: 6.99 ± 0.685
1.659AspHis: 1.659 ± 0.335
2.725AspIle: 2.725 ± 0.327
1.126AspLys: 1.126 ± 0.266
7.227AspLeu: 7.227 ± 0.643
1.066AspMet: 1.066 ± 0.256
1.481AspAsn: 1.481 ± 0.326
4.621AspPro: 4.621 ± 0.509
3.14AspGln: 3.14 ± 0.439
5.331AspArg: 5.331 ± 0.601
2.547AspSer: 2.547 ± 0.374
3.08AspThr: 3.08 ± 0.455
3.732AspVal: 3.732 ± 0.55
1.422AspTrp: 1.422 ± 0.316
1.303AspTyr: 1.303 ± 0.329
0.0AspXaa: 0.0 ± 0.0
Glu
7.049GluAla: 7.049 ± 0.664
0.711GluCys: 0.711 ± 0.241
2.547GluAsp: 2.547 ± 0.418
3.377GluGlu: 3.377 ± 0.497
2.014GluPhe: 2.014 ± 0.416
3.377GluGly: 3.377 ± 0.481
1.718GluHis: 1.718 ± 0.311
3.377GluIle: 3.377 ± 0.526
1.955GluLys: 1.955 ± 0.329
5.509GluLeu: 5.509 ± 0.584
1.481GluMet: 1.481 ± 0.295
1.599GluAsn: 1.599 ± 0.288
2.488GluPro: 2.488 ± 0.376
2.014GluGln: 2.014 ± 0.386
4.028GluArg: 4.028 ± 0.526
3.377GluSer: 3.377 ± 0.446
3.614GluThr: 3.614 ± 0.417
4.502GluVal: 4.502 ± 0.643
1.185GluTrp: 1.185 ± 0.292
1.54GluTyr: 1.54 ± 0.304
0.0GluXaa: 0.0 ± 0.0
Phe
2.784PheAla: 2.784 ± 0.44
0.474PheCys: 0.474 ± 0.15
2.192PheAsp: 2.192 ± 0.354
1.481PheGlu: 1.481 ± 0.332
0.415PhePhe: 0.415 ± 0.173
3.14PheGly: 3.14 ± 0.57
0.474PheHis: 0.474 ± 0.156
1.303PheIle: 1.303 ± 0.244
0.474PheLys: 0.474 ± 0.142
1.777PheLeu: 1.777 ± 0.339
0.355PheMet: 0.355 ± 0.156
0.533PheAsn: 0.533 ± 0.165
1.481PhePro: 1.481 ± 0.347
0.711PheGln: 0.711 ± 0.19
1.896PheArg: 1.896 ± 0.274
1.718PheSer: 1.718 ± 0.346
2.133PheThr: 2.133 ± 0.371
2.31PheVal: 2.31 ± 0.307
0.533PheTrp: 0.533 ± 0.183
0.474PheTyr: 0.474 ± 0.136
0.0PheXaa: 0.0 ± 0.0
Gly
8.59GlyAla: 8.59 ± 1.126
0.711GlyCys: 0.711 ± 0.271
5.628GlyAsp: 5.628 ± 0.604
4.265GlyGlu: 4.265 ± 0.415
2.37GlyPhe: 2.37 ± 0.348
8.649GlyGly: 8.649 ± 1.637
1.718GlyHis: 1.718 ± 0.345
3.436GlyIle: 3.436 ± 0.478
3.436GlyLys: 3.436 ± 0.483
5.746GlyLeu: 5.746 ± 0.606
1.54GlyMet: 1.54 ± 0.264
3.14GlyAsn: 3.14 ± 0.414
4.028GlyPro: 4.028 ± 0.406
3.91GlyGln: 3.91 ± 0.576
7.346GlyArg: 7.346 ± 0.625
4.621GlySer: 4.621 ± 0.805
5.628GlyThr: 5.628 ± 0.543
5.924GlyVal: 5.924 ± 0.641
1.659GlyTrp: 1.659 ± 0.289
1.955GlyTyr: 1.955 ± 0.346
0.0GlyXaa: 0.0 ± 0.0
His
1.955HisAla: 1.955 ± 0.373
0.355HisCys: 0.355 ± 0.164
1.422HisAsp: 1.422 ± 0.319
1.007HisGlu: 1.007 ± 0.198
0.178HisPhe: 0.178 ± 0.093
2.014HisGly: 2.014 ± 0.44
0.829HisHis: 0.829 ± 0.225
0.889HisIle: 0.889 ± 0.291
0.355HisLys: 0.355 ± 0.13
1.955HisLeu: 1.955 ± 0.459
0.118HisMet: 0.118 ± 0.078
0.77HisAsn: 0.77 ± 0.202
1.777HisPro: 1.777 ± 0.383
0.948HisGln: 0.948 ± 0.271
1.896HisArg: 1.896 ± 0.358
0.889HisSer: 0.889 ± 0.315
1.718HisThr: 1.718 ± 0.306
1.599HisVal: 1.599 ± 0.294
0.355HisTrp: 0.355 ± 0.137
0.829HisTyr: 0.829 ± 0.255
0.0HisXaa: 0.0 ± 0.0
Ile
5.805IleAla: 5.805 ± 0.544
0.296IleCys: 0.296 ± 0.151
3.554IleAsp: 3.554 ± 0.558
3.317IleGlu: 3.317 ± 0.518
0.711IlePhe: 0.711 ± 0.211
3.554IleGly: 3.554 ± 0.434
1.066IleHis: 1.066 ± 0.3
1.599IleIle: 1.599 ± 0.381
1.126IleLys: 1.126 ± 0.367
2.962IleLeu: 2.962 ± 0.503
0.178IleMet: 0.178 ± 0.103
1.126IleAsn: 1.126 ± 0.28
2.666IlePro: 2.666 ± 0.372
1.955IleGln: 1.955 ± 0.37
3.554IleArg: 3.554 ± 0.507
2.192IleSer: 2.192 ± 0.37
2.666IleThr: 2.666 ± 0.417
2.903IleVal: 2.903 ± 0.463
0.533IleTrp: 0.533 ± 0.172
1.185IleTyr: 1.185 ± 0.329
0.0IleXaa: 0.0 ± 0.0
Lys
3.377LysAla: 3.377 ± 0.531
0.178LysCys: 0.178 ± 0.11
1.599LysAsp: 1.599 ± 0.332
1.244LysGlu: 1.244 ± 0.253
0.711LysPhe: 0.711 ± 0.193
2.133LysGly: 2.133 ± 0.471
0.533LysHis: 0.533 ± 0.188
1.244LysIle: 1.244 ± 0.341
1.244LysLys: 1.244 ± 0.354
2.843LysLeu: 2.843 ± 0.434
0.652LysMet: 0.652 ± 0.198
1.066LysAsn: 1.066 ± 0.247
2.37LysPro: 2.37 ± 0.549
0.948LysGln: 0.948 ± 0.233
2.784LysArg: 2.784 ± 0.377
2.192LysSer: 2.192 ± 0.343
2.666LysThr: 2.666 ± 0.422
2.488LysVal: 2.488 ± 0.38
0.474LysTrp: 0.474 ± 0.164
0.415LysTyr: 0.415 ± 0.134
0.0LysXaa: 0.0 ± 0.0
Leu
10.307LeuAla: 10.307 ± 0.982
0.77LeuCys: 0.77 ± 0.284
5.45LeuAsp: 5.45 ± 0.622
4.739LeuGlu: 4.739 ± 0.584
2.547LeuPhe: 2.547 ± 0.385
5.805LeuGly: 5.805 ± 0.942
1.007LeuHis: 1.007 ± 0.249
3.614LeuIle: 3.614 ± 0.491
2.37LeuLys: 2.37 ± 0.507
4.443LeuLeu: 4.443 ± 0.45
1.777LeuMet: 1.777 ± 0.305
2.429LeuAsn: 2.429 ± 0.459
5.213LeuPro: 5.213 ± 0.643
2.31LeuGln: 2.31 ± 0.396
6.042LeuArg: 6.042 ± 0.864
5.035LeuSer: 5.035 ± 0.567
6.279LeuThr: 6.279 ± 0.593
4.68LeuVal: 4.68 ± 0.577
1.718LeuTrp: 1.718 ± 0.382
1.303LeuTyr: 1.303 ± 0.244
0.0LeuXaa: 0.0 ± 0.0
Met
2.37MetAla: 2.37 ± 0.378
0.237MetCys: 0.237 ± 0.123
0.829MetAsp: 0.829 ± 0.201
1.126MetGlu: 1.126 ± 0.285
0.355MetPhe: 0.355 ± 0.134
1.244MetGly: 1.244 ± 0.348
0.415MetHis: 0.415 ± 0.15
0.592MetIle: 0.592 ± 0.167
0.592MetLys: 0.592 ± 0.151
1.599MetLeu: 1.599 ± 0.261
0.415MetMet: 0.415 ± 0.181
0.652MetAsn: 0.652 ± 0.171
0.948MetPro: 0.948 ± 0.237
0.829MetGln: 0.829 ± 0.356
1.896MetArg: 1.896 ± 0.33
1.896MetSer: 1.896 ± 0.312
3.258MetThr: 3.258 ± 0.41
1.244MetVal: 1.244 ± 0.245
0.711MetTrp: 0.711 ± 0.201
0.296MetTyr: 0.296 ± 0.139
0.0MetXaa: 0.0 ± 0.0
Asn
4.087AsnAla: 4.087 ± 0.82
0.178AsnCys: 0.178 ± 0.114
1.777AsnAsp: 1.777 ± 0.343
1.185AsnGlu: 1.185 ± 0.255
0.948AsnPhe: 0.948 ± 0.261
3.14AsnGly: 3.14 ± 0.498
0.829AsnHis: 0.829 ± 0.257
1.126AsnIle: 1.126 ± 0.309
0.77AsnLys: 0.77 ± 0.233
2.192AsnLeu: 2.192 ± 0.414
0.474AsnMet: 0.474 ± 0.144
0.948AsnAsn: 0.948 ± 0.268
2.37AsnPro: 2.37 ± 0.372
1.126AsnGln: 1.126 ± 0.291
2.31AsnArg: 2.31 ± 0.348
2.37AsnSer: 2.37 ± 0.397
1.718AsnThr: 1.718 ± 0.362
1.599AsnVal: 1.599 ± 0.289
0.592AsnTrp: 0.592 ± 0.208
0.533AsnTyr: 0.533 ± 0.146
0.0AsnXaa: 0.0 ± 0.0
Pro
6.102ProAla: 6.102 ± 0.628
0.415ProCys: 0.415 ± 0.16
5.568ProAsp: 5.568 ± 0.773
4.265ProGlu: 4.265 ± 0.505
0.948ProPhe: 0.948 ± 0.204
5.094ProGly: 5.094 ± 0.491
1.185ProHis: 1.185 ± 0.276
2.014ProIle: 2.014 ± 0.352
2.133ProLys: 2.133 ± 0.426
3.791ProLeu: 3.791 ± 0.499
0.948ProMet: 0.948 ± 0.237
1.599ProAsn: 1.599 ± 0.342
3.436ProPro: 3.436 ± 0.516
1.54ProGln: 1.54 ± 0.223
3.85ProArg: 3.85 ± 0.592
3.199ProSer: 3.199 ± 0.427
3.554ProThr: 3.554 ± 0.412
3.732ProVal: 3.732 ± 0.559
1.422ProTrp: 1.422 ± 0.333
1.185ProTyr: 1.185 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
3.969GlnAla: 3.969 ± 0.658
0.237GlnCys: 0.237 ± 0.119
1.54GlnAsp: 1.54 ± 0.273
1.362GlnGlu: 1.362 ± 0.263
1.718GlnPhe: 1.718 ± 0.299
2.725GlnGly: 2.725 ± 0.655
0.889GlnHis: 0.889 ± 0.212
2.784GlnIle: 2.784 ± 0.523
1.007GlnLys: 1.007 ± 0.395
3.258GlnLeu: 3.258 ± 0.698
0.652GlnMet: 0.652 ± 0.19
1.362GlnAsn: 1.362 ± 0.325
1.659GlnPro: 1.659 ± 0.265
2.251GlnGln: 2.251 ± 0.698
3.436GlnArg: 3.436 ± 0.566
2.962GlnSer: 2.962 ± 0.454
2.31GlnThr: 2.31 ± 0.36
2.725GlnVal: 2.725 ± 0.416
1.007GlnTrp: 1.007 ± 0.216
0.77GlnTyr: 0.77 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
7.819ArgAla: 7.819 ± 0.791
1.066ArgCys: 1.066 ± 0.275
5.035ArgAsp: 5.035 ± 0.612
4.68ArgGlu: 4.68 ± 0.722
2.488ArgPhe: 2.488 ± 0.352
4.265ArgGly: 4.265 ± 0.503
1.896ArgHis: 1.896 ± 0.447
3.554ArgIle: 3.554 ± 0.469
3.199ArgLys: 3.199 ± 0.513
5.983ArgLeu: 5.983 ± 0.682
2.606ArgMet: 2.606 ± 0.354
2.784ArgAsn: 2.784 ± 0.406
4.502ArgPro: 4.502 ± 0.589
2.962ArgGln: 2.962 ± 0.463
6.99ArgArg: 6.99 ± 0.887
4.265ArgSer: 4.265 ± 0.409
4.798ArgThr: 4.798 ± 0.637
4.502ArgVal: 4.502 ± 0.653
1.718ArgTrp: 1.718 ± 0.338
2.251ArgTyr: 2.251 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
7.76SerAla: 7.76 ± 1.048
0.178SerCys: 0.178 ± 0.108
3.791SerAsp: 3.791 ± 0.337
3.554SerGlu: 3.554 ± 0.343
1.718SerPhe: 1.718 ± 0.266
7.049SerGly: 7.049 ± 0.779
0.948SerHis: 0.948 ± 0.21
2.31SerIle: 2.31 ± 0.402
1.836SerLys: 1.836 ± 0.442
4.265SerLeu: 4.265 ± 0.594
1.718SerMet: 1.718 ± 0.299
1.303SerAsn: 1.303 ± 0.29
2.666SerPro: 2.666 ± 0.32
2.843SerGln: 2.843 ± 0.407
3.554SerArg: 3.554 ± 0.457
3.436SerSer: 3.436 ± 0.565
3.614SerThr: 3.614 ± 0.506
3.732SerVal: 3.732 ± 0.439
1.185SerTrp: 1.185 ± 0.225
1.126SerTyr: 1.126 ± 0.221
0.0SerXaa: 0.0 ± 0.0
Thr
9.123ThrAla: 9.123 ± 0.794
0.77ThrCys: 0.77 ± 0.229
4.206ThrAsp: 4.206 ± 0.417
3.199ThrGlu: 3.199 ± 0.492
1.659ThrPhe: 1.659 ± 0.272
5.924ThrGly: 5.924 ± 0.776
1.126ThrHis: 1.126 ± 0.278
3.614ThrIle: 3.614 ± 0.437
1.481ThrLys: 1.481 ± 0.265
6.22ThrLeu: 6.22 ± 0.586
0.889ThrMet: 0.889 ± 0.268
2.014ThrAsn: 2.014 ± 0.323
4.798ThrPro: 4.798 ± 0.658
1.896ThrGln: 1.896 ± 0.345
4.858ThrArg: 4.858 ± 0.773
3.791ThrSer: 3.791 ± 0.458
5.45ThrThr: 5.45 ± 0.644
5.568ThrVal: 5.568 ± 0.678
1.244ThrTrp: 1.244 ± 0.219
1.066ThrTyr: 1.066 ± 0.236
0.0ThrXaa: 0.0 ± 0.0
Val
7.464ValAla: 7.464 ± 0.53
0.889ValCys: 0.889 ± 0.223
5.687ValAsp: 5.687 ± 0.674
3.673ValGlu: 3.673 ± 0.56
1.422ValPhe: 1.422 ± 0.322
4.917ValGly: 4.917 ± 0.542
1.422ValHis: 1.422 ± 0.33
2.666ValIle: 2.666 ± 0.461
2.192ValLys: 2.192 ± 0.458
4.798ValLeu: 4.798 ± 0.473
1.185ValMet: 1.185 ± 0.263
2.37ValAsn: 2.37 ± 0.422
4.384ValPro: 4.384 ± 0.46
2.37ValGln: 2.37 ± 0.468
4.798ValArg: 4.798 ± 0.55
4.561ValSer: 4.561 ± 0.496
5.331ValThr: 5.331 ± 0.551
5.331ValVal: 5.331 ± 0.531
1.244ValTrp: 1.244 ± 0.286
1.007ValTyr: 1.007 ± 0.234
0.0ValXaa: 0.0 ± 0.0
Trp
2.903TrpAla: 2.903 ± 0.506
0.296TrpCys: 0.296 ± 0.117
1.303TrpAsp: 1.303 ± 0.244
1.126TrpGlu: 1.126 ± 0.29
0.355TrpPhe: 0.355 ± 0.167
1.362TrpGly: 1.362 ± 0.22
0.77TrpHis: 0.77 ± 0.262
0.592TrpIle: 0.592 ± 0.192
0.711TrpLys: 0.711 ± 0.197
2.31TrpLeu: 2.31 ± 0.398
0.592TrpMet: 0.592 ± 0.192
0.592TrpAsn: 0.592 ± 0.188
0.592TrpPro: 0.592 ± 0.228
1.007TrpGln: 1.007 ± 0.253
1.422TrpArg: 1.422 ± 0.28
1.007TrpSer: 1.007 ± 0.221
0.889TrpThr: 0.889 ± 0.226
1.599TrpVal: 1.599 ± 0.273
0.533TrpTrp: 0.533 ± 0.218
0.415TrpTyr: 0.415 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.251TyrAla: 2.251 ± 0.404
0.237TyrCys: 0.237 ± 0.141
1.54TyrAsp: 1.54 ± 0.243
0.829TyrGlu: 0.829 ± 0.163
0.77TyrPhe: 0.77 ± 0.182
1.54TyrGly: 1.54 ± 0.229
0.415TyrHis: 0.415 ± 0.177
0.829TyrIle: 0.829 ± 0.245
0.652TyrLys: 0.652 ± 0.228
1.659TyrLeu: 1.659 ± 0.311
0.355TyrMet: 0.355 ± 0.131
0.652TyrAsn: 0.652 ± 0.154
0.889TyrPro: 0.889 ± 0.249
0.829TyrGln: 0.829 ± 0.196
2.073TyrArg: 2.073 ± 0.417
1.303TyrSer: 1.303 ± 0.306
1.362TyrThr: 1.362 ± 0.264
1.54TyrVal: 1.54 ± 0.265
0.415TyrTrp: 0.415 ± 0.163
0.415TyrTyr: 0.415 ± 0.149
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (16882 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski