Amino acid dipepetide frequency for Ralstonia phage Gervaise

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.931AlaAla: 21.931 ± 1.627
1.011AlaCys: 1.011 ± 0.303
6.228AlaAsp: 6.228 ± 0.561
8.996AlaGlu: 8.996 ± 0.873
3.673AlaPhe: 3.673 ± 0.397
11.232AlaGly: 11.232 ± 1.223
2.502AlaHis: 2.502 ± 0.322
5.855AlaIle: 5.855 ± 0.58
6.175AlaLys: 6.175 ± 0.557
9.635AlaLeu: 9.635 ± 0.733
3.779AlaMet: 3.779 ± 0.509
4.684AlaAsn: 4.684 ± 0.653
7.506AlaPro: 7.506 ± 0.979
5.909AlaGln: 5.909 ± 0.46
8.783AlaArg: 8.783 ± 0.794
7.719AlaSer: 7.719 ± 0.654
5.962AlaThr: 5.962 ± 0.584
7.559AlaVal: 7.559 ± 0.602
1.703AlaTrp: 1.703 ± 0.319
3.513AlaTyr: 3.513 ± 0.36
0.0AlaXaa: 0.0 ± 0.0
Cys
0.692CysAla: 0.692 ± 0.246
0.213CysCys: 0.213 ± 0.103
0.266CysAsp: 0.266 ± 0.146
0.213CysGlu: 0.213 ± 0.131
0.16CysPhe: 0.16 ± 0.127
0.692CysGly: 0.692 ± 0.27
0.266CysHis: 0.266 ± 0.144
0.319CysIle: 0.319 ± 0.15
0.426CysLys: 0.426 ± 0.193
0.586CysLeu: 0.586 ± 0.261
0.16CysMet: 0.16 ± 0.111
0.319CysAsn: 0.319 ± 0.144
0.266CysPro: 0.266 ± 0.167
0.053CysGln: 0.053 ± 0.063
0.692CysArg: 0.692 ± 0.264
0.319CysSer: 0.319 ± 0.155
0.319CysThr: 0.319 ± 0.192
0.586CysVal: 0.586 ± 0.247
0.213CysTrp: 0.213 ± 0.103
0.426CysTyr: 0.426 ± 0.236
0.0CysXaa: 0.0 ± 0.0
Asp
8.623AspAla: 8.623 ± 0.686
0.266AspCys: 0.266 ± 0.149
3.939AspAsp: 3.939 ± 0.782
4.471AspGlu: 4.471 ± 0.505
1.863AspPhe: 1.863 ± 0.323
7.346AspGly: 7.346 ± 0.675
1.331AspHis: 1.331 ± 0.249
3.354AspIle: 3.354 ± 0.417
2.182AspLys: 2.182 ± 0.36
3.992AspLeu: 3.992 ± 0.461
1.224AspMet: 1.224 ± 0.221
2.023AspAsn: 2.023 ± 0.331
2.981AspPro: 2.981 ± 0.468
1.49AspGln: 1.49 ± 0.336
4.578AspArg: 4.578 ± 0.471
2.502AspSer: 2.502 ± 0.464
3.034AspThr: 3.034 ± 0.269
3.566AspVal: 3.566 ± 0.521
0.958AspTrp: 0.958 ± 0.235
1.863AspTyr: 1.863 ± 0.269
0.0AspXaa: 0.0 ± 0.0
Glu
7.878GluAla: 7.878 ± 0.79
0.373GluCys: 0.373 ± 0.15
3.566GluAsp: 3.566 ± 0.379
3.779GluGlu: 3.779 ± 0.61
2.874GluPhe: 2.874 ± 0.45
4.205GluGly: 4.205 ± 0.653
1.384GluHis: 1.384 ± 0.282
3.141GluIle: 3.141 ± 0.349
2.715GluLys: 2.715 ± 0.65
5.642GluLeu: 5.642 ± 0.503
2.023GluMet: 2.023 ± 0.317
2.182GluAsn: 2.182 ± 0.288
3.194GluPro: 3.194 ± 0.398
3.62GluGln: 3.62 ± 0.485
6.547GluArg: 6.547 ± 0.694
2.821GluSer: 2.821 ± 0.422
2.874GluThr: 2.874 ± 0.394
2.608GluVal: 2.608 ± 0.373
0.852GluTrp: 0.852 ± 0.204
1.703GluTyr: 1.703 ± 0.381
0.0GluXaa: 0.0 ± 0.0
Phe
3.513PheAla: 3.513 ± 0.36
0.319PheCys: 0.319 ± 0.132
2.715PheAsp: 2.715 ± 0.352
1.863PheGlu: 1.863 ± 0.277
0.798PhePhe: 0.798 ± 0.263
2.449PheGly: 2.449 ± 0.305
0.479PheHis: 0.479 ± 0.169
1.65PheIle: 1.65 ± 0.379
1.011PheLys: 1.011 ± 0.271
1.703PheLeu: 1.703 ± 0.276
1.011PheMet: 1.011 ± 0.249
1.171PheAsn: 1.171 ± 0.281
1.224PhePro: 1.224 ± 0.258
1.171PheGln: 1.171 ± 0.285
1.757PheArg: 1.757 ± 0.424
1.544PheSer: 1.544 ± 0.263
1.49PheThr: 1.49 ± 0.303
2.981PheVal: 2.981 ± 0.412
0.479PheTrp: 0.479 ± 0.158
1.171PheTyr: 1.171 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
11.391GlyAla: 11.391 ± 1.341
0.745GlyCys: 0.745 ± 0.299
6.388GlyAsp: 6.388 ± 0.641
5.483GlyGlu: 5.483 ± 0.587
2.182GlyPhe: 2.182 ± 0.331
8.251GlyGly: 8.251 ± 0.989
1.703GlyHis: 1.703 ± 0.33
3.513GlyIle: 3.513 ± 0.376
4.578GlyLys: 4.578 ± 0.603
5.163GlyLeu: 5.163 ± 0.582
2.023GlyMet: 2.023 ± 0.424
3.673GlyAsn: 3.673 ± 0.604
2.555GlyPro: 2.555 ± 0.365
3.141GlyGln: 3.141 ± 0.339
5.749GlyArg: 5.749 ± 0.687
4.046GlySer: 4.046 ± 0.57
5.696GlyThr: 5.696 ± 0.709
5.642GlyVal: 5.642 ± 0.575
1.011GlyTrp: 1.011 ± 0.198
2.236GlyTyr: 2.236 ± 0.324
0.0GlyXaa: 0.0 ± 0.0
His
2.023HisAla: 2.023 ± 0.321
0.373HisCys: 0.373 ± 0.226
1.331HisAsp: 1.331 ± 0.277
1.597HisGlu: 1.597 ± 0.281
0.958HisPhe: 0.958 ± 0.198
1.384HisGly: 1.384 ± 0.323
0.532HisHis: 0.532 ± 0.185
1.224HisIle: 1.224 ± 0.206
0.745HisLys: 0.745 ± 0.244
1.384HisLeu: 1.384 ± 0.253
0.479HisMet: 0.479 ± 0.177
0.905HisAsn: 0.905 ± 0.251
0.905HisPro: 0.905 ± 0.242
0.479HisGln: 0.479 ± 0.126
1.224HisArg: 1.224 ± 0.28
1.331HisSer: 1.331 ± 0.296
0.745HisThr: 0.745 ± 0.136
0.958HisVal: 0.958 ± 0.253
0.053HisTrp: 0.053 ± 0.063
0.373HisTyr: 0.373 ± 0.14
0.0HisXaa: 0.0 ± 0.0
Ile
6.068IleAla: 6.068 ± 0.486
0.053IleCys: 0.053 ± 0.058
3.141IleAsp: 3.141 ± 0.38
4.897IleGlu: 4.897 ± 0.646
1.118IlePhe: 1.118 ± 0.263
3.939IleGly: 3.939 ± 0.448
0.586IleHis: 0.586 ± 0.275
1.65IleIle: 1.65 ± 0.314
2.182IleLys: 2.182 ± 0.322
2.289IleLeu: 2.289 ± 0.296
1.384IleMet: 1.384 ± 0.326
1.97IleAsn: 1.97 ± 0.267
1.81IlePro: 1.81 ± 0.259
1.81IleGln: 1.81 ± 0.35
2.928IleArg: 2.928 ± 0.426
2.874IleSer: 2.874 ± 0.437
3.194IleThr: 3.194 ± 0.694
3.46IleVal: 3.46 ± 0.464
0.319IleTrp: 0.319 ± 0.094
1.065IleTyr: 1.065 ± 0.25
0.0IleXaa: 0.0 ± 0.0
Lys
6.707LysAla: 6.707 ± 0.494
0.373LysCys: 0.373 ± 0.178
2.395LysAsp: 2.395 ± 0.448
2.715LysGlu: 2.715 ± 0.479
1.331LysPhe: 1.331 ± 0.276
3.141LysGly: 3.141 ± 0.566
0.639LysHis: 0.639 ± 0.151
1.81LysIle: 1.81 ± 0.405
2.182LysLys: 2.182 ± 0.393
3.566LysLeu: 3.566 ± 0.409
1.171LysMet: 1.171 ± 0.239
1.384LysAsn: 1.384 ± 0.285
2.874LysPro: 2.874 ± 0.44
2.449LysGln: 2.449 ± 0.61
4.258LysArg: 4.258 ± 0.531
2.395LysSer: 2.395 ± 0.485
3.087LysThr: 3.087 ± 0.407
3.034LysVal: 3.034 ± 0.463
0.798LysTrp: 0.798 ± 0.225
0.639LysTyr: 0.639 ± 0.143
0.0LysXaa: 0.0 ± 0.0
Leu
7.772LeuAla: 7.772 ± 0.636
0.373LeuCys: 0.373 ± 0.18
6.122LeuAsp: 6.122 ± 0.499
3.46LeuGlu: 3.46 ± 0.459
2.449LeuPhe: 2.449 ± 0.395
6.015LeuGly: 6.015 ± 0.591
1.278LeuHis: 1.278 ± 0.28
3.3LeuIle: 3.3 ± 0.44
3.992LeuLys: 3.992 ± 0.585
4.152LeuLeu: 4.152 ± 0.42
1.49LeuMet: 1.49 ± 0.296
2.129LeuAsn: 2.129 ± 0.344
4.152LeuPro: 4.152 ± 0.74
2.289LeuGln: 2.289 ± 0.318
5.057LeuArg: 5.057 ± 0.532
4.631LeuSer: 4.631 ± 0.606
4.95LeuThr: 4.95 ± 0.524
3.726LeuVal: 3.726 ± 0.494
0.692LeuTrp: 0.692 ± 0.234
1.65LeuTyr: 1.65 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
2.555MetAla: 2.555 ± 0.378
0.106MetCys: 0.106 ± 0.087
1.278MetAsp: 1.278 ± 0.244
1.437MetGlu: 1.437 ± 0.207
0.692MetPhe: 0.692 ± 0.219
1.916MetGly: 1.916 ± 0.334
0.426MetHis: 0.426 ± 0.148
0.905MetIle: 0.905 ± 0.18
1.65MetLys: 1.65 ± 0.263
2.129MetLeu: 2.129 ± 0.326
0.798MetMet: 0.798 ± 0.179
0.905MetAsn: 0.905 ± 0.214
1.81MetPro: 1.81 ± 0.233
1.331MetGln: 1.331 ± 0.314
2.342MetArg: 2.342 ± 0.378
1.916MetSer: 1.916 ± 0.343
2.076MetThr: 2.076 ± 0.324
1.437MetVal: 1.437 ± 0.338
0.319MetTrp: 0.319 ± 0.111
0.639MetTyr: 0.639 ± 0.183
0.0MetXaa: 0.0 ± 0.0
Asn
4.525AsnAla: 4.525 ± 0.551
0.16AsnCys: 0.16 ± 0.099
2.182AsnAsp: 2.182 ± 0.279
1.81AsnGlu: 1.81 ± 0.35
0.798AsnPhe: 0.798 ± 0.233
4.738AsnGly: 4.738 ± 0.727
0.745AsnHis: 0.745 ± 0.243
1.49AsnIle: 1.49 ± 0.346
1.437AsnLys: 1.437 ± 0.294
2.129AsnLeu: 2.129 ± 0.282
1.118AsnMet: 1.118 ± 0.294
1.544AsnAsn: 1.544 ± 0.283
2.076AsnPro: 2.076 ± 0.35
1.437AsnGln: 1.437 ± 0.402
1.916AsnArg: 1.916 ± 0.278
1.757AsnSer: 1.757 ± 0.323
2.236AsnThr: 2.236 ± 0.482
2.928AsnVal: 2.928 ± 0.415
0.745AsnTrp: 0.745 ± 0.182
0.639AsnTyr: 0.639 ± 0.213
0.0AsnXaa: 0.0 ± 0.0
Pro
8.677ProAla: 8.677 ± 1.059
0.373ProCys: 0.373 ± 0.164
2.928ProAsp: 2.928 ± 0.365
3.513ProGlu: 3.513 ± 0.431
1.597ProPhe: 1.597 ± 0.342
3.992ProGly: 3.992 ± 0.662
0.692ProHis: 0.692 ± 0.22
2.289ProIle: 2.289 ± 0.398
2.182ProLys: 2.182 ± 0.382
2.821ProLeu: 2.821 ± 0.322
1.065ProMet: 1.065 ± 0.236
1.384ProAsn: 1.384 ± 0.262
2.023ProPro: 2.023 ± 0.313
1.81ProGln: 1.81 ± 0.45
3.034ProArg: 3.034 ± 0.437
3.46ProSer: 3.46 ± 0.575
3.247ProThr: 3.247 ± 0.496
2.555ProVal: 2.555 ± 0.385
0.373ProTrp: 0.373 ± 0.142
0.852ProTyr: 0.852 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
6.814GlnAla: 6.814 ± 0.68
0.106GlnCys: 0.106 ± 0.075
1.757GlnAsp: 1.757 ± 0.342
2.342GlnGlu: 2.342 ± 0.369
0.852GlnPhe: 0.852 ± 0.268
2.821GlnGly: 2.821 ± 0.467
0.905GlnHis: 0.905 ± 0.241
1.863GlnIle: 1.863 ± 0.358
1.703GlnLys: 1.703 ± 0.307
3.141GlnLeu: 3.141 ± 0.422
1.49GlnMet: 1.49 ± 0.342
1.118GlnAsn: 1.118 ± 0.253
1.863GlnPro: 1.863 ± 0.27
3.141GlnGln: 3.141 ± 0.569
2.236GlnArg: 2.236 ± 0.298
2.076GlnSer: 2.076 ± 0.34
1.757GlnThr: 1.757 ± 0.33
2.928GlnVal: 2.928 ± 0.291
0.745GlnTrp: 0.745 ± 0.163
1.224GlnTyr: 1.224 ± 0.294
0.0GlnXaa: 0.0 ± 0.0
Arg
8.89ArgAla: 8.89 ± 0.721
0.373ArgCys: 0.373 ± 0.136
4.365ArgAsp: 4.365 ± 0.61
4.525ArgGlu: 4.525 ± 0.575
3.034ArgPhe: 3.034 ± 0.487
4.152ArgGly: 4.152 ± 0.493
1.278ArgHis: 1.278 ± 0.293
3.886ArgIle: 3.886 ± 0.506
3.779ArgLys: 3.779 ± 0.542
6.281ArgLeu: 6.281 ± 0.527
2.236ArgMet: 2.236 ± 0.324
2.662ArgAsn: 2.662 ± 0.351
2.768ArgPro: 2.768 ± 0.442
3.247ArgGln: 3.247 ± 0.518
4.578ArgArg: 4.578 ± 0.628
2.981ArgSer: 2.981 ± 0.401
2.768ArgThr: 2.768 ± 0.375
4.365ArgVal: 4.365 ± 0.515
1.118ArgTrp: 1.118 ± 0.258
2.555ArgTyr: 2.555 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
7.133SerAla: 7.133 ± 0.896
0.479SerCys: 0.479 ± 0.187
3.354SerAsp: 3.354 ± 0.393
2.874SerGlu: 2.874 ± 0.429
1.437SerPhe: 1.437 ± 0.278
5.589SerGly: 5.589 ± 0.679
1.278SerHis: 1.278 ± 0.243
2.662SerIle: 2.662 ± 0.368
2.449SerLys: 2.449 ± 0.302
3.886SerLeu: 3.886 ± 0.347
1.011SerMet: 1.011 ± 0.247
1.81SerAsn: 1.81 ± 0.43
2.608SerPro: 2.608 ± 0.329
2.449SerGln: 2.449 ± 0.329
3.087SerArg: 3.087 ± 0.38
3.087SerSer: 3.087 ± 0.368
3.726SerThr: 3.726 ± 0.464
3.3SerVal: 3.3 ± 0.433
0.905SerTrp: 0.905 ± 0.181
1.171SerTyr: 1.171 ± 0.208
0.0SerXaa: 0.0 ± 0.0
Thr
6.973ThrAla: 6.973 ± 0.677
0.266ThrCys: 0.266 ± 0.163
2.395ThrAsp: 2.395 ± 0.369
3.939ThrGlu: 3.939 ± 0.375
1.65ThrPhe: 1.65 ± 0.28
5.642ThrGly: 5.642 ± 0.657
1.49ThrHis: 1.49 ± 0.315
3.141ThrIle: 3.141 ± 0.477
3.3ThrLys: 3.3 ± 0.375
4.631ThrLeu: 4.631 ± 0.555
1.597ThrMet: 1.597 ± 0.251
1.916ThrAsn: 1.916 ± 0.359
3.194ThrPro: 3.194 ± 0.333
1.544ThrGln: 1.544 ± 0.343
2.821ThrArg: 2.821 ± 0.607
2.874ThrSer: 2.874 ± 0.52
3.194ThrThr: 3.194 ± 0.677
4.205ThrVal: 4.205 ± 0.467
0.692ThrTrp: 0.692 ± 0.25
1.437ThrTyr: 1.437 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
8.144ValAla: 8.144 ± 0.681
0.639ValCys: 0.639 ± 0.238
4.205ValAsp: 4.205 ± 0.528
4.099ValGlu: 4.099 ± 0.663
1.597ValPhe: 1.597 ± 0.388
4.631ValGly: 4.631 ± 0.477
0.852ValHis: 0.852 ± 0.213
2.662ValIle: 2.662 ± 0.476
3.087ValLys: 3.087 ± 0.64
3.034ValLeu: 3.034 ± 0.278
1.544ValMet: 1.544 ± 0.244
2.928ValAsn: 2.928 ± 0.37
3.939ValPro: 3.939 ± 0.496
2.023ValGln: 2.023 ± 0.268
4.631ValArg: 4.631 ± 0.514
3.833ValSer: 3.833 ± 0.44
3.939ValThr: 3.939 ± 0.428
4.258ValVal: 4.258 ± 0.67
0.798ValTrp: 0.798 ± 0.253
1.544ValTyr: 1.544 ± 0.264
0.0ValXaa: 0.0 ± 0.0
Trp
1.011TrpAla: 1.011 ± 0.22
0.213TrpCys: 0.213 ± 0.111
0.745TrpAsp: 0.745 ± 0.246
0.426TrpGlu: 0.426 ± 0.171
0.426TrpPhe: 0.426 ± 0.157
1.065TrpGly: 1.065 ± 0.217
0.213TrpHis: 0.213 ± 0.124
0.692TrpIle: 0.692 ± 0.149
0.426TrpLys: 0.426 ± 0.144
1.437TrpLeu: 1.437 ± 0.243
0.479TrpMet: 0.479 ± 0.166
0.692TrpAsn: 0.692 ± 0.238
0.213TrpPro: 0.213 ± 0.124
0.692TrpGln: 0.692 ± 0.169
1.278TrpArg: 1.278 ± 0.295
0.852TrpSer: 0.852 ± 0.283
1.065TrpThr: 1.065 ± 0.181
0.905TrpVal: 0.905 ± 0.223
0.053TrpTrp: 0.053 ± 0.06
0.266TrpTyr: 0.266 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.034TyrAla: 3.034 ± 0.412
0.373TyrCys: 0.373 ± 0.157
2.289TyrAsp: 2.289 ± 0.275
1.544TyrGlu: 1.544 ± 0.24
1.011TyrPhe: 1.011 ± 0.234
1.97TyrGly: 1.97 ± 0.466
0.426TyrHis: 0.426 ± 0.126
1.331TyrIle: 1.331 ± 0.262
0.745TyrLys: 0.745 ± 0.184
2.076TyrLeu: 2.076 ± 0.363
0.532TyrMet: 0.532 ± 0.196
1.011TyrAsn: 1.011 ± 0.275
0.905TyrPro: 0.905 ± 0.177
0.852TyrGln: 0.852 ± 0.183
2.342TyrArg: 2.342 ± 0.363
1.278TyrSer: 1.278 ± 0.315
1.544TyrThr: 1.544 ± 0.28
1.384TyrVal: 1.384 ± 0.284
0.319TyrTrp: 0.319 ± 0.194
0.479TyrTyr: 0.479 ± 0.141
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (18787 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski