Amino acid dipepetide frequency for Pseudomonas phage phiPsa381

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.438AlaAla: 8.438 ± 0.654
1.185AlaCys: 1.185 ± 0.221
5.16AlaAsp: 5.16 ± 0.397
5.37AlaGlu: 5.37 ± 0.579
2.824AlaPhe: 2.824 ± 0.271
6.067AlaGly: 6.067 ± 0.559
1.534AlaHis: 1.534 ± 0.261
5.439AlaIle: 5.439 ± 0.505
5.997AlaLys: 5.997 ± 0.521
8.298AlaLeu: 8.298 ± 0.607
2.162AlaMet: 2.162 ± 0.282
3.835AlaAsn: 3.835 ± 0.418
2.406AlaPro: 2.406 ± 0.339
4.358AlaGln: 4.358 ± 0.355
4.289AlaArg: 4.289 ± 0.359
4.533AlaSer: 4.533 ± 0.555
5.544AlaThr: 5.544 ± 0.56
5.893AlaVal: 5.893 ± 0.463
1.43AlaTrp: 1.43 ± 0.197
3.452AlaTyr: 3.452 ± 0.322
0.0AlaXaa: 0.0 ± 0.0
Cys
0.767CysAla: 0.767 ± 0.159
0.384CysCys: 0.384 ± 0.138
1.151CysAsp: 1.151 ± 0.202
0.662CysGlu: 0.662 ± 0.148
0.488CysPhe: 0.488 ± 0.126
0.837CysGly: 0.837 ± 0.191
0.523CysHis: 0.523 ± 0.126
0.384CysIle: 0.384 ± 0.102
0.767CysLys: 0.767 ± 0.202
0.837CysLeu: 0.837 ± 0.178
0.628CysMet: 0.628 ± 0.151
0.907CysAsn: 0.907 ± 0.203
0.697CysPro: 0.697 ± 0.185
0.384CysGln: 0.384 ± 0.125
0.802CysArg: 0.802 ± 0.153
0.593CysSer: 0.593 ± 0.147
0.941CysThr: 0.941 ± 0.186
0.697CysVal: 0.697 ± 0.168
0.07CysTrp: 0.07 ± 0.05
0.662CysTyr: 0.662 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
5.056AspAla: 5.056 ± 0.497
0.976AspCys: 0.976 ± 0.202
3.103AspAsp: 3.103 ± 0.33
3.731AspGlu: 3.731 ± 0.43
2.755AspPhe: 2.755 ± 0.273
4.637AspGly: 4.637 ± 0.427
1.151AspHis: 1.151 ± 0.216
3.138AspIle: 3.138 ± 0.301
4.184AspLys: 4.184 ± 0.48
5.091AspLeu: 5.091 ± 0.41
2.057AspMet: 2.057 ± 0.308
3.033AspAsn: 3.033 ± 0.361
2.092AspPro: 2.092 ± 0.253
2.127AspGln: 2.127 ± 0.289
2.999AspArg: 2.999 ± 0.347
3.382AspSer: 3.382 ± 0.348
3.347AspThr: 3.347 ± 0.404
4.393AspVal: 4.393 ± 0.373
1.185AspTrp: 1.185 ± 0.213
2.371AspTyr: 2.371 ± 0.263
0.0AspXaa: 0.0 ± 0.0
Glu
6.241GluAla: 6.241 ± 0.488
0.872GluCys: 0.872 ± 0.192
4.01GluAsp: 4.01 ± 0.397
5.37GluGlu: 5.37 ± 0.602
3.278GluPhe: 3.278 ± 0.3
3.975GluGly: 3.975 ± 0.416
1.464GluHis: 1.464 ± 0.186
4.114GluIle: 4.114 ± 0.376
3.243GluLys: 3.243 ± 0.382
5.614GluLeu: 5.614 ± 0.491
2.65GluMet: 2.65 ± 0.308
2.51GluAsn: 2.51 ± 0.295
1.639GluPro: 1.639 ± 0.258
2.789GluGln: 2.789 ± 0.275
3.452GluArg: 3.452 ± 0.41
3.626GluSer: 3.626 ± 0.363
2.824GluThr: 2.824 ± 0.317
4.498GluVal: 4.498 ± 0.46
1.011GluTrp: 1.011 ± 0.188
2.406GluTyr: 2.406 ± 0.257
0.0GluXaa: 0.0 ± 0.0
Phe
2.894PheAla: 2.894 ± 0.401
0.279PheCys: 0.279 ± 0.102
2.51PheAsp: 2.51 ± 0.279
3.103PheGlu: 3.103 ± 0.272
1.43PhePhe: 1.43 ± 0.207
2.999PheGly: 2.999 ± 0.341
0.732PheHis: 0.732 ± 0.161
1.987PheIle: 1.987 ± 0.31
1.883PheLys: 1.883 ± 0.251
2.72PheLeu: 2.72 ± 0.296
1.151PheMet: 1.151 ± 0.2
2.266PheAsn: 2.266 ± 0.285
1.639PhePro: 1.639 ± 0.243
1.395PheGln: 1.395 ± 0.198
1.569PheArg: 1.569 ± 0.235
2.476PheSer: 2.476 ± 0.262
2.685PheThr: 2.685 ± 0.31
2.545PheVal: 2.545 ± 0.317
0.279PheTrp: 0.279 ± 0.134
1.534PheTyr: 1.534 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
5.335GlyAla: 5.335 ± 0.545
1.081GlyCys: 1.081 ± 0.184
3.94GlyAsp: 3.94 ± 0.437
4.289GlyGlu: 4.289 ± 0.4
3.278GlyPhe: 3.278 ± 0.314
6.032GlyGly: 6.032 ± 0.65
1.36GlyHis: 1.36 ± 0.248
4.254GlyIle: 4.254 ± 0.421
4.463GlyLys: 4.463 ± 0.505
5.893GlyLeu: 5.893 ± 0.483
2.127GlyMet: 2.127 ± 0.313
3.312GlyAsn: 3.312 ± 0.48
1.604GlyPro: 1.604 ± 0.293
2.72GlyGln: 2.72 ± 0.344
3.138GlyArg: 3.138 ± 0.266
4.951GlySer: 4.951 ± 0.42
4.219GlyThr: 4.219 ± 0.473
5.927GlyVal: 5.927 ± 0.555
1.464GlyTrp: 1.464 ± 0.243
3.696GlyTyr: 3.696 ± 0.408
0.0GlyXaa: 0.0 ± 0.0
His
1.778HisAla: 1.778 ± 0.222
0.244HisCys: 0.244 ± 0.092
0.941HisAsp: 0.941 ± 0.157
1.081HisGlu: 1.081 ± 0.172
1.151HisPhe: 1.151 ± 0.224
1.709HisGly: 1.709 ± 0.223
0.418HisHis: 0.418 ± 0.148
1.116HisIle: 1.116 ± 0.155
1.151HisLys: 1.151 ± 0.221
1.185HisLeu: 1.185 ± 0.215
0.593HisMet: 0.593 ± 0.145
0.837HisAsn: 0.837 ± 0.172
0.732HisPro: 0.732 ± 0.15
0.593HisGln: 0.593 ± 0.147
0.732HisArg: 0.732 ± 0.161
0.907HisSer: 0.907 ± 0.162
1.499HisThr: 1.499 ± 0.261
1.36HisVal: 1.36 ± 0.202
0.558HisTrp: 0.558 ± 0.137
1.011HisTyr: 1.011 ± 0.191
0.0HisXaa: 0.0 ± 0.0
Ile
4.672IleAla: 4.672 ± 0.471
0.837IleCys: 0.837 ± 0.155
3.731IleAsp: 3.731 ± 0.41
3.696IleGlu: 3.696 ± 0.35
1.499IlePhe: 1.499 ± 0.238
4.114IleGly: 4.114 ± 0.422
1.29IleHis: 1.29 ± 0.223
2.824IleIle: 2.824 ± 0.314
4.114IleLys: 4.114 ± 0.359
3.801IleLeu: 3.801 ± 0.342
1.709IleMet: 1.709 ± 0.226
3.138IleAsn: 3.138 ± 0.38
2.197IlePro: 2.197 ± 0.258
2.266IleGln: 2.266 ± 0.301
2.789IleArg: 2.789 ± 0.28
3.208IleSer: 3.208 ± 0.322
4.358IleThr: 4.358 ± 0.389
4.01IleVal: 4.01 ± 0.405
0.628IleTrp: 0.628 ± 0.152
1.639IleTyr: 1.639 ± 0.179
0.0IleXaa: 0.0 ± 0.0
Lys
6.555LysAla: 6.555 ± 0.493
0.558LysCys: 0.558 ± 0.165
3.452LysAsp: 3.452 ± 0.342
4.114LysGlu: 4.114 ± 0.46
1.918LysPhe: 1.918 ± 0.244
4.254LysGly: 4.254 ± 0.512
1.011LysHis: 1.011 ± 0.229
3.766LysIle: 3.766 ± 0.379
3.103LysLys: 3.103 ± 0.384
4.986LysLeu: 4.986 ± 0.451
2.755LysMet: 2.755 ± 0.312
2.092LysAsn: 2.092 ± 0.303
2.371LysPro: 2.371 ± 0.355
2.72LysGln: 2.72 ± 0.309
2.755LysArg: 2.755 ± 0.366
3.138LysSer: 3.138 ± 0.32
3.138LysThr: 3.138 ± 0.403
4.637LysVal: 4.637 ± 0.444
0.872LysTrp: 0.872 ± 0.196
2.232LysTyr: 2.232 ± 0.231
0.0LysXaa: 0.0 ± 0.0
Leu
7.775LeuAla: 7.775 ± 0.57
0.872LeuCys: 0.872 ± 0.181
5.823LeuAsp: 5.823 ± 0.385
5.962LeuGlu: 5.962 ± 0.42
2.894LeuPhe: 2.894 ± 0.361
4.847LeuGly: 4.847 ± 0.467
1.569LeuHis: 1.569 ± 0.231
4.358LeuIle: 4.358 ± 0.359
4.986LeuLys: 4.986 ± 0.438
5.37LeuLeu: 5.37 ± 0.481
2.476LeuMet: 2.476 ± 0.278
3.766LeuAsn: 3.766 ± 0.416
3.522LeuPro: 3.522 ± 0.31
3.452LeuGln: 3.452 ± 0.323
3.417LeuArg: 3.417 ± 0.272
4.428LeuSer: 4.428 ± 0.426
6.137LeuThr: 6.137 ± 0.526
4.01LeuVal: 4.01 ± 0.355
1.081LeuTrp: 1.081 ± 0.197
2.301LeuTyr: 2.301 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
3.766MetAla: 3.766 ± 0.411
0.384MetCys: 0.384 ± 0.13
1.081MetAsp: 1.081 ± 0.179
1.883MetGlu: 1.883 ± 0.254
1.081MetPhe: 1.081 ± 0.209
1.499MetGly: 1.499 ± 0.228
0.488MetHis: 0.488 ± 0.174
2.057MetIle: 2.057 ± 0.244
2.58MetLys: 2.58 ± 0.329
2.406MetLeu: 2.406 ± 0.318
1.046MetMet: 1.046 ± 0.208
1.499MetAsn: 1.499 ± 0.229
0.837MetPro: 0.837 ± 0.157
1.325MetGln: 1.325 ± 0.227
1.255MetArg: 1.255 ± 0.171
1.743MetSer: 1.743 ± 0.255
2.51MetThr: 2.51 ± 0.272
1.953MetVal: 1.953 ± 0.269
0.349MetTrp: 0.349 ± 0.124
0.872MetTyr: 0.872 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
3.626AsnAla: 3.626 ± 0.346
0.523AsnCys: 0.523 ± 0.157
2.789AsnAsp: 2.789 ± 0.31
2.336AsnGlu: 2.336 ± 0.283
1.116AsnPhe: 1.116 ± 0.183
4.498AsnGly: 4.498 ± 0.508
1.046AsnHis: 1.046 ± 0.187
2.859AsnIle: 2.859 ± 0.354
2.894AsnLys: 2.894 ± 0.32
3.975AsnLeu: 3.975 ± 0.542
0.872AsnMet: 0.872 ± 0.199
2.406AsnAsn: 2.406 ± 0.351
2.58AsnPro: 2.58 ± 0.332
1.569AsnGln: 1.569 ± 0.268
2.301AsnArg: 2.301 ± 0.286
2.51AsnSer: 2.51 ± 0.334
3.173AsnThr: 3.173 ± 0.372
3.452AsnVal: 3.452 ± 0.295
0.558AsnTrp: 0.558 ± 0.135
1.778AsnTyr: 1.778 ± 0.284
0.0AsnXaa: 0.0 ± 0.0
Pro
2.755ProAla: 2.755 ± 0.312
0.523ProCys: 0.523 ± 0.131
2.336ProAsp: 2.336 ± 0.279
2.964ProGlu: 2.964 ± 0.303
1.325ProPhe: 1.325 ± 0.196
2.162ProGly: 2.162 ± 0.264
0.767ProHis: 0.767 ± 0.145
1.395ProIle: 1.395 ± 0.207
1.953ProLys: 1.953 ± 0.343
2.859ProLeu: 2.859 ± 0.327
0.976ProMet: 0.976 ± 0.189
1.918ProAsn: 1.918 ± 0.237
0.837ProPro: 0.837 ± 0.167
1.569ProGln: 1.569 ± 0.206
1.639ProArg: 1.639 ± 0.239
2.057ProSer: 2.057 ± 0.27
2.615ProThr: 2.615 ± 0.312
3.243ProVal: 3.243 ± 0.351
0.349ProTrp: 0.349 ± 0.109
1.43ProTyr: 1.43 ± 0.203
0.0ProXaa: 0.0 ± 0.0
Gln
4.358GlnAla: 4.358 ± 0.34
0.384GlnCys: 0.384 ± 0.11
2.162GlnAsp: 2.162 ± 0.296
2.476GlnGlu: 2.476 ± 0.271
1.569GlnPhe: 1.569 ± 0.227
2.545GlnGly: 2.545 ± 0.352
0.767GlnHis: 0.767 ± 0.134
2.441GlnIle: 2.441 ± 0.28
2.266GlnLys: 2.266 ± 0.297
3.626GlnLeu: 3.626 ± 0.382
1.569GlnMet: 1.569 ± 0.253
1.674GlnAsn: 1.674 ± 0.251
1.29GlnPro: 1.29 ± 0.168
1.848GlnGln: 1.848 ± 0.243
2.022GlnArg: 2.022 ± 0.28
1.778GlnSer: 1.778 ± 0.227
1.848GlnThr: 1.848 ± 0.272
3.173GlnVal: 3.173 ± 0.313
0.732GlnTrp: 0.732 ± 0.146
1.674GlnTyr: 1.674 ± 0.227
0.0GlnXaa: 0.0 ± 0.0
Arg
3.522ArgAla: 3.522 ± 0.39
0.523ArgCys: 0.523 ± 0.137
2.894ArgAsp: 2.894 ± 0.387
3.278ArgGlu: 3.278 ± 0.338
1.709ArgPhe: 1.709 ± 0.202
3.278ArgGly: 3.278 ± 0.343
0.732ArgHis: 0.732 ± 0.191
2.755ArgIle: 2.755 ± 0.318
2.929ArgLys: 2.929 ± 0.384
4.079ArgLeu: 4.079 ± 0.403
1.36ArgMet: 1.36 ± 0.241
2.545ArgAsn: 2.545 ± 0.316
1.743ArgPro: 1.743 ± 0.221
1.918ArgGln: 1.918 ± 0.215
2.336ArgArg: 2.336 ± 0.344
2.301ArgSer: 2.301 ± 0.246
3.068ArgThr: 3.068 ± 0.324
3.661ArgVal: 3.661 ± 0.356
0.697ArgTrp: 0.697 ± 0.169
1.464ArgTyr: 1.464 ± 0.212
0.0ArgXaa: 0.0 ± 0.0
Ser
4.812SerAla: 4.812 ± 0.497
1.046SerCys: 1.046 ± 0.225
4.01SerAsp: 4.01 ± 0.419
2.65SerGlu: 2.65 ± 0.35
2.022SerPhe: 2.022 ± 0.257
5.265SerGly: 5.265 ± 0.634
1.151SerHis: 1.151 ± 0.202
2.894SerIle: 2.894 ± 0.312
3.173SerLys: 3.173 ± 0.348
4.289SerLeu: 4.289 ± 0.371
1.255SerMet: 1.255 ± 0.188
2.615SerAsn: 2.615 ± 0.316
2.232SerPro: 2.232 ± 0.279
2.022SerGln: 2.022 ± 0.244
2.441SerArg: 2.441 ± 0.286
3.87SerSer: 3.87 ± 0.877
2.929SerThr: 2.929 ± 0.411
4.045SerVal: 4.045 ± 0.464
0.907SerTrp: 0.907 ± 0.186
2.545SerTyr: 2.545 ± 0.313
0.0SerXaa: 0.0 ± 0.0
Thr
5.683ThrAla: 5.683 ± 0.567
0.697ThrCys: 0.697 ± 0.165
3.766ThrAsp: 3.766 ± 0.355
3.591ThrGlu: 3.591 ± 0.287
2.72ThrPhe: 2.72 ± 0.264
5.718ThrGly: 5.718 ± 0.56
1.046ThrHis: 1.046 ± 0.197
4.637ThrIle: 4.637 ± 0.385
2.929ThrLys: 2.929 ± 0.337
5.544ThrLeu: 5.544 ± 0.575
1.464ThrMet: 1.464 ± 0.241
2.51ThrAsn: 2.51 ± 0.322
2.824ThrPro: 2.824 ± 0.321
2.58ThrGln: 2.58 ± 0.286
2.789ThrArg: 2.789 ± 0.309
3.731ThrSer: 3.731 ± 0.481
5.37ThrThr: 5.37 ± 0.597
5.195ThrVal: 5.195 ± 0.441
0.662ThrTrp: 0.662 ± 0.133
1.987ThrTyr: 1.987 ± 0.26
0.0ThrXaa: 0.0 ± 0.0
Val
6.102ValAla: 6.102 ± 0.571
1.151ValCys: 1.151 ± 0.234
4.463ValAsp: 4.463 ± 0.393
5.404ValGlu: 5.404 ± 0.47
3.173ValPhe: 3.173 ± 0.278
4.777ValGly: 4.777 ± 0.421
1.22ValHis: 1.22 ± 0.204
3.382ValIle: 3.382 ± 0.372
4.637ValLys: 4.637 ± 0.538
4.603ValLeu: 4.603 ± 0.387
2.406ValMet: 2.406 ± 0.271
2.72ValAsn: 2.72 ± 0.355
2.685ValPro: 2.685 ± 0.303
2.685ValGln: 2.685 ± 0.26
3.417ValArg: 3.417 ± 0.321
4.079ValSer: 4.079 ± 0.365
5.3ValThr: 5.3 ± 0.458
5.335ValVal: 5.335 ± 0.476
1.22ValTrp: 1.22 ± 0.2
2.65ValTyr: 2.65 ± 0.268
0.0ValXaa: 0.0 ± 0.0
Trp
1.255TrpAla: 1.255 ± 0.211
0.07TrpCys: 0.07 ± 0.041
0.802TrpAsp: 0.802 ± 0.149
1.325TrpGlu: 1.325 ± 0.228
0.523TrpPhe: 0.523 ± 0.138
0.907TrpGly: 0.907 ± 0.17
0.488TrpHis: 0.488 ± 0.122
0.802TrpIle: 0.802 ± 0.18
1.116TrpLys: 1.116 ± 0.234
1.499TrpLeu: 1.499 ± 0.234
0.209TrpMet: 0.209 ± 0.09
0.837TrpAsn: 0.837 ± 0.168
0.384TrpPro: 0.384 ± 0.132
0.314TrpGln: 0.314 ± 0.105
0.384TrpArg: 0.384 ± 0.14
0.732TrpSer: 0.732 ± 0.15
0.837TrpThr: 0.837 ± 0.174
1.185TrpVal: 1.185 ± 0.218
0.244TrpTrp: 0.244 ± 0.083
0.732TrpTyr: 0.732 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.824TyrAla: 2.824 ± 0.342
0.593TyrCys: 0.593 ± 0.138
2.685TyrAsp: 2.685 ± 0.329
2.301TyrGlu: 2.301 ± 0.282
1.325TyrPhe: 1.325 ± 0.203
2.964TyrGly: 2.964 ± 0.408
0.837TyrHis: 0.837 ± 0.157
1.918TyrIle: 1.918 ± 0.253
2.057TyrLys: 2.057 ± 0.253
2.371TyrLeu: 2.371 ± 0.337
1.116TyrMet: 1.116 ± 0.217
2.441TyrAsn: 2.441 ± 0.3
1.36TyrPro: 1.36 ± 0.179
1.569TyrGln: 1.569 ± 0.254
2.266TyrArg: 2.266 ± 0.298
2.127TyrSer: 2.127 ± 0.291
3.138TyrThr: 3.138 ± 0.358
2.197TyrVal: 2.197 ± 0.299
0.349TyrTrp: 0.349 ± 0.109
1.22TyrTyr: 1.22 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 173 proteins (28681 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski