Amino acid dipepetide frequency for Pseudomonas phage MR18

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.17AlaAla: 13.17 ± 1.422
1.19AlaCys: 1.19 ± 0.286
6.696AlaAsp: 6.696 ± 0.689
6.399AlaGlu: 6.399 ± 0.815
2.53AlaPhe: 2.53 ± 0.524
9.375AlaGly: 9.375 ± 0.991
1.637AlaHis: 1.637 ± 0.446
4.018AlaIle: 4.018 ± 0.55
5.655AlaLys: 5.655 ± 0.666
10.491AlaLeu: 10.491 ± 0.918
3.795AlaMet: 3.795 ± 0.597
4.167AlaAsn: 4.167 ± 0.596
4.315AlaPro: 4.315 ± 0.736
5.357AlaGln: 5.357 ± 0.647
5.952AlaArg: 5.952 ± 0.607
5.952AlaSer: 5.952 ± 0.671
6.176AlaThr: 6.176 ± 1.033
7.738AlaVal: 7.738 ± 0.782
1.042AlaTrp: 1.042 ± 0.214
3.199AlaTyr: 3.199 ± 0.528
0.0AlaXaa: 0.0 ± 0.0
Cys
1.116CysAla: 1.116 ± 0.297
0.223CysCys: 0.223 ± 0.113
0.521CysAsp: 0.521 ± 0.186
0.67CysGlu: 0.67 ± 0.269
0.298CysPhe: 0.298 ± 0.166
1.042CysGly: 1.042 ± 0.279
0.372CysHis: 0.372 ± 0.164
0.446CysIle: 0.446 ± 0.201
0.372CysLys: 0.372 ± 0.136
1.339CysLeu: 1.339 ± 0.413
0.298CysMet: 0.298 ± 0.152
0.744CysAsn: 0.744 ± 0.244
0.595CysPro: 0.595 ± 0.259
0.446CysGln: 0.446 ± 0.204
0.744CysArg: 0.744 ± 0.283
0.595CysSer: 0.595 ± 0.278
0.893CysThr: 0.893 ± 0.271
1.042CysVal: 1.042 ± 0.253
0.298CysTrp: 0.298 ± 0.158
0.521CysTyr: 0.521 ± 0.198
0.0CysXaa: 0.0 ± 0.0
Asp
5.06AspAla: 5.06 ± 0.635
0.521AspCys: 0.521 ± 0.241
3.199AspAsp: 3.199 ± 0.4
3.199AspGlu: 3.199 ± 0.43
1.86AspPhe: 1.86 ± 0.387
4.092AspGly: 4.092 ± 0.571
0.744AspHis: 0.744 ± 0.277
3.646AspIle: 3.646 ± 0.547
2.679AspLys: 2.679 ± 0.457
3.943AspLeu: 3.943 ± 0.524
2.307AspMet: 2.307 ± 0.306
2.232AspAsn: 2.232 ± 0.455
2.753AspPro: 2.753 ± 0.544
1.637AspGln: 1.637 ± 0.336
3.571AspArg: 3.571 ± 0.51
3.646AspSer: 3.646 ± 0.565
3.646AspThr: 3.646 ± 0.414
4.241AspVal: 4.241 ± 0.606
0.67AspTrp: 0.67 ± 0.188
2.009AspTyr: 2.009 ± 0.383
0.0AspXaa: 0.0 ± 0.0
Glu
7.143GluAla: 7.143 ± 0.82
0.595GluCys: 0.595 ± 0.203
2.753GluAsp: 2.753 ± 0.351
2.753GluGlu: 2.753 ± 0.382
1.86GluPhe: 1.86 ± 0.423
2.604GluGly: 2.604 ± 0.448
1.265GluHis: 1.265 ± 0.296
1.339GluIle: 1.339 ± 0.236
2.381GluLys: 2.381 ± 0.486
5.729GluLeu: 5.729 ± 0.699
1.637GluMet: 1.637 ± 0.367
1.265GluAsn: 1.265 ± 0.28
1.042GluPro: 1.042 ± 0.296
3.497GluGln: 3.497 ± 0.475
3.646GluArg: 3.646 ± 0.636
2.902GluSer: 2.902 ± 0.465
2.53GluThr: 2.53 ± 0.486
3.646GluVal: 3.646 ± 0.639
1.042GluTrp: 1.042 ± 0.262
2.307GluTyr: 2.307 ± 0.362
0.0GluXaa: 0.0 ± 0.0
Phe
2.753PheAla: 2.753 ± 0.366
0.595PheCys: 0.595 ± 0.235
2.009PheAsp: 2.009 ± 0.459
1.339PheGlu: 1.339 ± 0.361
0.818PhePhe: 0.818 ± 0.213
2.455PheGly: 2.455 ± 0.425
0.67PheHis: 0.67 ± 0.23
1.265PheIle: 1.265 ± 0.216
1.488PheLys: 1.488 ± 0.371
3.199PheLeu: 3.199 ± 0.674
0.818PheMet: 0.818 ± 0.287
1.414PheAsn: 1.414 ± 0.43
1.116PhePro: 1.116 ± 0.255
0.67PheGln: 0.67 ± 0.245
2.009PheArg: 2.009 ± 0.462
2.307PheSer: 2.307 ± 0.38
2.009PheThr: 2.009 ± 0.361
2.158PheVal: 2.158 ± 0.396
0.298PheTrp: 0.298 ± 0.182
0.67PheTyr: 0.67 ± 0.238
0.0PheXaa: 0.0 ± 0.0
Gly
7.44GlyAla: 7.44 ± 0.778
1.19GlyCys: 1.19 ± 0.39
4.464GlyAsp: 4.464 ± 0.557
3.72GlyGlu: 3.72 ± 0.564
2.604GlyPhe: 2.604 ± 0.64
6.696GlyGly: 6.696 ± 0.991
0.744GlyHis: 0.744 ± 0.235
2.902GlyIle: 2.902 ± 0.556
4.39GlyLys: 4.39 ± 0.696
7.44GlyLeu: 7.44 ± 0.82
2.976GlyMet: 2.976 ± 0.501
2.902GlyAsn: 2.902 ± 0.381
2.381GlyPro: 2.381 ± 0.42
3.795GlyGln: 3.795 ± 0.791
4.092GlyArg: 4.092 ± 0.46
4.911GlySer: 4.911 ± 0.621
5.208GlyThr: 5.208 ± 0.513
7.887GlyVal: 7.887 ± 0.822
1.488GlyTrp: 1.488 ± 0.312
3.051GlyTyr: 3.051 ± 0.387
0.0GlyXaa: 0.0 ± 0.0
His
1.711HisAla: 1.711 ± 0.324
0.298HisCys: 0.298 ± 0.178
1.786HisAsp: 1.786 ± 0.453
0.967HisGlu: 0.967 ± 0.26
0.67HisPhe: 0.67 ± 0.224
1.711HisGly: 1.711 ± 0.355
0.372HisHis: 0.372 ± 0.188
0.818HisIle: 0.818 ± 0.242
1.637HisLys: 1.637 ± 0.321
1.339HisLeu: 1.339 ± 0.332
0.744HisMet: 0.744 ± 0.234
0.67HisAsn: 0.67 ± 0.226
0.967HisPro: 0.967 ± 0.329
0.595HisGln: 0.595 ± 0.173
1.637HisArg: 1.637 ± 0.427
0.67HisSer: 0.67 ± 0.254
1.19HisThr: 1.19 ± 0.371
1.637HisVal: 1.637 ± 0.427
0.372HisTrp: 0.372 ± 0.131
0.744HisTyr: 0.744 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
3.943IleAla: 3.943 ± 0.598
0.372IleCys: 0.372 ± 0.163
2.232IleAsp: 2.232 ± 0.437
2.679IleGlu: 2.679 ± 0.414
0.521IlePhe: 0.521 ± 0.188
3.423IleGly: 3.423 ± 0.454
1.116IleHis: 1.116 ± 0.244
2.307IleIle: 2.307 ± 0.379
2.604IleLys: 2.604 ± 0.499
2.604IleLeu: 2.604 ± 0.47
0.893IleMet: 0.893 ± 0.234
1.86IleAsn: 1.86 ± 0.337
2.53IlePro: 2.53 ± 0.414
2.455IleGln: 2.455 ± 0.379
3.051IleArg: 3.051 ± 0.467
1.488IleSer: 1.488 ± 0.331
2.827IleThr: 2.827 ± 0.529
2.902IleVal: 2.902 ± 0.526
0.372IleTrp: 0.372 ± 0.153
0.595IleTyr: 0.595 ± 0.16
0.0IleXaa: 0.0 ± 0.0
Lys
5.134LysAla: 5.134 ± 0.946
0.67LysCys: 0.67 ± 0.223
2.53LysAsp: 2.53 ± 0.348
2.307LysGlu: 2.307 ± 0.478
1.637LysPhe: 1.637 ± 0.335
3.125LysGly: 3.125 ± 0.519
1.19LysHis: 1.19 ± 0.33
2.009LysIle: 2.009 ± 0.402
1.042LysLys: 1.042 ± 0.328
4.018LysLeu: 4.018 ± 0.522
0.818LysMet: 0.818 ± 0.157
1.19LysAsn: 1.19 ± 0.347
2.232LysPro: 2.232 ± 0.519
2.381LysGln: 2.381 ± 0.367
2.455LysArg: 2.455 ± 0.464
1.935LysSer: 1.935 ± 0.488
2.679LysThr: 2.679 ± 0.445
4.315LysVal: 4.315 ± 0.577
0.595LysTrp: 0.595 ± 0.206
1.339LysTyr: 1.339 ± 0.342
0.0LysXaa: 0.0 ± 0.0
Leu
9.896LeuAla: 9.896 ± 1.082
1.265LeuCys: 1.265 ± 0.353
5.878LeuAsp: 5.878 ± 0.588
5.208LeuGlu: 5.208 ± 0.587
2.009LeuPhe: 2.009 ± 0.409
7.738LeuGly: 7.738 ± 0.644
2.307LeuHis: 2.307 ± 0.487
2.976LeuIle: 2.976 ± 0.425
2.753LeuLys: 2.753 ± 0.483
6.548LeuLeu: 6.548 ± 0.87
2.455LeuMet: 2.455 ± 0.306
4.464LeuAsn: 4.464 ± 0.761
4.539LeuPro: 4.539 ± 0.74
3.423LeuGln: 3.423 ± 0.473
6.473LeuArg: 6.473 ± 0.88
5.878LeuSer: 5.878 ± 0.707
5.58LeuThr: 5.58 ± 0.698
6.324LeuVal: 6.324 ± 0.853
1.414LeuTrp: 1.414 ± 0.501
2.158LeuTyr: 2.158 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
3.125MetAla: 3.125 ± 0.603
0.223MetCys: 0.223 ± 0.121
1.339MetAsp: 1.339 ± 0.391
1.265MetGlu: 1.265 ± 0.335
1.042MetPhe: 1.042 ± 0.241
1.86MetGly: 1.86 ± 0.308
0.521MetHis: 0.521 ± 0.171
0.744MetIle: 0.744 ± 0.237
1.116MetLys: 1.116 ± 0.324
2.679MetLeu: 2.679 ± 0.327
0.595MetMet: 0.595 ± 0.241
1.042MetAsn: 1.042 ± 0.321
1.116MetPro: 1.116 ± 0.264
2.232MetGln: 2.232 ± 0.469
1.86MetArg: 1.86 ± 0.263
2.158MetSer: 2.158 ± 0.383
2.232MetThr: 2.232 ± 0.366
2.083MetVal: 2.083 ± 0.39
0.595MetTrp: 0.595 ± 0.209
1.19MetTyr: 1.19 ± 0.311
0.0MetXaa: 0.0 ± 0.0
Asn
3.943AsnAla: 3.943 ± 0.795
0.521AsnCys: 0.521 ± 0.188
1.786AsnAsp: 1.786 ± 0.328
2.232AsnGlu: 2.232 ± 0.318
0.893AsnPhe: 0.893 ± 0.32
3.571AsnGly: 3.571 ± 0.535
1.116AsnHis: 1.116 ± 0.334
1.488AsnIle: 1.488 ± 0.332
1.488AsnLys: 1.488 ± 0.307
3.795AsnLeu: 3.795 ± 0.522
1.265AsnMet: 1.265 ± 0.358
1.116AsnAsn: 1.116 ± 0.297
2.158AsnPro: 2.158 ± 0.316
1.935AsnGln: 1.935 ± 0.472
1.786AsnArg: 1.786 ± 0.409
2.53AsnSer: 2.53 ± 0.504
2.604AsnThr: 2.604 ± 0.542
3.423AsnVal: 3.423 ± 0.567
0.595AsnTrp: 0.595 ± 0.207
1.562AsnTyr: 1.562 ± 0.347
0.0AsnXaa: 0.0 ± 0.0
Pro
5.952ProAla: 5.952 ± 0.743
0.521ProCys: 0.521 ± 0.221
1.935ProAsp: 1.935 ± 0.411
2.976ProGlu: 2.976 ± 0.586
1.042ProPhe: 1.042 ± 0.275
3.497ProGly: 3.497 ± 0.431
0.67ProHis: 0.67 ± 0.236
2.232ProIle: 2.232 ± 0.376
1.935ProLys: 1.935 ± 0.445
2.53ProLeu: 2.53 ± 0.389
0.521ProMet: 0.521 ± 0.193
2.083ProAsn: 2.083 ± 0.365
1.265ProPro: 1.265 ± 0.319
1.637ProGln: 1.637 ± 0.261
2.604ProArg: 2.604 ± 0.474
2.455ProSer: 2.455 ± 0.428
2.753ProThr: 2.753 ± 0.582
3.646ProVal: 3.646 ± 0.505
0.818ProTrp: 0.818 ± 0.22
1.86ProTyr: 1.86 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
5.729GlnAla: 5.729 ± 1.002
0.298GlnCys: 0.298 ± 0.145
1.562GlnAsp: 1.562 ± 0.396
2.604GlnGlu: 2.604 ± 0.629
1.562GlnPhe: 1.562 ± 0.258
3.943GlnGly: 3.943 ± 0.481
1.414GlnHis: 1.414 ± 0.344
1.265GlnIle: 1.265 ± 0.326
1.488GlnLys: 1.488 ± 0.299
5.357GlnLeu: 5.357 ± 0.515
1.19GlnMet: 1.19 ± 0.362
1.935GlnAsn: 1.935 ± 0.32
1.414GlnPro: 1.414 ± 0.288
3.199GlnGln: 3.199 ± 0.565
3.72GlnArg: 3.72 ± 0.491
2.604GlnSer: 2.604 ± 0.421
2.158GlnThr: 2.158 ± 0.436
2.232GlnVal: 2.232 ± 0.491
0.595GlnTrp: 0.595 ± 0.201
1.711GlnTyr: 1.711 ± 0.361
0.0GlnXaa: 0.0 ± 0.0
Arg
5.878ArgAla: 5.878 ± 0.583
0.818ArgCys: 0.818 ± 0.287
4.167ArgAsp: 4.167 ± 0.444
4.018ArgGlu: 4.018 ± 0.675
2.381ArgPhe: 2.381 ± 0.44
4.762ArgGly: 4.762 ± 0.538
1.711ArgHis: 1.711 ± 0.37
3.348ArgIle: 3.348 ± 0.543
2.604ArgLys: 2.604 ± 0.501
5.729ArgLeu: 5.729 ± 0.676
2.53ArgMet: 2.53 ± 0.549
2.158ArgAsn: 2.158 ± 0.375
1.711ArgPro: 1.711 ± 0.477
2.604ArgGln: 2.604 ± 0.499
4.539ArgArg: 4.539 ± 0.591
3.72ArgSer: 3.72 ± 0.51
2.455ArgThr: 2.455 ± 0.41
4.539ArgVal: 4.539 ± 0.574
1.042ArgTrp: 1.042 ± 0.275
2.232ArgTyr: 2.232 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
7.366SerAla: 7.366 ± 0.583
0.967SerCys: 0.967 ± 0.326
2.976SerAsp: 2.976 ± 0.477
2.679SerGlu: 2.679 ± 0.534
2.679SerPhe: 2.679 ± 0.441
4.911SerGly: 4.911 ± 0.522
0.967SerHis: 0.967 ± 0.265
3.274SerIle: 3.274 ± 0.451
3.199SerLys: 3.199 ± 0.561
5.729SerLeu: 5.729 ± 0.709
2.158SerMet: 2.158 ± 0.386
2.158SerAsn: 2.158 ± 0.478
2.455SerPro: 2.455 ± 0.513
1.711SerGln: 1.711 ± 0.426
2.679SerArg: 2.679 ± 0.484
4.241SerSer: 4.241 ± 0.691
3.869SerThr: 3.869 ± 0.649
4.241SerVal: 4.241 ± 0.589
1.488SerTrp: 1.488 ± 0.348
1.562SerTyr: 1.562 ± 0.43
0.0SerXaa: 0.0 ± 0.0
Thr
7.068ThrAla: 7.068 ± 0.918
0.595ThrCys: 0.595 ± 0.236
3.199ThrAsp: 3.199 ± 0.446
1.562ThrGlu: 1.562 ± 0.328
1.935ThrPhe: 1.935 ± 0.328
6.92ThrGly: 6.92 ± 1.145
0.893ThrHis: 0.893 ± 0.253
2.307ThrIle: 2.307 ± 0.455
2.009ThrLys: 2.009 ± 0.412
4.911ThrLeu: 4.911 ± 0.715
1.042ThrMet: 1.042 ± 0.261
2.307ThrAsn: 2.307 ± 0.526
4.167ThrPro: 4.167 ± 0.643
3.199ThrGln: 3.199 ± 0.433
3.051ThrArg: 3.051 ± 0.452
4.167ThrSer: 4.167 ± 0.609
3.943ThrThr: 3.943 ± 0.718
4.688ThrVal: 4.688 ± 0.709
0.595ThrTrp: 0.595 ± 0.231
2.158ThrTyr: 2.158 ± 0.4
0.0ThrXaa: 0.0 ± 0.0
Val
7.664ValAla: 7.664 ± 0.817
0.893ValCys: 0.893 ± 0.291
5.208ValAsp: 5.208 ± 0.618
3.497ValGlu: 3.497 ± 0.594
2.307ValPhe: 2.307 ± 0.638
5.506ValGly: 5.506 ± 0.727
1.637ValHis: 1.637 ± 0.421
2.083ValIle: 2.083 ± 0.4
2.902ValLys: 2.902 ± 0.448
7.515ValLeu: 7.515 ± 0.756
2.083ValMet: 2.083 ± 0.414
3.646ValAsn: 3.646 ± 0.841
3.943ValPro: 3.943 ± 0.558
2.976ValGln: 2.976 ± 0.552
5.208ValArg: 5.208 ± 0.622
5.506ValSer: 5.506 ± 0.806
4.39ValThr: 4.39 ± 0.897
5.208ValVal: 5.208 ± 0.86
1.339ValTrp: 1.339 ± 0.245
2.158ValTyr: 2.158 ± 0.405
0.0ValXaa: 0.0 ± 0.0
Trp
2.232TrpAla: 2.232 ± 0.456
0.521TrpCys: 0.521 ± 0.185
0.67TrpAsp: 0.67 ± 0.244
0.744TrpGlu: 0.744 ± 0.19
0.595TrpPhe: 0.595 ± 0.169
0.67TrpGly: 0.67 ± 0.204
0.372TrpHis: 0.372 ± 0.144
0.744TrpIle: 0.744 ± 0.268
0.521TrpLys: 0.521 ± 0.203
1.265TrpLeu: 1.265 ± 0.365
0.298TrpMet: 0.298 ± 0.173
0.67TrpAsn: 0.67 ± 0.224
0.818TrpPro: 0.818 ± 0.244
0.818TrpGln: 0.818 ± 0.267
0.818TrpArg: 0.818 ± 0.284
1.19TrpSer: 1.19 ± 0.309
0.967TrpThr: 0.967 ± 0.217
0.67TrpVal: 0.67 ± 0.202
0.372TrpTrp: 0.372 ± 0.157
0.595TrpTyr: 0.595 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.53TyrAla: 2.53 ± 0.483
0.298TyrCys: 0.298 ± 0.173
0.744TyrAsp: 0.744 ± 0.254
1.116TyrGlu: 1.116 ± 0.327
0.893TyrPhe: 0.893 ± 0.269
2.381TyrGly: 2.381 ± 0.348
0.818TyrHis: 0.818 ± 0.284
1.562TyrIle: 1.562 ± 0.268
1.339TyrLys: 1.339 ± 0.31
3.125TyrLeu: 3.125 ± 0.517
0.521TyrMet: 0.521 ± 0.153
1.786TyrAsn: 1.786 ± 0.35
1.414TyrPro: 1.414 ± 0.283
1.414TyrGln: 1.414 ± 0.299
3.051TyrArg: 3.051 ± 0.48
2.455TyrSer: 2.455 ± 0.44
2.604TyrThr: 2.604 ± 0.388
3.051TyrVal: 3.051 ± 0.579
0.521TyrTrp: 0.521 ± 0.197
0.893TyrTyr: 0.893 ± 0.278
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13441 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski