Amino acid dipepetide frequency for Proteus phage PM 75

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.992AlaAla: 4.992 ± 0.94
0.471AlaCys: 0.471 ± 0.238
5.369AlaAsp: 5.369 ± 0.852
3.485AlaGlu: 3.485 ± 0.612
2.731AlaPhe: 2.731 ± 0.667
7.158AlaGly: 7.158 ± 0.925
1.13AlaHis: 1.13 ± 0.391
2.826AlaIle: 2.826 ± 0.527
4.05AlaLys: 4.05 ± 0.738
7.347AlaLeu: 7.347 ± 1.22
1.79AlaMet: 1.79 ± 0.437
3.485AlaAsn: 3.485 ± 0.484
4.333AlaPro: 4.333 ± 1.128
3.673AlaGln: 3.673 ± 0.654
4.144AlaArg: 4.144 ± 0.94
5.18AlaSer: 5.18 ± 0.779
3.956AlaThr: 3.956 ± 0.578
5.84AlaVal: 5.84 ± 0.898
0.471AlaTrp: 0.471 ± 0.18
3.014AlaTyr: 3.014 ± 0.53
0.0AlaXaa: 0.0 ± 0.0
Cys
0.471CysAla: 0.471 ± 0.183
0.283CysCys: 0.283 ± 0.148
0.754CysAsp: 0.754 ± 0.348
0.283CysGlu: 0.283 ± 0.138
0.094CysPhe: 0.094 ± 0.098
0.283CysGly: 0.283 ± 0.158
0.471CysHis: 0.471 ± 0.251
1.036CysIle: 1.036 ± 0.383
0.377CysLys: 0.377 ± 0.166
0.565CysLeu: 0.565 ± 0.275
0.848CysMet: 0.848 ± 0.338
0.471CysAsn: 0.471 ± 0.209
0.377CysPro: 0.377 ± 0.165
0.283CysGln: 0.283 ± 0.162
0.377CysArg: 0.377 ± 0.201
0.471CysSer: 0.471 ± 0.202
1.224CysThr: 1.224 ± 0.326
0.659CysVal: 0.659 ± 0.323
0.188CysTrp: 0.188 ± 0.163
0.471CysTyr: 0.471 ± 0.247
0.0CysXaa: 0.0 ± 0.0
Asp
5.557AspAla: 5.557 ± 0.828
0.848AspCys: 0.848 ± 0.319
3.391AspAsp: 3.391 ± 0.657
3.297AspGlu: 3.297 ± 0.486
2.637AspPhe: 2.637 ± 0.62
4.615AspGly: 4.615 ± 0.667
0.565AspHis: 0.565 ± 0.201
3.862AspIle: 3.862 ± 0.563
3.673AspLys: 3.673 ± 0.427
6.122AspLeu: 6.122 ± 0.59
1.413AspMet: 1.413 ± 0.628
1.601AspAsn: 1.601 ± 0.451
2.637AspPro: 2.637 ± 0.34
1.13AspGln: 1.13 ± 0.286
1.695AspArg: 1.695 ± 0.451
4.238AspSer: 4.238 ± 0.767
6.216AspThr: 6.216 ± 0.727
3.202AspVal: 3.202 ± 0.676
0.754AspTrp: 0.754 ± 0.249
2.261AspTyr: 2.261 ± 0.38
0.0AspXaa: 0.0 ± 0.0
Glu
5.18GluAla: 5.18 ± 0.82
0.754GluCys: 0.754 ± 0.292
1.978GluAsp: 1.978 ± 0.334
2.543GluGlu: 2.543 ± 0.658
3.297GluPhe: 3.297 ± 0.515
3.579GluGly: 3.579 ± 0.622
1.695GluHis: 1.695 ± 0.545
3.202GluIle: 3.202 ± 0.432
4.05GluLys: 4.05 ± 0.751
4.992GluLeu: 4.992 ± 0.541
1.884GluMet: 1.884 ± 0.32
1.224GluAsn: 1.224 ± 0.282
1.79GluPro: 1.79 ± 0.396
3.768GluGln: 3.768 ± 0.735
3.108GluArg: 3.108 ± 0.578
3.862GluSer: 3.862 ± 0.542
2.92GluThr: 2.92 ± 0.728
4.144GluVal: 4.144 ± 0.584
1.036GluTrp: 1.036 ± 0.282
3.768GluTyr: 3.768 ± 0.457
0.0GluXaa: 0.0 ± 0.0
Phe
2.166PheAla: 2.166 ± 0.54
0.283PheCys: 0.283 ± 0.17
2.543PheAsp: 2.543 ± 0.553
1.224PheGlu: 1.224 ± 0.455
1.601PhePhe: 1.601 ± 0.397
3.014PheGly: 3.014 ± 0.491
0.659PheHis: 0.659 ± 0.34
1.884PheIle: 1.884 ± 0.337
3.108PheLys: 3.108 ± 0.498
2.261PheLeu: 2.261 ± 0.502
0.659PheMet: 0.659 ± 0.185
2.543PheAsn: 2.543 ± 0.386
0.848PhePro: 0.848 ± 0.331
1.319PheGln: 1.319 ± 0.302
1.978PheArg: 1.978 ± 0.456
3.108PheSer: 3.108 ± 0.564
1.79PheThr: 1.79 ± 0.308
1.507PheVal: 1.507 ± 0.379
0.283PheTrp: 0.283 ± 0.141
1.224PheTyr: 1.224 ± 0.266
0.0PheXaa: 0.0 ± 0.0
Gly
7.253GlyAla: 7.253 ± 0.644
0.942GlyCys: 0.942 ± 0.427
4.238GlyAsp: 4.238 ± 0.71
3.485GlyGlu: 3.485 ± 0.635
2.543GlyPhe: 2.543 ± 0.384
4.238GlyGly: 4.238 ± 0.82
0.848GlyHis: 0.848 ± 0.31
5.086GlyIle: 5.086 ± 0.526
3.297GlyLys: 3.297 ± 0.638
6.028GlyLeu: 6.028 ± 0.794
3.202GlyMet: 3.202 ± 0.592
3.391GlyAsn: 3.391 ± 0.504
0.0GlyPro: 0.0 ± 0.0
1.884GlyGln: 1.884 ± 0.471
3.391GlyArg: 3.391 ± 0.553
6.782GlySer: 6.782 ± 0.607
7.064GlyThr: 7.064 ± 0.839
5.275GlyVal: 5.275 ± 0.61
0.942GlyTrp: 0.942 ± 0.208
3.014GlyTyr: 3.014 ± 0.5
0.0GlyXaa: 0.0 ± 0.0
His
0.942HisAla: 0.942 ± 0.278
0.283HisCys: 0.283 ± 0.155
1.13HisAsp: 1.13 ± 0.212
1.507HisGlu: 1.507 ± 0.369
1.224HisPhe: 1.224 ± 0.444
1.036HisGly: 1.036 ± 0.407
0.283HisHis: 0.283 ± 0.163
0.942HisIle: 0.942 ± 0.204
1.13HisLys: 1.13 ± 0.302
2.826HisLeu: 2.826 ± 0.387
0.283HisMet: 0.283 ± 0.16
0.377HisAsn: 0.377 ± 0.213
0.471HisPro: 0.471 ± 0.214
0.942HisGln: 0.942 ± 0.411
1.036HisArg: 1.036 ± 0.241
1.695HisSer: 1.695 ± 0.455
1.319HisThr: 1.319 ± 0.319
0.565HisVal: 0.565 ± 0.334
0.188HisTrp: 0.188 ± 0.121
0.848HisTyr: 0.848 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
2.731IleAla: 2.731 ± 0.473
0.377IleCys: 0.377 ± 0.175
4.427IleAsp: 4.427 ± 0.585
1.884IleGlu: 1.884 ± 0.386
0.942IlePhe: 0.942 ± 0.3
3.579IleGly: 3.579 ± 0.741
1.507IleHis: 1.507 ± 0.349
2.355IleIle: 2.355 ± 0.543
4.333IleLys: 4.333 ± 0.564
4.615IleLeu: 4.615 ± 0.51
1.507IleMet: 1.507 ± 0.278
2.261IleAsn: 2.261 ± 0.574
2.449IlePro: 2.449 ± 0.37
3.673IleGln: 3.673 ± 0.824
3.108IleArg: 3.108 ± 0.494
3.014IleSer: 3.014 ± 0.465
3.956IleThr: 3.956 ± 0.67
3.108IleVal: 3.108 ± 0.442
0.188IleTrp: 0.188 ± 0.131
1.036IleTyr: 1.036 ± 0.38
0.0IleXaa: 0.0 ± 0.0
Lys
4.804LysAla: 4.804 ± 0.586
0.565LysCys: 0.565 ± 0.237
5.18LysAsp: 5.18 ± 0.529
5.086LysGlu: 5.086 ± 0.779
2.072LysPhe: 2.072 ± 0.489
4.992LysGly: 4.992 ± 0.598
1.036LysHis: 1.036 ± 0.272
2.449LysIle: 2.449 ± 0.472
1.695LysLys: 1.695 ± 0.303
5.84LysLeu: 5.84 ± 0.802
1.036LysMet: 1.036 ± 0.374
2.261LysAsn: 2.261 ± 0.479
2.731LysPro: 2.731 ± 0.533
2.92LysGln: 2.92 ± 0.562
2.826LysArg: 2.826 ± 0.62
3.579LysSer: 3.579 ± 0.556
2.543LysThr: 2.543 ± 0.468
4.333LysVal: 4.333 ± 0.909
1.224LysTrp: 1.224 ± 0.269
3.297LysTyr: 3.297 ± 0.452
0.0LysXaa: 0.0 ± 0.0
Leu
5.275LeuAla: 5.275 ± 0.743
1.601LeuCys: 1.601 ± 0.503
5.369LeuAsp: 5.369 ± 0.543
6.687LeuGlu: 6.687 ± 0.711
2.826LeuPhe: 2.826 ± 0.529
6.122LeuGly: 6.122 ± 0.622
1.224LeuHis: 1.224 ± 0.342
5.557LeuIle: 5.557 ± 0.668
6.97LeuLys: 6.97 ± 0.647
6.97LeuLeu: 6.97 ± 0.742
2.261LeuMet: 2.261 ± 0.447
4.804LeuAsn: 4.804 ± 0.674
3.485LeuPro: 3.485 ± 0.437
5.086LeuGln: 5.086 ± 0.981
4.709LeuArg: 4.709 ± 0.893
6.687LeuSer: 6.687 ± 0.673
4.615LeuThr: 4.615 ± 0.589
6.311LeuVal: 6.311 ± 1.094
0.377LeuTrp: 0.377 ± 0.165
3.862LeuTyr: 3.862 ± 0.638
0.0LeuXaa: 0.0 ± 0.0
Met
2.072MetAla: 2.072 ± 0.426
0.094MetCys: 0.094 ± 0.082
1.224MetAsp: 1.224 ± 0.363
0.942MetGlu: 0.942 ± 0.266
0.659MetPhe: 0.659 ± 0.238
2.449MetGly: 2.449 ± 0.396
0.659MetHis: 0.659 ± 0.246
0.942MetIle: 0.942 ± 0.272
1.224MetLys: 1.224 ± 0.279
3.202MetLeu: 3.202 ± 0.673
0.471MetMet: 0.471 ± 0.2
1.413MetAsn: 1.413 ± 0.466
0.754MetPro: 0.754 ± 0.322
2.261MetGln: 2.261 ± 0.761
1.13MetArg: 1.13 ± 0.238
2.826MetSer: 2.826 ± 0.474
0.942MetThr: 0.942 ± 0.335
1.79MetVal: 1.79 ± 0.463
0.188MetTrp: 0.188 ± 0.127
1.224MetTyr: 1.224 ± 0.292
0.0MetXaa: 0.0 ± 0.0
Asn
3.202AsnAla: 3.202 ± 0.461
0.283AsnCys: 0.283 ± 0.178
1.695AsnAsp: 1.695 ± 0.433
2.261AsnGlu: 2.261 ± 0.446
1.319AsnPhe: 1.319 ± 0.481
2.543AsnGly: 2.543 ± 0.545
0.848AsnHis: 0.848 ± 0.289
2.355AsnIle: 2.355 ± 0.447
3.202AsnLys: 3.202 ± 0.423
4.804AsnLeu: 4.804 ± 0.561
1.036AsnMet: 1.036 ± 0.297
2.637AsnAsn: 2.637 ± 0.495
2.543AsnPro: 2.543 ± 0.54
1.036AsnGln: 1.036 ± 0.236
1.884AsnArg: 1.884 ± 0.303
3.202AsnSer: 3.202 ± 0.497
3.862AsnThr: 3.862 ± 0.479
3.673AsnVal: 3.673 ± 0.539
0.377AsnTrp: 0.377 ± 0.186
1.319AsnTyr: 1.319 ± 0.289
0.0AsnXaa: 0.0 ± 0.0
Pro
3.014ProAla: 3.014 ± 0.541
0.188ProCys: 0.188 ± 0.11
3.391ProAsp: 3.391 ± 0.579
2.826ProGlu: 2.826 ± 0.646
0.848ProPhe: 0.848 ± 0.254
0.094ProGly: 0.094 ± 0.095
0.754ProHis: 0.754 ± 0.232
1.695ProIle: 1.695 ± 0.433
3.297ProLys: 3.297 ± 0.4
2.449ProLeu: 2.449 ± 0.387
1.13ProMet: 1.13 ± 0.331
1.884ProAsn: 1.884 ± 0.399
0.565ProPro: 0.565 ± 0.261
1.413ProGln: 1.413 ± 0.378
1.224ProArg: 1.224 ± 0.306
4.333ProSer: 4.333 ± 0.693
3.108ProThr: 3.108 ± 0.56
2.92ProVal: 2.92 ± 0.414
0.471ProTrp: 0.471 ± 0.215
1.319ProTyr: 1.319 ± 0.285
0.0ProXaa: 0.0 ± 0.0
Gln
5.369GlnAla: 5.369 ± 1.174
0.471GlnCys: 0.471 ± 0.325
3.202GlnAsp: 3.202 ± 0.644
4.992GlnGlu: 4.992 ± 0.805
1.413GlnPhe: 1.413 ± 0.387
4.521GlnGly: 4.521 ± 0.512
1.319GlnHis: 1.319 ± 0.396
1.319GlnIle: 1.319 ± 0.37
1.978GlnLys: 1.978 ± 0.443
3.485GlnLeu: 3.485 ± 0.717
1.224GlnMet: 1.224 ± 0.343
2.261GlnAsn: 2.261 ± 0.486
0.471GlnPro: 0.471 ± 0.202
3.108GlnGln: 3.108 ± 1.0
2.261GlnArg: 2.261 ± 0.515
3.485GlnSer: 3.485 ± 0.534
2.166GlnThr: 2.166 ± 0.505
3.485GlnVal: 3.485 ± 0.65
0.565GlnTrp: 0.565 ± 0.207
3.108GlnTyr: 3.108 ± 0.397
0.0GlnXaa: 0.0 ± 0.0
Arg
3.862ArgAla: 3.862 ± 1.006
0.283ArgCys: 0.283 ± 0.226
2.92ArgAsp: 2.92 ± 0.418
3.297ArgGlu: 3.297 ± 0.518
1.79ArgPhe: 1.79 ± 0.355
3.391ArgGly: 3.391 ± 0.859
0.754ArgHis: 0.754 ± 0.253
2.166ArgIle: 2.166 ± 0.444
2.731ArgLys: 2.731 ± 0.457
5.463ArgLeu: 5.463 ± 0.81
1.224ArgMet: 1.224 ± 0.358
1.413ArgAsn: 1.413 ± 0.467
1.13ArgPro: 1.13 ± 0.337
1.319ArgGln: 1.319 ± 0.363
3.014ArgArg: 3.014 ± 0.661
3.485ArgSer: 3.485 ± 0.637
3.014ArgThr: 3.014 ± 0.439
4.615ArgVal: 4.615 ± 0.523
0.754ArgTrp: 0.754 ± 0.205
1.413ArgTyr: 1.413 ± 0.388
0.0ArgXaa: 0.0 ± 0.0
Ser
5.463SerAla: 5.463 ± 0.782
0.283SerCys: 0.283 ± 0.157
3.862SerAsp: 3.862 ± 0.478
4.144SerGlu: 4.144 ± 0.37
1.695SerPhe: 1.695 ± 0.377
6.122SerGly: 6.122 ± 0.861
1.413SerHis: 1.413 ± 0.437
4.333SerIle: 4.333 ± 0.472
4.521SerLys: 4.521 ± 0.8
6.499SerLeu: 6.499 ± 0.699
2.355SerMet: 2.355 ± 0.516
3.391SerAsn: 3.391 ± 0.582
3.297SerPro: 3.297 ± 0.406
2.637SerGln: 2.637 ± 0.649
2.731SerArg: 2.731 ± 0.369
5.18SerSer: 5.18 ± 0.72
8.194SerThr: 8.194 ± 1.143
5.84SerVal: 5.84 ± 0.782
0.848SerTrp: 0.848 ± 0.303
1.601SerTyr: 1.601 ± 0.372
0.0SerXaa: 0.0 ± 0.0
Thr
5.651ThrAla: 5.651 ± 0.976
0.471ThrCys: 0.471 ± 0.197
3.862ThrAsp: 3.862 ± 0.754
4.709ThrGlu: 4.709 ± 0.644
2.543ThrPhe: 2.543 ± 0.522
6.499ThrGly: 6.499 ± 0.809
1.13ThrHis: 1.13 ± 0.309
2.543ThrIle: 2.543 ± 0.621
3.862ThrLys: 3.862 ± 0.604
4.709ThrLeu: 4.709 ± 0.908
1.79ThrMet: 1.79 ± 0.395
3.108ThrAsn: 3.108 ± 0.52
3.956ThrPro: 3.956 ± 0.495
3.673ThrGln: 3.673 ± 0.571
2.449ThrArg: 2.449 ± 0.429
4.709ThrSer: 4.709 ± 0.637
2.261ThrThr: 2.261 ± 0.566
6.028ThrVal: 6.028 ± 0.939
0.565ThrTrp: 0.565 ± 0.218
2.355ThrTyr: 2.355 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
5.086ValAla: 5.086 ± 0.7
0.471ValCys: 0.471 ± 0.19
2.637ValAsp: 2.637 ± 0.357
2.826ValGlu: 2.826 ± 0.584
2.072ValPhe: 2.072 ± 0.467
5.275ValGly: 5.275 ± 0.787
1.978ValHis: 1.978 ± 0.569
2.826ValIle: 2.826 ± 0.525
3.956ValLys: 3.956 ± 0.679
7.064ValLeu: 7.064 ± 0.834
0.754ValMet: 0.754 ± 0.278
2.731ValAsn: 2.731 ± 0.507
3.862ValPro: 3.862 ± 0.673
7.441ValGln: 7.441 ± 1.219
4.238ValArg: 4.238 ± 0.673
4.615ValSer: 4.615 ± 0.496
4.521ValThr: 4.521 ± 0.679
4.992ValVal: 4.992 ± 0.87
0.754ValTrp: 0.754 ± 0.216
2.166ValTyr: 2.166 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.377TrpAla: 0.377 ± 0.181
0.377TrpCys: 0.377 ± 0.214
0.754TrpAsp: 0.754 ± 0.231
0.942TrpGlu: 0.942 ± 0.335
0.848TrpPhe: 0.848 ± 0.212
0.848TrpGly: 0.848 ± 0.294
0.377TrpHis: 0.377 ± 0.131
0.377TrpIle: 0.377 ± 0.201
0.565TrpLys: 0.565 ± 0.245
1.413TrpLeu: 1.413 ± 0.316
0.094TrpMet: 0.094 ± 0.084
0.565TrpAsn: 0.565 ± 0.184
0.0TrpPro: 0.0 ± 0.0
0.659TrpGln: 0.659 ± 0.238
0.377TrpArg: 0.377 ± 0.153
0.754TrpSer: 0.754 ± 0.177
0.283TrpThr: 0.283 ± 0.16
0.754TrpVal: 0.754 ± 0.282
0.471TrpTrp: 0.471 ± 0.247
0.659TrpTyr: 0.659 ± 0.276
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.978TyrAla: 1.978 ± 0.348
0.659TyrCys: 0.659 ± 0.26
1.79TyrAsp: 1.79 ± 0.329
2.355TyrGlu: 2.355 ± 0.546
0.848TyrPhe: 0.848 ± 0.27
2.543TyrGly: 2.543 ± 0.548
0.377TyrHis: 0.377 ± 0.203
3.297TyrIle: 3.297 ± 0.617
2.449TyrLys: 2.449 ± 0.429
4.144TyrLeu: 4.144 ± 0.877
1.413TyrMet: 1.413 ± 0.369
2.261TyrAsn: 2.261 ± 0.474
1.224TyrPro: 1.224 ± 0.291
1.978TyrGln: 1.978 ± 0.4
2.355TyrArg: 2.355 ± 0.59
3.297TyrSer: 3.297 ± 0.308
2.92TyrThr: 2.92 ± 0.512
1.224TyrVal: 1.224 ± 0.288
0.848TyrTrp: 0.848 ± 0.31
1.13TyrTyr: 1.13 ± 0.445
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (10618 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski