Amino acid dipepetide frequency for Pseudomonas phage MD8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.259AlaAla: 14.259 ± 1.455
0.925AlaCys: 0.925 ± 0.283
7.245AlaAsp: 7.245 ± 0.722
8.17AlaGlu: 8.17 ± 0.693
2.698AlaPhe: 2.698 ± 0.451
9.48AlaGly: 9.48 ± 0.871
1.464AlaHis: 1.464 ± 0.341
5.01AlaIle: 5.01 ± 0.651
5.087AlaLys: 5.087 ± 0.565
12.487AlaLeu: 12.487 ± 1.219
3.237AlaMet: 3.237 ± 0.492
3.391AlaAsn: 3.391 ± 0.547
5.781AlaPro: 5.781 ± 0.762
4.548AlaGln: 4.548 ± 0.608
8.17AlaArg: 8.17 ± 0.826
6.629AlaSer: 6.629 ± 0.664
5.781AlaThr: 5.781 ± 0.838
6.397AlaVal: 6.397 ± 0.673
2.544AlaTrp: 2.544 ± 0.421
2.466AlaTyr: 2.466 ± 0.509
0.0AlaXaa: 0.0 ± 0.0
Cys
1.079CysAla: 1.079 ± 0.32
0.308CysCys: 0.308 ± 0.166
0.694CysAsp: 0.694 ± 0.265
0.462CysGlu: 0.462 ± 0.208
0.154CysPhe: 0.154 ± 0.144
0.617CysGly: 0.617 ± 0.262
0.231CysHis: 0.231 ± 0.124
0.308CysIle: 0.308 ± 0.156
0.617CysLys: 0.617 ± 0.231
0.694CysLeu: 0.694 ± 0.217
0.154CysMet: 0.154 ± 0.106
0.308CysAsn: 0.308 ± 0.187
0.617CysPro: 0.617 ± 0.258
0.308CysGln: 0.308 ± 0.154
1.079CysArg: 1.079 ± 0.31
0.54CysSer: 0.54 ± 0.184
0.54CysThr: 0.54 ± 0.186
0.617CysVal: 0.617 ± 0.22
0.462CysTrp: 0.462 ± 0.239
0.077CysTyr: 0.077 ± 0.079
0.0CysXaa: 0.0 ± 0.0
Asp
6.012AspAla: 6.012 ± 0.731
0.925AspCys: 0.925 ± 0.3
3.931AspAsp: 3.931 ± 0.549
3.546AspGlu: 3.546 ± 0.611
1.464AspPhe: 1.464 ± 0.33
6.166AspGly: 6.166 ± 0.607
1.31AspHis: 1.31 ± 0.311
1.773AspIle: 1.773 ± 0.337
1.85AspLys: 1.85 ± 0.437
5.395AspLeu: 5.395 ± 0.553
0.848AspMet: 0.848 ± 0.266
1.002AspAsn: 1.002 ± 0.297
3.083AspPro: 3.083 ± 0.526
2.389AspGln: 2.389 ± 0.399
3.623AspArg: 3.623 ± 0.503
2.929AspSer: 2.929 ± 0.545
2.466AspThr: 2.466 ± 0.39
4.47AspVal: 4.47 ± 0.682
1.464AspTrp: 1.464 ± 0.275
1.233AspTyr: 1.233 ± 0.339
0.0AspXaa: 0.0 ± 0.0
Glu
7.399GluAla: 7.399 ± 0.582
0.617GluCys: 0.617 ± 0.228
2.389GluAsp: 2.389 ± 0.359
4.316GluGlu: 4.316 ± 0.569
2.389GluPhe: 2.389 ± 0.311
3.931GluGly: 3.931 ± 0.491
1.002GluHis: 1.002 ± 0.315
3.623GluIle: 3.623 ± 0.535
2.389GluLys: 2.389 ± 0.462
7.631GluLeu: 7.631 ± 0.721
1.85GluMet: 1.85 ± 0.351
1.542GluAsn: 1.542 ± 0.287
3.314GluPro: 3.314 ± 0.539
3.314GluGln: 3.314 ± 0.536
6.86GluArg: 6.86 ± 0.673
3.546GluSer: 3.546 ± 0.486
2.466GluThr: 2.466 ± 0.413
4.393GluVal: 4.393 ± 0.631
1.079GluTrp: 1.079 ± 0.303
1.464GluTyr: 1.464 ± 0.377
0.0GluXaa: 0.0 ± 0.0
Phe
2.544PheAla: 2.544 ± 0.453
0.385PheCys: 0.385 ± 0.16
1.619PheAsp: 1.619 ± 0.326
2.312PheGlu: 2.312 ± 0.442
0.925PhePhe: 0.925 ± 0.25
2.621PheGly: 2.621 ± 0.589
0.771PheHis: 0.771 ± 0.213
1.233PheIle: 1.233 ± 0.25
1.31PheLys: 1.31 ± 0.272
1.31PheLeu: 1.31 ± 0.274
0.54PheMet: 0.54 ± 0.196
1.387PheAsn: 1.387 ± 0.36
1.079PhePro: 1.079 ± 0.247
0.617PheGln: 0.617 ± 0.183
1.927PheArg: 1.927 ± 0.448
2.235PheSer: 2.235 ± 0.482
1.542PheThr: 1.542 ± 0.305
2.389PheVal: 2.389 ± 0.385
0.462PheTrp: 0.462 ± 0.207
0.848PheTyr: 0.848 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
8.71GlyAla: 8.71 ± 0.844
0.462GlyCys: 0.462 ± 0.179
4.085GlyAsp: 4.085 ± 0.555
5.318GlyGlu: 5.318 ± 0.628
2.698GlyPhe: 2.698 ± 0.496
7.168GlyGly: 7.168 ± 1.038
1.387GlyHis: 1.387 ± 0.299
4.008GlyIle: 4.008 ± 0.504
3.237GlyLys: 3.237 ± 0.523
7.245GlyLeu: 7.245 ± 0.65
2.621GlyMet: 2.621 ± 0.402
2.235GlyAsn: 2.235 ± 0.388
2.698GlyPro: 2.698 ± 0.393
2.621GlyGln: 2.621 ± 0.359
6.166GlyArg: 6.166 ± 0.663
5.472GlySer: 5.472 ± 0.741
3.546GlyThr: 3.546 ± 0.628
5.935GlyVal: 5.935 ± 0.616
1.773GlyTrp: 1.773 ± 0.3
1.85GlyTyr: 1.85 ± 0.395
0.0GlyXaa: 0.0 ± 0.0
His
1.927HisAla: 1.927 ± 0.341
0.231HisCys: 0.231 ± 0.175
0.848HisAsp: 0.848 ± 0.204
1.387HisGlu: 1.387 ± 0.299
0.694HisPhe: 0.694 ± 0.225
1.233HisGly: 1.233 ± 0.384
0.385HisHis: 0.385 ± 0.154
1.002HisIle: 1.002 ± 0.257
1.233HisLys: 1.233 ± 0.307
1.387HisLeu: 1.387 ± 0.405
0.308HisMet: 0.308 ± 0.147
0.231HisAsn: 0.231 ± 0.15
1.696HisPro: 1.696 ± 0.338
1.002HisGln: 1.002 ± 0.24
1.387HisArg: 1.387 ± 0.406
1.156HisSer: 1.156 ± 0.366
0.771HisThr: 0.771 ± 0.247
1.002HisVal: 1.002 ± 0.28
0.308HisTrp: 0.308 ± 0.149
0.54HisTyr: 0.54 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
5.164IleAla: 5.164 ± 0.511
0.385IleCys: 0.385 ± 0.18
3.391IleAsp: 3.391 ± 0.576
3.314IleGlu: 3.314 ± 0.5
0.771IlePhe: 0.771 ± 0.282
4.548IleGly: 4.548 ± 0.582
0.925IleHis: 0.925 ± 0.254
1.31IleIle: 1.31 ± 0.322
2.544IleLys: 2.544 ± 0.388
2.389IleLeu: 2.389 ± 0.468
0.771IleMet: 0.771 ± 0.264
2.081IleAsn: 2.081 ± 0.406
2.312IlePro: 2.312 ± 0.458
1.464IleGln: 1.464 ± 0.274
3.7IleArg: 3.7 ± 0.497
2.852IleSer: 2.852 ± 0.556
3.391IleThr: 3.391 ± 0.555
3.006IleVal: 3.006 ± 0.543
0.308IleTrp: 0.308 ± 0.157
1.464IleTyr: 1.464 ± 0.352
0.0IleXaa: 0.0 ± 0.0
Lys
4.779LysAla: 4.779 ± 0.625
0.54LysCys: 0.54 ± 0.19
1.542LysAsp: 1.542 ± 0.337
2.312LysGlu: 2.312 ± 0.469
0.617LysPhe: 0.617 ± 0.204
2.852LysGly: 2.852 ± 0.447
1.002LysHis: 1.002 ± 0.318
2.081LysIle: 2.081 ± 0.418
1.619LysLys: 1.619 ± 0.331
4.162LysLeu: 4.162 ± 0.577
0.771LysMet: 0.771 ± 0.229
1.002LysAsn: 1.002 ± 0.321
1.85LysPro: 1.85 ± 0.359
2.004LysGln: 2.004 ± 0.451
2.621LysArg: 2.621 ± 0.509
2.621LysSer: 2.621 ± 0.579
2.235LysThr: 2.235 ± 0.4
3.006LysVal: 3.006 ± 0.596
0.771LysTrp: 0.771 ± 0.243
0.925LysTyr: 0.925 ± 0.234
0.0LysXaa: 0.0 ± 0.0
Leu
10.945LeuAla: 10.945 ± 0.862
0.308LeuCys: 0.308 ± 0.141
5.395LeuAsp: 5.395 ± 0.459
6.629LeuGlu: 6.629 ± 0.635
2.389LeuPhe: 2.389 ± 0.398
5.395LeuGly: 5.395 ± 0.645
1.464LeuHis: 1.464 ± 0.329
3.854LeuIle: 3.854 ± 0.501
3.7LeuLys: 3.7 ± 0.507
6.937LeuLeu: 6.937 ± 0.85
2.389LeuMet: 2.389 ± 0.404
3.854LeuAsn: 3.854 ± 0.624
4.933LeuPro: 4.933 ± 0.626
4.085LeuGln: 4.085 ± 0.623
8.247LeuArg: 8.247 ± 0.845
6.629LeuSer: 6.629 ± 0.66
5.704LeuThr: 5.704 ± 0.633
5.704LeuVal: 5.704 ± 0.544
0.54LeuTrp: 0.54 ± 0.172
2.004LeuTyr: 2.004 ± 0.424
0.0LeuXaa: 0.0 ± 0.0
Met
3.391MetAla: 3.391 ± 0.499
0.231MetCys: 0.231 ± 0.13
0.925MetAsp: 0.925 ± 0.256
1.542MetGlu: 1.542 ± 0.325
0.617MetPhe: 0.617 ± 0.199
1.156MetGly: 1.156 ± 0.256
0.771MetHis: 0.771 ± 0.253
1.156MetIle: 1.156 ± 0.273
0.617MetLys: 0.617 ± 0.244
2.235MetLeu: 2.235 ± 0.365
0.694MetMet: 0.694 ± 0.187
1.079MetAsn: 1.079 ± 0.287
0.925MetPro: 0.925 ± 0.221
0.925MetGln: 0.925 ± 0.261
1.542MetArg: 1.542 ± 0.353
1.464MetSer: 1.464 ± 0.322
1.773MetThr: 1.773 ± 0.351
1.387MetVal: 1.387 ± 0.325
0.385MetTrp: 0.385 ± 0.162
0.54MetTyr: 0.54 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
3.546AsnAla: 3.546 ± 0.491
0.308AsnCys: 0.308 ± 0.134
2.389AsnAsp: 2.389 ± 0.403
1.619AsnGlu: 1.619 ± 0.357
0.617AsnPhe: 0.617 ± 0.224
3.546AsnGly: 3.546 ± 0.643
0.54AsnHis: 0.54 ± 0.177
0.925AsnIle: 0.925 ± 0.221
0.771AsnLys: 0.771 ± 0.228
2.698AsnLeu: 2.698 ± 0.458
0.462AsnMet: 0.462 ± 0.208
1.542AsnAsn: 1.542 ± 0.34
1.927AsnPro: 1.927 ± 0.421
1.156AsnGln: 1.156 ± 0.307
2.544AsnArg: 2.544 ± 0.42
1.85AsnSer: 1.85 ± 0.378
2.004AsnThr: 2.004 ± 0.397
1.31AsnVal: 1.31 ± 0.252
0.694AsnTrp: 0.694 ± 0.283
0.694AsnTyr: 0.694 ± 0.198
0.0AsnXaa: 0.0 ± 0.0
Pro
6.474ProAla: 6.474 ± 0.931
0.462ProCys: 0.462 ± 0.206
3.546ProAsp: 3.546 ± 0.503
4.239ProGlu: 4.239 ± 0.457
1.464ProPhe: 1.464 ± 0.319
5.164ProGly: 5.164 ± 0.551
0.925ProHis: 0.925 ± 0.293
2.544ProIle: 2.544 ± 0.502
1.31ProLys: 1.31 ± 0.348
4.548ProLeu: 4.548 ± 0.654
1.079ProMet: 1.079 ± 0.292
1.387ProAsn: 1.387 ± 0.335
2.698ProPro: 2.698 ± 0.482
1.619ProGln: 1.619 ± 0.283
2.929ProArg: 2.929 ± 0.508
3.314ProSer: 3.314 ± 0.526
3.237ProThr: 3.237 ± 0.528
2.775ProVal: 2.775 ± 0.527
1.156ProTrp: 1.156 ± 0.376
1.156ProTyr: 1.156 ± 0.342
0.0ProXaa: 0.0 ± 0.0
Gln
6.089GlnAla: 6.089 ± 0.945
0.617GlnCys: 0.617 ± 0.255
1.31GlnAsp: 1.31 ± 0.391
2.775GlnGlu: 2.775 ± 0.522
1.079GlnPhe: 1.079 ± 0.277
2.698GlnGly: 2.698 ± 0.429
1.156GlnHis: 1.156 ± 0.247
2.312GlnIle: 2.312 ± 0.396
1.387GlnLys: 1.387 ± 0.302
4.008GlnLeu: 4.008 ± 0.459
1.387GlnMet: 1.387 ± 0.315
0.925GlnAsn: 0.925 ± 0.338
2.698GlnPro: 2.698 ± 0.393
3.006GlnGln: 3.006 ± 0.499
3.083GlnArg: 3.083 ± 0.746
2.158GlnSer: 2.158 ± 0.515
1.85GlnThr: 1.85 ± 0.299
2.775GlnVal: 2.775 ± 0.441
0.385GlnTrp: 0.385 ± 0.187
0.771GlnTyr: 0.771 ± 0.214
0.0GlnXaa: 0.0 ± 0.0
Arg
7.014ArgAla: 7.014 ± 0.687
0.54ArgCys: 0.54 ± 0.204
4.008ArgAsp: 4.008 ± 0.588
5.318ArgGlu: 5.318 ± 0.713
2.235ArgPhe: 2.235 ± 0.4
4.933ArgGly: 4.933 ± 0.657
1.387ArgHis: 1.387 ± 0.407
4.393ArgIle: 4.393 ± 0.506
3.623ArgLys: 3.623 ± 0.619
8.093ArgLeu: 8.093 ± 0.761
1.233ArgMet: 1.233 ± 0.291
2.621ArgAsn: 2.621 ± 0.392
3.314ArgPro: 3.314 ± 0.49
4.316ArgGln: 4.316 ± 0.617
7.554ArgArg: 7.554 ± 1.186
4.162ArgSer: 4.162 ± 0.584
3.777ArgThr: 3.777 ± 0.572
4.933ArgVal: 4.933 ± 0.755
1.464ArgTrp: 1.464 ± 0.474
2.004ArgTyr: 2.004 ± 0.34
0.0ArgXaa: 0.0 ± 0.0
Ser
7.091SerAla: 7.091 ± 0.686
0.462SerCys: 0.462 ± 0.169
4.316SerAsp: 4.316 ± 0.544
2.544SerGlu: 2.544 ± 0.458
2.081SerPhe: 2.081 ± 0.424
6.629SerGly: 6.629 ± 0.994
1.233SerHis: 1.233 ± 0.332
3.314SerIle: 3.314 ± 0.445
1.773SerLys: 1.773 ± 0.356
5.241SerLeu: 5.241 ± 0.759
1.233SerMet: 1.233 ± 0.289
1.927SerAsn: 1.927 ± 0.435
3.468SerPro: 3.468 ± 0.576
2.466SerGln: 2.466 ± 0.433
3.854SerArg: 3.854 ± 0.608
4.085SerSer: 4.085 ± 0.642
3.468SerThr: 3.468 ± 0.473
5.241SerVal: 5.241 ± 0.712
0.694SerTrp: 0.694 ± 0.288
2.235SerTyr: 2.235 ± 0.418
0.0SerXaa: 0.0 ± 0.0
Thr
6.012ThrAla: 6.012 ± 0.638
0.54ThrCys: 0.54 ± 0.164
2.389ThrAsp: 2.389 ± 0.43
2.544ThrGlu: 2.544 ± 0.515
1.773ThrPhe: 1.773 ± 0.35
4.548ThrGly: 4.548 ± 0.592
1.156ThrHis: 1.156 ± 0.269
2.389ThrIle: 2.389 ± 0.493
2.081ThrLys: 2.081 ± 0.392
4.856ThrLeu: 4.856 ± 0.663
1.079ThrMet: 1.079 ± 0.301
1.233ThrAsn: 1.233 ± 0.355
3.083ThrPro: 3.083 ± 0.505
2.081ThrGln: 2.081 ± 0.345
3.16ThrArg: 3.16 ± 0.498
3.083ThrSer: 3.083 ± 0.541
2.312ThrThr: 2.312 ± 0.426
4.393ThrVal: 4.393 ± 0.647
1.464ThrTrp: 1.464 ± 0.35
1.696ThrTyr: 1.696 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
7.939ValAla: 7.939 ± 0.764
0.848ValCys: 0.848 ± 0.263
3.623ValAsp: 3.623 ± 0.559
5.164ValGlu: 5.164 ± 0.604
2.004ValPhe: 2.004 ± 0.38
3.931ValGly: 3.931 ± 0.581
1.156ValHis: 1.156 ± 0.288
2.621ValIle: 2.621 ± 0.458
2.929ValLys: 2.929 ± 0.469
5.55ValLeu: 5.55 ± 0.552
1.696ValMet: 1.696 ± 0.372
1.696ValAsn: 1.696 ± 0.293
4.702ValPro: 4.702 ± 0.649
2.929ValGln: 2.929 ± 0.455
3.931ValArg: 3.931 ± 0.486
5.241ValSer: 5.241 ± 0.729
3.083ValThr: 3.083 ± 0.486
4.393ValVal: 4.393 ± 0.714
0.925ValTrp: 0.925 ± 0.273
2.235ValTyr: 2.235 ± 0.419
0.0ValXaa: 0.0 ± 0.0
Trp
1.696TrpAla: 1.696 ± 0.41
0.385TrpCys: 0.385 ± 0.16
0.694TrpAsp: 0.694 ± 0.251
0.925TrpGlu: 0.925 ± 0.329
0.617TrpPhe: 0.617 ± 0.197
0.848TrpGly: 0.848 ± 0.255
0.077TrpHis: 0.077 ± 0.069
1.542TrpIle: 1.542 ± 0.431
0.385TrpLys: 0.385 ± 0.158
2.081TrpLeu: 2.081 ± 0.415
0.617TrpMet: 0.617 ± 0.212
0.694TrpAsn: 0.694 ± 0.237
0.771TrpPro: 0.771 ± 0.262
0.617TrpGln: 0.617 ± 0.221
1.387TrpArg: 1.387 ± 0.33
1.31TrpSer: 1.31 ± 0.247
0.848TrpThr: 0.848 ± 0.206
1.387TrpVal: 1.387 ± 0.278
0.231TrpTrp: 0.231 ± 0.137
0.462TrpTyr: 0.462 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.7TyrAla: 3.7 ± 0.564
0.385TyrCys: 0.385 ± 0.149
1.542TyrAsp: 1.542 ± 0.283
1.156TyrGlu: 1.156 ± 0.312
0.771TyrPhe: 0.771 ± 0.268
1.773TyrGly: 1.773 ± 0.332
0.385TyrHis: 0.385 ± 0.176
0.694TyrIle: 0.694 ± 0.197
0.848TyrLys: 0.848 ± 0.242
2.158TyrLeu: 2.158 ± 0.421
0.308TyrMet: 0.308 ± 0.144
1.002TyrAsn: 1.002 ± 0.297
1.079TyrPro: 1.079 ± 0.28
1.002TyrGln: 1.002 ± 0.27
3.16TyrArg: 3.16 ± 0.481
2.081TyrSer: 2.081 ± 0.406
1.079TyrThr: 1.079 ± 0.293
1.079TyrVal: 1.079 ± 0.343
0.462TyrTrp: 0.462 ± 0.154
0.462TyrTyr: 0.462 ± 0.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (12975 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski