Amino acid dipepetide frequency for Haemophilus phage SuMu

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.033AlaAla: 4.033 ± 1.047
0.964AlaCys: 0.964 ± 0.302
5.786AlaAsp: 5.786 ± 0.609
7.891AlaGlu: 7.891 ± 0.873
3.858AlaPhe: 3.858 ± 0.382
6.839AlaGly: 6.839 ± 0.997
1.929AlaHis: 1.929 ± 0.373
5.26AlaIle: 5.26 ± 0.712
6.663AlaLys: 6.663 ± 0.763
9.381AlaLeu: 9.381 ± 1.178
2.455AlaMet: 2.455 ± 0.51
4.121AlaAsn: 4.121 ± 0.605
2.192AlaPro: 2.192 ± 0.447
4.033AlaGln: 4.033 ± 0.544
4.033AlaArg: 4.033 ± 0.652
5.348AlaSer: 5.348 ± 0.835
5.26AlaThr: 5.26 ± 0.674
6.839AlaVal: 6.839 ± 0.691
0.964AlaTrp: 0.964 ± 0.211
2.016AlaTyr: 2.016 ± 0.383
0.0AlaXaa: 0.0 ± 0.0
Cys
0.438CysAla: 0.438 ± 0.179
0.175CysCys: 0.175 ± 0.136
0.438CysAsp: 0.438 ± 0.201
1.227CysGlu: 1.227 ± 0.332
0.789CysPhe: 0.789 ± 0.293
0.526CysGly: 0.526 ± 0.242
0.351CysHis: 0.351 ± 0.201
0.351CysIle: 0.351 ± 0.181
0.701CysLys: 0.701 ± 0.273
0.964CysLeu: 0.964 ± 0.335
0.175CysMet: 0.175 ± 0.12
0.438CysAsn: 0.438 ± 0.192
0.438CysPro: 0.438 ± 0.195
0.175CysGln: 0.175 ± 0.111
0.526CysArg: 0.526 ± 0.219
0.964CysSer: 0.964 ± 0.253
0.701CysThr: 0.701 ± 0.3
0.701CysVal: 0.701 ± 0.249
0.263CysTrp: 0.263 ± 0.132
0.526CysTyr: 0.526 ± 0.214
0.0CysXaa: 0.0 ± 0.0
Asp
4.208AspAla: 4.208 ± 0.549
0.614AspCys: 0.614 ± 0.207
3.156AspAsp: 3.156 ± 0.634
3.858AspGlu: 3.858 ± 0.71
2.192AspPhe: 2.192 ± 0.404
5.611AspGly: 5.611 ± 0.689
0.789AspHis: 0.789 ± 0.253
3.156AspIle: 3.156 ± 0.497
4.559AspLys: 4.559 ± 0.61
5.26AspLeu: 5.26 ± 0.652
0.964AspMet: 0.964 ± 0.338
2.543AspAsn: 2.543 ± 0.438
2.28AspPro: 2.28 ± 0.521
0.789AspGln: 0.789 ± 0.249
2.192AspArg: 2.192 ± 0.367
3.156AspSer: 3.156 ± 0.671
2.016AspThr: 2.016 ± 0.441
4.296AspVal: 4.296 ± 0.623
1.14AspTrp: 1.14 ± 0.289
2.192AspTyr: 2.192 ± 0.413
0.0AspXaa: 0.0 ± 0.0
Glu
6.839GluAla: 6.839 ± 0.638
0.351GluCys: 0.351 ± 0.171
2.543GluAsp: 2.543 ± 0.462
4.559GluGlu: 4.559 ± 0.548
2.806GluPhe: 2.806 ± 0.473
2.893GluGly: 2.893 ± 0.525
1.49GluHis: 1.49 ± 0.31
5.786GluIle: 5.786 ± 0.611
5.786GluLys: 5.786 ± 0.945
8.417GluLeu: 8.417 ± 0.895
2.63GluMet: 2.63 ± 0.44
3.244GluAsn: 3.244 ± 0.462
1.666GluPro: 1.666 ± 0.408
4.647GluGln: 4.647 ± 0.868
4.559GluArg: 4.559 ± 0.608
2.806GluSer: 2.806 ± 0.547
2.192GluThr: 2.192 ± 0.407
4.647GluVal: 4.647 ± 0.798
1.49GluTrp: 1.49 ± 0.366
2.192GluTyr: 2.192 ± 0.4
0.0GluXaa: 0.0 ± 0.0
Phe
3.507PheAla: 3.507 ± 0.519
0.614PheCys: 0.614 ± 0.235
3.156PheAsp: 3.156 ± 0.496
3.244PheGlu: 3.244 ± 0.474
1.052PhePhe: 1.052 ± 0.263
2.718PheGly: 2.718 ± 0.44
0.263PheHis: 0.263 ± 0.145
2.192PheIle: 2.192 ± 0.484
3.682PheLys: 3.682 ± 0.58
2.455PheLeu: 2.455 ± 0.483
0.789PheMet: 0.789 ± 0.228
2.367PheAsn: 2.367 ± 0.537
0.964PhePro: 0.964 ± 0.292
0.964PheGln: 0.964 ± 0.251
1.14PheArg: 1.14 ± 0.292
2.367PheSer: 2.367 ± 0.606
2.016PheThr: 2.016 ± 0.417
2.455PheVal: 2.455 ± 0.436
0.438PheTrp: 0.438 ± 0.181
1.578PheTyr: 1.578 ± 0.39
0.0PheXaa: 0.0 ± 0.0
Gly
5.173GlyAla: 5.173 ± 0.815
0.614GlyCys: 0.614 ± 0.215
3.419GlyAsp: 3.419 ± 0.499
4.647GlyGlu: 4.647 ± 0.504
2.63GlyPhe: 2.63 ± 0.48
4.471GlyGly: 4.471 ± 0.657
0.964GlyHis: 0.964 ± 0.364
4.208GlyIle: 4.208 ± 0.659
6.049GlyLys: 6.049 ± 0.824
5.085GlyLeu: 5.085 ± 0.561
0.789GlyMet: 0.789 ± 0.205
3.77GlyAsn: 3.77 ± 0.674
0.175GlyPro: 0.175 ± 0.1
3.156GlyGln: 3.156 ± 0.52
5.523GlyArg: 5.523 ± 0.709
2.893GlySer: 2.893 ± 0.495
2.806GlyThr: 2.806 ± 0.51
5.523GlyVal: 5.523 ± 0.77
1.841GlyTrp: 1.841 ± 0.325
1.578GlyTyr: 1.578 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
1.315HisAla: 1.315 ± 0.321
0.438HisCys: 0.438 ± 0.168
0.877HisAsp: 0.877 ± 0.253
0.175HisGlu: 0.175 ± 0.126
0.701HisPhe: 0.701 ± 0.289
1.315HisGly: 1.315 ± 0.3
0.263HisHis: 0.263 ± 0.133
0.789HisIle: 0.789 ± 0.318
1.49HisLys: 1.49 ± 0.305
2.192HisLeu: 2.192 ± 0.576
0.088HisMet: 0.088 ± 0.084
1.14HisAsn: 1.14 ± 0.296
0.789HisPro: 0.789 ± 0.237
1.052HisGln: 1.052 ± 0.304
1.315HisArg: 1.315 ± 0.374
1.052HisSer: 1.052 ± 0.255
0.964HisThr: 0.964 ± 0.237
0.351HisVal: 0.351 ± 0.146
0.175HisTrp: 0.175 ± 0.107
0.789HisTyr: 0.789 ± 0.24
0.0HisXaa: 0.0 ± 0.0
Ile
4.734IleAla: 4.734 ± 0.657
1.052IleCys: 1.052 ± 0.303
3.858IleAsp: 3.858 ± 0.604
6.488IleGlu: 6.488 ± 0.584
1.666IlePhe: 1.666 ± 0.343
3.945IleGly: 3.945 ± 0.538
1.403IleHis: 1.403 ± 0.3
3.156IleIle: 3.156 ± 0.521
4.647IleLys: 4.647 ± 0.578
3.595IleLeu: 3.595 ± 0.66
1.227IleMet: 1.227 ± 0.268
2.806IleAsn: 2.806 ± 0.572
1.49IlePro: 1.49 ± 0.402
2.455IleGln: 2.455 ± 0.368
3.595IleArg: 3.595 ± 0.439
2.981IleSer: 2.981 ± 0.51
3.069IleThr: 3.069 ± 0.469
2.981IleVal: 2.981 ± 0.533
0.964IleTrp: 0.964 ± 0.274
1.753IleTyr: 1.753 ± 0.433
0.0IleXaa: 0.0 ± 0.0
Lys
5.348LysAla: 5.348 ± 0.75
0.263LysCys: 0.263 ± 0.155
2.981LysAsp: 2.981 ± 0.549
5.26LysGlu: 5.26 ± 0.922
1.753LysPhe: 1.753 ± 0.423
4.384LysGly: 4.384 ± 0.701
1.14LysHis: 1.14 ± 0.326
3.77LysIle: 3.77 ± 0.562
4.559LysLys: 4.559 ± 0.618
6.751LysLeu: 6.751 ± 0.87
2.192LysMet: 2.192 ± 0.48
3.507LysAsn: 3.507 ± 0.508
2.981LysPro: 2.981 ± 0.404
5.26LysGln: 5.26 ± 0.706
4.296LysArg: 4.296 ± 0.684
5.26LysSer: 5.26 ± 0.575
4.471LysThr: 4.471 ± 0.542
5.26LysVal: 5.26 ± 0.705
1.227LysTrp: 1.227 ± 0.343
1.929LysTyr: 1.929 ± 0.409
0.0LysXaa: 0.0 ± 0.0
Leu
8.241LeuAla: 8.241 ± 1.054
0.964LeuCys: 0.964 ± 0.301
4.208LeuAsp: 4.208 ± 0.527
5.962LeuGlu: 5.962 ± 0.574
3.682LeuPhe: 3.682 ± 0.688
6.137LeuGly: 6.137 ± 0.715
1.929LeuHis: 1.929 ± 0.506
5.085LeuIle: 5.085 ± 0.692
6.488LeuLys: 6.488 ± 0.704
8.241LeuLeu: 8.241 ± 0.778
2.718LeuMet: 2.718 ± 0.594
5.085LeuAsn: 5.085 ± 0.908
3.77LeuPro: 3.77 ± 0.508
4.208LeuGln: 4.208 ± 0.568
4.121LeuArg: 4.121 ± 0.645
7.803LeuSer: 7.803 ± 0.781
5.699LeuThr: 5.699 ± 0.632
5.436LeuVal: 5.436 ± 0.519
0.701LeuTrp: 0.701 ± 0.282
2.016LeuTyr: 2.016 ± 0.306
0.0LeuXaa: 0.0 ± 0.0
Met
3.595MetAla: 3.595 ± 0.628
0.263MetCys: 0.263 ± 0.141
1.052MetAsp: 1.052 ± 0.318
0.964MetGlu: 0.964 ± 0.244
0.263MetPhe: 0.263 ± 0.158
1.227MetGly: 1.227 ± 0.301
0.088MetHis: 0.088 ± 0.106
1.666MetIle: 1.666 ± 0.378
1.49MetLys: 1.49 ± 0.322
2.455MetLeu: 2.455 ± 0.439
0.614MetMet: 0.614 ± 0.252
0.789MetAsn: 0.789 ± 0.276
0.964MetPro: 0.964 ± 0.24
1.403MetGln: 1.403 ± 0.33
1.929MetArg: 1.929 ± 0.391
1.578MetSer: 1.578 ± 0.356
1.052MetThr: 1.052 ± 0.279
1.49MetVal: 1.49 ± 0.329
0.263MetTrp: 0.263 ± 0.145
0.175MetTyr: 0.175 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
5.699AsnAla: 5.699 ± 0.797
0.263AsnCys: 0.263 ± 0.169
2.192AsnAsp: 2.192 ± 0.433
2.806AsnGlu: 2.806 ± 0.46
1.578AsnPhe: 1.578 ± 0.395
5.611AsnGly: 5.611 ± 0.628
1.052AsnHis: 1.052 ± 0.321
2.016AsnIle: 2.016 ± 0.356
2.367AsnLys: 2.367 ± 0.475
4.471AsnLeu: 4.471 ± 0.759
0.877AsnMet: 0.877 ± 0.276
2.981AsnAsn: 2.981 ± 0.416
2.28AsnPro: 2.28 ± 0.472
2.367AsnGln: 2.367 ± 0.392
1.929AsnArg: 1.929 ± 0.382
2.455AsnSer: 2.455 ± 0.51
2.192AsnThr: 2.192 ± 0.447
3.332AsnVal: 3.332 ± 0.632
0.964AsnTrp: 0.964 ± 0.257
0.789AsnTyr: 0.789 ± 0.275
0.0AsnXaa: 0.0 ± 0.0
Pro
3.244ProAla: 3.244 ± 0.563
0.263ProCys: 0.263 ± 0.145
2.367ProAsp: 2.367 ± 0.487
2.104ProGlu: 2.104 ± 0.436
1.14ProPhe: 1.14 ± 0.265
0.263ProGly: 0.263 ± 0.148
0.614ProHis: 0.614 ± 0.237
2.104ProIle: 2.104 ± 0.396
2.455ProLys: 2.455 ± 0.451
3.244ProLeu: 3.244 ± 0.506
0.701ProMet: 0.701 ± 0.251
2.63ProAsn: 2.63 ± 0.446
1.403ProPro: 1.403 ± 0.352
1.929ProGln: 1.929 ± 0.347
1.052ProArg: 1.052 ± 0.23
2.104ProSer: 2.104 ± 0.407
1.578ProThr: 1.578 ± 0.367
1.753ProVal: 1.753 ± 0.461
0.175ProTrp: 0.175 ± 0.14
0.964ProTyr: 0.964 ± 0.31
0.0ProXaa: 0.0 ± 0.0
Gln
6.137GlnAla: 6.137 ± 0.814
0.351GlnCys: 0.351 ± 0.171
2.016GlnAsp: 2.016 ± 0.446
2.893GlnGlu: 2.893 ± 0.465
2.367GlnPhe: 2.367 ± 0.317
2.192GlnGly: 2.192 ± 0.445
0.877GlnHis: 0.877 ± 0.228
3.156GlnIle: 3.156 ± 0.474
3.507GlnLys: 3.507 ± 0.509
4.647GlnLeu: 4.647 ± 0.652
1.227GlnMet: 1.227 ± 0.385
2.104GlnAsn: 2.104 ± 0.424
1.315GlnPro: 1.315 ± 0.314
2.981GlnGln: 2.981 ± 0.552
2.893GlnArg: 2.893 ± 0.445
3.332GlnSer: 3.332 ± 0.393
2.63GlnThr: 2.63 ± 0.462
3.244GlnVal: 3.244 ± 0.383
0.701GlnTrp: 0.701 ± 0.217
1.666GlnTyr: 1.666 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
3.858ArgAla: 3.858 ± 0.486
1.403ArgCys: 1.403 ± 0.494
3.244ArgAsp: 3.244 ± 0.428
3.858ArgGlu: 3.858 ± 0.721
2.718ArgPhe: 2.718 ± 0.528
3.332ArgGly: 3.332 ± 0.466
0.614ArgHis: 0.614 ± 0.281
3.77ArgIle: 3.77 ± 0.463
3.507ArgLys: 3.507 ± 0.568
6.049ArgLeu: 6.049 ± 0.625
0.789ArgMet: 0.789 ± 0.28
2.543ArgAsn: 2.543 ± 0.491
2.104ArgPro: 2.104 ± 0.382
3.244ArgGln: 3.244 ± 0.551
2.981ArgArg: 2.981 ± 0.748
2.104ArgSer: 2.104 ± 0.459
2.367ArgThr: 2.367 ± 0.451
3.595ArgVal: 3.595 ± 0.465
0.351ArgTrp: 0.351 ± 0.159
2.104ArgTyr: 2.104 ± 0.563
0.0ArgXaa: 0.0 ± 0.0
Ser
6.575SerAla: 6.575 ± 0.553
0.526SerCys: 0.526 ± 0.223
4.121SerAsp: 4.121 ± 0.652
4.208SerGlu: 4.208 ± 0.617
2.455SerPhe: 2.455 ± 0.498
3.858SerGly: 3.858 ± 0.716
0.964SerHis: 0.964 ± 0.295
2.718SerIle: 2.718 ± 0.431
3.419SerLys: 3.419 ± 0.488
5.26SerLeu: 5.26 ± 0.718
1.578SerMet: 1.578 ± 0.38
2.016SerAsn: 2.016 ± 0.452
1.841SerPro: 1.841 ± 0.375
2.893SerGln: 2.893 ± 0.456
2.806SerArg: 2.806 ± 0.434
4.033SerSer: 4.033 ± 0.693
2.104SerThr: 2.104 ± 0.472
3.77SerVal: 3.77 ± 0.7
0.877SerTrp: 0.877 ± 0.238
2.806SerTyr: 2.806 ± 0.504
0.0SerXaa: 0.0 ± 0.0
Thr
6.4ThrAla: 6.4 ± 0.724
0.175ThrCys: 0.175 ± 0.107
2.63ThrAsp: 2.63 ± 0.458
3.595ThrGlu: 3.595 ± 0.527
1.929ThrPhe: 1.929 ± 0.481
3.419ThrGly: 3.419 ± 0.575
0.789ThrHis: 0.789 ± 0.334
2.455ThrIle: 2.455 ± 0.464
3.244ThrLys: 3.244 ± 0.595
4.734ThrLeu: 4.734 ± 0.656
1.14ThrMet: 1.14 ± 0.351
2.016ThrAsn: 2.016 ± 0.406
2.192ThrPro: 2.192 ± 0.39
2.543ThrGln: 2.543 ± 0.389
2.016ThrArg: 2.016 ± 0.312
2.367ThrSer: 2.367 ± 0.506
2.806ThrThr: 2.806 ± 0.657
3.507ThrVal: 3.507 ± 0.621
0.175ThrTrp: 0.175 ± 0.136
1.753ThrTyr: 1.753 ± 0.327
0.0ThrXaa: 0.0 ± 0.0
Val
7.014ValAla: 7.014 ± 0.679
0.614ValCys: 0.614 ± 0.18
4.471ValAsp: 4.471 ± 0.625
4.997ValGlu: 4.997 ± 0.709
2.016ValPhe: 2.016 ± 0.384
4.647ValGly: 4.647 ± 0.685
0.877ValHis: 0.877 ± 0.279
4.471ValIle: 4.471 ± 0.747
5.085ValLys: 5.085 ± 0.717
4.734ValLeu: 4.734 ± 0.543
1.403ValMet: 1.403 ± 0.335
2.367ValAsn: 2.367 ± 0.471
1.49ValPro: 1.49 ± 0.274
2.806ValGln: 2.806 ± 0.602
4.559ValArg: 4.559 ± 0.596
3.858ValSer: 3.858 ± 0.585
4.121ValThr: 4.121 ± 0.712
5.523ValVal: 5.523 ± 0.835
0.789ValTrp: 0.789 ± 0.227
1.841ValTyr: 1.841 ± 0.363
0.0ValXaa: 0.0 ± 0.0
Trp
1.052TrpAla: 1.052 ± 0.282
0.526TrpCys: 0.526 ± 0.166
0.614TrpAsp: 0.614 ± 0.224
1.666TrpGlu: 1.666 ± 0.468
0.789TrpPhe: 0.789 ± 0.232
0.614TrpGly: 0.614 ± 0.208
0.351TrpHis: 0.351 ± 0.154
0.438TrpIle: 0.438 ± 0.182
0.789TrpLys: 0.789 ± 0.223
1.49TrpLeu: 1.49 ± 0.36
0.175TrpMet: 0.175 ± 0.106
0.351TrpAsn: 0.351 ± 0.149
0.088TrpPro: 0.088 ± 0.068
1.227TrpGln: 1.227 ± 0.345
1.14TrpArg: 1.14 ± 0.324
0.526TrpSer: 0.526 ± 0.219
0.614TrpThr: 0.614 ± 0.259
1.052TrpVal: 1.052 ± 0.313
0.438TrpTrp: 0.438 ± 0.197
0.438TrpTyr: 0.438 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.455TyrAla: 2.455 ± 0.44
0.438TyrCys: 0.438 ± 0.182
1.841TyrAsp: 1.841 ± 0.369
1.14TyrGlu: 1.14 ± 0.222
1.578TyrPhe: 1.578 ± 0.365
1.315TyrGly: 1.315 ± 0.341
0.526TyrHis: 0.526 ± 0.193
1.403TyrIle: 1.403 ± 0.375
2.104TyrLys: 2.104 ± 0.372
3.069TyrLeu: 3.069 ± 0.486
0.701TyrMet: 0.701 ± 0.204
1.403TyrAsn: 1.403 ± 0.309
1.666TyrPro: 1.666 ± 0.319
2.104TyrGln: 2.104 ± 0.423
1.929TyrArg: 1.929 ± 0.36
1.841TyrSer: 1.841 ± 0.387
1.227TyrThr: 1.227 ± 0.321
1.929TyrVal: 1.929 ± 0.354
0.438TyrTrp: 0.438 ± 0.212
0.964TyrTyr: 0.964 ± 0.335
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11407 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski