Amino acid dipepetide frequency for Sinorhizobium phage HMSP1-Susan

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.285AlaAla: 13.285 ± 1.453
1.022AlaCys: 1.022 ± 0.257
7.093AlaAsp: 7.093 ± 0.979
6.312AlaGlu: 6.312 ± 0.766
2.585AlaPhe: 2.585 ± 0.366
6.432AlaGly: 6.432 ± 0.699
1.984AlaHis: 1.984 ± 0.309
6.372AlaIle: 6.372 ± 0.6
4.268AlaLys: 4.268 ± 0.431
7.214AlaLeu: 7.214 ± 0.644
2.705AlaMet: 2.705 ± 0.42
3.547AlaAsn: 3.547 ± 0.414
5.17AlaPro: 5.17 ± 0.82
3.907AlaGln: 3.907 ± 0.479
6.011AlaArg: 6.011 ± 0.641
4.328AlaSer: 4.328 ± 0.61
6.312AlaThr: 6.312 ± 0.59
6.913AlaVal: 6.913 ± 0.628
1.383AlaTrp: 1.383 ± 0.303
2.825AlaTyr: 2.825 ± 0.326
0.0AlaXaa: 0.0 ± 0.0
Cys
0.962CysAla: 0.962 ± 0.26
0.06CysCys: 0.06 ± 0.058
1.202CysAsp: 1.202 ± 0.258
0.962CysGlu: 0.962 ± 0.269
0.12CysPhe: 0.12 ± 0.079
1.022CysGly: 1.022 ± 0.228
0.361CysHis: 0.361 ± 0.122
0.721CysIle: 0.721 ± 0.208
0.24CysLys: 0.24 ± 0.112
0.601CysLeu: 0.601 ± 0.2
0.18CysMet: 0.18 ± 0.102
0.12CysAsn: 0.12 ± 0.088
0.301CysPro: 0.301 ± 0.132
0.18CysGln: 0.18 ± 0.106
0.842CysArg: 0.842 ± 0.225
0.481CysSer: 0.481 ± 0.19
0.661CysThr: 0.661 ± 0.208
0.781CysVal: 0.781 ± 0.197
0.12CysTrp: 0.12 ± 0.087
0.421CysTyr: 0.421 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
6.492AspAla: 6.492 ± 0.822
0.661AspCys: 0.661 ± 0.195
3.306AspAsp: 3.306 ± 0.409
3.727AspGlu: 3.727 ± 0.417
1.984AspPhe: 1.984 ± 0.4
6.072AspGly: 6.072 ± 0.659
1.623AspHis: 1.623 ± 0.326
3.727AspIle: 3.727 ± 0.505
3.366AspLys: 3.366 ± 0.417
4.208AspLeu: 4.208 ± 0.449
1.323AspMet: 1.323 ± 0.308
1.503AspAsn: 1.503 ± 0.282
2.885AspPro: 2.885 ± 0.467
1.803AspGln: 1.803 ± 0.294
3.186AspArg: 3.186 ± 0.424
2.525AspSer: 2.525 ± 0.471
3.186AspThr: 3.186 ± 0.413
3.727AspVal: 3.727 ± 0.524
1.443AspTrp: 1.443 ± 0.357
2.224AspTyr: 2.224 ± 0.379
0.0AspXaa: 0.0 ± 0.0
Glu
5.831GluAla: 5.831 ± 0.639
0.962GluCys: 0.962 ± 0.278
3.006GluAsp: 3.006 ± 0.464
3.066GluGlu: 3.066 ± 0.413
3.126GluPhe: 3.126 ± 0.403
3.427GluGly: 3.427 ± 0.465
1.202GluHis: 1.202 ± 0.277
5.29GluIle: 5.29 ± 0.605
3.366GluLys: 3.366 ± 0.444
4.749GluLeu: 4.749 ± 0.525
2.405GluMet: 2.405 ± 0.351
2.465GluAsn: 2.465 ± 0.401
1.864GluPro: 1.864 ± 0.293
2.044GluGln: 2.044 ± 0.408
4.509GluArg: 4.509 ± 0.567
1.924GluSer: 1.924 ± 0.376
3.968GluThr: 3.968 ± 0.481
2.825GluVal: 2.825 ± 0.377
1.202GluTrp: 1.202 ± 0.248
3.306GluTyr: 3.306 ± 0.361
0.0GluXaa: 0.0 ± 0.0
Phe
3.246PheAla: 3.246 ± 0.386
0.661PheCys: 0.661 ± 0.228
2.405PheAsp: 2.405 ± 0.38
2.465PheGlu: 2.465 ± 0.335
1.443PhePhe: 1.443 ± 0.334
2.765PheGly: 2.765 ± 0.399
0.781PheHis: 0.781 ± 0.252
1.984PheIle: 1.984 ± 0.39
1.383PheLys: 1.383 ± 0.238
2.284PheLeu: 2.284 ± 0.342
0.962PheMet: 0.962 ± 0.261
2.044PheAsn: 2.044 ± 0.371
2.885PhePro: 2.885 ± 0.471
1.623PheGln: 1.623 ± 0.353
2.525PheArg: 2.525 ± 0.355
1.563PheSer: 1.563 ± 0.267
2.765PheThr: 2.765 ± 0.423
2.344PheVal: 2.344 ± 0.355
0.481PheTrp: 0.481 ± 0.236
1.864PheTyr: 1.864 ± 0.327
0.0PheXaa: 0.0 ± 0.0
Gly
7.574GlyAla: 7.574 ± 0.597
0.661GlyCys: 0.661 ± 0.25
4.388GlyAsp: 4.388 ± 0.465
4.809GlyGlu: 4.809 ± 0.562
2.284GlyPhe: 2.284 ± 0.298
5.47GlyGly: 5.47 ± 1.018
0.962GlyHis: 0.962 ± 0.247
4.268GlyIle: 4.268 ± 0.501
5.29GlyLys: 5.29 ± 0.517
4.629GlyLeu: 4.629 ± 0.542
2.765GlyMet: 2.765 ± 0.502
3.126GlyAsn: 3.126 ± 0.586
2.885GlyPro: 2.885 ± 0.383
1.984GlyGln: 1.984 ± 0.375
4.028GlyArg: 4.028 ± 0.536
3.787GlySer: 3.787 ± 0.579
4.809GlyThr: 4.809 ± 0.684
4.509GlyVal: 4.509 ± 0.437
1.262GlyTrp: 1.262 ± 0.367
3.727GlyTyr: 3.727 ± 0.399
0.0GlyXaa: 0.0 ± 0.0
His
1.864HisAla: 1.864 ± 0.324
0.361HisCys: 0.361 ± 0.137
1.262HisAsp: 1.262 ± 0.262
1.563HisGlu: 1.563 ± 0.439
0.962HisPhe: 0.962 ± 0.241
1.864HisGly: 1.864 ± 0.37
0.842HisHis: 0.842 ± 0.243
0.902HisIle: 0.902 ± 0.203
1.262HisLys: 1.262 ± 0.259
1.262HisLeu: 1.262 ± 0.337
0.421HisMet: 0.421 ± 0.172
0.781HisAsn: 0.781 ± 0.233
1.262HisPro: 1.262 ± 0.27
0.661HisGln: 0.661 ± 0.187
1.323HisArg: 1.323 ± 0.275
1.082HisSer: 1.082 ± 0.229
1.202HisThr: 1.202 ± 0.267
1.924HisVal: 1.924 ± 0.435
0.481HisTrp: 0.481 ± 0.18
0.421HisTyr: 0.421 ± 0.162
0.0HisXaa: 0.0 ± 0.0
Ile
7.033IleAla: 7.033 ± 0.783
0.661IleCys: 0.661 ± 0.226
3.126IleAsp: 3.126 ± 0.397
4.148IleGlu: 4.148 ± 0.594
2.104IlePhe: 2.104 ± 0.377
4.328IleGly: 4.328 ± 0.566
1.022IleHis: 1.022 ± 0.227
3.487IleIle: 3.487 ± 0.415
2.705IleLys: 2.705 ± 0.429
3.607IleLeu: 3.607 ± 0.465
1.984IleMet: 1.984 ± 0.301
2.224IleAsn: 2.224 ± 0.324
3.126IlePro: 3.126 ± 0.388
1.984IleGln: 1.984 ± 0.299
4.749IleArg: 4.749 ± 0.603
3.126IleSer: 3.126 ± 0.482
4.629IleThr: 4.629 ± 0.583
3.066IleVal: 3.066 ± 0.417
0.962IleTrp: 0.962 ± 0.262
2.344IleTyr: 2.344 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
4.869LysAla: 4.869 ± 0.456
0.301LysCys: 0.301 ± 0.152
3.366LysAsp: 3.366 ± 0.514
2.645LysGlu: 2.645 ± 0.375
2.044LysPhe: 2.044 ± 0.289
2.825LysGly: 2.825 ± 0.489
0.661LysHis: 0.661 ± 0.174
3.727LysIle: 3.727 ± 0.467
2.104LysLys: 2.104 ± 0.339
3.968LysLeu: 3.968 ± 0.474
1.803LysMet: 1.803 ± 0.28
1.623LysAsn: 1.623 ± 0.309
2.465LysPro: 2.465 ± 0.376
1.503LysGln: 1.503 ± 0.253
2.885LysArg: 2.885 ± 0.365
2.645LysSer: 2.645 ± 0.487
3.306LysThr: 3.306 ± 0.533
2.525LysVal: 2.525 ± 0.357
1.022LysTrp: 1.022 ± 0.204
2.705LysTyr: 2.705 ± 0.427
0.0LysXaa: 0.0 ± 0.0
Leu
6.552LeuAla: 6.552 ± 0.607
0.721LeuCys: 0.721 ± 0.203
3.667LeuAsp: 3.667 ± 0.389
4.268LeuGlu: 4.268 ± 0.485
1.743LeuPhe: 1.743 ± 0.305
4.448LeuGly: 4.448 ± 0.449
2.104LeuHis: 2.104 ± 0.375
4.869LeuIle: 4.869 ± 0.54
3.727LeuLys: 3.727 ± 0.435
3.907LeuLeu: 3.907 ± 0.674
1.984LeuMet: 1.984 ± 0.319
3.547LeuAsn: 3.547 ± 0.442
4.749LeuPro: 4.749 ± 0.532
3.186LeuGln: 3.186 ± 0.391
4.088LeuArg: 4.088 ± 0.509
4.268LeuSer: 4.268 ± 0.498
5.891LeuThr: 5.891 ± 0.668
3.607LeuVal: 3.607 ± 0.562
1.082LeuTrp: 1.082 ± 0.278
1.743LeuTyr: 1.743 ± 0.272
0.0LeuXaa: 0.0 ± 0.0
Met
2.405MetAla: 2.405 ± 0.364
0.18MetCys: 0.18 ± 0.109
1.864MetAsp: 1.864 ± 0.333
1.743MetGlu: 1.743 ± 0.298
1.503MetPhe: 1.503 ± 0.295
0.962MetGly: 0.962 ± 0.255
0.481MetHis: 0.481 ± 0.159
1.803MetIle: 1.803 ± 0.339
1.803MetLys: 1.803 ± 0.327
1.623MetLeu: 1.623 ± 0.282
0.721MetMet: 0.721 ± 0.227
1.262MetAsn: 1.262 ± 0.267
1.683MetPro: 1.683 ± 0.534
1.022MetGln: 1.022 ± 0.218
3.487MetArg: 3.487 ± 0.449
2.104MetSer: 2.104 ± 0.325
2.585MetThr: 2.585 ± 0.442
1.984MetVal: 1.984 ± 0.33
0.24MetTrp: 0.24 ± 0.115
0.902MetTyr: 0.902 ± 0.246
0.0MetXaa: 0.0 ± 0.0
Asn
4.148AsnAla: 4.148 ± 0.485
0.361AsnCys: 0.361 ± 0.137
2.705AsnAsp: 2.705 ± 0.374
2.044AsnGlu: 2.044 ± 0.361
1.262AsnPhe: 1.262 ± 0.304
3.487AsnGly: 3.487 ± 0.486
0.781AsnHis: 0.781 ± 0.217
2.104AsnIle: 2.104 ± 0.383
1.743AsnLys: 1.743 ± 0.321
2.645AsnLeu: 2.645 ± 0.441
1.443AsnMet: 1.443 ± 0.299
1.503AsnAsn: 1.503 ± 0.325
1.743AsnPro: 1.743 ± 0.33
1.022AsnGln: 1.022 ± 0.253
2.946AsnArg: 2.946 ± 0.367
2.525AsnSer: 2.525 ± 0.465
2.465AsnThr: 2.465 ± 0.474
2.765AsnVal: 2.765 ± 0.365
0.661AsnTrp: 0.661 ± 0.219
1.262AsnTyr: 1.262 ± 0.301
0.0AsnXaa: 0.0 ± 0.0
Pro
5.17ProAla: 5.17 ± 0.875
0.301ProCys: 0.301 ± 0.133
3.847ProAsp: 3.847 ± 0.531
3.427ProGlu: 3.427 ± 0.487
1.803ProPhe: 1.803 ± 0.312
5.11ProGly: 5.11 ± 0.547
0.962ProHis: 0.962 ± 0.226
2.224ProIle: 2.224 ± 0.352
2.344ProLys: 2.344 ± 0.404
4.208ProLeu: 4.208 ± 0.41
1.202ProMet: 1.202 ± 0.284
1.864ProAsn: 1.864 ± 0.28
3.186ProPro: 3.186 ± 0.432
2.104ProGln: 2.104 ± 0.5
2.705ProArg: 2.705 ± 0.355
1.924ProSer: 1.924 ± 0.388
2.825ProThr: 2.825 ± 0.375
4.088ProVal: 4.088 ± 0.495
0.902ProTrp: 0.902 ± 0.269
1.443ProTyr: 1.443 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.186GlnAla: 3.186 ± 0.512
0.661GlnCys: 0.661 ± 0.189
1.202GlnAsp: 1.202 ± 0.238
1.984GlnGlu: 1.984 ± 0.293
1.803GlnPhe: 1.803 ± 0.298
2.284GlnGly: 2.284 ± 0.458
0.661GlnHis: 0.661 ± 0.285
2.164GlnIle: 2.164 ± 0.315
1.262GlnLys: 1.262 ± 0.292
2.525GlnLeu: 2.525 ± 0.464
0.962GlnMet: 0.962 ± 0.191
1.443GlnAsn: 1.443 ± 0.232
1.743GlnPro: 1.743 ± 0.473
1.864GlnGln: 1.864 ± 0.452
3.427GlnArg: 3.427 ± 0.49
1.623GlnSer: 1.623 ± 0.289
2.044GlnThr: 2.044 ± 0.342
2.284GlnVal: 2.284 ± 0.325
0.781GlnTrp: 0.781 ± 0.234
1.323GlnTyr: 1.323 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
5.17ArgAla: 5.17 ± 0.739
0.481ArgCys: 0.481 ± 0.151
4.569ArgAsp: 4.569 ± 0.544
4.268ArgGlu: 4.268 ± 0.5
2.885ArgPhe: 2.885 ± 0.487
4.088ArgGly: 4.088 ± 0.469
1.683ArgHis: 1.683 ± 0.397
3.427ArgIle: 3.427 ± 0.398
3.787ArgLys: 3.787 ± 0.523
5.891ArgLeu: 5.891 ± 0.619
2.525ArgMet: 2.525 ± 0.39
3.006ArgAsn: 3.006 ± 0.452
2.765ArgPro: 2.765 ± 0.378
2.104ArgGln: 2.104 ± 0.364
4.509ArgArg: 4.509 ± 0.659
3.427ArgSer: 3.427 ± 0.322
2.885ArgThr: 2.885 ± 0.391
5.29ArgVal: 5.29 ± 0.645
1.323ArgTrp: 1.323 ± 0.329
2.164ArgTyr: 2.164 ± 0.313
0.0ArgXaa: 0.0 ± 0.0
Ser
6.192SerAla: 6.192 ± 0.809
0.421SerCys: 0.421 ± 0.178
2.645SerAsp: 2.645 ± 0.401
2.224SerGlu: 2.224 ± 0.378
2.645SerPhe: 2.645 ± 0.393
4.448SerGly: 4.448 ± 0.548
1.082SerHis: 1.082 ± 0.273
2.645SerIle: 2.645 ± 0.375
2.525SerLys: 2.525 ± 0.511
3.907SerLeu: 3.907 ± 0.725
1.503SerMet: 1.503 ± 0.307
2.224SerAsn: 2.224 ± 0.353
2.164SerPro: 2.164 ± 0.313
1.503SerGln: 1.503 ± 0.292
2.465SerArg: 2.465 ± 0.429
2.946SerSer: 2.946 ± 0.546
3.246SerThr: 3.246 ± 0.529
3.306SerVal: 3.306 ± 0.422
0.842SerTrp: 0.842 ± 0.219
1.142SerTyr: 1.142 ± 0.286
0.0SerXaa: 0.0 ± 0.0
Thr
6.552ThrAla: 6.552 ± 1.237
0.541ThrCys: 0.541 ± 0.172
2.705ThrAsp: 2.705 ± 0.429
3.667ThrGlu: 3.667 ± 0.553
2.405ThrPhe: 2.405 ± 0.399
6.673ThrGly: 6.673 ± 0.581
1.443ThrHis: 1.443 ± 0.323
3.667ThrIle: 3.667 ± 0.458
2.705ThrLys: 2.705 ± 0.443
4.809ThrLeu: 4.809 ± 0.464
1.803ThrMet: 1.803 ± 0.307
2.465ThrAsn: 2.465 ± 0.363
4.448ThrPro: 4.448 ± 0.565
1.864ThrGln: 1.864 ± 0.313
4.208ThrArg: 4.208 ± 0.492
3.186ThrSer: 3.186 ± 0.61
3.968ThrThr: 3.968 ± 0.645
6.192ThrVal: 6.192 ± 0.736
1.202ThrTrp: 1.202 ± 0.232
1.683ThrTyr: 1.683 ± 0.313
0.0ThrXaa: 0.0 ± 0.0
Val
5.35ValAla: 5.35 ± 0.653
0.601ValCys: 0.601 ± 0.165
3.968ValAsp: 3.968 ± 0.455
3.847ValGlu: 3.847 ± 0.42
2.825ValPhe: 2.825 ± 0.369
3.427ValGly: 3.427 ± 0.338
1.443ValHis: 1.443 ± 0.328
4.148ValIle: 4.148 ± 0.494
3.366ValLys: 3.366 ± 0.493
4.629ValLeu: 4.629 ± 0.565
2.104ValMet: 2.104 ± 0.394
2.164ValAsn: 2.164 ± 0.345
3.487ValPro: 3.487 ± 0.415
2.044ValGln: 2.044 ± 0.356
4.569ValArg: 4.569 ± 0.471
3.847ValSer: 3.847 ± 0.607
5.47ValThr: 5.47 ± 0.584
3.306ValVal: 3.306 ± 0.403
1.022ValTrp: 1.022 ± 0.279
2.705ValTyr: 2.705 ± 0.437
0.0ValXaa: 0.0 ± 0.0
Trp
0.842TrpAla: 0.842 ± 0.236
0.361TrpCys: 0.361 ± 0.138
0.661TrpAsp: 0.661 ± 0.182
0.781TrpGlu: 0.781 ± 0.195
1.323TrpPhe: 1.323 ± 0.293
1.142TrpGly: 1.142 ± 0.274
0.721TrpHis: 0.721 ± 0.196
1.082TrpIle: 1.082 ± 0.254
0.661TrpLys: 0.661 ± 0.187
1.202TrpLeu: 1.202 ± 0.279
0.301TrpMet: 0.301 ± 0.134
1.142TrpAsn: 1.142 ± 0.218
0.481TrpPro: 0.481 ± 0.182
1.142TrpGln: 1.142 ± 0.228
1.623TrpArg: 1.623 ± 0.334
1.082TrpSer: 1.082 ± 0.24
1.142TrpThr: 1.142 ± 0.443
0.721TrpVal: 0.721 ± 0.217
0.301TrpTrp: 0.301 ± 0.131
0.601TrpTyr: 0.601 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.006TyrAla: 3.006 ± 0.397
0.361TyrCys: 0.361 ± 0.145
1.984TyrAsp: 1.984 ± 0.393
2.585TyrGlu: 2.585 ± 0.437
1.623TyrPhe: 1.623 ± 0.339
2.946TyrGly: 2.946 ± 0.391
0.902TyrHis: 0.902 ± 0.248
1.803TyrIle: 1.803 ± 0.295
1.082TyrLys: 1.082 ± 0.276
2.284TyrLeu: 2.284 ± 0.329
1.202TyrMet: 1.202 ± 0.227
1.563TyrAsn: 1.563 ± 0.377
2.344TyrPro: 2.344 ± 0.382
1.683TyrGln: 1.683 ± 0.318
2.164TyrArg: 2.164 ± 0.4
1.683TyrSer: 1.683 ± 0.298
2.825TyrThr: 2.825 ± 0.394
2.164TyrVal: 2.164 ± 0.325
0.601TyrTrp: 0.601 ± 0.181
1.743TyrTyr: 1.743 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 96 proteins (16636 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski