Amino acid dipepetide frequency for Pseudomonas phage Nerthus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.011AlaAla: 10.011 ± 0.92
0.72AlaCys: 0.72 ± 0.328
5.113AlaAsp: 5.113 ± 0.648
6.554AlaGlu: 6.554 ± 0.741
3.169AlaPhe: 3.169 ± 0.425
7.85AlaGly: 7.85 ± 1.139
2.089AlaHis: 2.089 ± 0.364
4.753AlaIle: 4.753 ± 0.414
5.113AlaLys: 5.113 ± 0.639
6.698AlaLeu: 6.698 ± 0.615
3.385AlaMet: 3.385 ± 0.689
4.033AlaAsn: 4.033 ± 0.379
4.033AlaPro: 4.033 ± 0.831
3.961AlaGln: 3.961 ± 0.73
5.834AlaArg: 5.834 ± 0.686
4.825AlaSer: 4.825 ± 0.821
5.546AlaThr: 5.546 ± 0.649
6.266AlaVal: 6.266 ± 0.725
0.936AlaTrp: 0.936 ± 0.272
2.953AlaTyr: 2.953 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.576CysAla: 0.576 ± 0.231
0.072CysCys: 0.072 ± 0.074
1.008CysAsp: 1.008 ± 0.278
0.216CysGlu: 0.216 ± 0.153
0.288CysPhe: 0.288 ± 0.177
0.648CysGly: 0.648 ± 0.26
0.144CysHis: 0.144 ± 0.095
0.648CysIle: 0.648 ± 0.207
0.648CysLys: 0.648 ± 0.239
1.008CysLeu: 1.008 ± 0.307
0.216CysMet: 0.216 ± 0.131
0.576CysAsn: 0.576 ± 0.2
0.432CysPro: 0.432 ± 0.183
0.432CysGln: 0.432 ± 0.16
0.72CysArg: 0.72 ± 0.286
0.648CysSer: 0.648 ± 0.201
0.288CysThr: 0.288 ± 0.124
0.72CysVal: 0.72 ± 0.206
0.0CysTrp: 0.0 ± 0.0
0.288CysTyr: 0.288 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
5.69AspAla: 5.69 ± 0.741
0.864AspCys: 0.864 ± 0.29
4.537AspAsp: 4.537 ± 0.619
4.609AspGlu: 4.609 ± 0.655
2.305AspPhe: 2.305 ± 0.493
5.978AspGly: 5.978 ± 0.877
1.152AspHis: 1.152 ± 0.289
3.457AspIle: 3.457 ± 0.445
4.537AspLys: 4.537 ± 0.511
4.825AspLeu: 4.825 ± 0.644
1.296AspMet: 1.296 ± 0.36
2.305AspAsn: 2.305 ± 0.417
3.097AspPro: 3.097 ± 0.512
2.737AspGln: 2.737 ± 0.364
3.601AspArg: 3.601 ± 0.707
3.457AspSer: 3.457 ± 0.425
3.169AspThr: 3.169 ± 0.457
3.889AspVal: 3.889 ± 0.53
1.44AspTrp: 1.44 ± 0.334
2.377AspTyr: 2.377 ± 0.34
0.0AspXaa: 0.0 ± 0.0
Glu
7.922GluAla: 7.922 ± 0.813
0.72GluCys: 0.72 ± 0.222
4.321GluAsp: 4.321 ± 0.557
3.745GluGlu: 3.745 ± 0.576
2.809GluPhe: 2.809 ± 0.42
4.393GluGly: 4.393 ± 0.586
1.584GluHis: 1.584 ± 0.31
1.873GluIle: 1.873 ± 0.309
3.313GluLys: 3.313 ± 0.465
6.338GluLeu: 6.338 ± 0.623
2.377GluMet: 2.377 ± 0.431
2.017GluAsn: 2.017 ± 0.305
1.801GluPro: 1.801 ± 0.29
2.665GluGln: 2.665 ± 0.387
4.033GluArg: 4.033 ± 0.731
2.809GluSer: 2.809 ± 0.547
2.881GluThr: 2.881 ± 0.44
4.753GluVal: 4.753 ± 0.685
1.152GluTrp: 1.152 ± 0.311
2.665GluTyr: 2.665 ± 0.518
0.0GluXaa: 0.0 ± 0.0
Phe
3.097PheAla: 3.097 ± 0.601
0.36PheCys: 0.36 ± 0.2
3.097PheAsp: 3.097 ± 0.376
1.873PheGlu: 1.873 ± 0.355
1.584PhePhe: 1.584 ± 0.315
2.377PheGly: 2.377 ± 0.403
0.72PheHis: 0.72 ± 0.256
2.161PheIle: 2.161 ± 0.358
1.873PheLys: 1.873 ± 0.362
2.809PheLeu: 2.809 ± 0.454
1.08PheMet: 1.08 ± 0.292
1.728PheAsn: 1.728 ± 0.293
1.656PhePro: 1.656 ± 0.397
1.368PheGln: 1.368 ± 0.365
2.233PheArg: 2.233 ± 0.377
1.801PheSer: 1.801 ± 0.341
2.449PheThr: 2.449 ± 0.506
3.025PheVal: 3.025 ± 0.594
0.504PheTrp: 0.504 ± 0.214
1.296PheTyr: 1.296 ± 0.334
0.0PheXaa: 0.0 ± 0.0
Gly
6.194GlyAla: 6.194 ± 0.708
0.648GlyCys: 0.648 ± 0.228
5.402GlyAsp: 5.402 ± 0.62
4.897GlyGlu: 4.897 ± 0.717
3.385GlyPhe: 3.385 ± 0.626
6.266GlyGly: 6.266 ± 0.78
1.945GlyHis: 1.945 ± 0.485
3.817GlyIle: 3.817 ± 0.418
5.69GlyLys: 5.69 ± 0.612
5.257GlyLeu: 5.257 ± 0.604
2.377GlyMet: 2.377 ± 0.402
3.529GlyAsn: 3.529 ± 0.539
1.512GlyPro: 1.512 ± 0.254
2.953GlyGln: 2.953 ± 0.368
4.681GlyArg: 4.681 ± 0.517
4.825GlySer: 4.825 ± 0.654
4.393GlyThr: 4.393 ± 0.634
5.257GlyVal: 5.257 ± 0.522
1.584GlyTrp: 1.584 ± 0.326
2.881GlyTyr: 2.881 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
1.945HisAla: 1.945 ± 0.292
0.36HisCys: 0.36 ± 0.159
1.728HisAsp: 1.728 ± 0.358
1.08HisGlu: 1.08 ± 0.261
0.648HisPhe: 0.648 ± 0.202
2.089HisGly: 2.089 ± 0.47
0.216HisHis: 0.216 ± 0.131
0.792HisIle: 0.792 ± 0.284
1.801HisLys: 1.801 ± 0.396
2.521HisLeu: 2.521 ± 0.419
0.72HisMet: 0.72 ± 0.244
1.008HisAsn: 1.008 ± 0.27
1.08HisPro: 1.08 ± 0.355
1.08HisGln: 1.08 ± 0.232
1.008HisArg: 1.008 ± 0.233
0.792HisSer: 0.792 ± 0.204
0.936HisThr: 0.936 ± 0.285
1.008HisVal: 1.008 ± 0.239
0.648HisTrp: 0.648 ± 0.234
1.296HisTyr: 1.296 ± 0.266
0.0HisXaa: 0.0 ± 0.0
Ile
4.825IleAla: 4.825 ± 0.419
0.288IleCys: 0.288 ± 0.157
3.385IleAsp: 3.385 ± 0.405
3.313IleGlu: 3.313 ± 0.577
1.584IlePhe: 1.584 ± 0.367
3.529IleGly: 3.529 ± 0.497
1.44IleHis: 1.44 ± 0.352
2.017IleIle: 2.017 ± 0.667
3.673IleLys: 3.673 ± 0.499
2.233IleLeu: 2.233 ± 0.418
0.792IleMet: 0.792 ± 0.241
1.945IleAsn: 1.945 ± 0.409
2.521IlePro: 2.521 ± 0.441
1.945IleGln: 1.945 ± 0.461
2.953IleArg: 2.953 ± 0.351
2.665IleSer: 2.665 ± 0.439
2.881IleThr: 2.881 ± 0.485
3.169IleVal: 3.169 ± 0.44
0.792IleTrp: 0.792 ± 0.259
0.936IleTyr: 0.936 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
5.257LysAla: 5.257 ± 0.773
0.648LysCys: 0.648 ± 0.308
4.321LysAsp: 4.321 ± 0.659
4.753LysGlu: 4.753 ± 0.642
1.873LysPhe: 1.873 ± 0.452
4.609LysGly: 4.609 ± 0.805
1.368LysHis: 1.368 ± 0.285
1.728LysIle: 1.728 ± 0.372
2.593LysLys: 2.593 ± 0.335
6.122LysLeu: 6.122 ± 0.661
1.224LysMet: 1.224 ± 0.321
1.873LysAsn: 1.873 ± 0.374
2.593LysPro: 2.593 ± 0.362
2.737LysGln: 2.737 ± 0.484
3.241LysArg: 3.241 ± 0.496
2.593LysSer: 2.593 ± 0.363
3.097LysThr: 3.097 ± 0.418
3.457LysVal: 3.457 ± 0.455
0.792LysTrp: 0.792 ± 0.283
2.377LysTyr: 2.377 ± 0.426
0.0LysXaa: 0.0 ± 0.0
Leu
6.914LeuAla: 6.914 ± 0.665
1.152LeuCys: 1.152 ± 0.282
6.194LeuAsp: 6.194 ± 0.669
5.257LeuGlu: 5.257 ± 0.634
2.305LeuPhe: 2.305 ± 0.383
5.474LeuGly: 5.474 ± 0.538
2.017LeuHis: 2.017 ± 0.45
3.961LeuIle: 3.961 ± 0.544
3.529LeuLys: 3.529 ± 0.57
6.482LeuLeu: 6.482 ± 0.936
2.737LeuMet: 2.737 ± 0.431
3.097LeuAsn: 3.097 ± 0.443
3.817LeuPro: 3.817 ± 0.545
3.961LeuGln: 3.961 ± 0.525
4.897LeuArg: 4.897 ± 0.618
4.465LeuSer: 4.465 ± 0.549
6.122LeuThr: 6.122 ± 0.564
4.465LeuVal: 4.465 ± 0.548
0.648LeuTrp: 0.648 ± 0.174
2.449LeuTyr: 2.449 ± 0.41
0.0LeuXaa: 0.0 ± 0.0
Met
2.665MetAla: 2.665 ± 0.484
0.072MetCys: 0.072 ± 0.087
1.728MetAsp: 1.728 ± 0.271
2.089MetGlu: 2.089 ± 0.442
1.008MetPhe: 1.008 ± 0.257
2.377MetGly: 2.377 ± 0.459
0.576MetHis: 0.576 ± 0.197
0.792MetIle: 0.792 ± 0.173
1.801MetLys: 1.801 ± 0.398
2.233MetLeu: 2.233 ± 0.351
1.368MetMet: 1.368 ± 0.34
1.512MetAsn: 1.512 ± 0.377
1.44MetPro: 1.44 ± 0.344
1.44MetGln: 1.44 ± 0.351
1.44MetArg: 1.44 ± 0.232
2.449MetSer: 2.449 ± 0.605
2.737MetThr: 2.737 ± 0.354
1.801MetVal: 1.801 ± 0.294
0.216MetTrp: 0.216 ± 0.165
0.576MetTyr: 0.576 ± 0.183
0.0MetXaa: 0.0 ± 0.0
Asn
3.241AsnAla: 3.241 ± 0.666
0.072AsnCys: 0.072 ± 0.066
2.521AsnAsp: 2.521 ± 0.409
2.161AsnGlu: 2.161 ± 0.283
1.801AsnPhe: 1.801 ± 0.377
3.601AsnGly: 3.601 ± 0.62
0.864AsnHis: 0.864 ± 0.237
2.089AsnIle: 2.089 ± 0.459
2.953AsnLys: 2.953 ± 0.357
4.033AsnLeu: 4.033 ± 0.5
0.576AsnMet: 0.576 ± 0.134
1.368AsnAsn: 1.368 ± 0.319
2.809AsnPro: 2.809 ± 0.431
1.656AsnGln: 1.656 ± 0.319
2.809AsnArg: 2.809 ± 0.518
2.233AsnSer: 2.233 ± 0.377
3.169AsnThr: 3.169 ± 0.385
2.665AsnVal: 2.665 ± 0.379
0.792AsnTrp: 0.792 ± 0.273
1.224AsnTyr: 1.224 ± 0.284
0.0AsnXaa: 0.0 ± 0.0
Pro
3.673ProAla: 3.673 ± 0.603
0.432ProCys: 0.432 ± 0.238
2.521ProAsp: 2.521 ± 0.524
3.025ProGlu: 3.025 ± 0.403
1.728ProPhe: 1.728 ± 0.382
1.224ProGly: 1.224 ± 0.292
0.792ProHis: 0.792 ± 0.212
2.017ProIle: 2.017 ± 0.45
2.449ProLys: 2.449 ± 0.33
3.169ProLeu: 3.169 ± 0.347
1.44ProMet: 1.44 ± 0.284
2.233ProAsn: 2.233 ± 0.297
1.368ProPro: 1.368 ± 0.228
2.305ProGln: 2.305 ± 0.279
1.512ProArg: 1.512 ± 0.353
1.728ProSer: 1.728 ± 0.431
3.169ProThr: 3.169 ± 0.386
3.673ProVal: 3.673 ± 0.455
0.864ProTrp: 0.864 ± 0.199
1.873ProTyr: 1.873 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
6.338GlnAla: 6.338 ± 0.887
0.648GlnCys: 0.648 ± 0.171
1.873GlnAsp: 1.873 ± 0.445
3.097GlnGlu: 3.097 ± 0.471
2.089GlnPhe: 2.089 ± 0.344
3.169GlnGly: 3.169 ± 0.449
0.792GlnHis: 0.792 ± 0.254
1.801GlnIle: 1.801 ± 0.275
1.801GlnLys: 1.801 ± 0.482
3.745GlnLeu: 3.745 ± 0.55
1.368GlnMet: 1.368 ± 0.379
1.224GlnAsn: 1.224 ± 0.332
1.008GlnPro: 1.008 ± 0.262
3.529GlnGln: 3.529 ± 0.674
2.665GlnArg: 2.665 ± 0.37
2.737GlnSer: 2.737 ± 0.644
2.809GlnThr: 2.809 ± 0.541
2.881GlnVal: 2.881 ± 0.473
0.432GlnTrp: 0.432 ± 0.158
1.296GlnTyr: 1.296 ± 0.402
0.0GlnXaa: 0.0 ± 0.0
Arg
4.825ArgAla: 4.825 ± 0.745
0.288ArgCys: 0.288 ± 0.118
3.025ArgAsp: 3.025 ± 0.391
3.385ArgGlu: 3.385 ± 0.668
2.017ArgPhe: 2.017 ± 0.467
4.321ArgGly: 4.321 ± 0.512
1.584ArgHis: 1.584 ± 0.337
3.241ArgIle: 3.241 ± 0.488
3.385ArgLys: 3.385 ± 0.54
4.393ArgLeu: 4.393 ± 0.438
2.305ArgMet: 2.305 ± 0.352
3.097ArgAsn: 3.097 ± 0.491
1.945ArgPro: 1.945 ± 0.394
2.521ArgGln: 2.521 ± 0.529
2.737ArgArg: 2.737 ± 0.476
2.665ArgSer: 2.665 ± 0.445
3.601ArgThr: 3.601 ± 0.637
4.681ArgVal: 4.681 ± 0.523
0.792ArgTrp: 0.792 ± 0.213
1.801ArgTyr: 1.801 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
5.185SerAla: 5.185 ± 0.619
0.432SerCys: 0.432 ± 0.131
3.673SerAsp: 3.673 ± 0.395
2.665SerGlu: 2.665 ± 0.532
2.089SerPhe: 2.089 ± 0.328
4.249SerGly: 4.249 ± 0.632
1.152SerHis: 1.152 ± 0.337
3.025SerIle: 3.025 ± 0.582
3.169SerLys: 3.169 ± 0.46
5.041SerLeu: 5.041 ± 0.621
1.873SerMet: 1.873 ± 0.447
2.017SerAsn: 2.017 ± 0.446
2.305SerPro: 2.305 ± 0.565
2.881SerGln: 2.881 ± 0.616
2.953SerArg: 2.953 ± 0.409
2.377SerSer: 2.377 ± 0.439
2.665SerThr: 2.665 ± 0.439
3.313SerVal: 3.313 ± 0.448
0.792SerTrp: 0.792 ± 0.251
1.512SerTyr: 1.512 ± 0.311
0.0SerXaa: 0.0 ± 0.0
Thr
4.609ThrAla: 4.609 ± 0.764
0.648ThrCys: 0.648 ± 0.277
3.673ThrAsp: 3.673 ± 0.594
4.393ThrGlu: 4.393 ± 0.474
2.881ThrPhe: 2.881 ± 0.468
6.05ThrGly: 6.05 ± 0.59
1.873ThrHis: 1.873 ± 0.446
3.889ThrIle: 3.889 ± 0.558
3.169ThrLys: 3.169 ± 0.555
4.465ThrLeu: 4.465 ± 0.577
1.728ThrMet: 1.728 ± 0.394
2.953ThrAsn: 2.953 ± 0.437
2.521ThrPro: 2.521 ± 0.366
2.449ThrGln: 2.449 ± 0.416
2.593ThrArg: 2.593 ± 0.431
3.241ThrSer: 3.241 ± 0.561
4.033ThrThr: 4.033 ± 0.675
3.529ThrVal: 3.529 ± 0.661
0.432ThrTrp: 0.432 ± 0.138
2.233ThrTyr: 2.233 ± 0.45
0.0ThrXaa: 0.0 ± 0.0
Val
6.626ValAla: 6.626 ± 0.868
0.72ValCys: 0.72 ± 0.261
4.105ValAsp: 4.105 ± 0.627
4.105ValGlu: 4.105 ± 0.444
2.017ValPhe: 2.017 ± 0.473
4.609ValGly: 4.609 ± 0.642
1.728ValHis: 1.728 ± 0.507
3.313ValIle: 3.313 ± 0.435
4.105ValLys: 4.105 ± 0.593
4.681ValLeu: 4.681 ± 0.736
1.728ValMet: 1.728 ± 0.283
3.385ValAsn: 3.385 ± 0.541
2.881ValPro: 2.881 ± 0.438
2.161ValGln: 2.161 ± 0.466
3.313ValArg: 3.313 ± 0.456
3.817ValSer: 3.817 ± 0.454
4.177ValThr: 4.177 ± 0.592
5.329ValVal: 5.329 ± 0.658
1.224ValTrp: 1.224 ± 0.365
2.305ValTyr: 2.305 ± 0.452
0.0ValXaa: 0.0 ± 0.0
Trp
1.584TrpAla: 1.584 ± 0.324
0.216TrpCys: 0.216 ± 0.174
0.576TrpAsp: 0.576 ± 0.189
1.224TrpGlu: 1.224 ± 0.241
0.504TrpPhe: 0.504 ± 0.219
1.368TrpGly: 1.368 ± 0.325
0.288TrpHis: 0.288 ± 0.117
0.504TrpIle: 0.504 ± 0.177
0.36TrpLys: 0.36 ± 0.178
1.296TrpLeu: 1.296 ± 0.28
0.36TrpMet: 0.36 ± 0.136
0.648TrpAsn: 0.648 ± 0.197
0.792TrpPro: 0.792 ± 0.274
0.72TrpGln: 0.72 ± 0.178
1.08TrpArg: 1.08 ± 0.223
1.08TrpSer: 1.08 ± 0.29
0.72TrpThr: 0.72 ± 0.243
0.792TrpVal: 0.792 ± 0.23
0.216TrpTrp: 0.216 ± 0.118
0.576TrpTyr: 0.576 ± 0.288
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.737TyrAla: 2.737 ± 0.52
0.216TyrCys: 0.216 ± 0.108
2.521TyrAsp: 2.521 ± 0.445
1.728TyrGlu: 1.728 ± 0.428
1.008TyrPhe: 1.008 ± 0.264
3.313TyrGly: 3.313 ± 0.427
0.432TyrHis: 0.432 ± 0.149
1.008TyrIle: 1.008 ± 0.287
1.296TyrLys: 1.296 ± 0.272
2.593TyrLeu: 2.593 ± 0.34
1.296TyrMet: 1.296 ± 0.288
2.089TyrAsn: 2.089 ± 0.353
1.584TyrPro: 1.584 ± 0.457
1.801TyrGln: 1.801 ± 0.343
2.305TyrArg: 2.305 ± 0.467
2.161TyrSer: 2.161 ± 0.39
2.521TyrThr: 2.521 ± 0.603
1.656TyrVal: 1.656 ± 0.362
0.648TyrTrp: 0.648 ± 0.228
0.72TyrTyr: 0.72 ± 0.25
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (13886 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski