Amino acid dipepetide frequency for Salmonella phage Akira

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.514AlaAla: 8.514 ± 1.687
0.433AlaCys: 0.433 ± 0.169
4.618AlaAsp: 4.618 ± 0.564
6.278AlaGlu: 6.278 ± 0.967
3.031AlaPhe: 3.031 ± 0.404
6.35AlaGly: 6.35 ± 0.792
0.866AlaHis: 0.866 ± 0.24
6.133AlaIle: 6.133 ± 0.743
6.494AlaLys: 6.494 ± 0.741
6.999AlaLeu: 6.999 ± 0.818
2.814AlaMet: 2.814 ± 0.421
3.969AlaAsn: 3.969 ± 0.684
2.309AlaPro: 2.309 ± 0.35
3.391AlaGln: 3.391 ± 0.864
5.339AlaArg: 5.339 ± 0.633
5.195AlaSer: 5.195 ± 0.67
5.051AlaThr: 5.051 ± 0.813
6.061AlaVal: 6.061 ± 0.709
1.01AlaTrp: 1.01 ± 0.283
2.886AlaTyr: 2.886 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
0.722CysAla: 0.722 ± 0.227
0.433CysCys: 0.433 ± 0.166
0.938CysAsp: 0.938 ± 0.266
1.082CysGlu: 1.082 ± 0.297
0.505CysPhe: 0.505 ± 0.206
1.804CysGly: 1.804 ± 0.444
0.505CysHis: 0.505 ± 0.171
0.649CysIle: 0.649 ± 0.245
1.082CysLys: 1.082 ± 0.282
0.866CysLeu: 0.866 ± 0.236
0.072CysMet: 0.072 ± 0.064
0.722CysAsn: 0.722 ± 0.259
0.938CysPro: 0.938 ± 0.341
0.289CysGln: 0.289 ± 0.132
0.577CysArg: 0.577 ± 0.183
0.577CysSer: 0.577 ± 0.283
0.433CysThr: 0.433 ± 0.171
0.577CysVal: 0.577 ± 0.232
0.289CysTrp: 0.289 ± 0.126
0.649CysTyr: 0.649 ± 0.265
0.0CysXaa: 0.0 ± 0.0
Asp
5.845AspAla: 5.845 ± 0.603
0.866AspCys: 0.866 ± 0.299
4.979AspAsp: 4.979 ± 0.919
4.762AspGlu: 4.762 ± 0.547
2.958AspPhe: 2.958 ± 0.395
6.927AspGly: 6.927 ± 0.768
1.01AspHis: 1.01 ± 0.236
4.69AspIle: 4.69 ± 0.572
3.969AspLys: 3.969 ± 0.624
3.175AspLeu: 3.175 ± 0.422
2.381AspMet: 2.381 ± 0.454
2.165AspAsn: 2.165 ± 0.357
1.587AspPro: 1.587 ± 0.306
1.154AspGln: 1.154 ± 0.314
2.814AspArg: 2.814 ± 0.415
3.463AspSer: 3.463 ± 0.508
3.031AspThr: 3.031 ± 0.436
4.185AspVal: 4.185 ± 0.573
0.722AspTrp: 0.722 ± 0.202
3.319AspTyr: 3.319 ± 0.43
0.0AspXaa: 0.0 ± 0.0
Glu
5.989GluAla: 5.989 ± 0.751
1.082GluCys: 1.082 ± 0.318
3.391GluAsp: 3.391 ± 0.644
4.546GluGlu: 4.546 ± 0.565
2.525GluPhe: 2.525 ± 0.423
3.608GluGly: 3.608 ± 0.525
0.794GluHis: 0.794 ± 0.248
4.618GluIle: 4.618 ± 0.58
4.329GluLys: 4.329 ± 0.675
6.278GluLeu: 6.278 ± 0.711
2.886GluMet: 2.886 ± 0.483
2.67GluAsn: 2.67 ± 0.469
2.309GluPro: 2.309 ± 0.437
3.969GluGln: 3.969 ± 0.581
3.608GluArg: 3.608 ± 0.511
3.752GluSer: 3.752 ± 0.526
2.886GluThr: 2.886 ± 0.344
3.969GluVal: 3.969 ± 0.64
1.587GluTrp: 1.587 ± 0.266
2.67GluTyr: 2.67 ± 0.515
0.0GluXaa: 0.0 ± 0.0
Phe
2.093PheAla: 2.093 ± 0.427
0.938PheCys: 0.938 ± 0.269
3.247PheAsp: 3.247 ± 0.41
2.309PheGlu: 2.309 ± 0.428
1.01PhePhe: 1.01 ± 0.252
2.598PheGly: 2.598 ± 0.341
0.794PheHis: 0.794 ± 0.254
2.67PheIle: 2.67 ± 0.462
1.804PheLys: 1.804 ± 0.326
2.237PheLeu: 2.237 ± 0.317
1.443PheMet: 1.443 ± 0.297
2.309PheAsn: 2.309 ± 0.502
1.227PhePro: 1.227 ± 0.323
0.938PheGln: 0.938 ± 0.249
1.66PheArg: 1.66 ± 0.395
2.093PheSer: 2.093 ± 0.35
2.525PheThr: 2.525 ± 0.502
1.587PheVal: 1.587 ± 0.325
0.938PheTrp: 0.938 ± 0.27
1.371PheTyr: 1.371 ± 0.311
0.0PheXaa: 0.0 ± 0.0
Gly
6.855GlyAla: 6.855 ± 0.797
1.66GlyCys: 1.66 ± 0.457
4.329GlyAsp: 4.329 ± 0.5
5.628GlyGlu: 5.628 ± 0.66
2.598GlyPhe: 2.598 ± 0.493
5.123GlyGly: 5.123 ± 0.689
1.227GlyHis: 1.227 ± 0.262
4.834GlyIle: 4.834 ± 0.597
6.71GlyLys: 6.71 ± 0.686
4.041GlyLeu: 4.041 ± 0.44
2.165GlyMet: 2.165 ± 0.369
3.463GlyAsn: 3.463 ± 0.618
0.649GlyPro: 0.649 ± 0.213
2.525GlyGln: 2.525 ± 0.433
3.68GlyArg: 3.68 ± 0.49
5.051GlySer: 5.051 ± 0.913
3.175GlyThr: 3.175 ± 0.543
4.618GlyVal: 4.618 ± 0.834
1.227GlyTrp: 1.227 ± 0.331
3.896GlyTyr: 3.896 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
1.299HisAla: 1.299 ± 0.387
0.289HisCys: 0.289 ± 0.13
1.299HisAsp: 1.299 ± 0.25
1.01HisGlu: 1.01 ± 0.243
0.794HisPhe: 0.794 ± 0.206
2.02HisGly: 2.02 ± 0.554
0.361HisHis: 0.361 ± 0.161
0.866HisIle: 0.866 ± 0.272
1.299HisLys: 1.299 ± 0.326
1.299HisLeu: 1.299 ± 0.308
0.505HisMet: 0.505 ± 0.171
0.794HisAsn: 0.794 ± 0.253
0.866HisPro: 0.866 ± 0.258
0.577HisGln: 0.577 ± 0.165
0.938HisArg: 0.938 ± 0.229
1.227HisSer: 1.227 ± 0.27
0.722HisThr: 0.722 ± 0.218
1.732HisVal: 1.732 ± 0.423
0.216HisTrp: 0.216 ± 0.13
1.082HisTyr: 1.082 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
6.422IleAla: 6.422 ± 0.661
0.866IleCys: 0.866 ± 0.25
4.762IleAsp: 4.762 ± 0.559
4.546IleGlu: 4.546 ± 0.525
2.165IlePhe: 2.165 ± 0.357
4.834IleGly: 4.834 ± 0.579
1.804IleHis: 1.804 ± 0.38
4.834IleIle: 4.834 ± 0.571
3.969IleLys: 3.969 ± 0.504
4.329IleLeu: 4.329 ± 0.486
1.66IleMet: 1.66 ± 0.332
3.319IleAsn: 3.319 ± 0.434
1.876IlePro: 1.876 ± 0.313
1.587IleGln: 1.587 ± 0.325
3.68IleArg: 3.68 ± 0.511
4.329IleSer: 4.329 ± 0.633
4.113IleThr: 4.113 ± 0.492
4.401IleVal: 4.401 ± 0.397
0.649IleTrp: 0.649 ± 0.233
1.804IleTyr: 1.804 ± 0.424
0.0IleXaa: 0.0 ± 0.0
Lys
6.133LysAla: 6.133 ± 0.698
1.299LysCys: 1.299 ± 0.391
3.969LysAsp: 3.969 ± 0.586
3.608LysGlu: 3.608 ± 0.469
2.381LysPhe: 2.381 ± 0.484
3.896LysGly: 3.896 ± 0.524
1.587LysHis: 1.587 ± 0.36
4.474LysIle: 4.474 ± 0.567
4.546LysLys: 4.546 ± 0.549
5.412LysLeu: 5.412 ± 0.672
2.453LysMet: 2.453 ± 0.392
2.886LysAsn: 2.886 ± 0.456
2.742LysPro: 2.742 ± 0.508
2.67LysGln: 2.67 ± 0.431
3.175LysArg: 3.175 ± 0.458
3.752LysSer: 3.752 ± 0.435
3.463LysThr: 3.463 ± 0.401
4.041LysVal: 4.041 ± 0.51
0.794LysTrp: 0.794 ± 0.232
2.02LysTyr: 2.02 ± 0.368
0.0LysXaa: 0.0 ± 0.0
Leu
5.845LeuAla: 5.845 ± 0.771
0.866LeuCys: 0.866 ± 0.252
3.68LeuAsp: 3.68 ± 0.513
3.824LeuGlu: 3.824 ± 0.56
2.02LeuPhe: 2.02 ± 0.445
4.257LeuGly: 4.257 ± 0.533
1.227LeuHis: 1.227 ± 0.252
4.907LeuIle: 4.907 ± 0.686
5.051LeuLys: 5.051 ± 0.666
4.618LeuLeu: 4.618 ± 0.624
1.371LeuMet: 1.371 ± 0.285
3.463LeuAsn: 3.463 ± 0.569
3.391LeuPro: 3.391 ± 0.47
2.598LeuGln: 2.598 ± 0.483
4.907LeuArg: 4.907 ± 0.624
4.834LeuSer: 4.834 ± 0.481
4.907LeuThr: 4.907 ± 0.634
5.195LeuVal: 5.195 ± 0.615
1.443LeuTrp: 1.443 ± 0.24
2.309LeuTyr: 2.309 ± 0.465
0.0LeuXaa: 0.0 ± 0.0
Met
2.814MetAla: 2.814 ± 0.483
0.289MetCys: 0.289 ± 0.14
1.515MetAsp: 1.515 ± 0.294
1.515MetGlu: 1.515 ± 0.295
1.082MetPhe: 1.082 ± 0.353
1.154MetGly: 1.154 ± 0.276
0.577MetHis: 0.577 ± 0.207
2.381MetIle: 2.381 ± 0.401
2.453MetLys: 2.453 ± 0.423
2.381MetLeu: 2.381 ± 0.347
0.794MetMet: 0.794 ± 0.258
1.732MetAsn: 1.732 ± 0.296
1.66MetPro: 1.66 ± 0.385
1.227MetGln: 1.227 ± 0.303
1.515MetArg: 1.515 ± 0.284
2.67MetSer: 2.67 ± 0.438
1.876MetThr: 1.876 ± 0.365
1.587MetVal: 1.587 ± 0.349
0.649MetTrp: 0.649 ± 0.175
1.227MetTyr: 1.227 ± 0.256
0.0MetXaa: 0.0 ± 0.0
Asn
5.267AsnAla: 5.267 ± 0.85
0.361AsnCys: 0.361 ± 0.146
3.031AsnAsp: 3.031 ± 0.556
2.814AsnGlu: 2.814 ± 0.406
1.01AsnPhe: 1.01 ± 0.342
5.556AsnGly: 5.556 ± 0.674
1.227AsnHis: 1.227 ± 0.305
2.598AsnIle: 2.598 ± 0.415
2.958AsnLys: 2.958 ± 0.448
3.031AsnLeu: 3.031 ± 0.352
1.082AsnMet: 1.082 ± 0.306
1.876AsnAsn: 1.876 ± 0.458
1.587AsnPro: 1.587 ± 0.378
1.587AsnGln: 1.587 ± 0.278
2.165AsnArg: 2.165 ± 0.379
2.525AsnSer: 2.525 ± 0.441
2.453AsnThr: 2.453 ± 0.537
3.319AsnVal: 3.319 ± 0.591
0.649AsnTrp: 0.649 ± 0.2
1.587AsnTyr: 1.587 ± 0.305
0.0AsnXaa: 0.0 ± 0.0
Pro
3.463ProAla: 3.463 ± 0.545
0.433ProCys: 0.433 ± 0.198
2.525ProAsp: 2.525 ± 0.482
3.608ProGlu: 3.608 ± 0.584
1.299ProPhe: 1.299 ± 0.294
1.732ProGly: 1.732 ± 0.334
0.577ProHis: 0.577 ± 0.2
1.732ProIle: 1.732 ± 0.289
1.804ProLys: 1.804 ± 0.323
1.66ProLeu: 1.66 ± 0.338
1.01ProMet: 1.01 ± 0.251
1.299ProAsn: 1.299 ± 0.256
1.082ProPro: 1.082 ± 0.295
1.443ProGln: 1.443 ± 0.376
1.587ProArg: 1.587 ± 0.302
2.237ProSer: 2.237 ± 0.435
2.381ProThr: 2.381 ± 0.489
2.742ProVal: 2.742 ± 0.381
0.361ProTrp: 0.361 ± 0.201
1.587ProTyr: 1.587 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
3.391GlnAla: 3.391 ± 0.81
0.216GlnCys: 0.216 ± 0.098
1.443GlnAsp: 1.443 ± 0.368
2.598GlnGlu: 2.598 ± 0.502
1.732GlnPhe: 1.732 ± 0.306
1.443GlnGly: 1.443 ± 0.297
1.082GlnHis: 1.082 ± 0.294
2.598GlnIle: 2.598 ± 0.479
1.804GlnLys: 1.804 ± 0.358
3.319GlnLeu: 3.319 ± 0.553
1.371GlnMet: 1.371 ± 0.284
1.01GlnAsn: 1.01 ± 0.293
1.443GlnPro: 1.443 ± 0.378
2.093GlnGln: 2.093 ± 0.638
1.876GlnArg: 1.876 ± 0.382
2.381GlnSer: 2.381 ± 0.405
2.165GlnThr: 2.165 ± 0.553
1.66GlnVal: 1.66 ± 0.347
0.722GlnTrp: 0.722 ± 0.201
1.299GlnTyr: 1.299 ± 0.269
0.0GlnXaa: 0.0 ± 0.0
Arg
4.185ArgAla: 4.185 ± 0.538
1.082ArgCys: 1.082 ± 0.273
3.608ArgAsp: 3.608 ± 0.588
3.68ArgGlu: 3.68 ± 0.559
1.876ArgPhe: 1.876 ± 0.33
3.319ArgGly: 3.319 ± 0.53
0.722ArgHis: 0.722 ± 0.196
3.608ArgIle: 3.608 ± 0.385
3.463ArgLys: 3.463 ± 0.511
3.463ArgLeu: 3.463 ± 0.397
1.66ArgMet: 1.66 ± 0.291
3.608ArgAsn: 3.608 ± 0.567
1.515ArgPro: 1.515 ± 0.297
2.309ArgGln: 2.309 ± 0.471
2.598ArgArg: 2.598 ± 0.411
2.453ArgSer: 2.453 ± 0.464
1.804ArgThr: 1.804 ± 0.352
3.536ArgVal: 3.536 ± 0.6
0.433ArgTrp: 0.433 ± 0.202
2.525ArgTyr: 2.525 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
4.834SerAla: 4.834 ± 0.617
0.794SerCys: 0.794 ± 0.227
4.979SerAsp: 4.979 ± 0.48
3.969SerGlu: 3.969 ± 0.555
2.525SerPhe: 2.525 ± 0.409
5.772SerGly: 5.772 ± 0.552
0.722SerHis: 0.722 ± 0.309
3.247SerIle: 3.247 ± 0.544
3.463SerLys: 3.463 ± 0.468
5.195SerLeu: 5.195 ± 0.628
2.309SerMet: 2.309 ± 0.472
3.031SerAsn: 3.031 ± 0.373
2.165SerPro: 2.165 ± 0.381
1.732SerGln: 1.732 ± 0.34
3.247SerArg: 3.247 ± 0.735
3.463SerSer: 3.463 ± 0.707
3.896SerThr: 3.896 ± 0.829
4.041SerVal: 4.041 ± 0.697
0.794SerTrp: 0.794 ± 0.188
2.381SerTyr: 2.381 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
3.752ThrAla: 3.752 ± 0.583
0.505ThrCys: 0.505 ± 0.163
2.886ThrAsp: 2.886 ± 0.498
3.824ThrGlu: 3.824 ± 0.513
2.309ThrPhe: 2.309 ± 0.501
5.772ThrGly: 5.772 ± 0.759
0.794ThrHis: 0.794 ± 0.209
3.319ThrIle: 3.319 ± 0.59
3.319ThrLys: 3.319 ± 0.409
3.824ThrLeu: 3.824 ± 0.632
1.371ThrMet: 1.371 ± 0.265
2.165ThrAsn: 2.165 ± 0.491
3.608ThrPro: 3.608 ± 0.467
1.587ThrGln: 1.587 ± 0.345
2.598ThrArg: 2.598 ± 0.435
3.969ThrSer: 3.969 ± 0.78
3.247ThrThr: 3.247 ± 0.655
3.752ThrVal: 3.752 ± 0.681
0.794ThrTrp: 0.794 ± 0.264
2.237ThrTyr: 2.237 ± 0.491
0.0ThrXaa: 0.0 ± 0.0
Val
6.205ValAla: 6.205 ± 0.648
0.216ValCys: 0.216 ± 0.115
4.329ValAsp: 4.329 ± 0.714
4.185ValGlu: 4.185 ± 0.63
1.732ValPhe: 1.732 ± 0.459
4.185ValGly: 4.185 ± 0.631
1.371ValHis: 1.371 ± 0.293
5.051ValIle: 5.051 ± 0.571
3.319ValLys: 3.319 ± 0.534
4.474ValLeu: 4.474 ± 0.547
2.525ValMet: 2.525 ± 0.355
3.824ValAsn: 3.824 ± 0.647
1.587ValPro: 1.587 ± 0.305
1.876ValGln: 1.876 ± 0.278
3.103ValArg: 3.103 ± 0.502
5.195ValSer: 5.195 ± 0.665
3.969ValThr: 3.969 ± 0.928
4.185ValVal: 4.185 ± 0.723
1.227ValTrp: 1.227 ± 0.287
2.381ValTyr: 2.381 ± 0.387
0.0ValXaa: 0.0 ± 0.0
Trp
0.938TrpAla: 0.938 ± 0.239
0.433TrpCys: 0.433 ± 0.17
1.01TrpAsp: 1.01 ± 0.278
0.938TrpGlu: 0.938 ± 0.286
0.866TrpPhe: 0.866 ± 0.261
0.505TrpGly: 0.505 ± 0.228
0.649TrpHis: 0.649 ± 0.235
0.866TrpIle: 0.866 ± 0.277
1.371TrpLys: 1.371 ± 0.312
1.154TrpLeu: 1.154 ± 0.309
0.361TrpMet: 0.361 ± 0.183
0.794TrpAsn: 0.794 ± 0.228
0.361TrpPro: 0.361 ± 0.149
0.866TrpGln: 0.866 ± 0.257
1.082TrpArg: 1.082 ± 0.293
0.938TrpSer: 0.938 ± 0.25
0.794TrpThr: 0.794 ± 0.196
1.01TrpVal: 1.01 ± 0.247
0.216TrpTrp: 0.216 ± 0.113
0.361TrpTyr: 0.361 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.175TyrAla: 3.175 ± 0.395
0.649TyrCys: 0.649 ± 0.202
3.463TyrAsp: 3.463 ± 0.568
2.742TyrGlu: 2.742 ± 0.534
1.371TyrPhe: 1.371 ± 0.346
2.598TyrGly: 2.598 ± 0.504
1.299TyrHis: 1.299 ± 0.342
1.876TyrIle: 1.876 ± 0.38
2.165TyrLys: 2.165 ± 0.424
2.958TyrLeu: 2.958 ± 0.44
0.794TyrMet: 0.794 ± 0.227
1.587TyrAsn: 1.587 ± 0.302
1.66TyrPro: 1.66 ± 0.346
1.227TyrGln: 1.227 ± 0.273
1.299TyrArg: 1.299 ± 0.315
2.453TyrSer: 2.453 ± 0.396
2.742TyrThr: 2.742 ± 0.477
2.742TyrVal: 2.742 ± 0.525
0.794TyrTrp: 0.794 ± 0.237
1.01TyrTyr: 1.01 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 85 proteins (13860 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski