Amino acid dipepetide frequency for Aeromonas phage 2_D05

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.249AlaAla: 12.249 ± 1.411
1.099AlaCys: 1.099 ± 0.251
6.046AlaAsp: 6.046 ± 0.764
7.695AlaGlu: 7.695 ± 0.759
3.062AlaPhe: 3.062 ± 0.425
9.501AlaGly: 9.501 ± 1.004
1.256AlaHis: 1.256 ± 0.3
5.732AlaIle: 5.732 ± 0.805
6.36AlaLys: 6.36 ± 0.934
8.166AlaLeu: 8.166 ± 1.041
3.533AlaMet: 3.533 ± 0.565
3.219AlaAsn: 3.219 ± 0.511
4.083AlaPro: 4.083 ± 0.503
4.083AlaGln: 4.083 ± 0.561
5.889AlaArg: 5.889 ± 0.778
5.339AlaSer: 5.339 ± 0.682
6.988AlaThr: 6.988 ± 0.872
6.517AlaVal: 6.517 ± 0.885
1.57AlaTrp: 1.57 ± 0.382
3.533AlaTyr: 3.533 ± 0.504
0.0AlaXaa: 0.0 ± 0.0
Cys
1.099CysAla: 1.099 ± 0.347
0.393CysCys: 0.393 ± 0.2
0.707CysAsp: 0.707 ± 0.218
1.335CysGlu: 1.335 ± 0.289
0.471CysPhe: 0.471 ± 0.18
1.178CysGly: 1.178 ± 0.305
0.628CysHis: 0.628 ± 0.263
0.628CysIle: 0.628 ± 0.189
0.785CysLys: 0.785 ± 0.258
0.628CysLeu: 0.628 ± 0.229
0.236CysMet: 0.236 ± 0.132
0.55CysAsn: 0.55 ± 0.192
0.314CysPro: 0.314 ± 0.156
0.785CysGln: 0.785 ± 0.231
1.021CysArg: 1.021 ± 0.265
0.628CysSer: 0.628 ± 0.211
0.707CysThr: 0.707 ± 0.257
0.628CysVal: 0.628 ± 0.216
0.393CysTrp: 0.393 ± 0.176
0.471CysTyr: 0.471 ± 0.246
0.0CysXaa: 0.0 ± 0.0
Asp
6.831AspAla: 6.831 ± 0.798
0.942AspCys: 0.942 ± 0.281
2.591AspAsp: 2.591 ± 0.397
3.062AspGlu: 3.062 ± 0.469
2.513AspPhe: 2.513 ± 0.534
7.302AspGly: 7.302 ± 0.747
0.785AspHis: 0.785 ± 0.25
2.67AspIle: 2.67 ± 0.424
2.591AspLys: 2.591 ± 0.62
3.926AspLeu: 3.926 ± 0.561
1.806AspMet: 1.806 ± 0.417
2.277AspAsn: 2.277 ± 0.347
2.513AspPro: 2.513 ± 0.399
1.806AspGln: 1.806 ± 0.386
2.041AspArg: 2.041 ± 0.444
4.79AspSer: 4.79 ± 0.612
2.827AspThr: 2.827 ± 0.433
3.926AspVal: 3.926 ± 0.578
1.492AspTrp: 1.492 ± 0.37
2.356AspTyr: 2.356 ± 0.415
0.0AspXaa: 0.0 ± 0.0
Glu
6.517GluAla: 6.517 ± 0.643
0.864GluCys: 0.864 ± 0.296
3.455GluAsp: 3.455 ± 0.549
2.434GluGlu: 2.434 ± 0.438
2.905GluPhe: 2.905 ± 0.569
3.376GluGly: 3.376 ± 0.446
1.021GluHis: 1.021 ± 0.289
3.455GluIle: 3.455 ± 0.521
3.141GluLys: 3.141 ± 0.49
5.967GluLeu: 5.967 ± 0.742
3.141GluMet: 3.141 ± 0.668
1.099GluAsn: 1.099 ± 0.287
2.434GluPro: 2.434 ± 0.428
3.298GluGln: 3.298 ± 0.489
3.769GluArg: 3.769 ± 0.572
2.434GluSer: 2.434 ± 0.426
3.376GluThr: 3.376 ± 0.502
3.455GluVal: 3.455 ± 0.594
2.12GluTrp: 2.12 ± 0.422
1.727GluTyr: 1.727 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
3.533PheAla: 3.533 ± 0.54
0.628PheCys: 0.628 ± 0.246
2.905PheAsp: 2.905 ± 0.522
1.727PheGlu: 1.727 ± 0.372
1.099PhePhe: 1.099 ± 0.296
2.198PheGly: 2.198 ± 0.428
0.393PheHis: 0.393 ± 0.172
0.942PheIle: 0.942 ± 0.326
2.041PheLys: 2.041 ± 0.355
2.12PheLeu: 2.12 ± 0.323
1.57PheMet: 1.57 ± 0.374
2.434PheAsn: 2.434 ± 0.479
1.727PhePro: 1.727 ± 0.365
1.256PheGln: 1.256 ± 0.254
2.041PheArg: 2.041 ± 0.344
1.963PheSer: 1.963 ± 0.383
2.356PheThr: 2.356 ± 0.384
1.178PheVal: 1.178 ± 0.272
0.628PheTrp: 0.628 ± 0.2
1.021PheTyr: 1.021 ± 0.26
0.0PheXaa: 0.0 ± 0.0
Gly
5.967GlyAla: 5.967 ± 0.606
1.335GlyCys: 1.335 ± 0.303
4.947GlyAsp: 4.947 ± 0.606
3.926GlyGlu: 3.926 ± 0.458
3.533GlyPhe: 3.533 ± 0.654
5.496GlyGly: 5.496 ± 0.665
1.335GlyHis: 1.335 ± 0.364
4.004GlyIle: 4.004 ± 0.61
4.476GlyLys: 4.476 ± 0.738
7.302GlyLeu: 7.302 ± 0.748
2.277GlyMet: 2.277 ± 0.486
2.905GlyAsn: 2.905 ± 0.578
2.905GlyPro: 2.905 ± 0.44
3.69GlyGln: 3.69 ± 0.528
4.947GlyArg: 4.947 ± 0.654
5.182GlySer: 5.182 ± 0.632
4.868GlyThr: 4.868 ± 0.754
6.988GlyVal: 6.988 ± 0.779
1.806GlyTrp: 1.806 ± 0.424
2.356GlyTyr: 2.356 ± 0.435
0.0GlyXaa: 0.0 ± 0.0
His
1.649HisAla: 1.649 ± 0.313
0.471HisCys: 0.471 ± 0.177
0.707HisAsp: 0.707 ± 0.273
1.099HisGlu: 1.099 ± 0.277
0.471HisPhe: 0.471 ± 0.178
1.649HisGly: 1.649 ± 0.357
0.393HisHis: 0.393 ± 0.18
0.864HisIle: 0.864 ± 0.265
0.785HisLys: 0.785 ± 0.241
1.178HisLeu: 1.178 ± 0.281
0.707HisMet: 0.707 ± 0.266
0.55HisAsn: 0.55 ± 0.226
0.55HisPro: 0.55 ± 0.171
0.471HisGln: 0.471 ± 0.19
0.785HisArg: 0.785 ± 0.248
0.471HisSer: 0.471 ± 0.236
0.864HisThr: 0.864 ± 0.259
0.864HisVal: 0.864 ± 0.297
0.157HisTrp: 0.157 ± 0.122
0.393HisTyr: 0.393 ± 0.165
0.0HisXaa: 0.0 ± 0.0
Ile
5.889IleAla: 5.889 ± 0.641
0.864IleCys: 0.864 ± 0.31
4.476IleAsp: 4.476 ± 0.513
4.318IleGlu: 4.318 ± 0.634
1.099IlePhe: 1.099 ± 0.311
4.083IleGly: 4.083 ± 0.616
0.628IleHis: 0.628 ± 0.21
2.277IleIle: 2.277 ± 0.341
2.984IleLys: 2.984 ± 0.546
2.905IleLeu: 2.905 ± 0.426
0.707IleMet: 0.707 ± 0.253
2.827IleAsn: 2.827 ± 0.671
2.513IlePro: 2.513 ± 0.497
1.806IleGln: 1.806 ± 0.401
2.356IleArg: 2.356 ± 0.473
3.376IleSer: 3.376 ± 0.408
4.083IleThr: 4.083 ± 0.591
2.827IleVal: 2.827 ± 0.602
0.393IleTrp: 0.393 ± 0.169
1.256IleTyr: 1.256 ± 0.296
0.0IleXaa: 0.0 ± 0.0
Lys
6.36LysAla: 6.36 ± 0.797
0.393LysCys: 0.393 ± 0.158
2.984LysAsp: 2.984 ± 0.533
3.298LysGlu: 3.298 ± 0.448
1.806LysPhe: 1.806 ± 0.351
3.455LysGly: 3.455 ± 0.449
0.785LysHis: 0.785 ± 0.202
1.649LysIle: 1.649 ± 0.325
2.905LysLys: 2.905 ± 0.471
4.79LysLeu: 4.79 ± 0.706
2.513LysMet: 2.513 ± 0.416
1.649LysAsn: 1.649 ± 0.364
3.219LysPro: 3.219 ± 0.536
3.141LysGln: 3.141 ± 0.586
2.513LysArg: 2.513 ± 0.511
2.513LysSer: 2.513 ± 0.48
2.356LysThr: 2.356 ± 0.45
3.219LysVal: 3.219 ± 0.464
0.471LysTrp: 0.471 ± 0.197
1.649LysTyr: 1.649 ± 0.38
0.0LysXaa: 0.0 ± 0.0
Leu
9.972LeuAla: 9.972 ± 0.94
0.942LeuCys: 0.942 ± 0.247
4.397LeuAsp: 4.397 ± 0.639
4.554LeuGlu: 4.554 ± 0.486
1.884LeuPhe: 1.884 ± 0.333
5.653LeuGly: 5.653 ± 0.665
1.178LeuHis: 1.178 ± 0.252
4.161LeuIle: 4.161 ± 0.648
3.769LeuLys: 3.769 ± 0.45
4.868LeuLeu: 4.868 ± 0.757
1.727LeuMet: 1.727 ± 0.344
4.397LeuAsn: 4.397 ± 0.58
3.769LeuPro: 3.769 ± 0.702
3.533LeuGln: 3.533 ± 0.43
4.161LeuArg: 4.161 ± 0.559
5.496LeuSer: 5.496 ± 0.835
5.025LeuThr: 5.025 ± 0.644
5.81LeuVal: 5.81 ± 0.743
0.942LeuTrp: 0.942 ± 0.255
2.041LeuTyr: 2.041 ± 0.349
0.0LeuXaa: 0.0 ± 0.0
Met
4.554MetAla: 4.554 ± 0.512
0.707MetCys: 0.707 ± 0.255
1.099MetAsp: 1.099 ± 0.31
1.492MetGlu: 1.492 ± 0.32
0.864MetPhe: 0.864 ± 0.22
2.198MetGly: 2.198 ± 0.383
0.236MetHis: 0.236 ± 0.125
1.649MetIle: 1.649 ± 0.383
1.492MetLys: 1.492 ± 0.346
1.806MetLeu: 1.806 ± 0.424
1.335MetMet: 1.335 ± 0.391
1.099MetAsn: 1.099 ± 0.306
1.256MetPro: 1.256 ± 0.343
1.492MetGln: 1.492 ± 0.365
2.041MetArg: 2.041 ± 0.395
2.67MetSer: 2.67 ± 0.495
2.67MetThr: 2.67 ± 0.474
1.335MetVal: 1.335 ± 0.47
0.157MetTrp: 0.157 ± 0.139
0.707MetTyr: 0.707 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
3.298AsnAla: 3.298 ± 0.62
0.785AsnCys: 0.785 ± 0.268
1.649AsnAsp: 1.649 ± 0.317
2.277AsnGlu: 2.277 ± 0.389
1.099AsnPhe: 1.099 ± 0.284
2.827AsnGly: 2.827 ± 0.54
0.471AsnHis: 0.471 ± 0.188
1.727AsnIle: 1.727 ± 0.439
1.413AsnLys: 1.413 ± 0.333
3.219AsnLeu: 3.219 ± 0.547
0.864AsnMet: 0.864 ± 0.295
1.727AsnAsn: 1.727 ± 0.358
2.591AsnPro: 2.591 ± 0.502
1.256AsnGln: 1.256 ± 0.304
2.277AsnArg: 2.277 ± 0.433
3.062AsnSer: 3.062 ± 0.508
2.591AsnThr: 2.591 ± 0.427
2.67AsnVal: 2.67 ± 0.585
0.628AsnTrp: 0.628 ± 0.203
1.178AsnTyr: 1.178 ± 0.326
0.0AsnXaa: 0.0 ± 0.0
Pro
4.79ProAla: 4.79 ± 0.566
0.471ProCys: 0.471 ± 0.189
3.298ProAsp: 3.298 ± 0.533
3.455ProGlu: 3.455 ± 0.566
1.57ProPhe: 1.57 ± 0.28
4.083ProGly: 4.083 ± 0.673
0.785ProHis: 0.785 ± 0.243
1.884ProIle: 1.884 ± 0.4
2.12ProLys: 2.12 ± 0.366
3.141ProLeu: 3.141 ± 0.527
0.628ProMet: 0.628 ± 0.234
0.864ProAsn: 0.864 ± 0.226
1.806ProPro: 1.806 ± 0.371
1.649ProGln: 1.649 ± 0.374
1.492ProArg: 1.492 ± 0.302
3.062ProSer: 3.062 ± 0.477
3.062ProThr: 3.062 ± 0.557
3.376ProVal: 3.376 ± 0.621
0.55ProTrp: 0.55 ± 0.238
1.492ProTyr: 1.492 ± 0.37
0.0ProXaa: 0.0 ± 0.0
Gln
6.438GlnAla: 6.438 ± 0.655
0.55GlnCys: 0.55 ± 0.227
2.513GlnAsp: 2.513 ± 0.45
2.277GlnGlu: 2.277 ± 0.417
1.178GlnPhe: 1.178 ± 0.317
3.062GlnGly: 3.062 ± 0.527
0.393GlnHis: 0.393 ± 0.171
3.298GlnIle: 3.298 ± 0.367
1.806GlnLys: 1.806 ± 0.387
3.612GlnLeu: 3.612 ± 0.743
1.178GlnMet: 1.178 ± 0.375
1.492GlnAsn: 1.492 ± 0.292
1.178GlnPro: 1.178 ± 0.32
2.434GlnGln: 2.434 ± 0.474
2.748GlnArg: 2.748 ± 0.459
2.356GlnSer: 2.356 ± 0.404
2.277GlnThr: 2.277 ± 0.427
2.984GlnVal: 2.984 ± 0.495
0.864GlnTrp: 0.864 ± 0.295
1.649GlnTyr: 1.649 ± 0.344
0.0GlnXaa: 0.0 ± 0.0
Arg
4.711ArgAla: 4.711 ± 0.694
1.021ArgCys: 1.021 ± 0.328
2.591ArgAsp: 2.591 ± 0.473
3.533ArgGlu: 3.533 ± 0.578
1.963ArgPhe: 1.963 ± 0.336
3.219ArgGly: 3.219 ± 0.575
0.785ArgHis: 0.785 ± 0.273
3.141ArgIle: 3.141 ± 0.485
3.533ArgLys: 3.533 ± 0.508
4.868ArgLeu: 4.868 ± 0.583
1.884ArgMet: 1.884 ± 0.405
1.57ArgAsn: 1.57 ± 0.341
2.277ArgPro: 2.277 ± 0.414
2.67ArgGln: 2.67 ± 0.501
3.612ArgArg: 3.612 ± 0.563
4.161ArgSer: 4.161 ± 0.612
2.513ArgThr: 2.513 ± 0.448
3.926ArgVal: 3.926 ± 0.586
1.413ArgTrp: 1.413 ± 0.295
1.884ArgTyr: 1.884 ± 0.507
0.0ArgXaa: 0.0 ± 0.0
Ser
6.438SerAla: 6.438 ± 0.786
0.471SerCys: 0.471 ± 0.18
3.219SerAsp: 3.219 ± 0.382
3.376SerGlu: 3.376 ± 0.477
2.041SerPhe: 2.041 ± 0.473
6.046SerGly: 6.046 ± 0.808
1.178SerHis: 1.178 ± 0.299
3.376SerIle: 3.376 ± 0.482
3.062SerLys: 3.062 ± 0.451
4.318SerLeu: 4.318 ± 0.657
1.806SerMet: 1.806 ± 0.344
2.356SerAsn: 2.356 ± 0.44
2.277SerPro: 2.277 ± 0.385
2.984SerGln: 2.984 ± 0.485
2.984SerArg: 2.984 ± 0.366
4.24SerSer: 4.24 ± 0.598
3.062SerThr: 3.062 ± 0.446
5.339SerVal: 5.339 ± 0.61
1.178SerTrp: 1.178 ± 0.244
2.356SerTyr: 2.356 ± 0.489
0.0SerXaa: 0.0 ± 0.0
Thr
5.418ThrAla: 5.418 ± 0.73
0.393ThrCys: 0.393 ± 0.153
3.219ThrAsp: 3.219 ± 0.405
3.376ThrGlu: 3.376 ± 0.601
2.277ThrPhe: 2.277 ± 0.588
6.203ThrGly: 6.203 ± 0.819
1.256ThrHis: 1.256 ± 0.331
3.298ThrIle: 3.298 ± 0.652
2.198ThrLys: 2.198 ± 0.483
5.496ThrLeu: 5.496 ± 0.801
1.021ThrMet: 1.021 ± 0.267
2.041ThrAsn: 2.041 ± 0.342
4.004ThrPro: 4.004 ± 0.411
2.434ThrGln: 2.434 ± 0.468
3.062ThrArg: 3.062 ± 0.569
3.533ThrSer: 3.533 ± 0.616
3.533ThrThr: 3.533 ± 0.586
4.633ThrVal: 4.633 ± 0.644
1.256ThrTrp: 1.256 ± 0.34
1.57ThrTyr: 1.57 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
6.988ValAla: 6.988 ± 0.892
0.393ValCys: 0.393 ± 0.174
4.24ValAsp: 4.24 ± 0.541
4.947ValGlu: 4.947 ± 0.646
1.727ValPhe: 1.727 ± 0.332
4.711ValGly: 4.711 ± 0.651
0.707ValHis: 0.707 ± 0.208
4.161ValIle: 4.161 ± 0.52
4.161ValLys: 4.161 ± 0.649
5.261ValLeu: 5.261 ± 0.731
2.905ValMet: 2.905 ± 0.453
2.434ValAsn: 2.434 ± 0.477
2.356ValPro: 2.356 ± 0.447
2.12ValGln: 2.12 ± 0.489
3.69ValArg: 3.69 ± 0.52
4.868ValSer: 4.868 ± 0.596
4.633ValThr: 4.633 ± 0.651
4.79ValVal: 4.79 ± 0.688
0.707ValTrp: 0.707 ± 0.276
1.806ValTyr: 1.806 ± 0.384
0.0ValXaa: 0.0 ± 0.0
Trp
1.413TrpAla: 1.413 ± 0.36
0.314TrpCys: 0.314 ± 0.148
1.57TrpAsp: 1.57 ± 0.375
0.864TrpGlu: 0.864 ± 0.24
0.785TrpPhe: 0.785 ± 0.254
1.099TrpGly: 1.099 ± 0.282
0.471TrpHis: 0.471 ± 0.207
1.099TrpIle: 1.099 ± 0.314
0.55TrpLys: 0.55 ± 0.215
2.198TrpLeu: 2.198 ± 0.414
0.236TrpMet: 0.236 ± 0.137
0.55TrpAsn: 0.55 ± 0.205
0.864TrpPro: 0.864 ± 0.289
1.335TrpGln: 1.335 ± 0.239
1.256TrpArg: 1.256 ± 0.381
0.471TrpSer: 0.471 ± 0.2
0.628TrpThr: 0.628 ± 0.193
1.099TrpVal: 1.099 ± 0.246
0.393TrpTrp: 0.393 ± 0.141
0.707TrpTyr: 0.707 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.806TyrAla: 1.806 ± 0.319
0.55TyrCys: 0.55 ± 0.21
2.434TyrAsp: 2.434 ± 0.519
0.942TyrGlu: 0.942 ± 0.31
1.413TyrPhe: 1.413 ± 0.349
2.905TyrGly: 2.905 ± 0.414
0.393TyrHis: 0.393 ± 0.152
1.649TyrIle: 1.649 ± 0.335
1.492TyrLys: 1.492 ± 0.373
2.513TyrLeu: 2.513 ± 0.41
0.707TyrMet: 0.707 ± 0.259
1.649TyrAsn: 1.649 ± 0.292
0.942TyrPro: 0.942 ± 0.257
2.041TyrGln: 2.041 ± 0.402
2.591TyrArg: 2.591 ± 0.521
1.492TyrSer: 1.492 ± 0.411
1.806TyrThr: 1.806 ± 0.362
2.041TyrVal: 2.041 ± 0.372
0.785TyrTrp: 0.785 ± 0.256
1.178TyrTyr: 1.178 ± 0.336
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (12737 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski