Amino acid dipepetide frequency for Salinivibrio phage CW02

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.209AlaAla: 5.209 ± 0.745
0.791AlaCys: 0.791 ± 0.241
4.286AlaAsp: 4.286 ± 0.497
5.275AlaGlu: 5.275 ± 0.756
2.176AlaPhe: 2.176 ± 0.337
5.407AlaGly: 5.407 ± 0.64
1.385AlaHis: 1.385 ± 0.309
3.693AlaIle: 3.693 ± 0.479
4.484AlaLys: 4.484 ± 0.605
7.319AlaLeu: 7.319 ± 0.913
2.242AlaMet: 2.242 ± 0.432
4.022AlaAsn: 4.022 ± 0.561
2.11AlaPro: 2.11 ± 0.389
3.297AlaGln: 3.297 ± 0.603
3.693AlaArg: 3.693 ± 0.581
3.759AlaSer: 3.759 ± 0.448
3.759AlaThr: 3.759 ± 0.567
5.275AlaVal: 5.275 ± 0.715
1.055AlaTrp: 1.055 ± 0.278
3.165AlaTyr: 3.165 ± 0.556
0.0AlaXaa: 0.0 ± 0.0
Cys
0.659CysAla: 0.659 ± 0.205
0.264CysCys: 0.264 ± 0.132
0.33CysAsp: 0.33 ± 0.135
1.055CysGlu: 1.055 ± 0.349
0.198CysPhe: 0.198 ± 0.108
0.923CysGly: 0.923 ± 0.253
0.396CysHis: 0.396 ± 0.161
0.198CysIle: 0.198 ± 0.13
0.396CysLys: 0.396 ± 0.176
1.253CysLeu: 1.253 ± 0.305
0.396CysMet: 0.396 ± 0.166
0.528CysAsn: 0.528 ± 0.198
0.593CysPro: 0.593 ± 0.266
0.725CysGln: 0.725 ± 0.204
0.593CysArg: 0.593 ± 0.223
0.264CysSer: 0.264 ± 0.134
0.659CysThr: 0.659 ± 0.232
0.659CysVal: 0.659 ± 0.211
0.132CysTrp: 0.132 ± 0.089
0.462CysTyr: 0.462 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
4.748AspAla: 4.748 ± 0.663
0.857AspCys: 0.857 ± 0.256
4.022AspAsp: 4.022 ± 0.59
4.154AspGlu: 4.154 ± 0.383
2.704AspPhe: 2.704 ± 0.398
5.143AspGly: 5.143 ± 0.567
0.659AspHis: 0.659 ± 0.201
3.891AspIle: 3.891 ± 0.559
4.88AspLys: 4.88 ± 0.563
5.407AspLeu: 5.407 ± 0.509
1.583AspMet: 1.583 ± 0.274
3.495AspAsn: 3.495 ± 0.51
2.704AspPro: 2.704 ± 0.468
1.319AspGln: 1.319 ± 0.277
3.099AspArg: 3.099 ± 0.348
3.891AspSer: 3.891 ± 0.723
3.956AspThr: 3.956 ± 0.418
5.077AspVal: 5.077 ± 0.56
0.791AspTrp: 0.791 ± 0.243
2.77AspTyr: 2.77 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
5.737GluAla: 5.737 ± 0.763
0.725GluCys: 0.725 ± 0.235
5.671GluAsp: 5.671 ± 0.767
7.649GluGlu: 7.649 ± 0.841
3.231GluPhe: 3.231 ± 0.394
5.473GluGly: 5.473 ± 0.724
0.923GluHis: 0.923 ± 0.28
4.22GluIle: 4.22 ± 0.567
4.484GluLys: 4.484 ± 0.691
6.396GluLeu: 6.396 ± 0.504
1.978GluMet: 1.978 ± 0.3
2.704GluAsn: 2.704 ± 0.374
2.374GluPro: 2.374 ± 0.332
3.825GluGln: 3.825 ± 0.597
4.286GluArg: 4.286 ± 0.41
3.561GluSer: 3.561 ± 0.419
3.429GluThr: 3.429 ± 0.561
6.067GluVal: 6.067 ± 0.913
1.121GluTrp: 1.121 ± 0.275
3.165GluTyr: 3.165 ± 0.362
0.0GluXaa: 0.0 ± 0.0
Phe
2.176PheAla: 2.176 ± 0.395
0.528PheCys: 0.528 ± 0.223
2.835PheAsp: 2.835 ± 0.464
2.572PheGlu: 2.572 ± 0.48
1.319PhePhe: 1.319 ± 0.269
3.429PheGly: 3.429 ± 0.451
1.121PheHis: 1.121 ± 0.305
1.978PheIle: 1.978 ± 0.404
2.638PheLys: 2.638 ± 0.408
2.506PheLeu: 2.506 ± 0.471
0.725PheMet: 0.725 ± 0.197
1.846PheAsn: 1.846 ± 0.369
1.055PhePro: 1.055 ± 0.249
1.583PheGln: 1.583 ± 0.266
1.846PheArg: 1.846 ± 0.288
2.901PheSer: 2.901 ± 0.479
1.846PheThr: 1.846 ± 0.416
2.308PheVal: 2.308 ± 0.416
0.396PheTrp: 0.396 ± 0.138
1.253PheTyr: 1.253 ± 0.333
0.0PheXaa: 0.0 ± 0.0
Gly
4.814GlyAla: 4.814 ± 0.592
0.33GlyCys: 0.33 ± 0.144
5.275GlyAsp: 5.275 ± 0.654
5.869GlyGlu: 5.869 ± 0.587
3.429GlyPhe: 3.429 ± 0.529
6.396GlyGly: 6.396 ± 0.834
1.649GlyHis: 1.649 ± 0.386
3.495GlyIle: 3.495 ± 0.52
4.55GlyLys: 4.55 ± 0.521
6.528GlyLeu: 6.528 ± 0.885
1.649GlyMet: 1.649 ± 0.364
3.825GlyAsn: 3.825 ± 0.456
1.78GlyPro: 1.78 ± 0.425
2.374GlyGln: 2.374 ± 0.45
3.429GlyArg: 3.429 ± 0.366
5.869GlySer: 5.869 ± 0.63
3.231GlyThr: 3.231 ± 0.585
6.462GlyVal: 6.462 ± 0.721
1.649GlyTrp: 1.649 ± 0.276
2.835GlyTyr: 2.835 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
1.319HisAla: 1.319 ± 0.304
0.264HisCys: 0.264 ± 0.121
1.121HisAsp: 1.121 ± 0.251
0.989HisGlu: 0.989 ± 0.275
0.857HisPhe: 0.857 ± 0.242
0.857HisGly: 0.857 ± 0.25
0.593HisHis: 0.593 ± 0.204
0.923HisIle: 0.923 ± 0.228
1.121HisLys: 1.121 ± 0.292
1.583HisLeu: 1.583 ± 0.274
0.989HisMet: 0.989 ± 0.247
1.055HisAsn: 1.055 ± 0.358
0.989HisPro: 0.989 ± 0.278
0.264HisGln: 0.264 ± 0.12
0.857HisArg: 0.857 ± 0.209
1.385HisSer: 1.385 ± 0.267
1.055HisThr: 1.055 ± 0.261
1.319HisVal: 1.319 ± 0.28
0.198HisTrp: 0.198 ± 0.101
0.791HisTyr: 0.791 ± 0.23
0.0HisXaa: 0.0 ± 0.0
Ile
3.429IleAla: 3.429 ± 0.529
0.593IleCys: 0.593 ± 0.238
3.429IleAsp: 3.429 ± 0.438
3.429IleGlu: 3.429 ± 0.432
1.517IlePhe: 1.517 ± 0.254
2.374IleGly: 2.374 ± 0.417
0.989IleHis: 0.989 ± 0.244
1.846IleIle: 1.846 ± 0.329
4.682IleLys: 4.682 ± 0.567
3.627IleLeu: 3.627 ± 0.426
1.517IleMet: 1.517 ± 0.394
2.11IleAsn: 2.11 ± 0.429
2.044IlePro: 2.044 ± 0.286
1.846IleGln: 1.846 ± 0.314
2.506IleArg: 2.506 ± 0.435
3.759IleSer: 3.759 ± 0.503
2.967IleThr: 2.967 ± 0.479
3.495IleVal: 3.495 ± 0.47
0.659IleTrp: 0.659 ± 0.237
1.649IleTyr: 1.649 ± 0.318
0.0IleXaa: 0.0 ± 0.0
Lys
6.133LysAla: 6.133 ± 0.736
0.33LysCys: 0.33 ± 0.135
4.484LysAsp: 4.484 ± 0.557
5.539LysGlu: 5.539 ± 0.803
1.846LysPhe: 1.846 ± 0.391
5.473LysGly: 5.473 ± 0.768
1.451LysHis: 1.451 ± 0.325
3.231LysIle: 3.231 ± 0.559
4.286LysLys: 4.286 ± 0.668
5.341LysLeu: 5.341 ± 0.426
2.11LysMet: 2.11 ± 0.401
2.374LysAsn: 2.374 ± 0.378
2.638LysPro: 2.638 ± 0.517
2.967LysGln: 2.967 ± 0.378
3.561LysArg: 3.561 ± 0.491
3.956LysSer: 3.956 ± 0.577
3.363LysThr: 3.363 ± 0.499
4.484LysVal: 4.484 ± 0.468
0.857LysTrp: 0.857 ± 0.261
1.583LysTyr: 1.583 ± 0.3
0.0LysXaa: 0.0 ± 0.0
Leu
5.539LeuAla: 5.539 ± 0.547
1.385LeuCys: 1.385 ± 0.334
5.539LeuAsp: 5.539 ± 0.562
6.858LeuGlu: 6.858 ± 0.85
2.77LeuPhe: 2.77 ± 0.583
5.803LeuGly: 5.803 ± 0.64
1.253LeuHis: 1.253 ± 0.249
4.352LeuIle: 4.352 ± 0.477
5.012LeuLys: 5.012 ± 0.66
6.528LeuLeu: 6.528 ± 0.688
1.714LeuMet: 1.714 ± 0.308
3.891LeuAsn: 3.891 ± 0.566
3.099LeuPro: 3.099 ± 0.439
2.967LeuGln: 2.967 ± 0.42
4.352LeuArg: 4.352 ± 0.538
6.264LeuSer: 6.264 ± 0.76
5.275LeuThr: 5.275 ± 0.465
5.143LeuVal: 5.143 ± 0.61
1.121LeuTrp: 1.121 ± 0.261
3.231LeuTyr: 3.231 ± 0.381
0.0LeuXaa: 0.0 ± 0.0
Met
2.704MetAla: 2.704 ± 0.603
0.528MetCys: 0.528 ± 0.24
1.912MetAsp: 1.912 ± 0.323
1.78MetGlu: 1.78 ± 0.291
0.989MetPhe: 0.989 ± 0.261
1.978MetGly: 1.978 ± 0.474
0.264MetHis: 0.264 ± 0.139
0.396MetIle: 0.396 ± 0.178
1.517MetLys: 1.517 ± 0.35
2.11MetLeu: 2.11 ± 0.374
0.725MetMet: 0.725 ± 0.248
0.791MetAsn: 0.791 ± 0.202
1.121MetPro: 1.121 ± 0.307
0.857MetGln: 0.857 ± 0.195
1.451MetArg: 1.451 ± 0.339
2.374MetSer: 2.374 ± 0.369
1.714MetThr: 1.714 ± 0.381
1.517MetVal: 1.517 ± 0.276
0.462MetTrp: 0.462 ± 0.174
0.659MetTyr: 0.659 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
3.495AsnAla: 3.495 ± 0.408
0.462AsnCys: 0.462 ± 0.19
1.451AsnAsp: 1.451 ± 0.392
2.967AsnGlu: 2.967 ± 0.414
1.187AsnPhe: 1.187 ± 0.353
3.561AsnGly: 3.561 ± 0.509
0.791AsnHis: 0.791 ± 0.204
3.891AsnIle: 3.891 ± 0.452
3.561AsnLys: 3.561 ± 0.416
4.418AsnLeu: 4.418 ± 0.547
1.055AsnMet: 1.055 ± 0.26
1.846AsnAsn: 1.846 ± 0.386
2.506AsnPro: 2.506 ± 0.388
1.649AsnGln: 1.649 ± 0.302
1.846AsnArg: 1.846 ± 0.311
3.297AsnSer: 3.297 ± 0.615
3.099AsnThr: 3.099 ± 0.324
3.363AsnVal: 3.363 ± 0.483
0.725AsnTrp: 0.725 ± 0.266
2.704AsnTyr: 2.704 ± 0.375
0.0AsnXaa: 0.0 ± 0.0
Pro
1.78ProAla: 1.78 ± 0.355
0.33ProCys: 0.33 ± 0.132
3.033ProAsp: 3.033 ± 0.45
2.967ProGlu: 2.967 ± 0.45
1.846ProPhe: 1.846 ± 0.333
1.649ProGly: 1.649 ± 0.293
0.659ProHis: 0.659 ± 0.18
1.846ProIle: 1.846 ± 0.339
2.901ProLys: 2.901 ± 0.43
2.704ProLeu: 2.704 ± 0.403
0.462ProMet: 0.462 ± 0.171
2.11ProAsn: 2.11 ± 0.275
0.857ProPro: 0.857 ± 0.231
2.176ProGln: 2.176 ± 0.414
1.846ProArg: 1.846 ± 0.339
2.308ProSer: 2.308 ± 0.298
2.242ProThr: 2.242 ± 0.359
2.308ProVal: 2.308 ± 0.431
0.593ProTrp: 0.593 ± 0.204
1.517ProTyr: 1.517 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
3.693GlnAla: 3.693 ± 0.808
0.396GlnCys: 0.396 ± 0.167
2.77GlnAsp: 2.77 ± 0.395
3.099GlnGlu: 3.099 ± 0.449
1.385GlnPhe: 1.385 ± 0.311
3.297GlnGly: 3.297 ± 0.54
0.528GlnHis: 0.528 ± 0.192
1.78GlnIle: 1.78 ± 0.322
1.846GlnLys: 1.846 ± 0.368
2.901GlnLeu: 2.901 ± 0.496
1.253GlnMet: 1.253 ± 0.346
1.649GlnAsn: 1.649 ± 0.398
1.187GlnPro: 1.187 ± 0.257
1.978GlnGln: 1.978 ± 0.514
2.242GlnArg: 2.242 ± 0.447
2.308GlnSer: 2.308 ± 0.457
2.11GlnThr: 2.11 ± 0.399
2.638GlnVal: 2.638 ± 0.365
0.659GlnTrp: 0.659 ± 0.191
1.055GlnTyr: 1.055 ± 0.25
0.0GlnXaa: 0.0 ± 0.0
Arg
3.231ArgAla: 3.231 ± 0.441
0.462ArgCys: 0.462 ± 0.174
3.099ArgAsp: 3.099 ± 0.435
4.286ArgGlu: 4.286 ± 0.581
2.176ArgPhe: 2.176 ± 0.402
3.956ArgGly: 3.956 ± 0.581
1.451ArgHis: 1.451 ± 0.349
2.638ArgIle: 2.638 ± 0.377
3.825ArgLys: 3.825 ± 0.659
3.956ArgLeu: 3.956 ± 0.462
0.989ArgMet: 0.989 ± 0.215
2.638ArgAsn: 2.638 ± 0.45
1.912ArgPro: 1.912 ± 0.325
1.121ArgGln: 1.121 ± 0.31
3.033ArgArg: 3.033 ± 0.518
3.363ArgSer: 3.363 ± 0.455
2.308ArgThr: 2.308 ± 0.354
3.363ArgVal: 3.363 ± 0.442
0.989ArgTrp: 0.989 ± 0.227
1.978ArgTyr: 1.978 ± 0.351
0.0ArgXaa: 0.0 ± 0.0
Ser
4.154SerAla: 4.154 ± 0.663
0.791SerCys: 0.791 ± 0.252
3.825SerAsp: 3.825 ± 0.52
4.616SerGlu: 4.616 ± 0.568
2.242SerPhe: 2.242 ± 0.477
5.539SerGly: 5.539 ± 0.959
1.583SerHis: 1.583 ± 0.356
2.835SerIle: 2.835 ± 0.471
4.55SerLys: 4.55 ± 0.638
5.473SerLeu: 5.473 ± 0.788
1.583SerMet: 1.583 ± 0.302
3.429SerAsn: 3.429 ± 0.493
2.77SerPro: 2.77 ± 0.37
3.231SerGln: 3.231 ± 0.517
3.627SerArg: 3.627 ± 0.502
5.605SerSer: 5.605 ± 0.712
3.956SerThr: 3.956 ± 0.612
5.407SerVal: 5.407 ± 0.503
0.659SerTrp: 0.659 ± 0.19
2.176SerTyr: 2.176 ± 0.405
0.0SerXaa: 0.0 ± 0.0
Thr
3.429ThrAla: 3.429 ± 0.62
0.264ThrCys: 0.264 ± 0.152
3.495ThrAsp: 3.495 ± 0.534
4.022ThrGlu: 4.022 ± 0.468
2.044ThrPhe: 2.044 ± 0.412
4.55ThrGly: 4.55 ± 0.549
0.857ThrHis: 0.857 ± 0.282
2.704ThrIle: 2.704 ± 0.396
3.363ThrLys: 3.363 ± 0.465
4.484ThrLeu: 4.484 ± 0.638
0.923ThrMet: 0.923 ± 0.289
2.835ThrAsn: 2.835 ± 0.371
2.44ThrPro: 2.44 ± 0.324
2.374ThrGln: 2.374 ± 0.357
2.374ThrArg: 2.374 ± 0.372
4.946ThrSer: 4.946 ± 0.803
3.561ThrThr: 3.561 ± 0.579
3.693ThrVal: 3.693 ± 0.506
0.593ThrTrp: 0.593 ± 0.189
1.912ThrTyr: 1.912 ± 0.436
0.0ThrXaa: 0.0 ± 0.0
Val
6.33ValAla: 6.33 ± 0.571
0.923ValCys: 0.923 ± 0.243
5.275ValAsp: 5.275 ± 0.576
5.737ValGlu: 5.737 ± 0.677
2.901ValPhe: 2.901 ± 0.497
6.133ValGly: 6.133 ± 0.723
0.923ValHis: 0.923 ± 0.227
2.901ValIle: 2.901 ± 0.477
4.022ValLys: 4.022 ± 0.52
5.012ValLeu: 5.012 ± 0.697
2.506ValMet: 2.506 ± 0.384
3.956ValAsn: 3.956 ± 0.474
2.44ValPro: 2.44 ± 0.42
2.44ValGln: 2.44 ± 0.46
3.891ValArg: 3.891 ± 0.602
4.352ValSer: 4.352 ± 0.592
3.297ValThr: 3.297 ± 0.449
6.924ValVal: 6.924 ± 0.734
1.055ValTrp: 1.055 ± 0.244
2.308ValTyr: 2.308 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
0.989TrpAla: 0.989 ± 0.229
0.066TrpCys: 0.066 ± 0.07
1.253TrpAsp: 1.253 ± 0.381
1.385TrpGlu: 1.385 ± 0.24
0.659TrpPhe: 0.659 ± 0.193
0.923TrpGly: 0.923 ± 0.255
0.33TrpHis: 0.33 ± 0.18
0.396TrpIle: 0.396 ± 0.155
0.857TrpLys: 0.857 ± 0.333
1.649TrpLeu: 1.649 ± 0.329
0.528TrpMet: 0.528 ± 0.201
0.264TrpAsn: 0.264 ± 0.124
0.396TrpPro: 0.396 ± 0.165
0.593TrpGln: 0.593 ± 0.185
0.659TrpArg: 0.659 ± 0.212
0.923TrpSer: 0.923 ± 0.273
0.857TrpThr: 0.857 ± 0.204
0.923TrpVal: 0.923 ± 0.239
0.198TrpTrp: 0.198 ± 0.122
0.593TrpTyr: 0.593 ± 0.194
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.835TyrAla: 2.835 ± 0.497
0.396TyrCys: 0.396 ± 0.179
2.044TyrAsp: 2.044 ± 0.292
2.77TyrGlu: 2.77 ± 0.492
1.385TyrPhe: 1.385 ± 0.304
2.572TyrGly: 2.572 ± 0.397
0.725TyrHis: 0.725 ± 0.194
1.253TyrIle: 1.253 ± 0.311
3.033TyrLys: 3.033 ± 0.508
2.901TyrLeu: 2.901 ± 0.516
0.857TyrMet: 0.857 ± 0.22
2.506TyrAsn: 2.506 ± 0.411
1.121TyrPro: 1.121 ± 0.269
1.187TyrGln: 1.187 ± 0.302
1.583TyrArg: 1.583 ± 0.263
2.967TyrSer: 2.967 ± 0.407
2.176TyrThr: 2.176 ± 0.443
2.967TyrVal: 2.967 ± 0.423
0.528TyrTrp: 0.528 ± 0.15
1.451TyrTyr: 1.451 ± 0.3
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (15166 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski