Amino acid dipepetide frequency for Pseudoalteromonas phage pYD6-A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.29AlaAla: 6.29 ± 1.008
0.474AlaCys: 0.474 ± 0.171
4.179AlaAsp: 4.179 ± 0.39
5.299AlaGlu: 5.299 ± 0.62
2.283AlaPhe: 2.283 ± 0.304
4.696AlaGly: 4.696 ± 0.566
1.551AlaHis: 1.551 ± 0.264
5.515AlaIle: 5.515 ± 0.567
5.299AlaLys: 5.299 ± 0.575
6.894AlaLeu: 6.894 ± 0.605
1.853AlaMet: 1.853 ± 0.308
4.481AlaAsn: 4.481 ± 0.766
2.757AlaPro: 2.757 ± 0.385
3.662AlaGln: 3.662 ± 0.538
3.188AlaArg: 3.188 ± 0.753
4.093AlaSer: 4.093 ± 0.681
5.515AlaThr: 5.515 ± 0.521
5.17AlaVal: 5.17 ± 0.562
1.034AlaTrp: 1.034 ± 0.208
2.628AlaTyr: 2.628 ± 0.33
0.0AlaXaa: 0.0 ± 0.0
Cys
0.819CysAla: 0.819 ± 0.273
0.086CysCys: 0.086 ± 0.062
0.776CysAsp: 0.776 ± 0.197
0.776CysGlu: 0.776 ± 0.246
0.302CysPhe: 0.302 ± 0.12
0.689CysGly: 0.689 ± 0.26
0.086CysHis: 0.086 ± 0.064
0.517CysIle: 0.517 ± 0.181
0.56CysLys: 0.56 ± 0.211
0.646CysLeu: 0.646 ± 0.213
0.388CysMet: 0.388 ± 0.136
0.646CysAsn: 0.646 ± 0.236
0.517CysPro: 0.517 ± 0.148
0.129CysGln: 0.129 ± 0.107
0.431CysArg: 0.431 ± 0.144
0.388CysSer: 0.388 ± 0.136
0.732CysThr: 0.732 ± 0.212
0.388CysVal: 0.388 ± 0.145
0.129CysTrp: 0.129 ± 0.075
0.259CysTyr: 0.259 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
5.429AspAla: 5.429 ± 0.659
0.689AspCys: 0.689 ± 0.218
3.619AspAsp: 3.619 ± 0.619
4.739AspGlu: 4.739 ± 0.455
3.274AspPhe: 3.274 ± 0.467
5.213AspGly: 5.213 ± 0.81
1.163AspHis: 1.163 ± 0.226
5.256AspIle: 5.256 ± 0.448
3.791AspLys: 3.791 ± 0.41
5.903AspLeu: 5.903 ± 0.496
1.939AspMet: 1.939 ± 0.333
3.662AspAsn: 3.662 ± 0.501
3.016AspPro: 3.016 ± 0.359
1.896AspGln: 1.896 ± 0.29
2.37AspArg: 2.37 ± 0.332
3.921AspSer: 3.921 ± 0.382
4.696AspThr: 4.696 ± 0.468
3.835AspVal: 3.835 ± 0.438
1.206AspTrp: 1.206 ± 0.245
2.887AspTyr: 2.887 ± 0.411
0.0AspXaa: 0.0 ± 0.0
Glu
5.084GluAla: 5.084 ± 0.599
0.603GluCys: 0.603 ± 0.19
5.343GluAsp: 5.343 ± 0.523
5.687GluGlu: 5.687 ± 0.746
3.447GluPhe: 3.447 ± 0.379
4.826GluGly: 4.826 ± 0.446
1.034GluHis: 1.034 ± 0.168
5.213GluIle: 5.213 ± 0.594
3.533GluLys: 3.533 ± 0.474
6.678GluLeu: 6.678 ± 0.589
2.413GluMet: 2.413 ± 0.361
3.188GluAsn: 3.188 ± 0.399
1.896GluPro: 1.896 ± 0.222
2.456GluGln: 2.456 ± 0.424
3.231GluArg: 3.231 ± 0.32
4.05GluSer: 4.05 ± 0.377
3.447GluThr: 3.447 ± 0.383
4.61GluVal: 4.61 ± 0.531
1.034GluTrp: 1.034 ± 0.266
2.757GluTyr: 2.757 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
1.939PheAla: 1.939 ± 0.266
0.603PheCys: 0.603 ± 0.229
3.102PheAsp: 3.102 ± 0.444
2.111PheGlu: 2.111 ± 0.284
1.422PhePhe: 1.422 ± 0.25
2.542PheGly: 2.542 ± 0.355
0.646PheHis: 0.646 ± 0.161
3.016PheIle: 3.016 ± 0.34
2.844PheLys: 2.844 ± 0.388
2.801PheLeu: 2.801 ± 0.362
1.293PheMet: 1.293 ± 0.213
2.93PheAsn: 2.93 ± 0.439
1.12PhePro: 1.12 ± 0.276
1.551PheGln: 1.551 ± 0.238
1.766PheArg: 1.766 ± 0.252
2.628PheSer: 2.628 ± 0.431
2.327PheThr: 2.327 ± 0.354
1.594PheVal: 1.594 ± 0.277
0.259PheTrp: 0.259 ± 0.094
1.465PheTyr: 1.465 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
4.61GlyAla: 4.61 ± 0.624
0.474GlyCys: 0.474 ± 0.177
4.826GlyAsp: 4.826 ± 0.67
4.352GlyGlu: 4.352 ± 0.379
3.016GlyPhe: 3.016 ± 0.385
5.127GlyGly: 5.127 ± 0.685
0.819GlyHis: 0.819 ± 0.233
4.265GlyIle: 4.265 ± 0.488
3.921GlyLys: 3.921 ± 0.426
4.352GlyLeu: 4.352 ± 0.567
2.025GlyMet: 2.025 ± 0.296
4.136GlyAsn: 4.136 ± 0.577
1.336GlyPro: 1.336 ± 0.381
2.197GlyGln: 2.197 ± 0.38
2.628GlyArg: 2.628 ± 0.372
4.395GlySer: 4.395 ± 0.405
5.773GlyThr: 5.773 ± 0.669
4.265GlyVal: 4.265 ± 0.462
0.991GlyTrp: 0.991 ± 0.286
3.188GlyTyr: 3.188 ± 0.358
0.0GlyXaa: 0.0 ± 0.0
His
1.379HisAla: 1.379 ± 0.291
0.388HisCys: 0.388 ± 0.178
1.12HisAsp: 1.12 ± 0.259
1.163HisGlu: 1.163 ± 0.252
0.732HisPhe: 0.732 ± 0.176
1.077HisGly: 1.077 ± 0.226
0.474HisHis: 0.474 ± 0.142
1.249HisIle: 1.249 ± 0.254
1.723HisLys: 1.723 ± 0.318
1.81HisLeu: 1.81 ± 0.294
0.603HisMet: 0.603 ± 0.123
0.905HisAsn: 0.905 ± 0.214
0.905HisPro: 0.905 ± 0.209
0.517HisGln: 0.517 ± 0.143
0.905HisArg: 0.905 ± 0.232
0.732HisSer: 0.732 ± 0.185
0.862HisThr: 0.862 ± 0.212
1.12HisVal: 1.12 ± 0.171
0.431HisTrp: 0.431 ± 0.157
0.776HisTyr: 0.776 ± 0.243
0.0HisXaa: 0.0 ± 0.0
Ile
4.653IleAla: 4.653 ± 0.412
0.603IleCys: 0.603 ± 0.158
5.773IleAsp: 5.773 ± 0.545
5.343IleGlu: 5.343 ± 0.653
1.637IlePhe: 1.637 ± 0.256
4.093IleGly: 4.093 ± 0.506
1.637IleHis: 1.637 ± 0.326
2.973IleIle: 2.973 ± 0.41
4.438IleLys: 4.438 ± 0.437
3.748IleLeu: 3.748 ± 0.448
1.336IleMet: 1.336 ± 0.251
3.964IleAsn: 3.964 ± 0.37
2.283IlePro: 2.283 ± 0.282
2.93IleGln: 2.93 ± 0.368
2.714IleArg: 2.714 ± 0.325
4.265IleSer: 4.265 ± 0.464
3.748IleThr: 3.748 ± 0.438
3.921IleVal: 3.921 ± 0.439
0.732IleTrp: 0.732 ± 0.185
1.896IleTyr: 1.896 ± 0.267
0.0IleXaa: 0.0 ± 0.0
Lys
5.989LysAla: 5.989 ± 0.617
0.474LysCys: 0.474 ± 0.136
4.136LysAsp: 4.136 ± 0.461
5.213LysGlu: 5.213 ± 0.609
1.853LysPhe: 1.853 ± 0.317
4.136LysGly: 4.136 ± 0.391
1.723LysHis: 1.723 ± 0.289
3.361LysIle: 3.361 ± 0.323
3.447LysLys: 3.447 ± 0.616
5.773LysLeu: 5.773 ± 0.667
2.154LysMet: 2.154 ± 0.458
3.102LysAsn: 3.102 ± 0.444
2.111LysPro: 2.111 ± 0.239
2.973LysGln: 2.973 ± 0.411
3.016LysArg: 3.016 ± 0.371
3.533LysSer: 3.533 ± 0.477
3.748LysThr: 3.748 ± 0.434
4.438LysVal: 4.438 ± 0.342
1.077LysTrp: 1.077 ± 0.183
2.887LysTyr: 2.887 ± 0.44
0.0LysXaa: 0.0 ± 0.0
Leu
6.592LeuAla: 6.592 ± 0.651
0.517LeuCys: 0.517 ± 0.157
6.42LeuAsp: 6.42 ± 0.466
6.075LeuGlu: 6.075 ± 0.743
2.757LeuPhe: 2.757 ± 0.441
5.299LeuGly: 5.299 ± 0.692
1.249LeuHis: 1.249 ± 0.251
5.515LeuIle: 5.515 ± 0.712
5.256LeuLys: 5.256 ± 0.56
6.807LeuLeu: 6.807 ± 0.581
2.111LeuMet: 2.111 ± 0.266
4.308LeuAsn: 4.308 ± 0.376
3.016LeuPro: 3.016 ± 0.385
3.274LeuGln: 3.274 ± 0.501
3.921LeuArg: 3.921 ± 0.361
5.86LeuSer: 5.86 ± 0.418
4.696LeuThr: 4.696 ± 0.413
5.343LeuVal: 5.343 ± 0.488
0.603LeuTrp: 0.603 ± 0.191
2.671LeuTyr: 2.671 ± 0.382
0.0LeuXaa: 0.0 ± 0.0
Met
2.37MetAla: 2.37 ± 0.338
0.172MetCys: 0.172 ± 0.086
1.81MetAsp: 1.81 ± 0.337
1.249MetGlu: 1.249 ± 0.222
0.862MetPhe: 0.862 ± 0.23
1.465MetGly: 1.465 ± 0.274
0.905MetHis: 0.905 ± 0.206
1.594MetIle: 1.594 ± 0.261
1.465MetLys: 1.465 ± 0.205
2.197MetLeu: 2.197 ± 0.275
0.603MetMet: 0.603 ± 0.123
1.853MetAsn: 1.853 ± 0.278
0.948MetPro: 0.948 ± 0.206
0.905MetGln: 0.905 ± 0.19
1.206MetArg: 1.206 ± 0.225
2.37MetSer: 2.37 ± 0.303
1.853MetThr: 1.853 ± 0.297
1.379MetVal: 1.379 ± 0.235
0.345MetTrp: 0.345 ± 0.119
1.206MetTyr: 1.206 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
3.835AsnAla: 3.835 ± 0.635
0.474AsnCys: 0.474 ± 0.187
3.533AsnAsp: 3.533 ± 0.43
3.662AsnGlu: 3.662 ± 0.363
1.594AsnPhe: 1.594 ± 0.299
3.447AsnGly: 3.447 ± 0.397
0.948AsnHis: 0.948 ± 0.249
3.49AsnIle: 3.49 ± 0.416
3.705AsnLys: 3.705 ± 0.515
5.041AsnLeu: 5.041 ± 0.482
1.465AsnMet: 1.465 ± 0.34
4.136AsnAsn: 4.136 ± 0.452
2.757AsnPro: 2.757 ± 0.561
2.327AsnGln: 2.327 ± 0.389
2.542AsnArg: 2.542 ± 0.35
4.05AsnSer: 4.05 ± 0.525
3.791AsnThr: 3.791 ± 0.563
3.404AsnVal: 3.404 ± 0.418
0.776AsnTrp: 0.776 ± 0.151
2.456AsnTyr: 2.456 ± 0.332
0.0AsnXaa: 0.0 ± 0.0
Pro
2.327ProAla: 2.327 ± 0.405
0.215ProCys: 0.215 ± 0.116
2.628ProAsp: 2.628 ± 0.442
2.93ProGlu: 2.93 ± 0.361
1.422ProPhe: 1.422 ± 0.262
1.766ProGly: 1.766 ± 0.323
0.474ProHis: 0.474 ± 0.147
1.982ProIle: 1.982 ± 0.285
2.671ProLys: 2.671 ± 0.287
2.801ProLeu: 2.801 ± 0.336
0.862ProMet: 0.862 ± 0.18
2.154ProAsn: 2.154 ± 0.278
0.948ProPro: 0.948 ± 0.211
1.551ProGln: 1.551 ± 0.251
0.948ProArg: 0.948 ± 0.215
2.499ProSer: 2.499 ± 0.319
2.499ProThr: 2.499 ± 0.253
2.413ProVal: 2.413 ± 0.303
0.345ProTrp: 0.345 ± 0.17
1.206ProTyr: 1.206 ± 0.228
0.0ProXaa: 0.0 ± 0.0
Gln
3.619GlnAla: 3.619 ± 0.597
0.517GlnCys: 0.517 ± 0.177
2.111GlnAsp: 2.111 ± 0.314
3.188GlnGlu: 3.188 ± 0.593
1.551GlnPhe: 1.551 ± 0.289
2.844GlnGly: 2.844 ± 0.325
0.689GlnHis: 0.689 ± 0.164
1.551GlnIle: 1.551 ± 0.255
2.671GlnLys: 2.671 ± 0.432
3.533GlnLeu: 3.533 ± 0.487
1.034GlnMet: 1.034 ± 0.211
1.206GlnAsn: 1.206 ± 0.253
0.905GlnPro: 0.905 ± 0.246
2.283GlnGln: 2.283 ± 0.513
1.81GlnArg: 1.81 ± 0.409
2.714GlnSer: 2.714 ± 0.285
2.37GlnThr: 2.37 ± 0.413
2.283GlnVal: 2.283 ± 0.253
0.56GlnTrp: 0.56 ± 0.151
1.81GlnTyr: 1.81 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
3.361ArgAla: 3.361 ± 0.55
0.56ArgCys: 0.56 ± 0.196
2.887ArgAsp: 2.887 ± 0.478
3.145ArgGlu: 3.145 ± 0.529
1.68ArgPhe: 1.68 ± 0.313
2.628ArgGly: 2.628 ± 0.298
0.517ArgHis: 0.517 ± 0.133
2.671ArgIle: 2.671 ± 0.317
3.274ArgLys: 3.274 ± 0.391
3.964ArgLeu: 3.964 ± 0.5
0.948ArgMet: 0.948 ± 0.181
2.37ArgAsn: 2.37 ± 0.361
1.12ArgPro: 1.12 ± 0.269
1.637ArgGln: 1.637 ± 0.268
1.422ArgArg: 1.422 ± 0.213
2.542ArgSer: 2.542 ± 0.298
2.197ArgThr: 2.197 ± 0.264
2.671ArgVal: 2.671 ± 0.305
0.819ArgTrp: 0.819 ± 0.141
1.379ArgTyr: 1.379 ± 0.232
0.0ArgXaa: 0.0 ± 0.0
Ser
4.352SerAla: 4.352 ± 0.367
0.732SerCys: 0.732 ± 0.203
4.136SerAsp: 4.136 ± 0.429
3.404SerGlu: 3.404 ± 0.391
2.714SerPhe: 2.714 ± 0.352
4.912SerGly: 4.912 ± 0.536
1.077SerHis: 1.077 ± 0.234
3.662SerIle: 3.662 ± 0.438
5.127SerLys: 5.127 ± 0.46
5.386SerLeu: 5.386 ± 0.447
1.379SerMet: 1.379 ± 0.249
3.49SerAsn: 3.49 ± 0.463
2.542SerPro: 2.542 ± 0.342
2.24SerGln: 2.24 ± 0.312
2.37SerArg: 2.37 ± 0.315
3.318SerSer: 3.318 ± 0.542
3.835SerThr: 3.835 ± 0.455
4.395SerVal: 4.395 ± 0.463
0.991SerTrp: 0.991 ± 0.24
1.939SerTyr: 1.939 ± 0.323
0.0SerXaa: 0.0 ± 0.0
Thr
5.73ThrAla: 5.73 ± 0.767
0.603ThrCys: 0.603 ± 0.191
4.136ThrAsp: 4.136 ± 0.487
4.308ThrGlu: 4.308 ± 0.418
3.016ThrPhe: 3.016 ± 0.352
4.222ThrGly: 4.222 ± 0.416
1.336ThrHis: 1.336 ± 0.295
3.964ThrIle: 3.964 ± 0.391
3.921ThrLys: 3.921 ± 0.468
5.041ThrLeu: 5.041 ± 0.434
1.422ThrMet: 1.422 ± 0.251
3.878ThrAsn: 3.878 ± 0.304
2.887ThrPro: 2.887 ± 0.495
2.154ThrGln: 2.154 ± 0.21
1.982ThrArg: 1.982 ± 0.189
3.964ThrSer: 3.964 ± 0.536
4.179ThrThr: 4.179 ± 0.625
3.964ThrVal: 3.964 ± 0.551
0.646ThrTrp: 0.646 ± 0.173
2.542ThrTyr: 2.542 ± 0.415
0.0ThrXaa: 0.0 ± 0.0
Val
4.869ValAla: 4.869 ± 0.52
0.56ValCys: 0.56 ± 0.178
5.084ValAsp: 5.084 ± 0.456
4.438ValGlu: 4.438 ± 0.405
2.24ValPhe: 2.24 ± 0.33
4.653ValGly: 4.653 ± 0.406
1.249ValHis: 1.249 ± 0.2
3.748ValIle: 3.748 ± 0.332
4.007ValLys: 4.007 ± 0.389
4.136ValLeu: 4.136 ± 0.386
1.379ValMet: 1.379 ± 0.211
3.964ValAsn: 3.964 ± 0.499
2.068ValPro: 2.068 ± 0.276
2.327ValGln: 2.327 ± 0.331
3.016ValArg: 3.016 ± 0.315
3.447ValSer: 3.447 ± 0.375
4.782ValThr: 4.782 ± 0.546
4.352ValVal: 4.352 ± 0.427
0.732ValTrp: 0.732 ± 0.186
2.327ValTyr: 2.327 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
1.12TrpAla: 1.12 ± 0.186
0.129TrpCys: 0.129 ± 0.079
0.991TrpAsp: 0.991 ± 0.242
0.776TrpGlu: 0.776 ± 0.154
0.991TrpPhe: 0.991 ± 0.224
0.646TrpGly: 0.646 ± 0.133
0.345TrpHis: 0.345 ± 0.134
0.862TrpIle: 0.862 ± 0.17
0.776TrpLys: 0.776 ± 0.195
1.249TrpLeu: 1.249 ± 0.295
0.302TrpMet: 0.302 ± 0.088
0.862TrpAsn: 0.862 ± 0.225
0.172TrpPro: 0.172 ± 0.081
0.56TrpGln: 0.56 ± 0.165
0.603TrpArg: 0.603 ± 0.163
0.905TrpSer: 0.905 ± 0.209
0.776TrpThr: 0.776 ± 0.186
0.905TrpVal: 0.905 ± 0.202
0.043TrpTrp: 0.043 ± 0.051
0.603TrpTyr: 0.603 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.499TyrAla: 2.499 ± 0.361
0.388TyrCys: 0.388 ± 0.172
1.896TyrAsp: 1.896 ± 0.26
2.671TyrGlu: 2.671 ± 0.347
1.163TyrPhe: 1.163 ± 0.253
2.413TyrGly: 2.413 ± 0.336
0.948TyrHis: 0.948 ± 0.213
2.283TyrIle: 2.283 ± 0.306
2.757TyrLys: 2.757 ± 0.372
3.533TyrLeu: 3.533 ± 0.367
1.206TyrMet: 1.206 ± 0.232
2.197TyrAsn: 2.197 ± 0.225
1.336TyrPro: 1.336 ± 0.337
1.68TyrGln: 1.68 ± 0.246
1.723TyrArg: 1.723 ± 0.218
2.327TyrSer: 2.327 ± 0.33
2.068TyrThr: 2.068 ± 0.332
2.93TyrVal: 2.93 ± 0.349
0.862TyrTrp: 0.862 ± 0.193
1.551TyrTyr: 1.551 ± 0.334
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (23211 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski