Amino acid dipepetide frequency for Serratia phage vB_SspS_OS31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.709AlaAla: 10.709 ± 1.475
0.306AlaCys: 0.306 ± 0.157
5.966AlaAsp: 5.966 ± 0.579
6.272AlaGlu: 6.272 ± 0.73
3.595AlaPhe: 3.595 ± 0.594
7.573AlaGly: 7.573 ± 0.867
1.3AlaHis: 1.3 ± 0.388
6.196AlaIle: 6.196 ± 0.705
4.896AlaLys: 4.896 ± 0.633
7.802AlaLeu: 7.802 ± 0.69
3.672AlaMet: 3.672 ± 0.598
4.131AlaAsn: 4.131 ± 0.547
3.901AlaPro: 3.901 ± 0.652
3.748AlaGln: 3.748 ± 0.734
4.666AlaArg: 4.666 ± 0.574
6.578AlaSer: 6.578 ± 0.903
5.278AlaThr: 5.278 ± 1.146
6.884AlaVal: 6.884 ± 0.86
1.683AlaTrp: 1.683 ± 0.479
2.983AlaTyr: 2.983 ± 0.464
0.0AlaXaa: 0.0 ± 0.0
Cys
0.841CysAla: 0.841 ± 0.305
0.229CysCys: 0.229 ± 0.138
1.147CysAsp: 1.147 ± 0.287
0.612CysGlu: 0.612 ± 0.241
0.382CysPhe: 0.382 ± 0.165
1.147CysGly: 1.147 ± 0.376
0.229CysHis: 0.229 ± 0.128
1.147CysIle: 1.147 ± 0.308
0.688CysLys: 0.688 ± 0.25
0.918CysLeu: 0.918 ± 0.297
0.229CysMet: 0.229 ± 0.116
0.459CysAsn: 0.459 ± 0.242
0.382CysPro: 0.382 ± 0.18
0.535CysGln: 0.535 ± 0.201
0.765CysArg: 0.765 ± 0.266
0.688CysSer: 0.688 ± 0.231
0.688CysThr: 0.688 ± 0.208
0.765CysVal: 0.765 ± 0.193
0.153CysTrp: 0.153 ± 0.097
0.535CysTyr: 0.535 ± 0.22
0.0CysXaa: 0.0 ± 0.0
Asp
5.508AspAla: 5.508 ± 0.642
0.765AspCys: 0.765 ± 0.269
3.289AspAsp: 3.289 ± 0.566
3.519AspGlu: 3.519 ± 0.68
2.371AspPhe: 2.371 ± 0.505
5.966AspGly: 5.966 ± 1.002
1.147AspHis: 1.147 ± 0.299
3.06AspIle: 3.06 ± 0.565
2.371AspLys: 2.371 ± 0.474
4.819AspLeu: 4.819 ± 0.629
1.606AspMet: 1.606 ± 0.326
2.371AspAsn: 2.371 ± 0.447
2.524AspPro: 2.524 ± 0.364
1.759AspGln: 1.759 ± 0.415
2.524AspArg: 2.524 ± 0.398
2.983AspSer: 2.983 ± 0.427
2.448AspThr: 2.448 ± 0.445
3.901AspVal: 3.901 ± 0.572
0.765AspTrp: 0.765 ± 0.213
2.295AspTyr: 2.295 ± 0.359
0.0AspXaa: 0.0 ± 0.0
Glu
5.814GluAla: 5.814 ± 0.713
1.071GluCys: 1.071 ± 0.32
3.366GluAsp: 3.366 ± 0.479
4.284GluGlu: 4.284 ± 0.756
2.83GluPhe: 2.83 ± 0.43
3.901GluGly: 3.901 ± 0.528
0.994GluHis: 0.994 ± 0.294
4.207GluIle: 4.207 ± 0.464
3.213GluLys: 3.213 ± 0.667
5.278GluLeu: 5.278 ± 0.629
1.224GluMet: 1.224 ± 0.297
2.677GluAsn: 2.677 ± 0.504
2.448GluPro: 2.448 ± 0.512
3.442GluGln: 3.442 ± 0.774
3.672GluArg: 3.672 ± 0.408
3.901GluSer: 3.901 ± 0.52
2.907GluThr: 2.907 ± 0.347
2.907GluVal: 2.907 ± 0.444
0.841GluTrp: 0.841 ± 0.246
2.754GluTyr: 2.754 ± 0.418
0.0GluXaa: 0.0 ± 0.0
Phe
2.524PheAla: 2.524 ± 0.407
0.535PheCys: 0.535 ± 0.167
2.677PheAsp: 2.677 ± 0.529
2.371PheGlu: 2.371 ± 0.359
1.147PhePhe: 1.147 ± 0.315
2.754PheGly: 2.754 ± 0.422
0.688PheHis: 0.688 ± 0.248
1.912PheIle: 1.912 ± 0.413
2.601PheLys: 2.601 ± 0.48
1.989PheLeu: 1.989 ± 0.387
0.688PheMet: 0.688 ± 0.207
2.065PheAsn: 2.065 ± 0.406
1.683PhePro: 1.683 ± 0.312
1.224PheGln: 1.224 ± 0.268
1.836PheArg: 1.836 ± 0.396
2.83PheSer: 2.83 ± 0.388
2.142PheThr: 2.142 ± 0.567
1.759PheVal: 1.759 ± 0.381
0.765PheTrp: 0.765 ± 0.239
1.453PheTyr: 1.453 ± 0.356
0.0PheXaa: 0.0 ± 0.0
Gly
5.661GlyAla: 5.661 ± 0.943
0.688GlyCys: 0.688 ± 0.242
5.202GlyAsp: 5.202 ± 0.776
4.59GlyGlu: 4.59 ± 0.691
2.524GlyPhe: 2.524 ± 0.479
6.272GlyGly: 6.272 ± 0.976
1.606GlyHis: 1.606 ± 0.368
3.442GlyIle: 3.442 ± 0.615
5.966GlyLys: 5.966 ± 0.854
6.196GlyLeu: 6.196 ± 0.71
2.677GlyMet: 2.677 ± 0.508
3.519GlyAsn: 3.519 ± 0.597
1.989GlyPro: 1.989 ± 0.368
3.06GlyGln: 3.06 ± 0.597
4.743GlyArg: 4.743 ± 0.672
5.202GlySer: 5.202 ± 0.728
5.202GlyThr: 5.202 ± 0.973
5.508GlyVal: 5.508 ± 0.71
1.377GlyTrp: 1.377 ± 0.294
1.606GlyTyr: 1.606 ± 0.338
0.0GlyXaa: 0.0 ± 0.0
His
1.071HisAla: 1.071 ± 0.362
0.306HisCys: 0.306 ± 0.155
0.841HisAsp: 0.841 ± 0.283
0.688HisGlu: 0.688 ± 0.271
0.229HisPhe: 0.229 ± 0.129
1.224HisGly: 1.224 ± 0.305
0.535HisHis: 0.535 ± 0.19
0.612HisIle: 0.612 ± 0.218
0.918HisLys: 0.918 ± 0.281
1.377HisLeu: 1.377 ± 0.291
0.153HisMet: 0.153 ± 0.101
1.224HisAsn: 1.224 ± 0.246
0.535HisPro: 0.535 ± 0.217
0.918HisGln: 0.918 ± 0.251
1.147HisArg: 1.147 ± 0.284
0.841HisSer: 0.841 ± 0.307
0.612HisThr: 0.612 ± 0.202
0.841HisVal: 0.841 ± 0.257
0.382HisTrp: 0.382 ± 0.198
0.765HisTyr: 0.765 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
6.119IleAla: 6.119 ± 0.622
0.612IleCys: 0.612 ± 0.221
3.672IleAsp: 3.672 ± 0.531
3.672IleGlu: 3.672 ± 0.445
2.295IlePhe: 2.295 ± 0.428
3.978IleGly: 3.978 ± 0.536
0.382IleHis: 0.382 ± 0.191
3.442IleIle: 3.442 ± 0.566
3.748IleLys: 3.748 ± 0.509
3.213IleLeu: 3.213 ± 0.598
1.453IleMet: 1.453 ± 0.33
2.524IleAsn: 2.524 ± 0.348
1.759IlePro: 1.759 ± 0.336
2.218IleGln: 2.218 ± 0.348
2.83IleArg: 2.83 ± 0.49
4.284IleSer: 4.284 ± 0.532
4.513IleThr: 4.513 ± 0.666
2.907IleVal: 2.907 ± 0.379
0.765IleTrp: 0.765 ± 0.23
1.224IleTyr: 1.224 ± 0.312
0.0IleXaa: 0.0 ± 0.0
Lys
5.278LysAla: 5.278 ± 0.661
0.459LysCys: 0.459 ± 0.17
2.677LysAsp: 2.677 ± 0.356
3.825LysGlu: 3.825 ± 0.649
1.683LysPhe: 1.683 ± 0.459
3.672LysGly: 3.672 ± 0.543
0.688LysHis: 0.688 ± 0.257
2.907LysIle: 2.907 ± 0.362
4.743LysLys: 4.743 ± 0.896
4.743LysLeu: 4.743 ± 0.565
1.683LysMet: 1.683 ± 0.375
1.989LysAsn: 1.989 ± 0.361
3.442LysPro: 3.442 ± 0.713
3.06LysGln: 3.06 ± 0.513
2.907LysArg: 2.907 ± 0.701
3.825LysSer: 3.825 ± 0.551
4.207LysThr: 4.207 ± 0.485
3.672LysVal: 3.672 ± 0.47
0.994LysTrp: 0.994 ± 0.269
1.683LysTyr: 1.683 ± 0.34
0.0LysXaa: 0.0 ± 0.0
Leu
8.873LeuAla: 8.873 ± 0.906
1.224LeuCys: 1.224 ± 0.333
4.284LeuAsp: 4.284 ± 0.596
3.825LeuGlu: 3.825 ± 0.636
2.524LeuPhe: 2.524 ± 0.403
5.355LeuGly: 5.355 ± 0.617
0.688LeuHis: 0.688 ± 0.22
4.36LeuIle: 4.36 ± 0.561
4.054LeuLys: 4.054 ± 0.574
5.966LeuLeu: 5.966 ± 0.829
1.989LeuMet: 1.989 ± 0.362
4.513LeuAsn: 4.513 ± 0.531
3.519LeuPro: 3.519 ± 0.444
3.672LeuGln: 3.672 ± 0.529
5.202LeuArg: 5.202 ± 0.535
5.125LeuSer: 5.125 ± 0.653
5.814LeuThr: 5.814 ± 0.668
4.972LeuVal: 4.972 ± 0.732
0.841LeuTrp: 0.841 ± 0.291
2.142LeuTyr: 2.142 ± 0.356
0.0LeuXaa: 0.0 ± 0.0
Met
3.213MetAla: 3.213 ± 0.49
0.459MetCys: 0.459 ± 0.23
0.841MetAsp: 0.841 ± 0.319
1.3MetGlu: 1.3 ± 0.282
1.147MetPhe: 1.147 ± 0.328
1.912MetGly: 1.912 ± 0.383
0.382MetHis: 0.382 ± 0.179
1.224MetIle: 1.224 ± 0.289
1.453MetLys: 1.453 ± 0.36
2.448MetLeu: 2.448 ± 0.468
1.071MetMet: 1.071 ± 0.248
0.688MetAsn: 0.688 ± 0.212
1.453MetPro: 1.453 ± 0.272
1.377MetGln: 1.377 ± 0.271
1.606MetArg: 1.606 ± 0.324
1.989MetSer: 1.989 ± 0.487
2.065MetThr: 2.065 ± 0.393
1.606MetVal: 1.606 ± 0.344
0.306MetTrp: 0.306 ± 0.15
0.765MetTyr: 0.765 ± 0.236
0.0MetXaa: 0.0 ± 0.0
Asn
3.672AsnAla: 3.672 ± 0.474
0.688AsnCys: 0.688 ± 0.209
1.836AsnAsp: 1.836 ± 0.353
2.754AsnGlu: 2.754 ± 0.543
1.453AsnPhe: 1.453 ± 0.313
4.819AsnGly: 4.819 ± 0.535
0.459AsnHis: 0.459 ± 0.201
2.065AsnIle: 2.065 ± 0.383
2.448AsnLys: 2.448 ± 0.383
3.519AsnLeu: 3.519 ± 0.46
0.918AsnMet: 0.918 ± 0.295
2.601AsnAsn: 2.601 ± 0.595
2.448AsnPro: 2.448 ± 0.489
1.912AsnGln: 1.912 ± 0.383
2.142AsnArg: 2.142 ± 0.365
3.06AsnSer: 3.06 ± 0.428
2.295AsnThr: 2.295 ± 0.463
2.448AsnVal: 2.448 ± 0.469
0.765AsnTrp: 0.765 ± 0.234
1.224AsnTyr: 1.224 ± 0.311
0.0AsnXaa: 0.0 ± 0.0
Pro
4.054ProAla: 4.054 ± 0.61
0.535ProCys: 0.535 ± 0.209
2.065ProAsp: 2.065 ± 0.359
3.366ProGlu: 3.366 ± 0.646
1.453ProPhe: 1.453 ± 0.33
3.06ProGly: 3.06 ± 0.492
0.841ProHis: 0.841 ± 0.265
1.912ProIle: 1.912 ± 0.39
2.218ProLys: 2.218 ± 0.339
4.207ProLeu: 4.207 ± 0.677
1.224ProMet: 1.224 ± 0.333
2.371ProAsn: 2.371 ± 0.459
2.524ProPro: 2.524 ± 0.479
1.071ProGln: 1.071 ± 0.247
1.836ProArg: 1.836 ± 0.397
3.366ProSer: 3.366 ± 0.505
2.295ProThr: 2.295 ± 0.576
2.754ProVal: 2.754 ± 0.528
0.841ProTrp: 0.841 ± 0.233
1.377ProTyr: 1.377 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
4.437GlnAla: 4.437 ± 0.472
0.918GlnCys: 0.918 ± 0.273
1.453GlnAsp: 1.453 ± 0.319
2.524GlnGlu: 2.524 ± 0.35
1.453GlnPhe: 1.453 ± 0.424
3.06GlnGly: 3.06 ± 0.507
0.765GlnHis: 0.765 ± 0.217
2.907GlnIle: 2.907 ± 0.487
2.065GlnLys: 2.065 ± 0.549
3.748GlnLeu: 3.748 ± 0.653
1.3GlnMet: 1.3 ± 0.333
1.683GlnAsn: 1.683 ± 0.349
1.453GlnPro: 1.453 ± 0.273
2.754GlnGln: 2.754 ± 0.705
3.825GlnArg: 3.825 ± 0.806
3.213GlnSer: 3.213 ± 0.471
2.448GlnThr: 2.448 ± 0.602
2.907GlnVal: 2.907 ± 0.521
0.612GlnTrp: 0.612 ± 0.238
1.3GlnTyr: 1.3 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
5.661ArgAla: 5.661 ± 0.843
0.459ArgCys: 0.459 ± 0.187
4.054ArgAsp: 4.054 ± 0.721
4.131ArgGlu: 4.131 ± 0.59
2.371ArgPhe: 2.371 ± 0.471
2.754ArgGly: 2.754 ± 0.462
0.765ArgHis: 0.765 ± 0.266
3.595ArgIle: 3.595 ± 0.545
3.289ArgLys: 3.289 ± 0.488
4.972ArgLeu: 4.972 ± 0.597
1.224ArgMet: 1.224 ± 0.26
2.065ArgAsn: 2.065 ± 0.406
2.065ArgPro: 2.065 ± 0.42
2.754ArgGln: 2.754 ± 0.475
3.289ArgArg: 3.289 ± 0.624
2.218ArgSer: 2.218 ± 0.368
3.06ArgThr: 3.06 ± 0.514
3.289ArgVal: 3.289 ± 0.432
1.759ArgTrp: 1.759 ± 0.404
2.524ArgTyr: 2.524 ± 0.427
0.0ArgXaa: 0.0 ± 0.0
Ser
7.802SerAla: 7.802 ± 0.65
1.071SerCys: 1.071 ± 0.344
4.513SerAsp: 4.513 ± 0.665
4.131SerGlu: 4.131 ± 0.498
1.53SerPhe: 1.53 ± 0.324
5.431SerGly: 5.431 ± 0.683
1.224SerHis: 1.224 ± 0.329
3.136SerIle: 3.136 ± 0.587
3.366SerLys: 3.366 ± 0.519
5.278SerLeu: 5.278 ± 0.662
1.683SerMet: 1.683 ± 0.33
2.524SerAsn: 2.524 ± 0.405
2.83SerPro: 2.83 ± 0.438
3.442SerGln: 3.442 ± 0.849
2.754SerArg: 2.754 ± 0.484
4.36SerSer: 4.36 ± 0.822
3.672SerThr: 3.672 ± 0.722
4.284SerVal: 4.284 ± 0.595
0.918SerTrp: 0.918 ± 0.234
1.3SerTyr: 1.3 ± 0.337
0.0SerXaa: 0.0 ± 0.0
Thr
6.043ThrAla: 6.043 ± 1.17
0.535ThrCys: 0.535 ± 0.202
2.677ThrAsp: 2.677 ± 0.49
3.136ThrGlu: 3.136 ± 0.496
2.83ThrPhe: 2.83 ± 0.396
6.043ThrGly: 6.043 ± 0.899
0.841ThrHis: 0.841 ± 0.331
3.366ThrIle: 3.366 ± 0.495
3.519ThrLys: 3.519 ± 0.333
4.437ThrLeu: 4.437 ± 0.685
1.3ThrMet: 1.3 ± 0.257
1.836ThrAsn: 1.836 ± 0.452
3.442ThrPro: 3.442 ± 0.464
3.136ThrGln: 3.136 ± 0.468
3.289ThrArg: 3.289 ± 0.448
3.901ThrSer: 3.901 ± 0.528
3.289ThrThr: 3.289 ± 0.604
4.131ThrVal: 4.131 ± 0.618
0.612ThrTrp: 0.612 ± 0.204
1.989ThrTyr: 1.989 ± 0.367
0.0ThrXaa: 0.0 ± 0.0
Val
6.043ValAla: 6.043 ± 0.556
0.459ValCys: 0.459 ± 0.288
3.136ValAsp: 3.136 ± 0.37
4.207ValGlu: 4.207 ± 0.545
2.218ValPhe: 2.218 ± 0.409
4.896ValGly: 4.896 ± 0.668
0.918ValHis: 0.918 ± 0.273
2.754ValIle: 2.754 ± 0.501
4.437ValLys: 4.437 ± 0.704
4.284ValLeu: 4.284 ± 0.528
1.836ValMet: 1.836 ± 0.459
2.371ValAsn: 2.371 ± 0.436
2.83ValPro: 2.83 ± 0.472
1.836ValGln: 1.836 ± 0.434
3.748ValArg: 3.748 ± 0.415
4.131ValSer: 4.131 ± 0.591
4.513ValThr: 4.513 ± 0.808
4.054ValVal: 4.054 ± 0.542
1.224ValTrp: 1.224 ± 0.278
2.142ValTyr: 2.142 ± 0.361
0.0ValXaa: 0.0 ± 0.0
Trp
0.918TrpAla: 0.918 ± 0.309
0.306TrpCys: 0.306 ± 0.178
1.071TrpAsp: 1.071 ± 0.286
0.841TrpGlu: 0.841 ± 0.241
0.459TrpPhe: 0.459 ± 0.167
1.147TrpGly: 1.147 ± 0.346
0.153TrpHis: 0.153 ± 0.121
1.224TrpIle: 1.224 ± 0.359
0.918TrpLys: 0.918 ± 0.286
1.377TrpLeu: 1.377 ± 0.315
0.688TrpMet: 0.688 ± 0.275
0.688TrpAsn: 0.688 ± 0.227
0.841TrpPro: 0.841 ± 0.264
0.841TrpGln: 0.841 ± 0.222
1.071TrpArg: 1.071 ± 0.227
0.841TrpSer: 0.841 ± 0.257
0.994TrpThr: 0.994 ± 0.234
0.918TrpVal: 0.918 ± 0.21
0.535TrpTrp: 0.535 ± 0.164
0.688TrpTyr: 0.688 ± 0.197
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.901TyrAla: 3.901 ± 0.569
0.918TyrCys: 0.918 ± 0.253
1.224TyrAsp: 1.224 ± 0.299
1.683TyrGlu: 1.683 ± 0.274
1.147TyrPhe: 1.147 ± 0.252
2.065TyrGly: 2.065 ± 0.374
0.612TyrHis: 0.612 ± 0.186
2.142TyrIle: 2.142 ± 0.355
1.224TyrLys: 1.224 ± 0.321
2.371TyrLeu: 2.371 ± 0.453
0.612TyrMet: 0.612 ± 0.208
1.224TyrAsn: 1.224 ± 0.303
1.3TyrPro: 1.3 ± 0.3
2.065TyrGln: 2.065 ± 0.384
2.448TyrArg: 2.448 ± 0.355
1.912TyrSer: 1.912 ± 0.367
1.912TyrThr: 1.912 ± 0.354
1.606TyrVal: 1.606 ± 0.342
0.382TyrTrp: 0.382 ± 0.174
0.765TyrTyr: 0.765 ± 0.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (13074 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski