Amino acid dipepetide frequency for Psychrobacillus phage Spoks

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.351AlaAla: 4.351 ± 0.895
0.444AlaCys: 0.444 ± 0.195
4.085AlaAsp: 4.085 ± 0.621
4.617AlaGlu: 4.617 ± 0.904
3.108AlaPhe: 3.108 ± 0.516
3.108AlaGly: 3.108 ± 0.674
0.888AlaHis: 0.888 ± 0.297
6.304AlaIle: 6.304 ± 1.189
6.216AlaLys: 6.216 ± 1.283
5.15AlaLeu: 5.15 ± 0.649
1.598AlaMet: 1.598 ± 0.372
5.061AlaAsn: 5.061 ± 1.427
1.865AlaPro: 1.865 ± 0.461
2.042AlaGln: 2.042 ± 0.39
2.042AlaArg: 2.042 ± 0.401
3.818AlaSer: 3.818 ± 0.779
3.463AlaThr: 3.463 ± 0.781
4.351AlaVal: 4.351 ± 0.632
0.444AlaTrp: 0.444 ± 0.169
2.486AlaTyr: 2.486 ± 0.523
0.0AlaXaa: 0.0 ± 0.0
Cys
0.089CysAla: 0.089 ± 0.087
0.089CysCys: 0.089 ± 0.078
0.178CysAsp: 0.178 ± 0.132
0.355CysGlu: 0.355 ± 0.213
0.355CysPhe: 0.355 ± 0.166
0.266CysGly: 0.266 ± 0.151
0.089CysHis: 0.089 ± 0.088
0.533CysIle: 0.533 ± 0.204
0.799CysLys: 0.799 ± 0.26
0.266CysLeu: 0.266 ± 0.201
0.355CysMet: 0.355 ± 0.162
0.178CysAsn: 0.178 ± 0.111
0.178CysPro: 0.178 ± 0.118
0.0CysGln: 0.0 ± 0.0
0.266CysArg: 0.266 ± 0.171
0.622CysSer: 0.622 ± 0.317
0.178CysThr: 0.178 ± 0.127
0.266CysVal: 0.266 ± 0.131
0.089CysTrp: 0.089 ± 0.082
0.178CysTyr: 0.178 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
3.818AspAla: 3.818 ± 0.553
0.178AspCys: 0.178 ± 0.141
3.463AspAsp: 3.463 ± 0.64
6.127AspGlu: 6.127 ± 1.137
2.93AspPhe: 2.93 ± 0.431
4.706AspGly: 4.706 ± 0.783
0.977AspHis: 0.977 ± 0.313
5.505AspIle: 5.505 ± 0.713
3.996AspLys: 3.996 ± 0.506
4.085AspLeu: 4.085 ± 0.542
2.309AspMet: 2.309 ± 0.541
2.575AspAsn: 2.575 ± 0.33
1.953AspPro: 1.953 ± 0.357
1.51AspGln: 1.51 ± 0.315
2.397AspArg: 2.397 ± 0.507
3.818AspSer: 3.818 ± 0.598
2.309AspThr: 2.309 ± 0.328
3.463AspVal: 3.463 ± 0.574
0.799AspTrp: 0.799 ± 0.307
1.687AspTyr: 1.687 ± 0.397
0.0AspXaa: 0.0 ± 0.0
Glu
4.795GluAla: 4.795 ± 0.791
0.533GluCys: 0.533 ± 0.255
3.907GluAsp: 3.907 ± 0.728
6.482GluGlu: 6.482 ± 1.021
4.44GluPhe: 4.44 ± 0.839
3.552GluGly: 3.552 ± 0.694
0.888GluHis: 0.888 ± 0.296
7.104GluIle: 7.104 ± 1.089
6.216GluLys: 6.216 ± 0.893
8.613GluLeu: 8.613 ± 0.916
2.397GluMet: 2.397 ± 0.344
4.529GluAsn: 4.529 ± 0.665
1.953GluPro: 1.953 ± 0.378
3.374GluGln: 3.374 ± 0.661
3.197GluArg: 3.197 ± 0.824
4.617GluSer: 4.617 ± 0.741
3.463GluThr: 3.463 ± 0.477
6.393GluVal: 6.393 ± 0.555
1.154GluTrp: 1.154 ± 0.317
2.575GluTyr: 2.575 ± 0.619
0.0GluXaa: 0.0 ± 0.0
Phe
2.753PheAla: 2.753 ± 0.519
0.178PheCys: 0.178 ± 0.114
2.575PheAsp: 2.575 ± 0.487
3.907PheGlu: 3.907 ± 0.708
1.953PhePhe: 1.953 ± 0.52
3.108PheGly: 3.108 ± 0.643
0.622PheHis: 0.622 ± 0.193
3.108PheIle: 3.108 ± 0.713
5.772PheLys: 5.772 ± 0.89
4.262PheLeu: 4.262 ± 0.702
1.154PheMet: 1.154 ± 0.332
3.019PheAsn: 3.019 ± 0.401
0.799PhePro: 0.799 ± 0.251
1.066PheGln: 1.066 ± 0.324
1.421PheArg: 1.421 ± 0.396
2.664PheSer: 2.664 ± 0.462
2.93PheThr: 2.93 ± 0.356
2.397PheVal: 2.397 ± 0.493
0.533PheTrp: 0.533 ± 0.221
1.776PheTyr: 1.776 ± 0.359
0.0PheXaa: 0.0 ± 0.0
Gly
3.818GlyAla: 3.818 ± 0.799
0.178GlyCys: 0.178 ± 0.116
3.552GlyAsp: 3.552 ± 0.611
3.285GlyGlu: 3.285 ± 0.586
2.753GlyPhe: 2.753 ± 0.619
3.285GlyGly: 3.285 ± 0.645
0.888GlyHis: 0.888 ± 0.209
4.173GlyIle: 4.173 ± 0.596
4.351GlyLys: 4.351 ± 0.579
4.351GlyLeu: 4.351 ± 0.763
1.687GlyMet: 1.687 ± 0.523
2.841GlyAsn: 2.841 ± 0.437
1.776GlyPro: 1.776 ± 0.464
1.953GlyGln: 1.953 ± 0.41
2.131GlyArg: 2.131 ± 0.456
3.818GlySer: 3.818 ± 0.714
4.617GlyThr: 4.617 ± 0.814
4.884GlyVal: 4.884 ± 1.051
1.332GlyTrp: 1.332 ± 0.32
2.22GlyTyr: 2.22 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.066HisAla: 1.066 ± 0.307
0.0HisCys: 0.0 ± 0.0
0.888HisAsp: 0.888 ± 0.286
1.421HisGlu: 1.421 ± 0.372
0.71HisPhe: 0.71 ± 0.211
0.888HisGly: 0.888 ± 0.24
0.444HisHis: 0.444 ± 0.274
1.776HisIle: 1.776 ± 0.458
1.332HisLys: 1.332 ± 0.479
1.243HisLeu: 1.243 ± 0.398
0.622HisMet: 0.622 ± 0.236
1.066HisAsn: 1.066 ± 0.264
0.533HisPro: 0.533 ± 0.191
0.71HisGln: 0.71 ± 0.221
0.71HisArg: 0.71 ± 0.273
0.799HisSer: 0.799 ± 0.251
0.622HisThr: 0.622 ± 0.232
0.71HisVal: 0.71 ± 0.259
0.444HisTrp: 0.444 ± 0.242
0.622HisTyr: 0.622 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
5.772IleAla: 5.772 ± 0.684
0.444IleCys: 0.444 ± 0.242
4.795IleAsp: 4.795 ± 0.652
7.814IleGlu: 7.814 ± 0.694
2.486IlePhe: 2.486 ± 0.595
4.262IleGly: 4.262 ± 0.758
1.421IleHis: 1.421 ± 0.336
4.884IleIle: 4.884 ± 0.613
7.281IleLys: 7.281 ± 0.895
4.884IleLeu: 4.884 ± 0.681
1.687IleMet: 1.687 ± 0.367
4.795IleAsn: 4.795 ± 0.783
2.841IlePro: 2.841 ± 0.543
2.753IleGln: 2.753 ± 0.565
2.93IleArg: 2.93 ± 0.564
5.86IleSer: 5.86 ± 0.707
4.262IleThr: 4.262 ± 0.751
5.239IleVal: 5.239 ± 0.625
0.533IleTrp: 0.533 ± 0.218
2.397IleTyr: 2.397 ± 0.439
0.0IleXaa: 0.0 ± 0.0
Lys
5.061LysAla: 5.061 ± 0.65
0.266LysCys: 0.266 ± 0.136
5.949LysAsp: 5.949 ± 0.741
7.015LysGlu: 7.015 ± 1.057
3.374LysPhe: 3.374 ± 0.413
4.706LysGly: 4.706 ± 0.578
1.776LysHis: 1.776 ± 0.417
6.482LysIle: 6.482 ± 0.822
8.435LysLys: 8.435 ± 1.202
7.015LysLeu: 7.015 ± 0.841
2.664LysMet: 2.664 ± 0.434
5.416LysAsn: 5.416 ± 0.656
1.687LysPro: 1.687 ± 0.425
3.729LysGln: 3.729 ± 0.63
3.996LysArg: 3.996 ± 0.731
3.729LysSer: 3.729 ± 0.633
6.393LysThr: 6.393 ± 0.974
5.505LysVal: 5.505 ± 0.568
1.421LysTrp: 1.421 ± 0.331
2.397LysTyr: 2.397 ± 0.493
0.0LysXaa: 0.0 ± 0.0
Leu
5.949LeuAla: 5.949 ± 0.723
0.355LeuCys: 0.355 ± 0.19
5.416LeuAsp: 5.416 ± 0.592
7.37LeuGlu: 7.37 ± 0.905
3.996LeuPhe: 3.996 ± 0.681
4.351LeuGly: 4.351 ± 0.542
1.066LeuHis: 1.066 ± 0.334
5.239LeuIle: 5.239 ± 0.74
7.015LeuLys: 7.015 ± 0.891
6.837LeuLeu: 6.837 ± 1.031
1.598LeuMet: 1.598 ± 0.277
4.529LeuAsn: 4.529 ± 0.517
2.664LeuPro: 2.664 ± 0.442
2.753LeuGln: 2.753 ± 0.561
2.042LeuArg: 2.042 ± 0.507
6.393LeuSer: 6.393 ± 0.755
4.884LeuThr: 4.884 ± 0.537
4.617LeuVal: 4.617 ± 0.504
0.71LeuTrp: 0.71 ± 0.239
2.664LeuTyr: 2.664 ± 0.482
0.0LeuXaa: 0.0 ± 0.0
Met
1.776MetAla: 1.776 ± 0.348
0.178MetCys: 0.178 ± 0.126
1.332MetAsp: 1.332 ± 0.319
1.953MetGlu: 1.953 ± 0.451
0.799MetPhe: 0.799 ± 0.28
1.687MetGly: 1.687 ± 0.411
0.266MetHis: 0.266 ± 0.145
2.042MetIle: 2.042 ± 0.519
3.285MetLys: 3.285 ± 0.613
2.131MetLeu: 2.131 ± 0.316
0.533MetMet: 0.533 ± 0.199
2.22MetAsn: 2.22 ± 0.31
0.71MetPro: 0.71 ± 0.215
0.71MetGln: 0.71 ± 0.298
0.888MetArg: 0.888 ± 0.32
1.066MetSer: 1.066 ± 0.272
2.753MetThr: 2.753 ± 0.479
1.953MetVal: 1.953 ± 0.465
0.0MetTrp: 0.0 ± 0.0
1.154MetTyr: 1.154 ± 0.318
0.0MetXaa: 0.0 ± 0.0
Asn
5.239AsnAla: 5.239 ± 1.18
0.355AsnCys: 0.355 ± 0.189
2.131AsnAsp: 2.131 ± 0.406
5.061AsnGlu: 5.061 ± 0.548
2.486AsnPhe: 2.486 ± 0.394
4.884AsnGly: 4.884 ± 1.101
1.066AsnHis: 1.066 ± 0.343
3.374AsnIle: 3.374 ± 0.5
4.795AsnLys: 4.795 ± 0.675
4.884AsnLeu: 4.884 ± 0.667
1.243AsnMet: 1.243 ± 0.35
3.818AsnAsn: 3.818 ± 0.739
1.953AsnPro: 1.953 ± 0.371
2.22AsnGln: 2.22 ± 0.348
2.575AsnArg: 2.575 ± 0.514
3.552AsnSer: 3.552 ± 0.567
3.641AsnThr: 3.641 ± 0.768
4.351AsnVal: 4.351 ± 0.75
0.444AsnTrp: 0.444 ± 0.217
2.22AsnTyr: 2.22 ± 0.458
0.0AsnXaa: 0.0 ± 0.0
Pro
1.865ProAla: 1.865 ± 0.353
0.178ProCys: 0.178 ± 0.113
1.154ProAsp: 1.154 ± 0.255
1.776ProGlu: 1.776 ± 0.466
1.776ProPhe: 1.776 ± 0.384
1.154ProGly: 1.154 ± 0.263
0.444ProHis: 0.444 ± 0.233
1.865ProIle: 1.865 ± 0.449
2.309ProLys: 2.309 ± 0.457
3.019ProLeu: 3.019 ± 0.573
0.533ProMet: 0.533 ± 0.209
1.51ProAsn: 1.51 ± 0.325
0.977ProPro: 0.977 ± 0.363
1.687ProGln: 1.687 ± 0.427
0.71ProArg: 0.71 ± 0.273
1.953ProSer: 1.953 ± 0.412
2.841ProThr: 2.841 ± 0.716
2.309ProVal: 2.309 ± 0.586
0.178ProTrp: 0.178 ± 0.129
1.066ProTyr: 1.066 ± 0.334
0.0ProXaa: 0.0 ± 0.0
Gln
2.397GlnAla: 2.397 ± 0.51
0.0GlnCys: 0.0 ± 0.0
2.22GlnAsp: 2.22 ± 0.408
3.285GlnGlu: 3.285 ± 0.722
1.421GlnPhe: 1.421 ± 0.382
0.977GlnGly: 0.977 ± 0.294
0.622GlnHis: 0.622 ± 0.214
2.753GlnIle: 2.753 ± 0.438
2.753GlnLys: 2.753 ± 0.624
3.019GlnLeu: 3.019 ± 0.538
1.332GlnMet: 1.332 ± 0.296
1.51GlnAsn: 1.51 ± 0.573
1.154GlnPro: 1.154 ± 0.308
1.953GlnGln: 1.953 ± 0.403
1.598GlnArg: 1.598 ± 0.367
2.131GlnSer: 2.131 ± 0.592
2.131GlnThr: 2.131 ± 0.598
1.598GlnVal: 1.598 ± 0.34
0.266GlnTrp: 0.266 ± 0.209
1.421GlnTyr: 1.421 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
2.131ArgAla: 2.131 ± 0.379
0.355ArgCys: 0.355 ± 0.208
2.841ArgAsp: 2.841 ± 0.548
2.753ArgGlu: 2.753 ± 0.634
1.776ArgPhe: 1.776 ± 0.37
1.865ArgGly: 1.865 ± 0.35
0.888ArgHis: 0.888 ± 0.274
2.841ArgIle: 2.841 ± 0.629
4.085ArgLys: 4.085 ± 0.731
3.019ArgLeu: 3.019 ± 0.553
1.776ArgMet: 1.776 ± 0.427
1.953ArgAsn: 1.953 ± 0.413
1.243ArgPro: 1.243 ± 0.311
0.888ArgGln: 0.888 ± 0.277
1.598ArgArg: 1.598 ± 0.471
2.042ArgSer: 2.042 ± 0.492
2.309ArgThr: 2.309 ± 0.51
2.131ArgVal: 2.131 ± 0.469
0.355ArgTrp: 0.355 ± 0.203
1.865ArgTyr: 1.865 ± 0.427
0.0ArgXaa: 0.0 ± 0.0
Ser
3.641SerAla: 3.641 ± 0.583
0.178SerCys: 0.178 ± 0.124
3.108SerAsp: 3.108 ± 0.454
4.884SerGlu: 4.884 ± 0.824
2.309SerPhe: 2.309 ± 0.56
2.575SerGly: 2.575 ± 0.454
1.066SerHis: 1.066 ± 0.296
6.482SerIle: 6.482 ± 0.858
4.529SerLys: 4.529 ± 0.718
4.085SerLeu: 4.085 ± 0.738
1.687SerMet: 1.687 ± 0.385
3.996SerAsn: 3.996 ± 0.576
1.154SerPro: 1.154 ± 0.375
2.22SerGln: 2.22 ± 0.408
2.753SerArg: 2.753 ± 0.45
4.44SerSer: 4.44 ± 0.639
3.463SerThr: 3.463 ± 0.46
4.795SerVal: 4.795 ± 0.586
0.799SerTrp: 0.799 ± 0.22
1.953SerTyr: 1.953 ± 0.376
0.0SerXaa: 0.0 ± 0.0
Thr
3.641ThrAla: 3.641 ± 0.739
0.355ThrCys: 0.355 ± 0.16
2.93ThrAsp: 2.93 ± 0.508
3.108ThrGlu: 3.108 ± 0.485
3.374ThrPhe: 3.374 ± 0.699
4.972ThrGly: 4.972 ± 0.74
0.888ThrHis: 0.888 ± 0.318
5.772ThrIle: 5.772 ± 0.891
4.795ThrLys: 4.795 ± 0.689
4.884ThrLeu: 4.884 ± 0.598
1.243ThrMet: 1.243 ± 0.41
4.351ThrAsn: 4.351 ± 0.64
2.042ThrPro: 2.042 ± 0.433
2.397ThrGln: 2.397 ± 0.602
2.131ThrArg: 2.131 ± 0.488
3.552ThrSer: 3.552 ± 0.529
4.617ThrThr: 4.617 ± 0.73
5.061ThrVal: 5.061 ± 1.218
0.71ThrTrp: 0.71 ± 0.246
2.042ThrTyr: 2.042 ± 0.365
0.0ThrXaa: 0.0 ± 0.0
Val
4.884ValAla: 4.884 ± 0.942
0.444ValCys: 0.444 ± 0.21
4.706ValAsp: 4.706 ± 0.513
5.683ValGlu: 5.683 ± 0.876
3.019ValPhe: 3.019 ± 0.57
3.729ValGly: 3.729 ± 0.621
1.421ValHis: 1.421 ± 0.37
4.262ValIle: 4.262 ± 0.514
5.239ValLys: 5.239 ± 0.892
4.706ValLeu: 4.706 ± 0.868
1.953ValMet: 1.953 ± 0.378
3.285ValAsn: 3.285 ± 0.526
2.575ValPro: 2.575 ± 0.478
1.687ValGln: 1.687 ± 0.338
3.019ValArg: 3.019 ± 0.599
3.641ValSer: 3.641 ± 0.714
5.86ValThr: 5.86 ± 0.948
3.818ValVal: 3.818 ± 1.062
0.444ValTrp: 0.444 ± 0.279
2.042ValTyr: 2.042 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
0.444TrpAla: 0.444 ± 0.177
0.178TrpCys: 0.178 ± 0.138
0.622TrpAsp: 0.622 ± 0.245
0.622TrpGlu: 0.622 ± 0.188
0.622TrpPhe: 0.622 ± 0.234
0.977TrpGly: 0.977 ± 0.426
0.444TrpHis: 0.444 ± 0.166
0.622TrpIle: 0.622 ± 0.28
1.243TrpLys: 1.243 ± 0.315
1.154TrpLeu: 1.154 ± 0.234
0.444TrpMet: 0.444 ± 0.237
0.71TrpAsn: 0.71 ± 0.217
0.089TrpPro: 0.089 ± 0.087
0.178TrpGln: 0.178 ± 0.114
0.533TrpArg: 0.533 ± 0.277
0.622TrpSer: 0.622 ± 0.247
0.266TrpThr: 0.266 ± 0.134
0.71TrpVal: 0.71 ± 0.229
0.089TrpTrp: 0.089 ± 0.081
0.355TrpTyr: 0.355 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.953TyrAla: 1.953 ± 0.426
0.444TyrCys: 0.444 ± 0.242
2.753TyrAsp: 2.753 ± 0.517
2.664TyrGlu: 2.664 ± 0.473
2.309TyrPhe: 2.309 ± 0.468
2.841TyrGly: 2.841 ± 0.553
0.444TyrHis: 0.444 ± 0.234
2.22TyrIle: 2.22 ± 0.5
2.575TyrLys: 2.575 ± 0.517
2.841TyrLeu: 2.841 ± 0.494
0.533TyrMet: 0.533 ± 0.166
3.019TyrAsn: 3.019 ± 0.532
1.154TyrPro: 1.154 ± 0.238
0.71TyrGln: 0.71 ± 0.233
1.776TyrArg: 1.776 ± 0.475
0.977TyrSer: 0.977 ± 0.246
1.865TyrThr: 1.865 ± 0.433
1.865TyrVal: 1.865 ± 0.374
0.178TyrTrp: 0.178 ± 0.129
1.421TyrTyr: 1.421 ± 0.46
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (11263 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski