Amino acid dipepetide frequency for Arthrobacter phage StevieBAY

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.328AlaAla: 17.328 ± 1.692
0.619AlaCys: 0.619 ± 0.19
6.312AlaAsp: 6.312 ± 1.026
8.107AlaGlu: 8.107 ± 0.835
3.094AlaPhe: 3.094 ± 0.526
9.963AlaGly: 9.963 ± 1.078
2.352AlaHis: 2.352 ± 0.426
4.27AlaIle: 4.27 ± 0.469
4.703AlaLys: 4.703 ± 0.605
10.335AlaLeu: 10.335 ± 0.947
2.97AlaMet: 2.97 ± 0.447
3.218AlaAsn: 3.218 ± 0.482
5.384AlaPro: 5.384 ± 0.835
4.703AlaGln: 4.703 ± 0.557
7.612AlaArg: 7.612 ± 0.907
5.57AlaSer: 5.57 ± 0.654
6.931AlaThr: 6.931 ± 0.804
7.612AlaVal: 7.612 ± 0.775
1.052AlaTrp: 1.052 ± 0.258
2.166AlaTyr: 2.166 ± 0.298
0.0AlaXaa: 0.0 ± 0.0
Cys
0.557CysAla: 0.557 ± 0.185
0.062CysCys: 0.062 ± 0.061
0.309CysAsp: 0.309 ± 0.13
0.371CysGlu: 0.371 ± 0.142
0.124CysPhe: 0.124 ± 0.088
0.557CysGly: 0.557 ± 0.216
0.124CysHis: 0.124 ± 0.097
0.124CysIle: 0.124 ± 0.093
0.309CysLys: 0.309 ± 0.185
0.743CysLeu: 0.743 ± 0.253
0.124CysMet: 0.124 ± 0.087
0.186CysAsn: 0.186 ± 0.11
0.495CysPro: 0.495 ± 0.178
0.186CysGln: 0.186 ± 0.1
0.371CysArg: 0.371 ± 0.189
0.371CysSer: 0.371 ± 0.17
0.495CysThr: 0.495 ± 0.158
0.681CysVal: 0.681 ± 0.176
0.186CysTrp: 0.186 ± 0.103
0.248CysTyr: 0.248 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
8.478AspAla: 8.478 ± 0.828
0.866AspCys: 0.866 ± 0.24
4.208AspAsp: 4.208 ± 0.573
5.879AspGlu: 5.879 ± 0.792
2.104AspPhe: 2.104 ± 0.418
5.57AspGly: 5.57 ± 0.662
0.99AspHis: 0.99 ± 0.2
2.661AspIle: 2.661 ± 0.362
3.342AspLys: 3.342 ± 0.43
3.28AspLeu: 3.28 ± 0.458
1.176AspMet: 1.176 ± 0.262
1.98AspAsn: 1.98 ± 0.34
2.909AspPro: 2.909 ± 0.4
1.98AspGln: 1.98 ± 0.288
2.785AspArg: 2.785 ± 0.38
3.032AspSer: 3.032 ± 0.439
3.218AspThr: 3.218 ± 0.453
6.498AspVal: 6.498 ± 0.632
1.3AspTrp: 1.3 ± 0.282
1.361AspTyr: 1.361 ± 0.219
0.0AspXaa: 0.0 ± 0.0
Glu
6.127GluAla: 6.127 ± 0.787
0.309GluCys: 0.309 ± 0.143
3.589GluAsp: 3.589 ± 0.544
3.837GluGlu: 3.837 ± 0.531
1.795GluPhe: 1.795 ± 0.306
3.28GluGly: 3.28 ± 0.528
1.918GluHis: 1.918 ± 0.373
3.404GluIle: 3.404 ± 0.376
3.094GluLys: 3.094 ± 0.443
7.55GluLeu: 7.55 ± 0.985
1.733GluMet: 1.733 ± 0.364
1.98GluAsn: 1.98 ± 0.376
3.837GluPro: 3.837 ± 0.574
3.713GluGln: 3.713 ± 0.672
4.765GluArg: 4.765 ± 0.619
4.023GluSer: 4.023 ± 0.542
3.28GluThr: 3.28 ± 0.472
3.527GluVal: 3.527 ± 0.485
0.866GluTrp: 0.866 ± 0.226
1.609GluTyr: 1.609 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
2.909PheAla: 2.909 ± 0.369
0.248PheCys: 0.248 ± 0.114
2.661PheAsp: 2.661 ± 0.424
2.599PheGlu: 2.599 ± 0.359
0.433PhePhe: 0.433 ± 0.199
2.475PheGly: 2.475 ± 0.423
0.371PheHis: 0.371 ± 0.166
1.485PheIle: 1.485 ± 0.233
1.795PheLys: 1.795 ± 0.339
1.485PheLeu: 1.485 ± 0.312
1.052PheMet: 1.052 ± 0.244
1.176PheAsn: 1.176 ± 0.285
1.361PhePro: 1.361 ± 0.259
0.743PheGln: 0.743 ± 0.216
1.238PheArg: 1.238 ± 0.288
1.918PheSer: 1.918 ± 0.385
2.104PheThr: 2.104 ± 0.371
2.537PheVal: 2.537 ± 0.334
0.557PheTrp: 0.557 ± 0.196
0.495PheTyr: 0.495 ± 0.159
0.0PheXaa: 0.0 ± 0.0
Gly
7.302GlyAla: 7.302 ± 0.575
0.124GlyCys: 0.124 ± 0.09
5.446GlyAsp: 5.446 ± 0.628
4.023GlyGlu: 4.023 ± 0.552
1.795GlyPhe: 1.795 ± 0.378
6.993GlyGly: 6.993 ± 1.365
1.547GlyHis: 1.547 ± 0.26
3.589GlyIle: 3.589 ± 0.58
4.208GlyLys: 4.208 ± 0.533
4.889GlyLeu: 4.889 ± 0.681
1.423GlyMet: 1.423 ± 0.279
3.094GlyAsn: 3.094 ± 0.522
2.599GlyPro: 2.599 ± 0.441
1.918GlyGln: 1.918 ± 0.359
4.208GlyArg: 4.208 ± 0.455
4.765GlySer: 4.765 ± 0.698
5.136GlyThr: 5.136 ± 0.718
6.065GlyVal: 6.065 ± 0.7
2.228GlyTrp: 2.228 ± 0.477
2.414GlyTyr: 2.414 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
2.104HisAla: 2.104 ± 0.402
0.124HisCys: 0.124 ± 0.077
1.114HisAsp: 1.114 ± 0.307
1.238HisGlu: 1.238 ± 0.301
0.805HisPhe: 0.805 ± 0.218
0.619HisGly: 0.619 ± 0.229
0.495HisHis: 0.495 ± 0.203
0.99HisIle: 0.99 ± 0.229
0.743HisLys: 0.743 ± 0.237
1.3HisLeu: 1.3 ± 0.318
0.495HisMet: 0.495 ± 0.197
0.866HisAsn: 0.866 ± 0.242
0.557HisPro: 0.557 ± 0.224
0.743HisGln: 0.743 ± 0.197
0.928HisArg: 0.928 ± 0.271
0.681HisSer: 0.681 ± 0.209
1.671HisThr: 1.671 ± 0.362
2.104HisVal: 2.104 ± 0.338
0.433HisTrp: 0.433 ± 0.148
0.433HisTyr: 0.433 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
5.879IleAla: 5.879 ± 0.544
0.248IleCys: 0.248 ± 0.116
3.527IleAsp: 3.527 ± 0.426
4.951IleGlu: 4.951 ± 0.747
1.3IlePhe: 1.3 ± 0.205
4.023IleGly: 4.023 ± 0.736
0.248IleHis: 0.248 ± 0.121
2.785IleIle: 2.785 ± 0.42
2.414IleLys: 2.414 ± 0.358
2.228IleLeu: 2.228 ± 0.413
1.361IleMet: 1.361 ± 0.259
2.042IleAsn: 2.042 ± 0.382
1.485IlePro: 1.485 ± 0.36
1.238IleGln: 1.238 ± 0.275
1.918IleArg: 1.918 ± 0.348
3.837IleSer: 3.837 ± 0.467
3.466IleThr: 3.466 ± 0.449
4.579IleVal: 4.579 ± 0.494
0.557IleTrp: 0.557 ± 0.191
1.176IleTyr: 1.176 ± 0.254
0.0IleXaa: 0.0 ± 0.0
Lys
5.57LysAla: 5.57 ± 0.586
0.371LysCys: 0.371 ± 0.178
2.166LysAsp: 2.166 ± 0.338
1.671LysGlu: 1.671 ± 0.33
1.609LysPhe: 1.609 ± 0.317
3.342LysGly: 3.342 ± 0.479
1.052LysHis: 1.052 ± 0.265
2.723LysIle: 2.723 ± 0.411
0.928LysLys: 0.928 ± 0.235
5.322LysLeu: 5.322 ± 0.563
1.361LysMet: 1.361 ± 0.308
1.238LysAsn: 1.238 ± 0.303
2.599LysPro: 2.599 ± 0.487
2.661LysGln: 2.661 ± 0.469
3.713LysArg: 3.713 ± 0.577
2.723LysSer: 2.723 ± 0.427
2.909LysThr: 2.909 ± 0.396
2.847LysVal: 2.847 ± 0.402
0.743LysTrp: 0.743 ± 0.219
1.3LysTyr: 1.3 ± 0.295
0.0LysXaa: 0.0 ± 0.0
Leu
8.045LeuAla: 8.045 ± 0.673
0.186LeuCys: 0.186 ± 0.111
4.889LeuAsp: 4.889 ± 0.48
4.208LeuGlu: 4.208 ± 0.608
1.795LeuPhe: 1.795 ± 0.305
6.065LeuGly: 6.065 ± 0.877
1.176LeuHis: 1.176 ± 0.251
3.961LeuIle: 3.961 ± 0.531
3.961LeuLys: 3.961 ± 0.489
7.117LeuLeu: 7.117 ± 0.9
1.857LeuMet: 1.857 ± 0.419
2.847LeuAsn: 2.847 ± 0.375
3.961LeuPro: 3.961 ± 0.547
3.342LeuGln: 3.342 ± 0.375
6.065LeuArg: 6.065 ± 0.576
5.384LeuSer: 5.384 ± 0.751
6.745LeuThr: 6.745 ± 0.546
6.127LeuVal: 6.127 ± 0.515
0.928LeuTrp: 0.928 ± 0.288
1.671LeuTyr: 1.671 ± 0.283
0.0LeuXaa: 0.0 ± 0.0
Met
2.475MetAla: 2.475 ± 0.399
0.186MetCys: 0.186 ± 0.109
0.866MetAsp: 0.866 ± 0.252
0.495MetGlu: 0.495 ± 0.233
0.805MetPhe: 0.805 ± 0.22
1.114MetGly: 1.114 ± 0.251
0.309MetHis: 0.309 ± 0.122
0.866MetIle: 0.866 ± 0.208
1.3MetLys: 1.3 ± 0.324
1.98MetLeu: 1.98 ± 0.335
0.557MetMet: 0.557 ± 0.204
1.176MetAsn: 1.176 ± 0.265
1.547MetPro: 1.547 ± 0.356
1.114MetGln: 1.114 ± 0.261
1.857MetArg: 1.857 ± 0.259
2.599MetSer: 2.599 ± 0.341
2.785MetThr: 2.785 ± 0.358
0.619MetVal: 0.619 ± 0.188
0.248MetTrp: 0.248 ± 0.106
0.371MetTyr: 0.371 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
4.146AsnAla: 4.146 ± 0.669
0.557AsnCys: 0.557 ± 0.197
1.857AsnAsp: 1.857 ± 0.363
2.599AsnGlu: 2.599 ± 0.377
0.805AsnPhe: 0.805 ± 0.21
2.909AsnGly: 2.909 ± 0.408
0.495AsnHis: 0.495 ± 0.19
2.104AsnIle: 2.104 ± 0.378
1.547AsnLys: 1.547 ± 0.325
2.909AsnLeu: 2.909 ± 0.35
0.743AsnMet: 0.743 ± 0.205
1.485AsnAsn: 1.485 ± 0.391
2.537AsnPro: 2.537 ± 0.406
0.928AsnGln: 0.928 ± 0.288
1.609AsnArg: 1.609 ± 0.279
1.547AsnSer: 1.547 ± 0.274
2.661AsnThr: 2.661 ± 0.415
3.094AsnVal: 3.094 ± 0.362
0.866AsnTrp: 0.866 ± 0.214
0.743AsnTyr: 0.743 ± 0.284
0.0AsnXaa: 0.0 ± 0.0
Pro
7.179ProAla: 7.179 ± 1.011
0.309ProCys: 0.309 ± 0.12
3.589ProAsp: 3.589 ± 0.564
3.28ProGlu: 3.28 ± 0.389
1.114ProPhe: 1.114 ± 0.229
3.404ProGly: 3.404 ± 0.547
1.3ProHis: 1.3 ± 0.279
2.661ProIle: 2.661 ± 0.544
2.228ProLys: 2.228 ± 0.335
3.404ProLeu: 3.404 ± 0.434
0.99ProMet: 0.99 ± 0.27
2.166ProAsn: 2.166 ± 0.487
2.414ProPro: 2.414 ± 0.67
1.671ProGln: 1.671 ± 0.318
2.042ProArg: 2.042 ± 0.415
2.847ProSer: 2.847 ± 0.522
2.97ProThr: 2.97 ± 0.484
2.847ProVal: 2.847 ± 0.455
0.557ProTrp: 0.557 ± 0.17
1.176ProTyr: 1.176 ± 0.281
0.0ProXaa: 0.0 ± 0.0
Gln
4.208GlnAla: 4.208 ± 0.487
0.433GlnCys: 0.433 ± 0.169
1.238GlnAsp: 1.238 ± 0.24
1.547GlnGlu: 1.547 ± 0.343
1.3GlnPhe: 1.3 ± 0.294
2.104GlnGly: 2.104 ± 0.42
0.99GlnHis: 0.99 ± 0.245
2.352GlnIle: 2.352 ± 0.466
1.547GlnLys: 1.547 ± 0.316
3.466GlnLeu: 3.466 ± 0.54
1.3GlnMet: 1.3 ± 0.248
1.3GlnAsn: 1.3 ± 0.285
2.228GlnPro: 2.228 ± 0.351
2.97GlnGln: 2.97 ± 0.499
2.785GlnArg: 2.785 ± 0.44
2.352GlnSer: 2.352 ± 0.393
2.599GlnThr: 2.599 ± 0.439
2.228GlnVal: 2.228 ± 0.397
0.495GlnTrp: 0.495 ± 0.149
0.805GlnTyr: 0.805 ± 0.253
0.0GlnXaa: 0.0 ± 0.0
Arg
6.065ArgAla: 6.065 ± 0.493
0.433ArgCys: 0.433 ± 0.194
3.961ArgAsp: 3.961 ± 0.427
5.075ArgGlu: 5.075 ± 0.758
2.104ArgPhe: 2.104 ± 0.404
4.27ArgGly: 4.27 ± 0.555
0.805ArgHis: 0.805 ± 0.231
2.537ArgIle: 2.537 ± 0.469
3.651ArgLys: 3.651 ± 0.461
4.889ArgLeu: 4.889 ± 0.607
0.99ArgMet: 0.99 ± 0.228
2.29ArgAsn: 2.29 ± 0.423
3.094ArgPro: 3.094 ± 0.492
1.918ArgGln: 1.918 ± 0.297
3.837ArgArg: 3.837 ± 0.69
3.775ArgSer: 3.775 ± 0.459
3.466ArgThr: 3.466 ± 0.59
4.889ArgVal: 4.889 ± 0.581
1.547ArgTrp: 1.547 ± 0.307
1.733ArgTyr: 1.733 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
6.127SerAla: 6.127 ± 0.846
0.371SerCys: 0.371 ± 0.135
4.208SerAsp: 4.208 ± 0.408
3.837SerGlu: 3.837 ± 0.527
2.414SerPhe: 2.414 ± 0.35
4.084SerGly: 4.084 ± 0.741
0.99SerHis: 0.99 ± 0.235
3.466SerIle: 3.466 ± 0.476
2.599SerLys: 2.599 ± 0.464
4.332SerLeu: 4.332 ± 0.583
1.485SerMet: 1.485 ± 0.269
2.104SerAsn: 2.104 ± 0.353
2.723SerPro: 2.723 ± 0.379
1.671SerGln: 1.671 ± 0.311
3.466SerArg: 3.466 ± 0.518
4.023SerSer: 4.023 ± 0.581
4.889SerThr: 4.889 ± 0.75
5.322SerVal: 5.322 ± 0.582
1.238SerTrp: 1.238 ± 0.282
1.547SerTyr: 1.547 ± 0.329
0.0SerXaa: 0.0 ± 0.0
Thr
7.921ThrAla: 7.921 ± 0.806
0.619ThrCys: 0.619 ± 0.226
4.394ThrAsp: 4.394 ± 0.518
3.032ThrGlu: 3.032 ± 0.415
2.661ThrPhe: 2.661 ± 0.378
4.456ThrGly: 4.456 ± 0.562
1.485ThrHis: 1.485 ± 0.338
3.28ThrIle: 3.28 ± 0.417
2.847ThrLys: 2.847 ± 0.427
7.055ThrLeu: 7.055 ± 0.565
0.866ThrMet: 0.866 ± 0.252
2.166ThrAsn: 2.166 ± 0.386
3.218ThrPro: 3.218 ± 0.506
2.537ThrGln: 2.537 ± 0.393
3.837ThrArg: 3.837 ± 0.424
4.703ThrSer: 4.703 ± 0.709
5.693ThrThr: 5.693 ± 0.796
4.703ThrVal: 4.703 ± 0.642
0.619ThrTrp: 0.619 ± 0.276
1.795ThrTyr: 1.795 ± 0.341
0.0ThrXaa: 0.0 ± 0.0
Val
8.045ValAla: 8.045 ± 0.713
0.248ValCys: 0.248 ± 0.135
6.745ValAsp: 6.745 ± 0.638
5.322ValGlu: 5.322 ± 0.617
2.228ValPhe: 2.228 ± 0.338
4.765ValGly: 4.765 ± 0.763
0.99ValHis: 0.99 ± 0.232
4.456ValIle: 4.456 ± 0.488
3.651ValLys: 3.651 ± 0.447
4.765ValLeu: 4.765 ± 0.592
1.733ValMet: 1.733 ± 0.415
3.094ValAsn: 3.094 ± 0.528
3.466ValPro: 3.466 ± 0.411
3.156ValGln: 3.156 ± 0.449
5.508ValArg: 5.508 ± 0.59
4.27ValSer: 4.27 ± 0.521
4.951ValThr: 4.951 ± 0.566
7.241ValVal: 7.241 ± 0.722
1.238ValTrp: 1.238 ± 0.253
1.609ValTyr: 1.609 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
1.671TrpAla: 1.671 ± 0.418
0.124TrpCys: 0.124 ± 0.09
1.238TrpAsp: 1.238 ± 0.32
0.866TrpGlu: 0.866 ± 0.223
0.805TrpPhe: 0.805 ± 0.275
0.866TrpGly: 0.866 ± 0.263
0.619TrpHis: 0.619 ± 0.193
0.681TrpIle: 0.681 ± 0.207
1.052TrpLys: 1.052 ± 0.24
2.042TrpLeu: 2.042 ± 0.338
0.371TrpMet: 0.371 ± 0.172
0.557TrpAsn: 0.557 ± 0.173
0.681TrpPro: 0.681 ± 0.229
0.309TrpGln: 0.309 ± 0.125
1.3TrpArg: 1.3 ± 0.26
0.495TrpSer: 0.495 ± 0.159
0.805TrpThr: 0.805 ± 0.24
1.238TrpVal: 1.238 ± 0.227
0.186TrpTrp: 0.186 ± 0.122
0.309TrpTyr: 0.309 ± 0.18
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.166TyrAla: 2.166 ± 0.305
0.186TyrCys: 0.186 ± 0.102
1.3TyrAsp: 1.3 ± 0.254
1.3TyrGlu: 1.3 ± 0.37
0.557TyrPhe: 0.557 ± 0.202
2.909TyrGly: 2.909 ± 0.44
0.186TyrHis: 0.186 ± 0.101
0.866TyrIle: 0.866 ± 0.273
0.928TyrLys: 0.928 ± 0.235
1.238TyrLeu: 1.238 ± 0.256
0.619TyrMet: 0.619 ± 0.168
1.176TyrAsn: 1.176 ± 0.232
1.052TyrPro: 1.052 ± 0.3
0.743TyrGln: 0.743 ± 0.2
1.485TyrArg: 1.485 ± 0.343
2.042TyrSer: 2.042 ± 0.394
1.114TyrThr: 1.114 ± 0.266
2.723TyrVal: 2.723 ± 0.43
0.371TyrTrp: 0.371 ± 0.165
0.371TyrTyr: 0.371 ± 0.15
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (16160 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski