Amino acid dipepetide frequency for Microbacterium phage Stormbreaker

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.794AlaAla: 11.794 ± 1.152
0.584AlaCys: 0.584 ± 0.264
5.897AlaAsp: 5.897 ± 0.574
5.196AlaGlu: 5.196 ± 0.596
3.036AlaPhe: 3.036 ± 0.341
6.539AlaGly: 6.539 ± 0.768
1.868AlaHis: 1.868 ± 0.326
4.029AlaIle: 4.029 ± 0.544
3.503AlaLys: 3.503 ± 0.541
7.999AlaLeu: 7.999 ± 0.805
2.627AlaMet: 2.627 ± 0.485
4.496AlaAsn: 4.496 ± 0.558
5.547AlaPro: 5.547 ± 1.031
3.854AlaGln: 3.854 ± 0.532
6.247AlaArg: 6.247 ± 0.681
5.956AlaSer: 5.956 ± 0.738
5.956AlaThr: 5.956 ± 0.776
5.722AlaVal: 5.722 ± 0.469
1.81AlaTrp: 1.81 ± 0.268
3.095AlaTyr: 3.095 ± 0.349
0.0AlaXaa: 0.0 ± 0.0
Cys
0.234CysAla: 0.234 ± 0.127
0.0CysCys: 0.0 ± 0.0
0.058CysAsp: 0.058 ± 0.07
0.642CysGlu: 0.642 ± 0.265
0.292CysPhe: 0.292 ± 0.176
0.467CysGly: 0.467 ± 0.197
0.117CysHis: 0.117 ± 0.094
0.175CysIle: 0.175 ± 0.115
0.234CysLys: 0.234 ± 0.137
0.35CysLeu: 0.35 ± 0.186
0.175CysMet: 0.175 ± 0.094
0.117CysAsn: 0.117 ± 0.103
0.409CysPro: 0.409 ± 0.205
0.234CysGln: 0.234 ± 0.172
0.117CysArg: 0.117 ± 0.088
0.35CysSer: 0.35 ± 0.142
0.292CysThr: 0.292 ± 0.166
0.292CysVal: 0.292 ± 0.161
0.058CysTrp: 0.058 ± 0.07
0.292CysTyr: 0.292 ± 0.152
0.0CysXaa: 0.0 ± 0.0
Asp
7.707AspAla: 7.707 ± 0.641
0.409AspCys: 0.409 ± 0.188
5.021AspAsp: 5.021 ± 0.577
4.729AspGlu: 4.729 ± 0.596
2.16AspPhe: 2.16 ± 0.308
7.123AspGly: 7.123 ± 0.743
1.109AspHis: 1.109 ± 0.222
3.562AspIle: 3.562 ± 0.465
3.036AspLys: 3.036 ± 0.475
4.613AspLeu: 4.613 ± 0.588
1.576AspMet: 1.576 ± 0.292
2.686AspAsn: 2.686 ± 0.431
3.036AspPro: 3.036 ± 0.445
3.095AspGln: 3.095 ± 0.389
3.036AspArg: 3.036 ± 0.505
3.678AspSer: 3.678 ± 0.296
3.562AspThr: 3.562 ± 0.666
4.087AspVal: 4.087 ± 0.63
1.693AspTrp: 1.693 ± 0.272
1.868AspTyr: 1.868 ± 0.338
0.0AspXaa: 0.0 ± 0.0
Glu
5.488GluAla: 5.488 ± 0.498
0.058GluCys: 0.058 ± 0.067
4.321GluAsp: 4.321 ± 0.619
4.087GluGlu: 4.087 ± 0.654
2.861GluPhe: 2.861 ± 0.441
4.846GluGly: 4.846 ± 0.434
1.226GluHis: 1.226 ± 0.384
3.678GluIle: 3.678 ± 0.577
3.795GluLys: 3.795 ± 0.659
6.364GluLeu: 6.364 ± 0.92
1.401GluMet: 1.401 ± 0.312
2.978GluAsn: 2.978 ± 0.544
2.102GluPro: 2.102 ± 0.537
2.919GluGln: 2.919 ± 0.439
4.321GluArg: 4.321 ± 0.619
3.562GluSer: 3.562 ± 0.44
3.211GluThr: 3.211 ± 0.423
4.146GluVal: 4.146 ± 0.527
1.635GluTrp: 1.635 ± 0.465
2.744GluTyr: 2.744 ± 0.319
0.0GluXaa: 0.0 ± 0.0
Phe
1.927PheAla: 1.927 ± 0.392
0.467PheCys: 0.467 ± 0.21
2.569PheAsp: 2.569 ± 0.349
3.153PheGlu: 3.153 ± 0.588
1.109PhePhe: 1.109 ± 0.201
2.511PheGly: 2.511 ± 0.291
0.701PheHis: 0.701 ± 0.232
1.343PheIle: 1.343 ± 0.238
2.102PheLys: 2.102 ± 0.371
2.511PheLeu: 2.511 ± 0.361
1.168PheMet: 1.168 ± 0.238
1.343PheAsn: 1.343 ± 0.354
1.693PhePro: 1.693 ± 0.24
1.343PheGln: 1.343 ± 0.266
2.452PheArg: 2.452 ± 0.47
2.044PheSer: 2.044 ± 0.333
2.044PheThr: 2.044 ± 0.468
2.861PheVal: 2.861 ± 0.521
0.35PheTrp: 0.35 ± 0.134
0.993PheTyr: 0.993 ± 0.177
0.0PheXaa: 0.0 ± 0.0
Gly
7.298GlyAla: 7.298 ± 0.967
0.058GlyCys: 0.058 ± 0.068
4.496GlyAsp: 4.496 ± 0.443
5.372GlyGlu: 5.372 ± 0.43
2.744GlyPhe: 2.744 ± 0.316
7.065GlyGly: 7.065 ± 1.081
1.109GlyHis: 1.109 ± 0.265
4.554GlyIle: 4.554 ± 0.58
3.386GlyLys: 3.386 ± 0.611
5.839GlyLeu: 5.839 ± 0.497
2.335GlyMet: 2.335 ± 0.411
2.452GlyAsn: 2.452 ± 0.54
2.16GlyPro: 2.16 ± 0.356
3.445GlyGln: 3.445 ± 0.377
4.496GlyArg: 4.496 ± 0.577
4.963GlySer: 4.963 ± 0.609
5.313GlyThr: 5.313 ± 0.794
5.488GlyVal: 5.488 ± 0.559
1.051GlyTrp: 1.051 ± 0.245
3.153GlyTyr: 3.153 ± 0.342
0.0GlyXaa: 0.0 ± 0.0
His
1.109HisAla: 1.109 ± 0.346
0.35HisCys: 0.35 ± 0.211
1.109HisAsp: 1.109 ± 0.26
1.343HisGlu: 1.343 ± 0.285
0.35HisPhe: 0.35 ± 0.118
0.876HisGly: 0.876 ± 0.23
0.175HisHis: 0.175 ± 0.117
0.642HisIle: 0.642 ± 0.172
1.343HisLys: 1.343 ± 0.394
1.343HisLeu: 1.343 ± 0.318
0.35HisMet: 0.35 ± 0.151
0.35HisAsn: 0.35 ± 0.131
1.226HisPro: 1.226 ± 0.23
0.876HisGln: 0.876 ± 0.194
1.226HisArg: 1.226 ± 0.28
0.817HisSer: 0.817 ± 0.211
0.701HisThr: 0.701 ± 0.275
1.401HisVal: 1.401 ± 0.343
0.175HisTrp: 0.175 ± 0.084
0.409HisTyr: 0.409 ± 0.158
0.0HisXaa: 0.0 ± 0.0
Ile
4.204IleAla: 4.204 ± 0.513
0.175IleCys: 0.175 ± 0.118
4.729IleAsp: 4.729 ± 0.759
3.27IleGlu: 3.27 ± 0.533
1.051IlePhe: 1.051 ± 0.239
2.335IleGly: 2.335 ± 0.521
0.409IleHis: 0.409 ± 0.158
2.569IleIle: 2.569 ± 0.457
2.861IleLys: 2.861 ± 0.511
3.562IleLeu: 3.562 ± 0.513
0.934IleMet: 0.934 ± 0.244
2.569IleAsn: 2.569 ± 0.533
2.219IlePro: 2.219 ± 0.378
2.16IleGln: 2.16 ± 0.345
2.627IleArg: 2.627 ± 0.479
2.919IleSer: 2.919 ± 0.349
2.686IleThr: 2.686 ± 0.383
3.036IleVal: 3.036 ± 0.624
0.759IleTrp: 0.759 ± 0.169
1.285IleTyr: 1.285 ± 0.216
0.0IleXaa: 0.0 ± 0.0
Lys
4.437LysAla: 4.437 ± 0.672
0.292LysCys: 0.292 ± 0.177
2.919LysAsp: 2.919 ± 0.39
2.978LysGlu: 2.978 ± 0.476
1.226LysPhe: 1.226 ± 0.232
2.686LysGly: 2.686 ± 0.414
0.642LysHis: 0.642 ± 0.196
1.576LysIle: 1.576 ± 0.252
1.985LysLys: 1.985 ± 0.489
4.204LysLeu: 4.204 ± 0.661
0.934LysMet: 0.934 ± 0.295
2.744LysAsn: 2.744 ± 0.361
2.16LysPro: 2.16 ± 0.395
1.518LysGln: 1.518 ± 0.276
2.511LysArg: 2.511 ± 0.584
2.16LysSer: 2.16 ± 0.426
3.503LysThr: 3.503 ± 0.411
3.095LysVal: 3.095 ± 0.46
0.759LysTrp: 0.759 ± 0.272
2.044LysTyr: 2.044 ± 0.385
0.0LysXaa: 0.0 ± 0.0
Leu
7.649LeuAla: 7.649 ± 0.709
0.584LeuCys: 0.584 ± 0.231
5.43LeuAsp: 5.43 ± 0.591
5.08LeuGlu: 5.08 ± 0.593
2.744LeuPhe: 2.744 ± 0.418
6.364LeuGly: 6.364 ± 0.833
1.401LeuHis: 1.401 ± 0.34
3.678LeuIle: 3.678 ± 0.679
3.562LeuLys: 3.562 ± 0.52
6.715LeuLeu: 6.715 ± 0.812
2.919LeuMet: 2.919 ± 0.641
3.27LeuAsn: 3.27 ± 0.434
3.737LeuPro: 3.737 ± 0.498
4.087LeuGln: 4.087 ± 0.868
5.956LeuArg: 5.956 ± 0.639
5.372LeuSer: 5.372 ± 0.888
5.196LeuThr: 5.196 ± 0.554
5.547LeuVal: 5.547 ± 0.778
1.285LeuTrp: 1.285 ± 0.377
2.744LeuTyr: 2.744 ± 0.355
0.0LeuXaa: 0.0 ± 0.0
Met
3.095MetAla: 3.095 ± 0.401
0.175MetCys: 0.175 ± 0.1
1.576MetAsp: 1.576 ± 0.221
1.168MetGlu: 1.168 ± 0.306
1.401MetPhe: 1.401 ± 0.305
1.927MetGly: 1.927 ± 0.625
0.409MetHis: 0.409 ± 0.131
0.934MetIle: 0.934 ± 0.216
1.635MetLys: 1.635 ± 0.333
1.985MetLeu: 1.985 ± 0.324
0.642MetMet: 0.642 ± 0.205
1.168MetAsn: 1.168 ± 0.216
1.518MetPro: 1.518 ± 0.277
1.343MetGln: 1.343 ± 0.271
1.576MetArg: 1.576 ± 0.364
2.394MetSer: 2.394 ± 0.362
1.985MetThr: 1.985 ± 0.399
1.168MetVal: 1.168 ± 0.271
0.292MetTrp: 0.292 ± 0.154
0.817MetTyr: 0.817 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
4.029AsnAla: 4.029 ± 0.836
0.35AsnCys: 0.35 ± 0.163
2.511AsnAsp: 2.511 ± 0.469
2.335AsnGlu: 2.335 ± 0.31
1.285AsnPhe: 1.285 ± 0.239
4.846AsnGly: 4.846 ± 0.492
0.642AsnHis: 0.642 ± 0.252
1.81AsnIle: 1.81 ± 0.334
2.044AsnLys: 2.044 ± 0.417
3.795AsnLeu: 3.795 ± 0.536
0.993AsnMet: 0.993 ± 0.238
1.752AsnAsn: 1.752 ± 0.323
2.861AsnPro: 2.861 ± 0.749
2.277AsnGln: 2.277 ± 0.405
1.635AsnArg: 1.635 ± 0.375
2.277AsnSer: 2.277 ± 0.425
3.445AsnThr: 3.445 ± 0.54
3.503AsnVal: 3.503 ± 0.417
0.467AsnTrp: 0.467 ± 0.16
1.343AsnTyr: 1.343 ± 0.261
0.0AsnXaa: 0.0 ± 0.0
Pro
4.554ProAla: 4.554 ± 0.604
0.117ProCys: 0.117 ± 0.091
3.386ProAsp: 3.386 ± 0.4
4.087ProGlu: 4.087 ± 0.532
1.226ProPhe: 1.226 ± 0.246
3.795ProGly: 3.795 ± 0.601
0.409ProHis: 0.409 ± 0.208
2.569ProIle: 2.569 ± 0.376
1.693ProLys: 1.693 ± 0.481
4.029ProLeu: 4.029 ± 0.531
0.934ProMet: 0.934 ± 0.255
2.919ProAsn: 2.919 ± 0.328
1.927ProPro: 1.927 ± 0.352
1.46ProGln: 1.46 ± 0.312
2.219ProArg: 2.219 ± 0.437
2.627ProSer: 2.627 ± 0.361
3.386ProThr: 3.386 ± 0.555
3.854ProVal: 3.854 ± 0.484
0.759ProTrp: 0.759 ± 0.22
1.285ProTyr: 1.285 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
5.839GlnAla: 5.839 ± 0.539
0.234GlnCys: 0.234 ± 0.133
2.978GlnAsp: 2.978 ± 0.385
2.686GlnGlu: 2.686 ± 0.532
1.518GlnPhe: 1.518 ± 0.24
3.036GlnGly: 3.036 ± 0.415
0.876GlnHis: 0.876 ± 0.214
1.868GlnIle: 1.868 ± 0.298
1.226GlnLys: 1.226 ± 0.262
4.087GlnLeu: 4.087 ± 0.457
1.752GlnMet: 1.752 ± 0.368
1.985GlnAsn: 1.985 ± 0.343
2.102GlnPro: 2.102 ± 0.397
2.744GlnGln: 2.744 ± 0.561
3.153GlnArg: 3.153 ± 0.466
2.219GlnSer: 2.219 ± 0.383
2.803GlnThr: 2.803 ± 0.436
2.335GlnVal: 2.335 ± 0.394
0.934GlnTrp: 0.934 ± 0.244
1.46GlnTyr: 1.46 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
4.788ArgAla: 4.788 ± 0.738
0.058ArgCys: 0.058 ± 0.077
3.854ArgAsp: 3.854 ± 0.513
5.196ArgGlu: 5.196 ± 0.707
2.394ArgPhe: 2.394 ± 0.599
4.379ArgGly: 4.379 ± 0.648
0.817ArgHis: 0.817 ± 0.193
2.452ArgIle: 2.452 ± 0.415
2.277ArgLys: 2.277 ± 0.372
5.255ArgLeu: 5.255 ± 0.707
1.693ArgMet: 1.693 ± 0.351
2.744ArgAsn: 2.744 ± 0.539
2.686ArgPro: 2.686 ± 0.396
2.861ArgGln: 2.861 ± 0.432
4.905ArgArg: 4.905 ± 0.948
3.27ArgSer: 3.27 ± 0.592
3.795ArgThr: 3.795 ± 0.742
4.146ArgVal: 4.146 ± 0.45
0.701ArgTrp: 0.701 ± 0.225
1.518ArgTyr: 1.518 ± 0.313
0.0ArgXaa: 0.0 ± 0.0
Ser
5.43SerAla: 5.43 ± 0.577
0.117SerCys: 0.117 ± 0.091
4.321SerAsp: 4.321 ± 0.621
3.095SerGlu: 3.095 ± 0.461
3.153SerPhe: 3.153 ± 0.458
5.08SerGly: 5.08 ± 0.755
0.993SerHis: 0.993 ± 0.206
2.627SerIle: 2.627 ± 0.366
2.335SerLys: 2.335 ± 0.381
4.262SerLeu: 4.262 ± 0.562
1.81SerMet: 1.81 ± 0.366
2.978SerAsn: 2.978 ± 0.345
2.744SerPro: 2.744 ± 0.415
2.861SerGln: 2.861 ± 0.442
3.036SerArg: 3.036 ± 0.386
3.503SerSer: 3.503 ± 0.561
3.97SerThr: 3.97 ± 0.721
3.854SerVal: 3.854 ± 0.461
1.343SerTrp: 1.343 ± 0.238
1.868SerTyr: 1.868 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
5.605ThrAla: 5.605 ± 0.566
0.117ThrCys: 0.117 ± 0.111
4.321ThrAsp: 4.321 ± 0.421
4.321ThrGlu: 4.321 ± 0.523
2.044ThrPhe: 2.044 ± 0.347
5.78ThrGly: 5.78 ± 0.797
0.876ThrHis: 0.876 ± 0.272
3.211ThrIle: 3.211 ± 0.498
2.102ThrLys: 2.102 ± 0.279
6.306ThrLeu: 6.306 ± 0.681
1.81ThrMet: 1.81 ± 0.34
2.627ThrAsn: 2.627 ± 0.385
3.153ThrPro: 3.153 ± 0.434
2.511ThrGln: 2.511 ± 0.358
3.62ThrArg: 3.62 ± 0.418
4.321ThrSer: 4.321 ± 0.638
5.313ThrThr: 5.313 ± 0.86
3.503ThrVal: 3.503 ± 0.526
1.576ThrTrp: 1.576 ± 0.369
1.635ThrTyr: 1.635 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
6.014ValAla: 6.014 ± 0.727
0.35ValCys: 0.35 ± 0.164
4.379ValAsp: 4.379 ± 0.532
4.204ValGlu: 4.204 ± 0.469
2.277ValPhe: 2.277 ± 0.341
4.379ValGly: 4.379 ± 0.613
1.576ValHis: 1.576 ± 0.329
2.978ValIle: 2.978 ± 0.377
2.803ValLys: 2.803 ± 0.371
6.014ValLeu: 6.014 ± 0.623
1.518ValMet: 1.518 ± 0.323
2.511ValAsn: 2.511 ± 0.346
4.087ValPro: 4.087 ± 0.702
3.562ValGln: 3.562 ± 0.328
4.262ValArg: 4.262 ± 0.578
3.678ValSer: 3.678 ± 0.421
4.437ValThr: 4.437 ± 0.576
3.62ValVal: 3.62 ± 0.5
1.051ValTrp: 1.051 ± 0.294
2.219ValTyr: 2.219 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
1.285TrpAla: 1.285 ± 0.233
0.058TrpCys: 0.058 ± 0.077
1.576TrpAsp: 1.576 ± 0.253
0.934TrpGlu: 0.934 ± 0.228
0.642TrpPhe: 0.642 ± 0.211
1.226TrpGly: 1.226 ± 0.367
0.234TrpHis: 0.234 ± 0.101
0.584TrpIle: 0.584 ± 0.215
0.817TrpLys: 0.817 ± 0.211
1.752TrpLeu: 1.752 ± 0.402
0.525TrpMet: 0.525 ± 0.178
0.701TrpAsn: 0.701 ± 0.216
0.409TrpPro: 0.409 ± 0.149
0.759TrpGln: 0.759 ± 0.21
0.701TrpArg: 0.701 ± 0.197
1.285TrpSer: 1.285 ± 0.351
1.343TrpThr: 1.343 ± 0.299
1.518TrpVal: 1.518 ± 0.254
0.234TrpTrp: 0.234 ± 0.125
0.584TrpTyr: 0.584 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.27TyrAla: 3.27 ± 0.372
0.35TyrCys: 0.35 ± 0.156
2.335TyrAsp: 2.335 ± 0.246
1.868TyrGlu: 1.868 ± 0.325
1.168TyrPhe: 1.168 ± 0.284
1.518TyrGly: 1.518 ± 0.3
0.584TyrHis: 0.584 ± 0.202
1.518TyrIle: 1.518 ± 0.239
1.635TyrLys: 1.635 ± 0.312
2.394TyrLeu: 2.394 ± 0.377
1.051TyrMet: 1.051 ± 0.293
1.81TyrAsn: 1.81 ± 0.462
1.46TyrPro: 1.46 ± 0.374
1.927TyrGln: 1.927 ± 0.221
1.693TyrArg: 1.693 ± 0.365
1.985TyrSer: 1.985 ± 0.339
1.81TyrThr: 1.81 ± 0.372
2.744TyrVal: 2.744 ± 0.393
0.234TyrTrp: 0.234 ± 0.118
1.285TyrTyr: 1.285 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (17128 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski