Amino acid dipepetide frequency for Mycobacterium phage Misha28

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.076AlaAla: 15.076 ± 1.678
0.871AlaCys: 0.871 ± 0.246
7.674AlaAsp: 7.674 ± 0.653
8.491AlaGlu: 8.491 ± 0.7
3.102AlaPhe: 3.102 ± 0.483
10.722AlaGly: 10.722 ± 1.119
2.395AlaHis: 2.395 ± 0.404
4.898AlaIle: 4.898 ± 0.527
4.3AlaLys: 4.3 ± 0.505
8.872AlaLeu: 8.872 ± 0.77
2.667AlaMet: 2.667 ± 0.386
3.266AlaAsn: 3.266 ± 0.464
4.626AlaPro: 4.626 ± 0.582
4.137AlaGln: 4.137 ± 0.604
6.586AlaArg: 6.586 ± 0.687
6.259AlaSer: 6.259 ± 0.57
6.368AlaThr: 6.368 ± 0.526
7.021AlaVal: 7.021 ± 0.49
2.504AlaTrp: 2.504 ± 0.385
2.558AlaTyr: 2.558 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
0.98CysAla: 0.98 ± 0.292
0.109CysCys: 0.109 ± 0.073
0.925CysAsp: 0.925 ± 0.271
0.599CysGlu: 0.599 ± 0.187
0.109CysPhe: 0.109 ± 0.076
1.633CysGly: 1.633 ± 0.369
0.109CysHis: 0.109 ± 0.077
0.381CysIle: 0.381 ± 0.194
0.218CysLys: 0.218 ± 0.114
0.871CysLeu: 0.871 ± 0.316
0.054CysMet: 0.054 ± 0.048
0.272CysAsn: 0.272 ± 0.112
1.252CysPro: 1.252 ± 0.32
0.544CysGln: 0.544 ± 0.226
0.98CysArg: 0.98 ± 0.282
0.49CysSer: 0.49 ± 0.162
0.599CysThr: 0.599 ± 0.173
0.381CysVal: 0.381 ± 0.116
0.218CysTrp: 0.218 ± 0.118
0.381CysTyr: 0.381 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
7.729AspAla: 7.729 ± 0.652
0.816AspCys: 0.816 ± 0.203
5.062AspAsp: 5.062 ± 0.511
3.919AspGlu: 3.919 ± 0.558
1.415AspPhe: 1.415 ± 0.192
6.477AspGly: 6.477 ± 0.49
1.633AspHis: 1.633 ± 0.275
2.177AspIle: 2.177 ± 0.436
2.232AspLys: 2.232 ± 0.28
5.606AspLeu: 5.606 ± 0.581
1.252AspMet: 1.252 ± 0.249
1.578AspAsn: 1.578 ± 0.35
4.245AspPro: 4.245 ± 0.423
2.667AspGln: 2.667 ± 0.362
5.715AspArg: 5.715 ± 0.683
3.048AspSer: 3.048 ± 0.47
3.647AspThr: 3.647 ± 0.427
5.388AspVal: 5.388 ± 0.534
1.143AspTrp: 1.143 ± 0.257
2.286AspTyr: 2.286 ± 0.358
0.0AspXaa: 0.0 ± 0.0
Glu
5.279GluAla: 5.279 ± 0.642
0.98GluCys: 0.98 ± 0.248
2.776GluAsp: 2.776 ± 0.334
2.286GluGlu: 2.286 ± 0.416
1.796GluPhe: 1.796 ± 0.302
3.32GluGly: 3.32 ± 0.411
1.687GluHis: 1.687 ± 0.399
2.721GluIle: 2.721 ± 0.361
1.578GluLys: 1.578 ± 0.268
5.334GluLeu: 5.334 ± 0.63
1.687GluMet: 1.687 ± 0.276
1.687GluAsn: 1.687 ± 0.222
2.776GluPro: 2.776 ± 0.43
2.776GluGln: 2.776 ± 0.433
5.116GluArg: 5.116 ± 0.524
3.429GluSer: 3.429 ± 0.385
4.245GluThr: 4.245 ± 0.65
4.409GluVal: 4.409 ± 0.534
0.98GluTrp: 0.98 ± 0.232
1.524GluTyr: 1.524 ± 0.297
0.0GluXaa: 0.0 ± 0.0
Phe
2.776PheAla: 2.776 ± 0.43
0.272PheCys: 0.272 ± 0.141
2.667PheAsp: 2.667 ± 0.341
1.47PheGlu: 1.47 ± 0.267
1.143PhePhe: 1.143 ± 0.246
3.375PheGly: 3.375 ± 0.785
0.544PheHis: 0.544 ± 0.173
1.143PheIle: 1.143 ± 0.269
1.306PheLys: 1.306 ± 0.289
1.633PheLeu: 1.633 ± 0.24
0.49PheMet: 0.49 ± 0.156
0.925PheAsn: 0.925 ± 0.322
1.524PhePro: 1.524 ± 0.263
1.034PheGln: 1.034 ± 0.343
1.633PheArg: 1.633 ± 0.257
1.633PheSer: 1.633 ± 0.252
1.851PheThr: 1.851 ± 0.305
1.742PheVal: 1.742 ± 0.255
0.762PheTrp: 0.762 ± 0.183
0.98PheTyr: 0.98 ± 0.259
0.0PheXaa: 0.0 ± 0.0
Gly
10.396GlyAla: 10.396 ± 0.989
0.816GlyCys: 0.816 ± 0.18
5.606GlyAsp: 5.606 ± 0.454
3.375GlyGlu: 3.375 ± 0.495
2.939GlyPhe: 2.939 ± 0.442
10.886GlyGly: 10.886 ± 1.901
2.177GlyHis: 2.177 ± 0.287
4.626GlyIle: 4.626 ± 0.585
2.449GlyLys: 2.449 ± 0.36
6.422GlyLeu: 6.422 ± 0.614
2.123GlyMet: 2.123 ± 0.467
2.613GlyAsn: 2.613 ± 0.42
4.245GlyPro: 4.245 ± 0.562
2.177GlyGln: 2.177 ± 0.401
5.933GlyArg: 5.933 ± 0.71
6.205GlySer: 6.205 ± 0.814
6.695GlyThr: 6.695 ± 0.75
6.368GlyVal: 6.368 ± 0.639
2.667GlyTrp: 2.667 ± 0.399
2.721GlyTyr: 2.721 ± 0.468
0.0GlyXaa: 0.0 ± 0.0
His
2.014HisAla: 2.014 ± 0.376
0.272HisCys: 0.272 ± 0.165
1.143HisAsp: 1.143 ± 0.279
1.47HisGlu: 1.47 ± 0.311
0.49HisPhe: 0.49 ± 0.133
2.34HisGly: 2.34 ± 0.37
0.653HisHis: 0.653 ± 0.233
1.306HisIle: 1.306 ± 0.304
0.925HisLys: 0.925 ± 0.258
1.578HisLeu: 1.578 ± 0.259
0.272HisMet: 0.272 ± 0.105
0.653HisAsn: 0.653 ± 0.175
1.361HisPro: 1.361 ± 0.25
0.653HisGln: 0.653 ± 0.202
2.014HisArg: 2.014 ± 0.373
0.98HisSer: 0.98 ± 0.255
1.578HisThr: 1.578 ± 0.297
1.524HisVal: 1.524 ± 0.316
0.435HisTrp: 0.435 ± 0.132
1.034HisTyr: 1.034 ± 0.238
0.0HisXaa: 0.0 ± 0.0
Ile
6.15IleAla: 6.15 ± 0.58
0.218IleCys: 0.218 ± 0.104
3.756IleAsp: 3.756 ± 0.463
3.375IleGlu: 3.375 ± 0.437
0.816IlePhe: 0.816 ± 0.246
3.81IleGly: 3.81 ± 0.555
1.089IleHis: 1.089 ± 0.247
1.361IleIle: 1.361 ± 0.302
1.197IleLys: 1.197 ± 0.3
2.068IleLeu: 2.068 ± 0.452
0.218IleMet: 0.218 ± 0.088
1.796IleAsn: 1.796 ± 0.317
2.83IlePro: 2.83 ± 0.372
1.742IleGln: 1.742 ± 0.272
1.905IleArg: 1.905 ± 0.31
2.34IleSer: 2.34 ± 0.441
3.375IleThr: 3.375 ± 0.367
3.157IleVal: 3.157 ± 0.34
1.089IleTrp: 1.089 ± 0.264
0.871IleTyr: 0.871 ± 0.223
0.0IleXaa: 0.0 ± 0.0
Lys
3.538LysAla: 3.538 ± 0.414
0.327LysCys: 0.327 ± 0.138
1.47LysAsp: 1.47 ± 0.282
1.143LysGlu: 1.143 ± 0.232
0.925LysPhe: 0.925 ± 0.187
2.667LysGly: 2.667 ± 0.339
0.925LysHis: 0.925 ± 0.214
0.925LysIle: 0.925 ± 0.245
1.089LysLys: 1.089 ± 0.223
2.939LysLeu: 2.939 ± 0.531
0.653LysMet: 0.653 ± 0.165
0.925LysAsn: 0.925 ± 0.218
2.395LysPro: 2.395 ± 0.448
1.361LysGln: 1.361 ± 0.235
2.177LysArg: 2.177 ± 0.393
1.851LysSer: 1.851 ± 0.32
2.014LysThr: 2.014 ± 0.372
2.721LysVal: 2.721 ± 0.388
0.925LysTrp: 0.925 ± 0.248
0.599LysTyr: 0.599 ± 0.224
0.0LysXaa: 0.0 ± 0.0
Leu
8.382LeuAla: 8.382 ± 0.816
0.98LeuCys: 0.98 ± 0.265
5.987LeuAsp: 5.987 ± 0.601
3.429LeuGlu: 3.429 ± 0.433
2.667LeuPhe: 2.667 ± 0.443
5.878LeuGly: 5.878 ± 0.633
0.816LeuHis: 0.816 ± 0.259
2.994LeuIle: 2.994 ± 0.421
2.232LeuLys: 2.232 ± 0.436
4.245LeuLeu: 4.245 ± 0.571
1.252LeuMet: 1.252 ± 0.29
2.395LeuAsn: 2.395 ± 0.325
4.735LeuPro: 4.735 ± 0.498
2.667LeuGln: 2.667 ± 0.472
5.715LeuArg: 5.715 ± 0.713
4.191LeuSer: 4.191 ± 0.49
5.116LeuThr: 5.116 ± 0.488
5.388LeuVal: 5.388 ± 0.52
0.98LeuTrp: 0.98 ± 0.256
1.687LeuTyr: 1.687 ± 0.31
0.0LeuXaa: 0.0 ± 0.0
Met
2.177MetAla: 2.177 ± 0.386
0.109MetCys: 0.109 ± 0.079
1.089MetAsp: 1.089 ± 0.244
0.544MetGlu: 0.544 ± 0.14
0.381MetPhe: 0.381 ± 0.158
1.851MetGly: 1.851 ± 0.307
0.49MetHis: 0.49 ± 0.185
0.653MetIle: 0.653 ± 0.207
0.653MetLys: 0.653 ± 0.185
1.524MetLeu: 1.524 ± 0.234
0.599MetMet: 0.599 ± 0.225
0.762MetAsn: 0.762 ± 0.197
1.252MetPro: 1.252 ± 0.224
0.599MetGln: 0.599 ± 0.173
1.47MetArg: 1.47 ± 0.269
2.286MetSer: 2.286 ± 0.315
2.286MetThr: 2.286 ± 0.313
1.361MetVal: 1.361 ± 0.298
0.272MetTrp: 0.272 ± 0.135
0.272MetTyr: 0.272 ± 0.101
0.0MetXaa: 0.0 ± 0.0
Asn
2.83AsnAla: 2.83 ± 0.416
0.218AsnCys: 0.218 ± 0.12
1.851AsnAsp: 1.851 ± 0.316
1.578AsnGlu: 1.578 ± 0.249
0.816AsnPhe: 0.816 ± 0.264
3.864AsnGly: 3.864 ± 0.529
1.089AsnHis: 1.089 ± 0.271
1.633AsnIle: 1.633 ± 0.422
0.762AsnLys: 0.762 ± 0.181
2.014AsnLeu: 2.014 ± 0.288
0.599AsnMet: 0.599 ± 0.183
1.687AsnAsn: 1.687 ± 0.353
2.613AsnPro: 2.613 ± 0.331
0.98AsnGln: 0.98 ± 0.258
1.905AsnArg: 1.905 ± 0.364
1.687AsnSer: 1.687 ± 0.326
2.068AsnThr: 2.068 ± 0.301
2.068AsnVal: 2.068 ± 0.384
0.653AsnTrp: 0.653 ± 0.201
0.381AsnTyr: 0.381 ± 0.124
0.0AsnXaa: 0.0 ± 0.0
Pro
5.769ProAla: 5.769 ± 0.6
0.708ProCys: 0.708 ± 0.301
4.137ProAsp: 4.137 ± 0.483
3.919ProGlu: 3.919 ± 0.442
1.633ProPhe: 1.633 ± 0.305
6.041ProGly: 6.041 ± 0.683
1.415ProHis: 1.415 ± 0.273
2.286ProIle: 2.286 ± 0.391
1.796ProLys: 1.796 ± 0.31
4.245ProLeu: 4.245 ± 0.59
1.252ProMet: 1.252 ± 0.265
2.177ProAsn: 2.177 ± 0.328
3.211ProPro: 3.211 ± 0.533
2.721ProGln: 2.721 ± 0.452
3.483ProArg: 3.483 ± 0.535
3.375ProSer: 3.375 ± 0.331
3.647ProThr: 3.647 ± 0.429
4.191ProVal: 4.191 ± 0.451
1.143ProTrp: 1.143 ± 0.242
1.796ProTyr: 1.796 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
5.225GlnAla: 5.225 ± 0.588
0.109GlnCys: 0.109 ± 0.077
1.415GlnAsp: 1.415 ± 0.294
1.578GlnGlu: 1.578 ± 0.294
1.252GlnPhe: 1.252 ± 0.241
2.123GlnGly: 2.123 ± 0.435
0.816GlnHis: 0.816 ± 0.216
1.905GlnIle: 1.905 ± 0.42
1.089GlnLys: 1.089 ± 0.207
3.048GlnLeu: 3.048 ± 0.507
0.435GlnMet: 0.435 ± 0.217
1.143GlnAsn: 1.143 ± 0.312
2.504GlnPro: 2.504 ± 0.468
1.742GlnGln: 1.742 ± 0.491
3.102GlnArg: 3.102 ± 0.41
2.558GlnSer: 2.558 ± 0.368
1.742GlnThr: 1.742 ± 0.386
2.776GlnVal: 2.776 ± 0.346
1.034GlnTrp: 1.034 ± 0.238
0.871GlnTyr: 0.871 ± 0.255
0.0GlnXaa: 0.0 ± 0.0
Arg
8.11ArgAla: 8.11 ± 0.844
1.47ArgCys: 1.47 ± 0.387
5.443ArgAsp: 5.443 ± 0.627
4.409ArgGlu: 4.409 ± 0.581
2.232ArgPhe: 2.232 ± 0.344
4.409ArgGly: 4.409 ± 0.398
1.742ArgHis: 1.742 ± 0.352
4.137ArgIle: 4.137 ± 0.56
2.232ArgLys: 2.232 ± 0.344
4.517ArgLeu: 4.517 ± 0.579
2.667ArgMet: 2.667 ± 0.455
2.123ArgAsn: 2.123 ± 0.352
3.81ArgPro: 3.81 ± 0.5
2.068ArgGln: 2.068 ± 0.376
5.171ArgArg: 5.171 ± 0.852
3.592ArgSer: 3.592 ± 0.388
3.647ArgThr: 3.647 ± 0.534
4.463ArgVal: 4.463 ± 0.545
1.306ArgTrp: 1.306 ± 0.289
2.34ArgTyr: 2.34 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
6.64SerAla: 6.64 ± 0.645
0.327SerCys: 0.327 ± 0.126
4.3SerAsp: 4.3 ± 0.492
2.939SerGlu: 2.939 ± 0.391
1.959SerPhe: 1.959 ± 0.328
6.314SerGly: 6.314 ± 0.823
0.816SerHis: 0.816 ± 0.251
2.613SerIle: 2.613 ± 0.356
2.286SerLys: 2.286 ± 0.335
3.211SerLeu: 3.211 ± 0.455
1.361SerMet: 1.361 ± 0.28
2.123SerAsn: 2.123 ± 0.456
3.592SerPro: 3.592 ± 0.405
1.851SerGln: 1.851 ± 0.249
3.973SerArg: 3.973 ± 0.411
4.191SerSer: 4.191 ± 0.649
3.32SerThr: 3.32 ± 0.476
4.681SerVal: 4.681 ± 0.546
1.415SerTrp: 1.415 ± 0.303
1.089SerTyr: 1.089 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
6.368ThrAla: 6.368 ± 0.577
0.653ThrCys: 0.653 ± 0.231
3.973ThrAsp: 3.973 ± 0.488
4.028ThrGlu: 4.028 ± 0.526
1.361ThrPhe: 1.361 ± 0.245
7.076ThrGly: 7.076 ± 0.8
1.578ThrHis: 1.578 ± 0.312
3.102ThrIle: 3.102 ± 0.41
2.177ThrLys: 2.177 ± 0.346
4.409ThrLeu: 4.409 ± 0.473
0.925ThrMet: 0.925 ± 0.298
1.959ThrAsn: 1.959 ± 0.281
4.245ThrPro: 4.245 ± 0.457
1.687ThrGln: 1.687 ± 0.294
3.919ThrArg: 3.919 ± 0.57
3.81ThrSer: 3.81 ± 0.475
5.171ThrThr: 5.171 ± 0.571
5.334ThrVal: 5.334 ± 0.705
1.197ThrTrp: 1.197 ± 0.23
2.014ThrTyr: 2.014 ± 0.353
0.0ThrXaa: 0.0 ± 0.0
Val
8.273ValAla: 8.273 ± 0.649
1.252ValCys: 1.252 ± 0.27
5.606ValAsp: 5.606 ± 0.478
4.844ValGlu: 4.844 ± 0.583
2.232ValPhe: 2.232 ± 0.408
5.388ValGly: 5.388 ± 0.668
1.306ValHis: 1.306 ± 0.278
2.449ValIle: 2.449 ± 0.357
2.014ValLys: 2.014 ± 0.343
6.041ValLeu: 6.041 ± 0.612
0.708ValMet: 0.708 ± 0.17
2.34ValAsn: 2.34 ± 0.379
4.79ValPro: 4.79 ± 0.518
2.83ValGln: 2.83 ± 0.366
4.898ValArg: 4.898 ± 0.675
4.354ValSer: 4.354 ± 0.448
4.626ValThr: 4.626 ± 0.478
6.422ValVal: 6.422 ± 0.724
1.633ValTrp: 1.633 ± 0.353
1.361ValTyr: 1.361 ± 0.305
0.0ValXaa: 0.0 ± 0.0
Trp
1.905TrpAla: 1.905 ± 0.232
0.327TrpCys: 0.327 ± 0.168
1.415TrpAsp: 1.415 ± 0.254
0.98TrpGlu: 0.98 ± 0.254
0.925TrpPhe: 0.925 ± 0.233
1.143TrpGly: 1.143 ± 0.236
0.816TrpHis: 0.816 ± 0.226
0.816TrpIle: 0.816 ± 0.209
0.435TrpLys: 0.435 ± 0.143
1.633TrpLeu: 1.633 ± 0.305
1.143TrpMet: 1.143 ± 0.291
0.435TrpAsn: 0.435 ± 0.198
1.197TrpPro: 1.197 ± 0.27
0.925TrpGln: 0.925 ± 0.247
2.068TrpArg: 2.068 ± 0.352
1.415TrpSer: 1.415 ± 0.291
1.197TrpThr: 1.197 ± 0.222
1.524TrpVal: 1.524 ± 0.328
0.816TrpTrp: 0.816 ± 0.191
0.653TrpTyr: 0.653 ± 0.176
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.776TyrAla: 2.776 ± 0.387
0.272TyrCys: 0.272 ± 0.136
1.742TyrAsp: 1.742 ± 0.371
2.014TyrGlu: 2.014 ± 0.338
0.708TyrPhe: 0.708 ± 0.184
2.123TyrGly: 2.123 ± 0.391
0.599TyrHis: 0.599 ± 0.174
1.034TyrIle: 1.034 ± 0.204
0.653TyrLys: 0.653 ± 0.211
1.524TyrLeu: 1.524 ± 0.329
0.218TyrMet: 0.218 ± 0.109
0.435TyrAsn: 0.435 ± 0.141
1.742TyrPro: 1.742 ± 0.3
1.306TyrGln: 1.306 ± 0.325
2.123TyrArg: 2.123 ± 0.307
1.306TyrSer: 1.306 ± 0.252
1.796TyrThr: 1.796 ± 0.336
2.449TyrVal: 2.449 ± 0.294
0.599TyrTrp: 0.599 ± 0.151
0.816TyrTyr: 0.816 ± 0.145
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (18374 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski