Amino acid dipepetide frequency for Mycobacterium phage PattyP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.096AlaAla: 13.096 ± 1.433
0.845AlaCys: 0.845 ± 0.232
6.337AlaAsp: 6.337 ± 0.677
6.095AlaGlu: 6.095 ± 0.609
3.259AlaPhe: 3.259 ± 0.435
7.725AlaGly: 7.725 ± 0.77
1.569AlaHis: 1.569 ± 0.354
4.406AlaIle: 4.406 ± 0.64
3.862AlaLys: 3.862 ± 0.455
8.147AlaLeu: 8.147 ± 0.713
2.293AlaMet: 2.293 ± 0.349
2.535AlaAsn: 2.535 ± 0.34
5.733AlaPro: 5.733 ± 0.768
2.836AlaGln: 2.836 ± 0.412
6.337AlaArg: 6.337 ± 0.495
5.25AlaSer: 5.25 ± 0.641
5.673AlaThr: 5.673 ± 0.604
7.725AlaVal: 7.725 ± 0.652
2.052AlaTrp: 2.052 ± 0.335
2.535AlaTyr: 2.535 ± 0.4
0.0AlaXaa: 0.0 ± 0.0
Cys
0.905CysAla: 0.905 ± 0.226
0.06CysCys: 0.06 ± 0.056
0.604CysAsp: 0.604 ± 0.168
0.845CysGlu: 0.845 ± 0.231
0.181CysPhe: 0.181 ± 0.1
0.785CysGly: 0.785 ± 0.245
0.241CysHis: 0.241 ± 0.113
0.302CysIle: 0.302 ± 0.145
0.362CysLys: 0.362 ± 0.148
0.483CysLeu: 0.483 ± 0.226
0.241CysMet: 0.241 ± 0.127
0.302CysAsn: 0.302 ± 0.121
0.241CysPro: 0.241 ± 0.116
0.181CysGln: 0.181 ± 0.104
0.483CysArg: 0.483 ± 0.17
0.483CysSer: 0.483 ± 0.159
0.241CysThr: 0.241 ± 0.118
0.302CysVal: 0.302 ± 0.12
0.181CysTrp: 0.181 ± 0.105
0.181CysTyr: 0.181 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
6.095AspAla: 6.095 ± 0.565
0.664AspCys: 0.664 ± 0.187
4.587AspAsp: 4.587 ± 0.472
3.561AspGlu: 3.561 ± 0.386
2.173AspPhe: 2.173 ± 0.319
6.156AspGly: 6.156 ± 0.608
1.267AspHis: 1.267 ± 0.287
3.018AspIle: 3.018 ± 0.45
2.535AspLys: 2.535 ± 0.364
6.578AspLeu: 6.578 ± 0.639
1.147AspMet: 1.147 ± 0.196
1.69AspAsn: 1.69 ± 0.357
4.647AspPro: 4.647 ± 0.546
1.509AspGln: 1.509 ± 0.375
3.923AspArg: 3.923 ± 0.394
2.957AspSer: 2.957 ± 0.453
3.923AspThr: 3.923 ± 0.415
4.164AspVal: 4.164 ± 0.466
1.871AspTrp: 1.871 ± 0.334
2.233AspTyr: 2.233 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
5.733GluAla: 5.733 ± 0.66
0.422GluCys: 0.422 ± 0.183
4.949GluAsp: 4.949 ± 0.62
4.768GluGlu: 4.768 ± 0.584
2.112GluPhe: 2.112 ± 0.383
4.164GluGly: 4.164 ± 0.554
1.75GluHis: 1.75 ± 0.36
3.5GluIle: 3.5 ± 0.588
2.776GluLys: 2.776 ± 0.412
6.639GluLeu: 6.639 ± 0.55
1.448GluMet: 1.448 ± 0.256
1.629GluAsn: 1.629 ± 0.393
2.655GluPro: 2.655 ± 0.421
2.595GluGln: 2.595 ± 0.412
3.742GluArg: 3.742 ± 0.457
3.621GluSer: 3.621 ± 0.419
3.5GluThr: 3.5 ± 0.502
5.13GluVal: 5.13 ± 0.588
1.629GluTrp: 1.629 ± 0.402
2.595GluTyr: 2.595 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
2.474PheAla: 2.474 ± 0.329
0.302PheCys: 0.302 ± 0.166
2.957PheAsp: 2.957 ± 0.401
1.931PheGlu: 1.931 ± 0.304
0.664PhePhe: 0.664 ± 0.167
3.5PheGly: 3.5 ± 0.421
0.664PheHis: 0.664 ± 0.246
1.267PheIle: 1.267 ± 0.3
1.207PheLys: 1.207 ± 0.273
2.354PheLeu: 2.354 ± 0.396
0.845PheMet: 0.845 ± 0.22
0.966PheAsn: 0.966 ± 0.226
1.69PhePro: 1.69 ± 0.303
0.905PheGln: 0.905 ± 0.193
1.871PheArg: 1.871 ± 0.326
1.992PheSer: 1.992 ± 0.382
2.173PheThr: 2.173 ± 0.36
1.992PheVal: 1.992 ± 0.333
0.604PheTrp: 0.604 ± 0.2
0.845PheTyr: 0.845 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
7.483GlyAla: 7.483 ± 0.884
0.664GlyCys: 0.664 ± 0.2
5.613GlyAsp: 5.613 ± 0.532
4.406GlyGlu: 4.406 ± 0.523
2.957GlyPhe: 2.957 ± 0.469
9.173GlyGly: 9.173 ± 1.708
1.871GlyHis: 1.871 ± 0.361
4.406GlyIle: 4.406 ± 0.686
3.862GlyLys: 3.862 ± 0.606
7.906GlyLeu: 7.906 ± 0.902
1.569GlyMet: 1.569 ± 0.244
3.259GlyAsn: 3.259 ± 0.426
3.802GlyPro: 3.802 ± 0.529
2.595GlyGln: 2.595 ± 0.358
5.069GlyArg: 5.069 ± 0.604
6.095GlySer: 6.095 ± 0.752
5.13GlyThr: 5.13 ± 0.588
5.311GlyVal: 5.311 ± 0.557
2.474GlyTrp: 2.474 ± 0.414
2.776GlyTyr: 2.776 ± 0.433
0.0GlyXaa: 0.0 ± 0.0
His
1.992HisAla: 1.992 ± 0.388
0.241HisCys: 0.241 ± 0.141
1.267HisAsp: 1.267 ± 0.245
1.267HisGlu: 1.267 ± 0.312
0.604HisPhe: 0.604 ± 0.204
1.509HisGly: 1.509 ± 0.342
0.785HisHis: 0.785 ± 0.197
0.905HisIle: 0.905 ± 0.196
1.086HisLys: 1.086 ± 0.313
1.992HisLeu: 1.992 ± 0.397
0.06HisMet: 0.06 ± 0.069
0.302HisAsn: 0.302 ± 0.121
1.147HisPro: 1.147 ± 0.247
1.026HisGln: 1.026 ± 0.253
1.75HisArg: 1.75 ± 0.355
0.785HisSer: 0.785 ± 0.19
1.267HisThr: 1.267 ± 0.303
1.569HisVal: 1.569 ± 0.296
0.664HisTrp: 0.664 ± 0.157
0.724HisTyr: 0.724 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
6.397IleAla: 6.397 ± 0.76
0.241IleCys: 0.241 ± 0.126
3.561IleAsp: 3.561 ± 0.325
3.923IleGlu: 3.923 ± 0.427
0.905IlePhe: 0.905 ± 0.238
3.802IleGly: 3.802 ± 0.471
0.845IleHis: 0.845 ± 0.191
1.629IleIle: 1.629 ± 0.317
1.69IleLys: 1.69 ± 0.335
3.44IleLeu: 3.44 ± 0.45
0.664IleMet: 0.664 ± 0.18
1.992IleAsn: 1.992 ± 0.283
2.957IlePro: 2.957 ± 0.36
1.448IleGln: 1.448 ± 0.321
3.5IleArg: 3.5 ± 0.44
3.5IleSer: 3.5 ± 0.429
3.621IleThr: 3.621 ± 0.466
3.199IleVal: 3.199 ± 0.494
0.845IleTrp: 0.845 ± 0.176
1.629IleTyr: 1.629 ± 0.25
0.0IleXaa: 0.0 ± 0.0
Lys
3.983LysAla: 3.983 ± 0.472
0.302LysCys: 0.302 ± 0.128
2.414LysAsp: 2.414 ± 0.376
2.052LysGlu: 2.052 ± 0.37
1.629LysPhe: 1.629 ± 0.285
2.535LysGly: 2.535 ± 0.363
1.086LysHis: 1.086 ± 0.297
2.173LysIle: 2.173 ± 0.426
1.931LysLys: 1.931 ± 0.391
3.44LysLeu: 3.44 ± 0.417
0.966LysMet: 0.966 ± 0.207
1.569LysAsn: 1.569 ± 0.247
3.078LysPro: 3.078 ± 0.467
1.388LysGln: 1.388 ± 0.309
3.621LysArg: 3.621 ± 0.64
2.173LysSer: 2.173 ± 0.371
2.354LysThr: 2.354 ± 0.401
3.259LysVal: 3.259 ± 0.514
0.845LysTrp: 0.845 ± 0.238
0.905LysTyr: 0.905 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
8.992LeuAla: 8.992 ± 0.704
0.483LeuCys: 0.483 ± 0.181
5.613LeuAsp: 5.613 ± 0.568
5.552LeuGlu: 5.552 ± 0.554
1.992LeuPhe: 1.992 ± 0.354
7.725LeuGly: 7.725 ± 0.779
1.509LeuHis: 1.509 ± 0.323
5.13LeuIle: 5.13 ± 0.604
3.923LeuLys: 3.923 ± 0.466
5.975LeuLeu: 5.975 ± 0.576
1.75LeuMet: 1.75 ± 0.245
2.897LeuAsn: 2.897 ± 0.343
5.854LeuPro: 5.854 ± 0.664
2.474LeuGln: 2.474 ± 0.368
5.673LeuArg: 5.673 ± 0.575
5.432LeuSer: 5.432 ± 0.556
6.397LeuThr: 6.397 ± 0.651
4.466LeuVal: 4.466 ± 0.587
1.147LeuTrp: 1.147 ± 0.338
2.716LeuTyr: 2.716 ± 0.415
0.0LeuXaa: 0.0 ± 0.0
Met
2.112MetAla: 2.112 ± 0.35
0.0MetCys: 0.0 ± 0.0
1.207MetAsp: 1.207 ± 0.233
1.388MetGlu: 1.388 ± 0.301
0.664MetPhe: 0.664 ± 0.189
1.448MetGly: 1.448 ± 0.322
0.302MetHis: 0.302 ± 0.134
0.362MetIle: 0.362 ± 0.143
1.147MetLys: 1.147 ± 0.279
1.147MetLeu: 1.147 ± 0.254
0.181MetMet: 0.181 ± 0.1
0.845MetAsn: 0.845 ± 0.187
1.086MetPro: 1.086 ± 0.239
0.604MetGln: 0.604 ± 0.15
1.509MetArg: 1.509 ± 0.291
2.776MetSer: 2.776 ± 0.391
2.052MetThr: 2.052 ± 0.278
0.966MetVal: 0.966 ± 0.266
0.241MetTrp: 0.241 ± 0.118
0.422MetTyr: 0.422 ± 0.144
0.0MetXaa: 0.0 ± 0.0
Asn
2.836AsnAla: 2.836 ± 0.452
0.06AsnCys: 0.06 ± 0.054
1.871AsnAsp: 1.871 ± 0.326
2.052AsnGlu: 2.052 ± 0.37
1.147AsnPhe: 1.147 ± 0.258
3.561AsnGly: 3.561 ± 0.459
0.845AsnHis: 0.845 ± 0.235
1.569AsnIle: 1.569 ± 0.309
0.785AsnLys: 0.785 ± 0.221
2.173AsnLeu: 2.173 ± 0.319
0.543AsnMet: 0.543 ± 0.168
1.026AsnAsn: 1.026 ± 0.227
2.716AsnPro: 2.716 ± 0.404
1.147AsnGln: 1.147 ± 0.253
1.569AsnArg: 1.569 ± 0.384
1.811AsnSer: 1.811 ± 0.363
1.931AsnThr: 1.931 ± 0.353
2.414AsnVal: 2.414 ± 0.403
0.664AsnTrp: 0.664 ± 0.184
1.147AsnTyr: 1.147 ± 0.295
0.0AsnXaa: 0.0 ± 0.0
Pro
5.794ProAla: 5.794 ± 0.633
0.362ProCys: 0.362 ± 0.17
4.164ProAsp: 4.164 ± 0.423
4.526ProGlu: 4.526 ± 0.536
1.871ProPhe: 1.871 ± 0.408
4.888ProGly: 4.888 ± 0.568
1.026ProHis: 1.026 ± 0.243
2.474ProIle: 2.474 ± 0.393
2.112ProLys: 2.112 ± 0.307
4.466ProLeu: 4.466 ± 0.527
0.966ProMet: 0.966 ± 0.261
1.75ProAsn: 1.75 ± 0.296
3.138ProPro: 3.138 ± 0.494
1.569ProGln: 1.569 ± 0.317
2.897ProArg: 2.897 ± 0.414
3.44ProSer: 3.44 ± 0.435
4.043ProThr: 4.043 ± 0.469
3.923ProVal: 3.923 ± 0.485
0.966ProTrp: 0.966 ± 0.278
1.569ProTyr: 1.569 ± 0.375
0.0ProXaa: 0.0 ± 0.0
Gln
2.897GlnAla: 2.897 ± 0.491
0.06GlnCys: 0.06 ± 0.061
1.328GlnAsp: 1.328 ± 0.335
1.811GlnGlu: 1.811 ± 0.319
0.905GlnPhe: 0.905 ± 0.202
2.535GlnGly: 2.535 ± 0.319
0.543GlnHis: 0.543 ± 0.18
2.655GlnIle: 2.655 ± 0.5
1.267GlnLys: 1.267 ± 0.222
3.44GlnLeu: 3.44 ± 0.485
0.966GlnMet: 0.966 ± 0.25
0.422GlnAsn: 0.422 ± 0.154
1.811GlnPro: 1.811 ± 0.351
1.811GlnGln: 1.811 ± 0.415
1.69GlnArg: 1.69 ± 0.34
1.629GlnSer: 1.629 ± 0.261
1.629GlnThr: 1.629 ± 0.287
2.474GlnVal: 2.474 ± 0.286
0.664GlnTrp: 0.664 ± 0.192
0.604GlnTyr: 0.604 ± 0.174
0.0GlnXaa: 0.0 ± 0.0
Arg
5.794ArgAla: 5.794 ± 0.644
0.845ArgCys: 0.845 ± 0.273
3.078ArgAsp: 3.078 ± 0.394
4.888ArgGlu: 4.888 ± 0.562
1.931ArgPhe: 1.931 ± 0.393
4.768ArgGly: 4.768 ± 0.536
1.267ArgHis: 1.267 ± 0.269
3.681ArgIle: 3.681 ± 0.564
3.561ArgLys: 3.561 ± 0.583
5.914ArgLeu: 5.914 ± 0.609
1.811ArgMet: 1.811 ± 0.328
2.535ArgAsn: 2.535 ± 0.484
2.776ArgPro: 2.776 ± 0.44
1.811ArgGln: 1.811 ± 0.298
5.552ArgArg: 5.552 ± 0.614
3.621ArgSer: 3.621 ± 0.395
3.078ArgThr: 3.078 ± 0.542
4.828ArgVal: 4.828 ± 0.49
1.388ArgTrp: 1.388 ± 0.255
1.629ArgTyr: 1.629 ± 0.274
0.0ArgXaa: 0.0 ± 0.0
Ser
5.733SerAla: 5.733 ± 0.683
0.604SerCys: 0.604 ± 0.2
2.897SerAsp: 2.897 ± 0.463
3.742SerGlu: 3.742 ± 0.455
1.931SerPhe: 1.931 ± 0.344
7.001SerGly: 7.001 ± 0.83
1.569SerHis: 1.569 ± 0.331
2.957SerIle: 2.957 ± 0.414
2.535SerLys: 2.535 ± 0.351
5.25SerLeu: 5.25 ± 0.602
1.509SerMet: 1.509 ± 0.252
2.595SerAsn: 2.595 ± 0.44
3.018SerPro: 3.018 ± 0.441
1.569SerGln: 1.569 ± 0.323
2.957SerArg: 2.957 ± 0.361
3.621SerSer: 3.621 ± 0.824
3.138SerThr: 3.138 ± 0.427
3.681SerVal: 3.681 ± 0.446
1.569SerTrp: 1.569 ± 0.359
1.75SerTyr: 1.75 ± 0.363
0.0SerXaa: 0.0 ± 0.0
Thr
6.156ThrAla: 6.156 ± 0.797
0.241ThrCys: 0.241 ± 0.13
4.285ThrAsp: 4.285 ± 0.514
4.345ThrGlu: 4.345 ± 0.495
2.173ThrPhe: 2.173 ± 0.408
6.276ThrGly: 6.276 ± 0.62
1.086ThrHis: 1.086 ± 0.326
3.138ThrIle: 3.138 ± 0.537
2.474ThrLys: 2.474 ± 0.311
5.914ThrLeu: 5.914 ± 0.545
1.086ThrMet: 1.086 ± 0.25
1.69ThrAsn: 1.69 ± 0.283
3.561ThrPro: 3.561 ± 0.431
1.811ThrGln: 1.811 ± 0.319
3.5ThrArg: 3.5 ± 0.503
3.561ThrSer: 3.561 ± 0.476
4.466ThrThr: 4.466 ± 0.516
5.25ThrVal: 5.25 ± 0.513
1.328ThrTrp: 1.328 ± 0.274
1.811ThrTyr: 1.811 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
6.457ValAla: 6.457 ± 0.665
0.604ValCys: 0.604 ± 0.199
5.432ValAsp: 5.432 ± 0.52
4.587ValGlu: 4.587 ± 0.503
2.595ValPhe: 2.595 ± 0.397
4.587ValGly: 4.587 ± 0.637
1.509ValHis: 1.509 ± 0.271
3.38ValIle: 3.38 ± 0.369
2.897ValLys: 2.897 ± 0.385
5.371ValLeu: 5.371 ± 0.515
1.147ValMet: 1.147 ± 0.31
2.354ValAsn: 2.354 ± 0.393
3.681ValPro: 3.681 ± 0.457
2.112ValGln: 2.112 ± 0.352
4.768ValArg: 4.768 ± 0.563
4.466ValSer: 4.466 ± 0.464
5.492ValThr: 5.492 ± 0.589
5.311ValVal: 5.311 ± 0.611
1.267ValTrp: 1.267 ± 0.289
2.052ValTyr: 2.052 ± 0.365
0.0ValXaa: 0.0 ± 0.0
Trp
1.448TrpAla: 1.448 ± 0.286
0.362TrpCys: 0.362 ± 0.131
1.388TrpAsp: 1.388 ± 0.317
1.267TrpGlu: 1.267 ± 0.247
0.845TrpPhe: 0.845 ± 0.249
1.811TrpGly: 1.811 ± 0.287
0.422TrpHis: 0.422 ± 0.145
1.207TrpIle: 1.207 ± 0.228
0.422TrpLys: 0.422 ± 0.167
2.112TrpLeu: 2.112 ± 0.316
0.543TrpMet: 0.543 ± 0.207
0.422TrpAsn: 0.422 ± 0.149
0.785TrpPro: 0.785 ± 0.202
0.785TrpGln: 0.785 ± 0.194
1.69TrpArg: 1.69 ± 0.297
1.147TrpSer: 1.147 ± 0.261
1.69TrpThr: 1.69 ± 0.378
2.173TrpVal: 2.173 ± 0.308
0.483TrpTrp: 0.483 ± 0.162
0.302TrpTyr: 0.302 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.931TyrAla: 1.931 ± 0.379
0.241TyrCys: 0.241 ± 0.138
1.147TyrAsp: 1.147 ± 0.259
2.354TyrGlu: 2.354 ± 0.384
0.664TyrPhe: 0.664 ± 0.195
2.474TyrGly: 2.474 ± 0.366
0.785TyrHis: 0.785 ± 0.242
1.629TyrIle: 1.629 ± 0.348
1.328TyrLys: 1.328 ± 0.301
2.957TyrLeu: 2.957 ± 0.393
0.664TyrMet: 0.664 ± 0.178
1.086TyrAsn: 1.086 ± 0.289
1.569TyrPro: 1.569 ± 0.312
0.966TyrGln: 0.966 ± 0.234
2.655TyrArg: 2.655 ± 0.44
1.267TyrSer: 1.267 ± 0.29
2.354TyrThr: 2.354 ± 0.401
1.931TyrVal: 1.931 ± 0.284
0.422TyrTrp: 0.422 ± 0.164
0.543TyrTyr: 0.543 ± 0.196
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (16571 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski