Amino acid dipepetide frequency for Mycobacterium phage Send513

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.321AlaAla: 11.321 ± 1.155
0.677AlaCys: 0.677 ± 0.174
5.367AlaAsp: 5.367 ± 0.56
5.548AlaGlu: 5.548 ± 0.522
3.608AlaPhe: 3.608 ± 0.472
8.209AlaGly: 8.209 ± 0.856
1.353AlaHis: 1.353 ± 0.3
4.285AlaIle: 4.285 ± 0.39
6.224AlaLys: 6.224 ± 0.672
7.893AlaLeu: 7.893 ± 0.61
2.165AlaMet: 2.165 ± 0.344
3.744AlaAsn: 3.744 ± 0.584
3.744AlaPro: 3.744 ± 0.372
4.15AlaGln: 4.15 ± 0.518
5.593AlaArg: 5.593 ± 0.579
4.691AlaSer: 4.691 ± 0.553
6.269AlaThr: 6.269 ± 0.599
5.638AlaVal: 5.638 ± 0.45
1.398AlaTrp: 1.398 ± 0.257
2.616AlaTyr: 2.616 ± 0.357
0.0AlaXaa: 0.0 ± 0.0
Cys
0.677CysAla: 0.677 ± 0.181
0.135CysCys: 0.135 ± 0.074
0.451CysAsp: 0.451 ± 0.128
0.767CysGlu: 0.767 ± 0.179
0.361CysPhe: 0.361 ± 0.134
0.767CysGly: 0.767 ± 0.225
0.18CysHis: 0.18 ± 0.094
0.496CysIle: 0.496 ± 0.165
0.406CysLys: 0.406 ± 0.127
0.586CysLeu: 0.586 ± 0.193
0.09CysMet: 0.09 ± 0.058
0.451CysAsn: 0.451 ± 0.14
0.586CysPro: 0.586 ± 0.166
0.361CysGln: 0.361 ± 0.134
0.361CysArg: 0.361 ± 0.112
0.361CysSer: 0.361 ± 0.127
0.496CysThr: 0.496 ± 0.184
0.316CysVal: 0.316 ± 0.127
0.18CysTrp: 0.18 ± 0.091
0.316CysTyr: 0.316 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
5.052AspAla: 5.052 ± 0.626
0.767AspCys: 0.767 ± 0.247
5.232AspAsp: 5.232 ± 0.948
4.33AspGlu: 4.33 ± 0.525
2.526AspPhe: 2.526 ± 0.322
5.503AspGly: 5.503 ± 0.478
1.173AspHis: 1.173 ± 0.243
4.014AspIle: 4.014 ± 0.352
2.977AspLys: 2.977 ± 0.383
5.322AspLeu: 5.322 ± 0.353
1.443AspMet: 1.443 ± 0.237
2.436AspAsn: 2.436 ± 0.337
4.375AspPro: 4.375 ± 0.5
2.255AspGln: 2.255 ± 0.33
3.338AspArg: 3.338 ± 0.321
3.157AspSer: 3.157 ± 0.444
3.699AspThr: 3.699 ± 0.375
4.24AspVal: 4.24 ± 0.461
1.308AspTrp: 1.308 ± 0.268
1.985AspTyr: 1.985 ± 0.285
0.0AspXaa: 0.0 ± 0.0
Glu
6.179GluAla: 6.179 ± 0.465
0.496GluCys: 0.496 ± 0.162
4.781GluAsp: 4.781 ± 0.756
4.014GluGlu: 4.014 ± 0.357
2.03GluPhe: 2.03 ± 0.314
5.097GluGly: 5.097 ± 0.415
1.894GluHis: 1.894 ± 0.371
2.796GluIle: 2.796 ± 0.414
3.202GluLys: 3.202 ± 0.383
5.593GluLeu: 5.593 ± 0.5
1.804GluMet: 1.804 ± 0.268
1.804GluAsn: 1.804 ± 0.27
3.293GluPro: 3.293 ± 0.504
2.706GluGln: 2.706 ± 0.332
3.969GluArg: 3.969 ± 0.473
2.842GluSer: 2.842 ± 0.324
4.104GluThr: 4.104 ± 0.434
4.33GluVal: 4.33 ± 0.485
1.128GluTrp: 1.128 ± 0.22
2.345GluTyr: 2.345 ± 0.417
0.0GluXaa: 0.0 ± 0.0
Phe
3.473PheAla: 3.473 ± 0.425
0.361PheCys: 0.361 ± 0.126
2.21PheAsp: 2.21 ± 0.333
2.345PheGlu: 2.345 ± 0.372
1.218PhePhe: 1.218 ± 0.235
3.428PheGly: 3.428 ± 0.454
0.902PheHis: 0.902 ± 0.219
1.308PheIle: 1.308 ± 0.3
1.894PheLys: 1.894 ± 0.28
2.345PheLeu: 2.345 ± 0.334
0.631PheMet: 0.631 ± 0.189
1.443PheAsn: 1.443 ± 0.245
1.128PhePro: 1.128 ± 0.174
1.669PheGln: 1.669 ± 0.247
2.12PheArg: 2.12 ± 0.267
1.804PheSer: 1.804 ± 0.253
1.985PheThr: 1.985 ± 0.348
2.03PheVal: 2.03 ± 0.253
0.361PheTrp: 0.361 ± 0.118
0.677PheTyr: 0.677 ± 0.166
0.0PheXaa: 0.0 ± 0.0
Gly
5.548GlyAla: 5.548 ± 0.667
0.496GlyCys: 0.496 ± 0.152
5.322GlyAsp: 5.322 ± 0.447
4.24GlyGlu: 4.24 ± 0.426
2.12GlyPhe: 2.12 ± 0.287
7.532GlyGly: 7.532 ± 1.286
1.804GlyHis: 1.804 ± 0.324
4.691GlyIle: 4.691 ± 0.61
4.781GlyLys: 4.781 ± 0.637
6.134GlyLeu: 6.134 ± 0.627
2.345GlyMet: 2.345 ± 0.355
4.42GlyAsn: 4.42 ± 0.542
3.563GlyPro: 3.563 ± 0.451
2.3GlyGln: 2.3 ± 0.27
4.375GlyArg: 4.375 ± 0.373
4.871GlySer: 4.871 ± 0.664
6.811GlyThr: 6.811 ± 0.663
5.458GlyVal: 5.458 ± 0.601
2.12GlyTrp: 2.12 ± 0.285
3.157GlyTyr: 3.157 ± 0.337
0.0GlyXaa: 0.0 ± 0.0
His
1.308HisAla: 1.308 ± 0.254
0.226HisCys: 0.226 ± 0.108
1.173HisAsp: 1.173 ± 0.262
1.714HisGlu: 1.714 ± 0.272
0.451HisPhe: 0.451 ± 0.139
1.398HisGly: 1.398 ± 0.248
0.496HisHis: 0.496 ± 0.138
0.722HisIle: 0.722 ± 0.195
1.173HisLys: 1.173 ± 0.244
2.075HisLeu: 2.075 ± 0.262
0.722HisMet: 0.722 ± 0.203
0.496HisAsn: 0.496 ± 0.177
1.759HisPro: 1.759 ± 0.243
0.541HisGln: 0.541 ± 0.164
1.037HisArg: 1.037 ± 0.202
0.812HisSer: 0.812 ± 0.212
1.173HisThr: 1.173 ± 0.273
1.443HisVal: 1.443 ± 0.229
0.316HisTrp: 0.316 ± 0.12
0.992HisTyr: 0.992 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.465IleAla: 4.465 ± 0.504
0.361IleCys: 0.361 ± 0.126
3.518IleAsp: 3.518 ± 0.39
3.744IleGlu: 3.744 ± 0.375
1.082IlePhe: 1.082 ± 0.229
3.383IleGly: 3.383 ± 0.516
1.353IleHis: 1.353 ± 0.245
2.481IleIle: 2.481 ± 0.337
1.804IleLys: 1.804 ± 0.24
3.383IleLeu: 3.383 ± 0.473
0.947IleMet: 0.947 ± 0.212
1.669IleAsn: 1.669 ± 0.314
2.526IlePro: 2.526 ± 0.316
2.526IleGln: 2.526 ± 0.3
3.022IleArg: 3.022 ± 0.336
2.887IleSer: 2.887 ± 0.327
3.067IleThr: 3.067 ± 0.302
3.293IleVal: 3.293 ± 0.405
1.263IleTrp: 1.263 ± 0.218
0.947IleTyr: 0.947 ± 0.191
0.0IleXaa: 0.0 ± 0.0
Lys
6.811LysAla: 6.811 ± 0.768
0.361LysCys: 0.361 ± 0.129
2.571LysAsp: 2.571 ± 0.38
3.067LysGlu: 3.067 ± 0.395
2.3LysPhe: 2.3 ± 0.296
4.195LysGly: 4.195 ± 0.467
0.902LysHis: 0.902 ± 0.187
2.481LysIle: 2.481 ± 0.279
3.293LysLys: 3.293 ± 0.615
4.375LysLeu: 4.375 ± 0.379
1.128LysMet: 1.128 ± 0.291
1.759LysAsn: 1.759 ± 0.237
4.014LysPro: 4.014 ± 0.498
1.939LysGln: 1.939 ± 0.3
3.744LysArg: 3.744 ± 0.525
2.571LysSer: 2.571 ± 0.33
2.796LysThr: 2.796 ± 0.334
3.924LysVal: 3.924 ± 0.398
0.677LysTrp: 0.677 ± 0.18
1.534LysTyr: 1.534 ± 0.289
0.0LysXaa: 0.0 ± 0.0
Leu
7.532LeuAla: 7.532 ± 0.483
0.631LeuCys: 0.631 ± 0.208
5.277LeuAsp: 5.277 ± 0.598
6.269LeuGlu: 6.269 ± 0.466
2.12LeuPhe: 2.12 ± 0.268
6.36LeuGly: 6.36 ± 0.599
1.443LeuHis: 1.443 ± 0.249
3.112LeuIle: 3.112 ± 0.324
4.014LeuLys: 4.014 ± 0.42
6.45LeuLeu: 6.45 ± 0.627
1.218LeuMet: 1.218 ± 0.245
3.157LeuAsn: 3.157 ± 0.382
4.285LeuPro: 4.285 ± 0.41
3.608LeuGln: 3.608 ± 0.48
6.36LeuArg: 6.36 ± 0.621
5.232LeuSer: 5.232 ± 0.52
4.15LeuThr: 4.15 ± 0.484
5.232LeuVal: 5.232 ± 0.548
1.082LeuTrp: 1.082 ± 0.206
1.804LeuTyr: 1.804 ± 0.31
0.0LeuXaa: 0.0 ± 0.0
Met
2.391MetAla: 2.391 ± 0.413
0.316MetCys: 0.316 ± 0.101
1.669MetAsp: 1.669 ± 0.243
1.308MetGlu: 1.308 ± 0.238
0.902MetPhe: 0.902 ± 0.185
1.443MetGly: 1.443 ± 0.243
0.361MetHis: 0.361 ± 0.118
0.947MetIle: 0.947 ± 0.241
0.857MetLys: 0.857 ± 0.171
1.759MetLeu: 1.759 ± 0.195
0.451MetMet: 0.451 ± 0.134
0.767MetAsn: 0.767 ± 0.184
1.443MetPro: 1.443 ± 0.289
0.857MetGln: 0.857 ± 0.172
1.488MetArg: 1.488 ± 0.265
1.173MetSer: 1.173 ± 0.262
2.075MetThr: 2.075 ± 0.33
1.714MetVal: 1.714 ± 0.254
0.18MetTrp: 0.18 ± 0.09
0.406MetTyr: 0.406 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
4.42AsnAla: 4.42 ± 0.367
0.271AsnCys: 0.271 ± 0.12
1.985AsnAsp: 1.985 ± 0.301
1.939AsnGlu: 1.939 ± 0.311
1.218AsnPhe: 1.218 ± 0.249
3.293AsnGly: 3.293 ± 0.305
0.631AsnHis: 0.631 ± 0.151
1.849AsnIle: 1.849 ± 0.269
2.03AsnLys: 2.03 ± 0.304
3.112AsnLeu: 3.112 ± 0.393
0.812AsnMet: 0.812 ± 0.157
1.759AsnAsn: 1.759 ± 0.313
3.067AsnPro: 3.067 ± 0.345
1.624AsnGln: 1.624 ± 0.256
2.12AsnArg: 2.12 ± 0.305
2.165AsnSer: 2.165 ± 0.287
2.436AsnThr: 2.436 ± 0.456
3.383AsnVal: 3.383 ± 0.334
1.037AsnTrp: 1.037 ± 0.205
1.308AsnTyr: 1.308 ± 0.204
0.0AsnXaa: 0.0 ± 0.0
Pro
3.744ProAla: 3.744 ± 0.364
0.631ProCys: 0.631 ± 0.202
4.014ProAsp: 4.014 ± 0.523
4.104ProGlu: 4.104 ± 0.511
1.939ProPhe: 1.939 ± 0.299
4.104ProGly: 4.104 ± 0.462
0.812ProHis: 0.812 ± 0.183
2.436ProIle: 2.436 ± 0.387
3.157ProLys: 3.157 ± 0.353
4.375ProLeu: 4.375 ± 0.435
1.534ProMet: 1.534 ± 0.284
2.391ProAsn: 2.391 ± 0.344
2.526ProPro: 2.526 ± 0.372
1.488ProGln: 1.488 ± 0.248
2.526ProArg: 2.526 ± 0.382
3.563ProSer: 3.563 ± 0.397
3.293ProThr: 3.293 ± 0.382
3.293ProVal: 3.293 ± 0.345
0.902ProTrp: 0.902 ± 0.23
1.443ProTyr: 1.443 ± 0.237
0.0ProXaa: 0.0 ± 0.0
Gln
5.097GlnAla: 5.097 ± 0.637
0.677GlnCys: 0.677 ± 0.203
1.939GlnAsp: 1.939 ± 0.358
2.661GlnGlu: 2.661 ± 0.348
1.488GlnPhe: 1.488 ± 0.315
3.157GlnGly: 3.157 ± 0.638
0.451GlnHis: 0.451 ± 0.127
1.804GlnIle: 1.804 ± 0.284
1.579GlnLys: 1.579 ± 0.25
3.157GlnLeu: 3.157 ± 0.341
1.218GlnMet: 1.218 ± 0.25
1.443GlnAsn: 1.443 ± 0.357
1.714GlnPro: 1.714 ± 0.259
1.759GlnGln: 1.759 ± 0.357
3.247GlnArg: 3.247 ± 0.372
2.165GlnSer: 2.165 ± 0.318
1.804GlnThr: 1.804 ± 0.228
2.842GlnVal: 2.842 ± 0.368
0.631GlnTrp: 0.631 ± 0.181
1.308GlnTyr: 1.308 ± 0.211
0.0GlnXaa: 0.0 ± 0.0
Arg
5.728ArgAla: 5.728 ± 0.529
0.631ArgCys: 0.631 ± 0.194
3.789ArgAsp: 3.789 ± 0.361
3.879ArgGlu: 3.879 ± 0.405
2.436ArgPhe: 2.436 ± 0.334
3.744ArgGly: 3.744 ± 0.408
1.353ArgHis: 1.353 ± 0.29
3.518ArgIle: 3.518 ± 0.432
4.646ArgLys: 4.646 ± 0.566
5.052ArgLeu: 5.052 ± 0.414
1.353ArgMet: 1.353 ± 0.22
3.022ArgAsn: 3.022 ± 0.341
2.481ArgPro: 2.481 ± 0.37
2.661ArgGln: 2.661 ± 0.323
4.465ArgArg: 4.465 ± 0.63
2.526ArgSer: 2.526 ± 0.317
2.391ArgThr: 2.391 ± 0.377
4.24ArgVal: 4.24 ± 0.431
1.534ArgTrp: 1.534 ± 0.428
1.939ArgTyr: 1.939 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
5.187SerAla: 5.187 ± 0.714
0.226SerCys: 0.226 ± 0.09
3.699SerAsp: 3.699 ± 0.418
3.383SerGlu: 3.383 ± 0.366
1.534SerPhe: 1.534 ± 0.26
5.097SerGly: 5.097 ± 0.553
0.902SerHis: 0.902 ± 0.202
2.345SerIle: 2.345 ± 0.353
2.751SerLys: 2.751 ± 0.526
4.556SerLeu: 4.556 ± 0.411
1.173SerMet: 1.173 ± 0.221
2.751SerAsn: 2.751 ± 0.315
1.894SerPro: 1.894 ± 0.263
2.255SerGln: 2.255 ± 0.406
2.616SerArg: 2.616 ± 0.299
3.518SerSer: 3.518 ± 0.464
3.744SerThr: 3.744 ± 0.48
3.473SerVal: 3.473 ± 0.369
1.398SerTrp: 1.398 ± 0.238
2.3SerTyr: 2.3 ± 0.328
0.0SerXaa: 0.0 ± 0.0
Thr
5.503ThrAla: 5.503 ± 0.517
0.361ThrCys: 0.361 ± 0.162
3.518ThrAsp: 3.518 ± 0.462
2.796ThrGlu: 2.796 ± 0.319
2.345ThrPhe: 2.345 ± 0.355
6.315ThrGly: 6.315 ± 0.71
1.308ThrHis: 1.308 ± 0.268
3.383ThrIle: 3.383 ± 0.276
3.202ThrLys: 3.202 ± 0.372
4.916ThrLeu: 4.916 ± 0.442
0.992ThrMet: 0.992 ± 0.195
2.255ThrAsn: 2.255 ± 0.333
3.699ThrPro: 3.699 ± 0.525
2.932ThrGln: 2.932 ± 0.329
3.653ThrArg: 3.653 ± 0.433
3.428ThrSer: 3.428 ± 0.57
4.104ThrThr: 4.104 ± 0.465
4.556ThrVal: 4.556 ± 0.458
1.037ThrTrp: 1.037 ± 0.202
1.488ThrTyr: 1.488 ± 0.268
0.0ThrXaa: 0.0 ± 0.0
Val
6.179ValAla: 6.179 ± 0.584
0.406ValCys: 0.406 ± 0.165
5.097ValAsp: 5.097 ± 0.482
5.007ValGlu: 5.007 ± 0.46
1.985ValPhe: 1.985 ± 0.302
4.691ValGly: 4.691 ± 0.425
1.398ValHis: 1.398 ± 0.252
2.796ValIle: 2.796 ± 0.381
3.608ValLys: 3.608 ± 0.447
5.007ValLeu: 5.007 ± 0.519
1.398ValMet: 1.398 ± 0.275
2.751ValAsn: 2.751 ± 0.233
3.744ValPro: 3.744 ± 0.408
2.751ValGln: 2.751 ± 0.336
4.375ValArg: 4.375 ± 0.318
4.104ValSer: 4.104 ± 0.356
4.781ValThr: 4.781 ± 0.473
5.007ValVal: 5.007 ± 0.539
0.947ValTrp: 0.947 ± 0.272
2.345ValTyr: 2.345 ± 0.363
0.0ValXaa: 0.0 ± 0.0
Trp
1.534TrpAla: 1.534 ± 0.217
0.226TrpCys: 0.226 ± 0.096
1.308TrpAsp: 1.308 ± 0.304
1.353TrpGlu: 1.353 ± 0.216
0.406TrpPhe: 0.406 ± 0.121
1.353TrpGly: 1.353 ± 0.206
0.677TrpHis: 0.677 ± 0.137
0.722TrpIle: 0.722 ± 0.165
1.128TrpLys: 1.128 ± 0.23
1.488TrpLeu: 1.488 ± 0.274
0.361TrpMet: 0.361 ± 0.131
0.677TrpAsn: 0.677 ± 0.16
0.631TrpPro: 0.631 ± 0.13
0.631TrpGln: 0.631 ± 0.143
0.767TrpArg: 0.767 ± 0.157
1.128TrpSer: 1.128 ± 0.223
1.488TrpThr: 1.488 ± 0.206
1.353TrpVal: 1.353 ± 0.291
0.541TrpTrp: 0.541 ± 0.173
0.722TrpTyr: 0.722 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.345TyrAla: 2.345 ± 0.393
0.09TyrCys: 0.09 ± 0.072
2.255TyrAsp: 2.255 ± 0.326
1.939TyrGlu: 1.939 ± 0.29
1.263TyrPhe: 1.263 ± 0.186
3.022TyrGly: 3.022 ± 0.34
0.631TyrHis: 0.631 ± 0.198
1.534TyrIle: 1.534 ± 0.216
1.714TyrLys: 1.714 ± 0.268
1.759TyrLeu: 1.759 ± 0.228
0.677TyrMet: 0.677 ± 0.168
1.263TyrAsn: 1.263 ± 0.262
1.804TyrPro: 1.804 ± 0.308
1.128TyrGln: 1.128 ± 0.252
2.21TyrArg: 2.21 ± 0.378
1.804TyrSer: 1.804 ± 0.263
1.128TyrThr: 1.128 ± 0.195
2.616TyrVal: 2.616 ± 0.375
0.451TyrTrp: 0.451 ± 0.152
0.812TyrTyr: 0.812 ± 0.232
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (22172 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski