Amino acid dipepetide frequency for Mycobacterium phage Argie

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.674AlaAla: 19.674 ± 1.546
0.971AlaCys: 0.971 ± 0.258
6.95AlaAsp: 6.95 ± 0.557
9.505AlaGlu: 9.505 ± 0.801
2.402AlaPhe: 2.402 ± 0.351
8.738AlaGly: 8.738 ± 1.167
1.38AlaHis: 1.38 ± 0.243
5.008AlaIle: 5.008 ± 0.623
3.73AlaLys: 3.73 ± 0.493
12.111AlaLeu: 12.111 ± 1.244
2.759AlaMet: 2.759 ± 0.365
3.168AlaAsn: 3.168 ± 0.438
5.315AlaPro: 5.315 ± 0.59
4.037AlaGln: 4.037 ± 0.499
7.972AlaArg: 7.972 ± 0.75
5.723AlaSer: 5.723 ± 0.562
5.928AlaThr: 5.928 ± 0.538
9.914AlaVal: 9.914 ± 0.71
2.248AlaTrp: 2.248 ± 0.388
1.942AlaTyr: 1.942 ± 0.327
0.0AlaXaa: 0.0 ± 0.0
Cys
1.073CysAla: 1.073 ± 0.276
0.153CysCys: 0.153 ± 0.08
0.92CysAsp: 0.92 ± 0.255
0.869CysGlu: 0.869 ± 0.275
0.307CysPhe: 0.307 ± 0.152
1.686CysGly: 1.686 ± 0.41
0.204CysHis: 0.204 ± 0.129
0.46CysIle: 0.46 ± 0.146
0.358CysLys: 0.358 ± 0.134
0.613CysLeu: 0.613 ± 0.188
0.307CysMet: 0.307 ± 0.106
0.358CysAsn: 0.358 ± 0.145
1.278CysPro: 1.278 ± 0.292
0.307CysGln: 0.307 ± 0.121
1.175CysArg: 1.175 ± 0.307
0.818CysSer: 0.818 ± 0.184
0.767CysThr: 0.767 ± 0.21
0.767CysVal: 0.767 ± 0.21
0.358CysTrp: 0.358 ± 0.163
0.102CysTyr: 0.102 ± 0.086
0.0CysXaa: 0.0 ± 0.0
Asp
7.154AspAla: 7.154 ± 0.511
0.715AspCys: 0.715 ± 0.174
4.804AspAsp: 4.804 ± 0.666
6.643AspGlu: 6.643 ± 0.647
1.329AspPhe: 1.329 ± 0.237
6.796AspGly: 6.796 ± 0.636
0.971AspHis: 0.971 ± 0.252
1.942AspIle: 1.942 ± 0.312
2.095AspLys: 2.095 ± 0.403
5.059AspLeu: 5.059 ± 0.428
1.329AspMet: 1.329 ± 0.27
0.869AspAsn: 0.869 ± 0.187
4.752AspPro: 4.752 ± 0.672
2.248AspGln: 2.248 ± 0.343
4.497AspArg: 4.497 ± 0.548
2.708AspSer: 2.708 ± 0.354
2.504AspThr: 2.504 ± 0.376
4.241AspVal: 4.241 ± 0.527
1.175AspTrp: 1.175 ± 0.241
1.38AspTyr: 1.38 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
7.972GluAla: 7.972 ± 0.74
1.022GluCys: 1.022 ± 0.297
3.73GluAsp: 3.73 ± 0.517
3.219GluGlu: 3.219 ± 0.458
1.993GluPhe: 1.993 ± 0.315
4.701GluGly: 4.701 ± 0.498
1.84GluHis: 1.84 ± 0.301
2.811GluIle: 2.811 ± 0.315
2.913GluLys: 2.913 ± 0.449
6.49GluLeu: 6.49 ± 0.544
1.942GluMet: 1.942 ± 0.368
1.84GluAsn: 1.84 ± 0.296
3.833GluPro: 3.833 ± 0.528
2.862GluGln: 2.862 ± 0.336
5.417GluArg: 5.417 ± 0.553
4.395GluSer: 4.395 ± 0.432
3.884GluThr: 3.884 ± 0.442
5.928GluVal: 5.928 ± 0.674
2.146GluTrp: 2.146 ± 0.364
1.789GluTyr: 1.789 ± 0.312
0.0GluXaa: 0.0 ± 0.0
Phe
2.248PheAla: 2.248 ± 0.277
0.153PheCys: 0.153 ± 0.079
2.913PheAsp: 2.913 ± 0.432
1.482PheGlu: 1.482 ± 0.27
0.358PhePhe: 0.358 ± 0.149
2.351PheGly: 2.351 ± 0.352
0.409PheHis: 0.409 ± 0.144
0.818PheIle: 0.818 ± 0.173
0.664PheLys: 0.664 ± 0.222
1.686PheLeu: 1.686 ± 0.312
0.358PheMet: 0.358 ± 0.137
0.869PheAsn: 0.869 ± 0.19
1.482PhePro: 1.482 ± 0.213
0.971PheGln: 0.971 ± 0.242
1.993PheArg: 1.993 ± 0.304
1.226PheSer: 1.226 ± 0.188
1.737PheThr: 1.737 ± 0.355
1.891PheVal: 1.891 ± 0.321
0.46PheTrp: 0.46 ± 0.129
0.46PheTyr: 0.46 ± 0.157
0.0PheXaa: 0.0 ± 0.0
Gly
8.687GlyAla: 8.687 ± 1.278
1.175GlyCys: 1.175 ± 0.271
5.366GlyAsp: 5.366 ± 0.476
5.263GlyGlu: 5.263 ± 0.59
2.095GlyPhe: 2.095 ± 0.33
9.965GlyGly: 9.965 ± 2.114
1.584GlyHis: 1.584 ± 0.283
2.862GlyIle: 2.862 ± 0.318
3.066GlyLys: 3.066 ± 0.376
7.307GlyLeu: 7.307 ± 0.88
2.351GlyMet: 2.351 ± 0.274
3.066GlyAsn: 3.066 ± 0.415
3.168GlyPro: 3.168 ± 0.589
2.913GlyGln: 2.913 ± 0.505
5.366GlyArg: 5.366 ± 0.515
5.723GlySer: 5.723 ± 0.57
5.621GlyThr: 5.621 ± 0.587
8.023GlyVal: 8.023 ± 0.9
2.095GlyTrp: 2.095 ± 0.345
1.942GlyTyr: 1.942 ± 0.327
0.0GlyXaa: 0.0 ± 0.0
His
2.197HisAla: 2.197 ± 0.336
0.307HisCys: 0.307 ± 0.123
0.92HisAsp: 0.92 ± 0.212
0.818HisGlu: 0.818 ± 0.168
0.46HisPhe: 0.46 ± 0.147
1.635HisGly: 1.635 ± 0.287
0.358HisHis: 0.358 ± 0.14
0.715HisIle: 0.715 ± 0.237
0.256HisLys: 0.256 ± 0.094
1.635HisLeu: 1.635 ± 0.238
0.46HisMet: 0.46 ± 0.139
0.307HisAsn: 0.307 ± 0.116
1.278HisPro: 1.278 ± 0.221
0.715HisGln: 0.715 ± 0.212
1.891HisArg: 1.891 ± 0.33
0.869HisSer: 0.869 ± 0.212
0.613HisThr: 0.613 ± 0.177
1.073HisVal: 1.073 ± 0.194
0.46HisTrp: 0.46 ± 0.131
0.358HisTyr: 0.358 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
5.723IleAla: 5.723 ± 0.488
0.409IleCys: 0.409 ± 0.108
3.628IleAsp: 3.628 ± 0.454
3.628IleGlu: 3.628 ± 0.444
0.409IlePhe: 0.409 ± 0.144
3.628IleGly: 3.628 ± 0.539
0.409IleHis: 0.409 ± 0.158
1.278IleIle: 1.278 ± 0.266
1.073IleLys: 1.073 ± 0.191
2.862IleLeu: 2.862 ± 0.445
0.767IleMet: 0.767 ± 0.245
1.022IleAsn: 1.022 ± 0.243
1.635IlePro: 1.635 ± 0.271
1.329IleGln: 1.329 ± 0.238
3.168IleArg: 3.168 ± 0.379
1.431IleSer: 1.431 ± 0.287
2.606IleThr: 2.606 ± 0.352
2.913IleVal: 2.913 ± 0.366
0.511IleTrp: 0.511 ± 0.151
1.124IleTyr: 1.124 ± 0.3
0.0IleXaa: 0.0 ± 0.0
Lys
4.395LysAla: 4.395 ± 0.706
0.613LysCys: 0.613 ± 0.192
1.533LysAsp: 1.533 ± 0.276
1.84LysGlu: 1.84 ± 0.34
0.767LysPhe: 0.767 ± 0.194
1.891LysGly: 1.891 ± 0.314
0.562LysHis: 0.562 ± 0.193
1.635LysIle: 1.635 ± 0.317
1.482LysLys: 1.482 ± 0.47
2.3LysLeu: 2.3 ± 0.288
0.92LysMet: 0.92 ± 0.187
0.46LysAsn: 0.46 ± 0.129
2.351LysPro: 2.351 ± 0.355
0.92LysGln: 0.92 ± 0.182
2.095LysArg: 2.095 ± 0.417
1.789LysSer: 1.789 ± 0.227
1.737LysThr: 1.737 ± 0.28
2.606LysVal: 2.606 ± 0.404
0.511LysTrp: 0.511 ± 0.155
0.767LysTyr: 0.767 ± 0.182
0.0LysXaa: 0.0 ± 0.0
Leu
9.607LeuAla: 9.607 ± 0.753
1.073LeuCys: 1.073 ± 0.256
4.804LeuAsp: 4.804 ± 0.563
6.388LeuGlu: 6.388 ± 0.497
1.84LeuPhe: 1.84 ± 0.359
7.103LeuGly: 7.103 ± 0.839
1.482LeuHis: 1.482 ± 0.285
3.475LeuIle: 3.475 ± 0.385
2.044LeuLys: 2.044 ± 0.368
4.599LeuLeu: 4.599 ± 0.526
1.482LeuMet: 1.482 ± 0.273
2.657LeuAsn: 2.657 ± 0.439
5.468LeuPro: 5.468 ± 0.56
3.015LeuGln: 3.015 ± 0.398
5.417LeuArg: 5.417 ± 0.521
6.183LeuSer: 6.183 ± 0.608
5.979LeuThr: 5.979 ± 0.573
6.388LeuVal: 6.388 ± 0.523
1.226LeuTrp: 1.226 ± 0.253
1.635LeuTyr: 1.635 ± 0.309
0.0LeuXaa: 0.0 ± 0.0
Met
3.117MetAla: 3.117 ± 0.371
0.102MetCys: 0.102 ± 0.074
0.818MetAsp: 0.818 ± 0.214
1.073MetGlu: 1.073 ± 0.206
0.715MetPhe: 0.715 ± 0.159
1.686MetGly: 1.686 ± 0.33
0.358MetHis: 0.358 ± 0.161
0.92MetIle: 0.92 ± 0.261
0.92MetLys: 0.92 ± 0.177
1.993MetLeu: 1.993 ± 0.364
0.358MetMet: 0.358 ± 0.119
0.46MetAsn: 0.46 ± 0.151
1.789MetPro: 1.789 ± 0.244
0.613MetGln: 0.613 ± 0.232
1.431MetArg: 1.431 ± 0.242
2.197MetSer: 2.197 ± 0.312
2.248MetThr: 2.248 ± 0.351
1.686MetVal: 1.686 ± 0.263
0.664MetTrp: 0.664 ± 0.207
0.358MetTyr: 0.358 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.884AsnAla: 3.884 ± 0.424
0.358AsnCys: 0.358 ± 0.139
1.329AsnAsp: 1.329 ± 0.287
1.329AsnGlu: 1.329 ± 0.251
0.409AsnPhe: 0.409 ± 0.145
3.015AsnGly: 3.015 ± 0.421
0.409AsnHis: 0.409 ± 0.153
0.818AsnIle: 0.818 ± 0.261
0.613AsnLys: 0.613 ± 0.161
2.402AsnLeu: 2.402 ± 0.338
0.204AsnMet: 0.204 ± 0.091
0.664AsnAsn: 0.664 ± 0.253
1.84AsnPro: 1.84 ± 0.266
0.818AsnGln: 0.818 ± 0.229
2.146AsnArg: 2.146 ± 0.277
1.789AsnSer: 1.789 ± 0.38
1.482AsnThr: 1.482 ± 0.283
1.84AsnVal: 1.84 ± 0.288
0.46AsnTrp: 0.46 ± 0.171
1.022AsnTyr: 1.022 ± 0.229
0.0AsnXaa: 0.0 ± 0.0
Pro
6.132ProAla: 6.132 ± 0.68
0.767ProCys: 0.767 ± 0.253
4.139ProAsp: 4.139 ± 0.472
5.212ProGlu: 5.212 ± 0.568
1.38ProPhe: 1.38 ± 0.218
5.672ProGly: 5.672 ± 0.565
1.022ProHis: 1.022 ± 0.208
2.351ProIle: 2.351 ± 0.348
1.789ProLys: 1.789 ± 0.272
3.73ProLeu: 3.73 ± 0.398
1.022ProMet: 1.022 ± 0.213
1.84ProAsn: 1.84 ± 0.323
3.884ProPro: 3.884 ± 0.589
1.686ProGln: 1.686 ± 0.261
2.351ProArg: 2.351 ± 0.394
3.015ProSer: 3.015 ± 0.432
4.293ProThr: 4.293 ± 0.524
4.855ProVal: 4.855 ± 0.501
1.022ProTrp: 1.022 ± 0.251
0.767ProTyr: 0.767 ± 0.21
0.0ProXaa: 0.0 ± 0.0
Gln
4.548GlnAla: 4.548 ± 0.494
0.307GlnCys: 0.307 ± 0.149
1.38GlnAsp: 1.38 ± 0.246
2.095GlnGlu: 2.095 ± 0.311
0.971GlnPhe: 0.971 ± 0.318
2.555GlnGly: 2.555 ± 0.411
1.022GlnHis: 1.022 ± 0.225
1.993GlnIle: 1.993 ± 0.483
1.278GlnLys: 1.278 ± 0.257
3.833GlnLeu: 3.833 ± 0.396
1.278GlnMet: 1.278 ± 0.264
0.971GlnAsn: 0.971 ± 0.196
1.942GlnPro: 1.942 ± 0.256
1.891GlnGln: 1.891 ± 0.345
2.248GlnArg: 2.248 ± 0.357
1.737GlnSer: 1.737 ± 0.376
1.635GlnThr: 1.635 ± 0.304
2.759GlnVal: 2.759 ± 0.471
0.715GlnTrp: 0.715 ± 0.204
0.767GlnTyr: 0.767 ± 0.201
0.0GlnXaa: 0.0 ± 0.0
Arg
6.95ArgAla: 6.95 ± 0.605
1.175ArgCys: 1.175 ± 0.23
4.19ArgAsp: 4.19 ± 0.515
4.548ArgGlu: 4.548 ± 0.553
1.891ArgPhe: 1.891 ± 0.293
4.701ArgGly: 4.701 ± 0.394
1.431ArgHis: 1.431 ± 0.254
2.811ArgIle: 2.811 ± 0.306
2.3ArgLys: 2.3 ± 0.511
6.234ArgLeu: 6.234 ± 0.737
2.044ArgMet: 2.044 ± 0.363
1.737ArgAsn: 1.737 ± 0.288
2.913ArgPro: 2.913 ± 0.309
2.657ArgGln: 2.657 ± 0.326
6.694ArgArg: 6.694 ± 0.863
3.781ArgSer: 3.781 ± 0.489
3.833ArgThr: 3.833 ± 0.467
5.263ArgVal: 5.263 ± 0.381
1.891ArgTrp: 1.891 ± 0.334
2.146ArgTyr: 2.146 ± 0.376
0.0ArgXaa: 0.0 ± 0.0
Ser
5.877SerAla: 5.877 ± 0.762
0.715SerCys: 0.715 ± 0.208
3.322SerAsp: 3.322 ± 0.313
3.833SerGlu: 3.833 ± 0.462
1.686SerPhe: 1.686 ± 0.32
6.03SerGly: 6.03 ± 0.475
0.92SerHis: 0.92 ± 0.218
2.555SerIle: 2.555 ± 0.354
1.635SerLys: 1.635 ± 0.229
4.752SerLeu: 4.752 ± 0.512
1.482SerMet: 1.482 ± 0.27
1.482SerAsn: 1.482 ± 0.269
3.219SerPro: 3.219 ± 0.398
2.657SerGln: 2.657 ± 0.343
3.73SerArg: 3.73 ± 0.556
3.066SerSer: 3.066 ± 0.39
3.679SerThr: 3.679 ± 0.425
4.088SerVal: 4.088 ± 0.423
1.278SerTrp: 1.278 ± 0.259
1.482SerTyr: 1.482 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
6.745ThrAla: 6.745 ± 0.805
0.971ThrCys: 0.971 ± 0.253
3.833ThrAsp: 3.833 ± 0.523
3.986ThrGlu: 3.986 ± 0.461
1.84ThrPhe: 1.84 ± 0.278
5.826ThrGly: 5.826 ± 0.75
0.562ThrHis: 0.562 ± 0.144
3.577ThrIle: 3.577 ± 0.428
1.686ThrLys: 1.686 ± 0.304
4.548ThrLeu: 4.548 ± 0.505
1.635ThrMet: 1.635 ± 0.26
1.789ThrAsn: 1.789 ± 0.381
4.293ThrPro: 4.293 ± 0.486
1.635ThrGln: 1.635 ± 0.28
3.424ThrArg: 3.424 ± 0.49
3.475ThrSer: 3.475 ± 0.383
3.219ThrThr: 3.219 ± 0.52
5.161ThrVal: 5.161 ± 0.683
1.124ThrTrp: 1.124 ± 0.238
1.124ThrTyr: 1.124 ± 0.186
0.0ThrXaa: 0.0 ± 0.0
Val
9.505ValAla: 9.505 ± 0.747
1.022ValCys: 1.022 ± 0.273
5.468ValAsp: 5.468 ± 0.491
6.285ValGlu: 6.285 ± 0.586
2.095ValPhe: 2.095 ± 0.34
5.877ValGly: 5.877 ± 0.589
1.482ValHis: 1.482 ± 0.314
2.402ValIle: 2.402 ± 0.336
2.402ValLys: 2.402 ± 0.355
6.03ValLeu: 6.03 ± 0.47
1.533ValMet: 1.533 ± 0.286
2.146ValAsn: 2.146 ± 0.354
4.804ValPro: 4.804 ± 0.599
3.117ValGln: 3.117 ± 0.363
4.446ValArg: 4.446 ± 0.679
5.263ValSer: 5.263 ± 0.654
5.928ValThr: 5.928 ± 0.598
6.899ValVal: 6.899 ± 0.574
1.124ValTrp: 1.124 ± 0.247
1.686ValTyr: 1.686 ± 0.272
0.0ValXaa: 0.0 ± 0.0
Trp
2.3TrpAla: 2.3 ± 0.375
0.46TrpCys: 0.46 ± 0.163
1.431TrpAsp: 1.431 ± 0.275
1.38TrpGlu: 1.38 ± 0.301
0.971TrpPhe: 0.971 ± 0.254
1.329TrpGly: 1.329 ± 0.206
0.613TrpHis: 0.613 ± 0.167
0.92TrpIle: 0.92 ± 0.189
0.613TrpLys: 0.613 ± 0.2
1.737TrpLeu: 1.737 ± 0.31
0.46TrpMet: 0.46 ± 0.16
0.613TrpAsn: 0.613 ± 0.158
0.613TrpPro: 0.613 ± 0.199
0.562TrpGln: 0.562 ± 0.119
1.584TrpArg: 1.584 ± 0.317
1.278TrpSer: 1.278 ± 0.284
1.175TrpThr: 1.175 ± 0.226
1.635TrpVal: 1.635 ± 0.37
0.256TrpTrp: 0.256 ± 0.113
0.307TrpTyr: 0.307 ± 0.115
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.686TyrAla: 1.686 ± 0.325
0.409TyrCys: 0.409 ± 0.148
1.789TyrAsp: 1.789 ± 0.269
1.329TyrGlu: 1.329 ± 0.233
0.613TyrPhe: 0.613 ± 0.162
2.453TyrGly: 2.453 ± 0.436
0.307TyrHis: 0.307 ± 0.141
0.204TyrIle: 0.204 ± 0.091
0.256TyrLys: 0.256 ± 0.108
1.942TyrLeu: 1.942 ± 0.306
0.767TyrMet: 0.767 ± 0.169
0.511TyrAsn: 0.511 ± 0.217
1.073TyrPro: 1.073 ± 0.239
0.971TyrGln: 0.971 ± 0.258
2.146TyrArg: 2.146 ± 0.369
1.022TyrSer: 1.022 ± 0.252
1.584TyrThr: 1.584 ± 0.295
1.584TyrVal: 1.584 ± 0.348
0.511TyrTrp: 0.511 ± 0.194
0.409TyrTyr: 0.409 ± 0.148
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (19570 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski