Amino acid dipepetide frequency for Mycobacterium phage Sandalphon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.953AlaAla: 13.953 ± 1.73
0.948AlaCys: 0.948 ± 0.244
7.319AlaAsp: 7.319 ± 0.635
6.582AlaGlu: 6.582 ± 0.582
3.265AlaPhe: 3.265 ± 0.398
10.162AlaGly: 10.162 ± 1.345
2.685AlaHis: 2.685 ± 0.408
4.16AlaIle: 4.16 ± 0.62
4.528AlaLys: 4.528 ± 0.485
7.687AlaLeu: 7.687 ± 0.802
2.633AlaMet: 2.633 ± 0.401
3.054AlaAsn: 3.054 ± 0.438
4.634AlaPro: 4.634 ± 0.571
3.054AlaGln: 3.054 ± 0.354
6.634AlaArg: 6.634 ± 0.594
5.423AlaSer: 5.423 ± 0.64
6.371AlaThr: 6.371 ± 0.482
6.634AlaVal: 6.634 ± 0.568
2.527AlaTrp: 2.527 ± 0.374
2.159AlaTyr: 2.159 ± 0.374
0.0AlaXaa: 0.0 ± 0.0
Cys
0.737CysAla: 0.737 ± 0.252
0.0CysCys: 0.0 ± 0.0
1.422CysAsp: 1.422 ± 0.322
0.79CysGlu: 0.79 ± 0.202
0.158CysPhe: 0.158 ± 0.089
1.738CysGly: 1.738 ± 0.435
0.369CysHis: 0.369 ± 0.13
0.316CysIle: 0.316 ± 0.128
0.579CysLys: 0.579 ± 0.178
0.632CysLeu: 0.632 ± 0.226
0.316CysMet: 0.316 ± 0.117
0.369CysAsn: 0.369 ± 0.153
1.211CysPro: 1.211 ± 0.275
0.316CysGln: 0.316 ± 0.126
0.684CysArg: 0.684 ± 0.179
0.737CysSer: 0.737 ± 0.212
0.579CysThr: 0.579 ± 0.169
0.895CysVal: 0.895 ± 0.218
0.527CysTrp: 0.527 ± 0.155
0.158CysTyr: 0.158 ± 0.083
0.0CysXaa: 0.0 ± 0.0
Asp
6.898AspAla: 6.898 ± 0.679
0.842AspCys: 0.842 ± 0.192
4.054AspAsp: 4.054 ± 0.509
3.949AspGlu: 3.949 ± 0.546
1.896AspPhe: 1.896 ± 0.265
6.424AspGly: 6.424 ± 0.611
1.422AspHis: 1.422 ± 0.271
2.211AspIle: 2.211 ± 0.343
1.58AspLys: 1.58 ± 0.289
5.792AspLeu: 5.792 ± 0.574
1.158AspMet: 1.158 ± 0.31
1.738AspAsn: 1.738 ± 0.348
5.16AspPro: 5.16 ± 0.572
2.001AspGln: 2.001 ± 0.293
5.529AspArg: 5.529 ± 0.594
3.528AspSer: 3.528 ± 0.466
4.16AspThr: 4.16 ± 0.541
4.054AspVal: 4.054 ± 0.542
1.474AspTrp: 1.474 ± 0.306
2.369AspTyr: 2.369 ± 0.361
0.0AspXaa: 0.0 ± 0.0
Glu
6.266GluAla: 6.266 ± 0.713
0.737GluCys: 0.737 ± 0.183
2.685GluAsp: 2.685 ± 0.347
2.949GluGlu: 2.949 ± 0.42
2.053GluPhe: 2.053 ± 0.357
3.686GluGly: 3.686 ± 0.481
1.474GluHis: 1.474 ± 0.304
2.791GluIle: 2.791 ± 0.369
2.106GluLys: 2.106 ± 0.338
6.003GluLeu: 6.003 ± 0.697
1.738GluMet: 1.738 ± 0.314
2.264GluAsn: 2.264 ± 0.314
2.843GluPro: 2.843 ± 0.516
3.107GluGln: 3.107 ± 0.358
4.686GluArg: 4.686 ± 0.594
3.107GluSer: 3.107 ± 0.434
4.476GluThr: 4.476 ± 0.64
3.949GluVal: 3.949 ± 0.451
1.316GluTrp: 1.316 ± 0.277
1.896GluTyr: 1.896 ± 0.342
0.0GluXaa: 0.0 ± 0.0
Phe
3.265PheAla: 3.265 ± 0.418
0.316PheCys: 0.316 ± 0.122
2.685PheAsp: 2.685 ± 0.446
1.896PheGlu: 1.896 ± 0.355
0.79PhePhe: 0.79 ± 0.251
3.001PheGly: 3.001 ± 0.635
0.527PheHis: 0.527 ± 0.189
1.106PheIle: 1.106 ± 0.326
1.106PheLys: 1.106 ± 0.251
1.896PheLeu: 1.896 ± 0.272
0.579PheMet: 0.579 ± 0.152
1.316PheAsn: 1.316 ± 0.352
1.527PhePro: 1.527 ± 0.322
1.211PheGln: 1.211 ± 0.331
1.264PheArg: 1.264 ± 0.23
1.685PheSer: 1.685 ± 0.285
2.633PheThr: 2.633 ± 0.416
2.369PheVal: 2.369 ± 0.304
0.737PheTrp: 0.737 ± 0.21
0.895PheTyr: 0.895 ± 0.256
0.0PheXaa: 0.0 ± 0.0
Gly
9.53GlyAla: 9.53 ± 1.332
1.211GlyCys: 1.211 ± 0.255
5.371GlyAsp: 5.371 ± 0.562
5.107GlyGlu: 5.107 ± 0.645
2.738GlyPhe: 2.738 ± 0.438
10.899GlyGly: 10.899 ± 2.711
2.001GlyHis: 2.001 ± 0.314
4.002GlyIle: 4.002 ± 0.61
2.738GlyLys: 2.738 ± 0.411
6.266GlyLeu: 6.266 ± 0.558
2.159GlyMet: 2.159 ± 0.44
2.738GlyAsn: 2.738 ± 0.388
3.686GlyPro: 3.686 ± 0.575
2.369GlyGln: 2.369 ± 0.601
5.476GlyArg: 5.476 ± 0.623
6.003GlySer: 6.003 ± 0.823
5.897GlyThr: 5.897 ± 0.794
5.897GlyVal: 5.897 ± 0.581
2.317GlyTrp: 2.317 ± 0.468
2.211GlyTyr: 2.211 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
1.896HisAla: 1.896 ± 0.351
0.316HisCys: 0.316 ± 0.159
1.316HisAsp: 1.316 ± 0.225
1.264HisGlu: 1.264 ± 0.263
0.527HisPhe: 0.527 ± 0.132
2.053HisGly: 2.053 ± 0.39
0.684HisHis: 0.684 ± 0.212
1.527HisIle: 1.527 ± 0.327
1.158HisLys: 1.158 ± 0.27
1.632HisLeu: 1.632 ± 0.394
0.579HisMet: 0.579 ± 0.161
0.895HisAsn: 0.895 ± 0.202
1.474HisPro: 1.474 ± 0.305
0.632HisGln: 0.632 ± 0.174
1.79HisArg: 1.79 ± 0.312
1.158HisSer: 1.158 ± 0.218
1.369HisThr: 1.369 ± 0.312
1.474HisVal: 1.474 ± 0.329
0.579HisTrp: 0.579 ± 0.185
0.79HisTyr: 0.79 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
5.107IleAla: 5.107 ± 0.548
0.684IleCys: 0.684 ± 0.211
3.896IleAsp: 3.896 ± 0.474
3.528IleGlu: 3.528 ± 0.373
0.737IlePhe: 0.737 ± 0.235
3.949IleGly: 3.949 ± 0.503
1.79IleHis: 1.79 ± 0.358
1.264IleIle: 1.264 ± 0.251
0.895IleLys: 0.895 ± 0.2
2.527IleLeu: 2.527 ± 0.374
0.316IleMet: 0.316 ± 0.124
1.58IleAsn: 1.58 ± 0.282
3.001IlePro: 3.001 ± 0.342
1.527IleGln: 1.527 ± 0.25
2.106IleArg: 2.106 ± 0.366
2.369IleSer: 2.369 ± 0.474
3.422IleThr: 3.422 ± 0.458
3.212IleVal: 3.212 ± 0.373
0.842IleTrp: 0.842 ± 0.193
0.895IleTyr: 0.895 ± 0.21
0.0IleXaa: 0.0 ± 0.0
Lys
3.738LysAla: 3.738 ± 0.523
0.474LysCys: 0.474 ± 0.192
1.632LysAsp: 1.632 ± 0.267
1.527LysGlu: 1.527 ± 0.262
1.211LysPhe: 1.211 ± 0.189
2.264LysGly: 2.264 ± 0.349
1.053LysHis: 1.053 ± 0.254
1.053LysIle: 1.053 ± 0.277
1.632LysLys: 1.632 ± 0.452
2.633LysLeu: 2.633 ± 0.501
0.79LysMet: 0.79 ± 0.191
1.0LysAsn: 1.0 ± 0.204
2.527LysPro: 2.527 ± 0.406
1.422LysGln: 1.422 ± 0.241
2.633LysArg: 2.633 ± 0.475
2.053LysSer: 2.053 ± 0.291
2.159LysThr: 2.159 ± 0.315
2.369LysVal: 2.369 ± 0.398
1.211LysTrp: 1.211 ± 0.333
0.79LysTyr: 0.79 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
7.898LeuAla: 7.898 ± 0.96
1.0LeuCys: 1.0 ± 0.266
4.528LeuAsp: 4.528 ± 0.524
4.318LeuGlu: 4.318 ± 0.511
2.949LeuPhe: 2.949 ± 0.375
5.107LeuGly: 5.107 ± 0.574
0.684LeuHis: 0.684 ± 0.206
3.054LeuIle: 3.054 ± 0.455
2.264LeuLys: 2.264 ± 0.316
4.634LeuLeu: 4.634 ± 0.565
1.896LeuMet: 1.896 ± 0.321
2.949LeuAsn: 2.949 ± 0.426
5.16LeuPro: 5.16 ± 0.685
2.949LeuGln: 2.949 ± 0.432
5.16LeuArg: 5.16 ± 0.638
4.476LeuSer: 4.476 ± 0.496
5.318LeuThr: 5.318 ± 0.554
4.739LeuVal: 4.739 ± 0.635
1.527LeuTrp: 1.527 ± 0.302
2.211LeuTyr: 2.211 ± 0.392
0.0LeuXaa: 0.0 ± 0.0
Met
2.053MetAla: 2.053 ± 0.375
0.211MetCys: 0.211 ± 0.12
1.264MetAsp: 1.264 ± 0.279
1.0MetGlu: 1.0 ± 0.22
0.579MetPhe: 0.579 ± 0.203
1.738MetGly: 1.738 ± 0.261
0.158MetHis: 0.158 ± 0.096
1.053MetIle: 1.053 ± 0.238
0.948MetLys: 0.948 ± 0.249
1.843MetLeu: 1.843 ± 0.295
0.474MetMet: 0.474 ± 0.227
1.0MetAsn: 1.0 ± 0.226
1.264MetPro: 1.264 ± 0.277
0.684MetGln: 0.684 ± 0.195
1.632MetArg: 1.632 ± 0.323
3.001MetSer: 3.001 ± 0.479
2.422MetThr: 2.422 ± 0.36
1.316MetVal: 1.316 ± 0.315
0.263MetTrp: 0.263 ± 0.107
0.211MetTyr: 0.211 ± 0.111
0.0MetXaa: 0.0 ± 0.0
Asn
3.212AsnAla: 3.212 ± 0.448
0.421AsnCys: 0.421 ± 0.123
1.738AsnAsp: 1.738 ± 0.301
1.632AsnGlu: 1.632 ± 0.297
0.79AsnPhe: 0.79 ± 0.28
4.107AsnGly: 4.107 ± 0.503
0.895AsnHis: 0.895 ± 0.161
1.527AsnIle: 1.527 ± 0.444
1.053AsnLys: 1.053 ± 0.287
2.475AsnLeu: 2.475 ± 0.364
0.684AsnMet: 0.684 ± 0.17
2.001AsnAsn: 2.001 ± 0.371
2.896AsnPro: 2.896 ± 0.317
1.053AsnGln: 1.053 ± 0.348
2.527AsnArg: 2.527 ± 0.38
1.474AsnSer: 1.474 ± 0.304
2.053AsnThr: 2.053 ± 0.316
1.79AsnVal: 1.79 ± 0.351
0.632AsnTrp: 0.632 ± 0.18
0.737AsnTyr: 0.737 ± 0.184
0.0AsnXaa: 0.0 ± 0.0
Pro
5.002ProAla: 5.002 ± 0.602
0.79ProCys: 0.79 ± 0.231
4.949ProAsp: 4.949 ± 0.635
4.37ProGlu: 4.37 ± 0.447
1.632ProPhe: 1.632 ± 0.272
6.371ProGly: 6.371 ± 0.779
1.632ProHis: 1.632 ± 0.342
1.843ProIle: 1.843 ± 0.305
2.106ProLys: 2.106 ± 0.365
4.476ProLeu: 4.476 ± 0.57
1.316ProMet: 1.316 ± 0.296
2.159ProAsn: 2.159 ± 0.303
3.475ProPro: 3.475 ± 0.535
2.211ProGln: 2.211 ± 0.418
3.738ProArg: 3.738 ± 0.63
2.896ProSer: 2.896 ± 0.36
3.317ProThr: 3.317 ± 0.442
4.318ProVal: 4.318 ± 0.45
1.211ProTrp: 1.211 ± 0.266
1.527ProTyr: 1.527 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
4.265GlnAla: 4.265 ± 0.591
0.316GlnCys: 0.316 ± 0.131
1.527GlnAsp: 1.527 ± 0.249
1.527GlnGlu: 1.527 ± 0.3
1.0GlnPhe: 1.0 ± 0.233
2.369GlnGly: 2.369 ± 0.457
1.211GlnHis: 1.211 ± 0.3
1.527GlnIle: 1.527 ± 0.293
1.264GlnLys: 1.264 ± 0.257
2.791GlnLeu: 2.791 ± 0.436
0.895GlnMet: 0.895 ± 0.216
1.0GlnAsn: 1.0 ± 0.286
2.369GlnPro: 2.369 ± 0.364
1.369GlnGln: 1.369 ± 0.313
2.685GlnArg: 2.685 ± 0.31
2.264GlnSer: 2.264 ± 0.357
2.106GlnThr: 2.106 ± 0.321
2.475GlnVal: 2.475 ± 0.34
0.527GlnTrp: 0.527 ± 0.155
0.737GlnTyr: 0.737 ± 0.27
0.0GlnXaa: 0.0 ± 0.0
Arg
6.792ArgAla: 6.792 ± 0.711
1.422ArgCys: 1.422 ± 0.403
4.37ArgAsp: 4.37 ± 0.515
4.844ArgGlu: 4.844 ± 0.64
2.58ArgPhe: 2.58 ± 0.434
4.581ArgGly: 4.581 ± 0.421
0.948ArgHis: 0.948 ± 0.272
4.107ArgIle: 4.107 ± 0.566
2.369ArgLys: 2.369 ± 0.382
4.844ArgLeu: 4.844 ± 0.499
2.159ArgMet: 2.159 ± 0.328
2.211ArgAsn: 2.211 ± 0.407
3.844ArgPro: 3.844 ± 0.513
2.159ArgGln: 2.159 ± 0.359
6.529ArgArg: 6.529 ± 0.888
3.475ArgSer: 3.475 ± 0.426
3.37ArgThr: 3.37 ± 0.441
5.371ArgVal: 5.371 ± 0.608
1.843ArgTrp: 1.843 ± 0.358
2.159ArgTyr: 2.159 ± 0.311
0.0ArgXaa: 0.0 ± 0.0
Ser
5.213SerAla: 5.213 ± 0.785
0.421SerCys: 0.421 ± 0.142
4.581SerAsp: 4.581 ± 0.558
3.001SerGlu: 3.001 ± 0.397
2.106SerPhe: 2.106 ± 0.486
5.634SerGly: 5.634 ± 0.86
1.474SerHis: 1.474 ± 0.285
2.896SerIle: 2.896 ± 0.405
2.106SerLys: 2.106 ± 0.353
3.896SerLeu: 3.896 ± 0.428
1.632SerMet: 1.632 ± 0.285
1.896SerAsn: 1.896 ± 0.305
3.265SerPro: 3.265 ± 0.38
1.527SerGln: 1.527 ± 0.245
3.317SerArg: 3.317 ± 0.441
4.002SerSer: 4.002 ± 0.641
3.422SerThr: 3.422 ± 0.476
4.686SerVal: 4.686 ± 0.485
1.264SerTrp: 1.264 ± 0.25
1.211SerTyr: 1.211 ± 0.284
0.0SerXaa: 0.0 ± 0.0
Thr
6.74ThrAla: 6.74 ± 0.517
0.632ThrCys: 0.632 ± 0.17
4.265ThrAsp: 4.265 ± 0.583
3.738ThrGlu: 3.738 ± 0.397
2.053ThrPhe: 2.053 ± 0.364
6.213ThrGly: 6.213 ± 0.633
1.685ThrHis: 1.685 ± 0.309
3.58ThrIle: 3.58 ± 0.425
1.843ThrLys: 1.843 ± 0.322
4.107ThrLeu: 4.107 ± 0.424
1.264ThrMet: 1.264 ± 0.261
2.159ThrAsn: 2.159 ± 0.403
4.265ThrPro: 4.265 ± 0.436
2.159ThrGln: 2.159 ± 0.35
4.423ThrArg: 4.423 ± 0.5
3.738ThrSer: 3.738 ± 0.425
4.791ThrThr: 4.791 ± 0.582
5.16ThrVal: 5.16 ± 0.651
1.211ThrTrp: 1.211 ± 0.269
1.738ThrTyr: 1.738 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
7.003ValAla: 7.003 ± 0.527
1.0ValCys: 1.0 ± 0.224
5.581ValAsp: 5.581 ± 0.613
4.949ValGlu: 4.949 ± 0.575
2.053ValPhe: 2.053 ± 0.382
5.16ValGly: 5.16 ± 0.679
1.422ValHis: 1.422 ± 0.263
3.317ValIle: 3.317 ± 0.389
2.633ValLys: 2.633 ± 0.35
5.055ValLeu: 5.055 ± 0.659
1.474ValMet: 1.474 ± 0.272
2.106ValAsn: 2.106 ± 0.348
4.054ValPro: 4.054 ± 0.417
2.685ValGln: 2.685 ± 0.335
4.791ValArg: 4.791 ± 0.58
3.896ValSer: 3.896 ± 0.51
4.897ValThr: 4.897 ± 0.559
6.318ValVal: 6.318 ± 0.704
1.58ValTrp: 1.58 ± 0.32
1.474ValTyr: 1.474 ± 0.323
0.0ValXaa: 0.0 ± 0.0
Trp
2.053TrpAla: 2.053 ± 0.348
0.316TrpCys: 0.316 ± 0.147
1.422TrpAsp: 1.422 ± 0.262
1.158TrpGlu: 1.158 ± 0.288
0.842TrpPhe: 0.842 ± 0.226
1.158TrpGly: 1.158 ± 0.282
0.737TrpHis: 0.737 ± 0.188
1.158TrpIle: 1.158 ± 0.27
0.737TrpLys: 0.737 ± 0.169
1.738TrpLeu: 1.738 ± 0.344
0.842TrpMet: 0.842 ± 0.268
0.527TrpAsn: 0.527 ± 0.204
1.0TrpPro: 1.0 ± 0.295
1.0TrpGln: 1.0 ± 0.245
2.422TrpArg: 2.422 ± 0.46
1.369TrpSer: 1.369 ± 0.316
1.422TrpThr: 1.422 ± 0.275
1.843TrpVal: 1.843 ± 0.427
1.158TrpTrp: 1.158 ± 0.242
0.369TrpTyr: 0.369 ± 0.144
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.843TyrAla: 2.843 ± 0.428
0.474TyrCys: 0.474 ± 0.137
1.632TyrAsp: 1.632 ± 0.33
2.001TyrGlu: 2.001 ± 0.291
0.842TyrPhe: 0.842 ± 0.236
2.001TyrGly: 2.001 ± 0.438
0.263TyrHis: 0.263 ± 0.109
1.158TyrIle: 1.158 ± 0.196
0.527TyrLys: 0.527 ± 0.195
1.896TyrLeu: 1.896 ± 0.321
0.158TyrMet: 0.158 ± 0.087
0.737TyrAsn: 0.737 ± 0.18
1.685TyrPro: 1.685 ± 0.271
0.79TyrGln: 0.79 ± 0.206
1.948TyrArg: 1.948 ± 0.313
0.895TyrSer: 0.895 ± 0.245
1.58TyrThr: 1.58 ± 0.36
2.58TyrVal: 2.58 ± 0.338
0.527TyrTrp: 0.527 ± 0.158
0.579TyrTyr: 0.579 ± 0.181
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 104 proteins (18993 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski