Amino acid dipepetide frequency for Microbacterium phage Sucha

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.183AlaAla: 11.183 ± 1.561
0.453AlaCys: 0.453 ± 0.24
5.138AlaAsp: 5.138 ± 0.591
6.423AlaGlu: 6.423 ± 0.805
2.72AlaPhe: 2.72 ± 0.513
7.632AlaGly: 7.632 ± 0.817
2.494AlaHis: 2.494 ± 0.471
4.987AlaIle: 4.987 ± 0.627
5.667AlaLys: 5.667 ± 0.878
8.387AlaLeu: 8.387 ± 0.898
2.267AlaMet: 2.267 ± 0.337
3.854AlaAsn: 3.854 ± 0.571
6.121AlaPro: 6.121 ± 0.966
3.778AlaGln: 3.778 ± 0.627
7.556AlaArg: 7.556 ± 0.709
5.894AlaSer: 5.894 ± 0.679
6.423AlaThr: 6.423 ± 1.033
7.33AlaVal: 7.33 ± 0.885
3.249AlaTrp: 3.249 ± 0.566
2.72AlaTyr: 2.72 ± 0.459
0.0AlaXaa: 0.0 ± 0.0
Cys
0.378CysAla: 0.378 ± 0.195
0.151CysCys: 0.151 ± 0.107
0.453CysAsp: 0.453 ± 0.179
0.378CysGlu: 0.378 ± 0.205
0.076CysPhe: 0.076 ± 0.073
0.982CysGly: 0.982 ± 0.344
0.151CysHis: 0.151 ± 0.094
0.227CysIle: 0.227 ± 0.131
0.227CysLys: 0.227 ± 0.129
0.68CysLeu: 0.68 ± 0.198
0.0CysMet: 0.0 ± 0.0
0.227CysAsn: 0.227 ± 0.128
0.302CysPro: 0.302 ± 0.154
0.227CysGln: 0.227 ± 0.131
0.605CysArg: 0.605 ± 0.248
0.151CysSer: 0.151 ± 0.123
0.529CysThr: 0.529 ± 0.229
0.378CysVal: 0.378 ± 0.185
0.151CysTrp: 0.151 ± 0.126
0.076CysTyr: 0.076 ± 0.062
0.0CysXaa: 0.0 ± 0.0
Asp
7.178AspAla: 7.178 ± 0.633
0.151AspCys: 0.151 ± 0.101
4.458AspAsp: 4.458 ± 0.456
4.232AspGlu: 4.232 ± 0.538
2.494AspPhe: 2.494 ± 0.362
5.214AspGly: 5.214 ± 0.676
0.907AspHis: 0.907 ± 0.258
1.511AspIle: 1.511 ± 0.321
1.738AspLys: 1.738 ± 0.312
6.423AspLeu: 6.423 ± 0.877
1.209AspMet: 1.209 ± 0.264
1.36AspAsn: 1.36 ± 0.288
5.138AspPro: 5.138 ± 0.721
2.796AspGln: 2.796 ± 0.435
4.005AspArg: 4.005 ± 0.577
2.796AspSer: 2.796 ± 0.402
4.534AspThr: 4.534 ± 0.65
3.249AspVal: 3.249 ± 0.45
1.738AspTrp: 1.738 ± 0.351
1.587AspTyr: 1.587 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
7.783GluAla: 7.783 ± 0.646
0.529GluCys: 0.529 ± 0.267
4.156GluAsp: 4.156 ± 0.564
4.685GluGlu: 4.685 ± 0.637
2.418GluPhe: 2.418 ± 0.406
5.441GluGly: 5.441 ± 0.619
1.511GluHis: 1.511 ± 0.344
2.947GluIle: 2.947 ± 0.369
2.947GluLys: 2.947 ± 0.433
6.423GluLeu: 6.423 ± 0.635
1.587GluMet: 1.587 ± 0.311
1.36GluAsn: 1.36 ± 0.29
3.4GluPro: 3.4 ± 0.671
2.267GluGln: 2.267 ± 0.384
4.08GluArg: 4.08 ± 0.692
2.494GluSer: 2.494 ± 0.473
3.627GluThr: 3.627 ± 0.518
5.138GluVal: 5.138 ± 0.602
1.662GluTrp: 1.662 ± 0.372
1.587GluTyr: 1.587 ± 0.385
0.0GluXaa: 0.0 ± 0.0
Phe
2.267PheAla: 2.267 ± 0.413
0.076PheCys: 0.076 ± 0.074
2.267PheAsp: 2.267 ± 0.375
1.285PheGlu: 1.285 ± 0.395
0.982PhePhe: 0.982 ± 0.493
3.098PheGly: 3.098 ± 0.499
0.605PheHis: 0.605 ± 0.227
0.982PheIle: 0.982 ± 0.322
1.058PheLys: 1.058 ± 0.313
2.267PheLeu: 2.267 ± 0.46
0.151PheMet: 0.151 ± 0.136
1.209PheAsn: 1.209 ± 0.358
1.36PhePro: 1.36 ± 0.376
1.587PheGln: 1.587 ± 0.379
2.116PheArg: 2.116 ± 0.351
1.738PheSer: 1.738 ± 0.447
2.267PheThr: 2.267 ± 0.438
2.191PheVal: 2.191 ± 0.492
0.605PheTrp: 0.605 ± 0.207
0.605PheTyr: 0.605 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
8.539GlyAla: 8.539 ± 0.833
0.605GlyCys: 0.605 ± 0.199
4.383GlyAsp: 4.383 ± 0.725
5.516GlyGlu: 5.516 ± 0.655
2.796GlyPhe: 2.796 ± 0.489
8.916GlyGly: 8.916 ± 0.948
1.587GlyHis: 1.587 ± 0.465
4.987GlyIle: 4.987 ± 0.891
4.08GlyLys: 4.08 ± 0.508
6.952GlyLeu: 6.952 ± 0.758
1.36GlyMet: 1.36 ± 0.292
3.174GlyAsn: 3.174 ± 0.572
3.854GlyPro: 3.854 ± 0.543
3.249GlyGln: 3.249 ± 0.62
5.441GlyArg: 5.441 ± 0.509
5.365GlySer: 5.365 ± 0.728
6.347GlyThr: 6.347 ± 0.88
7.632GlyVal: 7.632 ± 0.836
2.04GlyTrp: 2.04 ± 0.471
2.796GlyTyr: 2.796 ± 0.497
0.0GlyXaa: 0.0 ± 0.0
His
0.605HisAla: 0.605 ± 0.195
0.076HisCys: 0.076 ± 0.068
1.36HisAsp: 1.36 ± 0.357
1.285HisGlu: 1.285 ± 0.414
0.605HisPhe: 0.605 ± 0.268
1.738HisGly: 1.738 ± 0.404
0.151HisHis: 0.151 ± 0.096
0.378HisIle: 0.378 ± 0.225
0.756HisLys: 0.756 ± 0.21
1.738HisLeu: 1.738 ± 0.373
0.076HisMet: 0.076 ± 0.072
0.529HisAsn: 0.529 ± 0.171
2.191HisPro: 2.191 ± 0.345
0.529HisGln: 0.529 ± 0.174
1.285HisArg: 1.285 ± 0.286
0.68HisSer: 0.68 ± 0.262
1.133HisThr: 1.133 ± 0.381
1.133HisVal: 1.133 ± 0.341
0.529HisTrp: 0.529 ± 0.184
1.209HisTyr: 1.209 ± 0.316
0.0HisXaa: 0.0 ± 0.0
Ile
4.08IleAla: 4.08 ± 0.541
0.151IleCys: 0.151 ± 0.109
3.703IleAsp: 3.703 ± 0.603
2.871IleGlu: 2.871 ± 0.572
0.907IlePhe: 0.907 ± 0.263
3.703IleGly: 3.703 ± 0.632
0.529IleHis: 0.529 ± 0.194
1.36IleIle: 1.36 ± 0.432
2.04IleLys: 2.04 ± 0.424
3.174IleLeu: 3.174 ± 0.438
0.529IleMet: 0.529 ± 0.219
1.209IleAsn: 1.209 ± 0.337
2.267IlePro: 2.267 ± 0.522
1.889IleGln: 1.889 ± 0.341
2.871IleArg: 2.871 ± 0.423
1.511IleSer: 1.511 ± 0.409
2.72IleThr: 2.72 ± 0.502
2.645IleVal: 2.645 ± 0.651
1.511IleTrp: 1.511 ± 0.632
0.831IleTyr: 0.831 ± 0.255
0.0IleXaa: 0.0 ± 0.0
Lys
4.307LysAla: 4.307 ± 0.805
0.227LysCys: 0.227 ± 0.109
1.36LysAsp: 1.36 ± 0.267
1.587LysGlu: 1.587 ± 0.344
1.285LysPhe: 1.285 ± 0.297
3.4LysGly: 3.4 ± 0.529
0.831LysHis: 0.831 ± 0.337
1.965LysIle: 1.965 ± 0.369
2.342LysLys: 2.342 ± 0.536
4.76LysLeu: 4.76 ± 0.54
1.209LysMet: 1.209 ± 0.27
0.831LysAsn: 0.831 ± 0.255
2.191LysPro: 2.191 ± 0.446
0.756LysGln: 0.756 ± 0.268
3.098LysArg: 3.098 ± 0.441
2.418LysSer: 2.418 ± 0.373
3.023LysThr: 3.023 ± 0.525
3.325LysVal: 3.325 ± 0.381
1.36LysTrp: 1.36 ± 0.313
1.587LysTyr: 1.587 ± 0.412
0.0LysXaa: 0.0 ± 0.0
Leu
8.236LeuAla: 8.236 ± 0.683
0.453LeuCys: 0.453 ± 0.2
5.138LeuAsp: 5.138 ± 0.667
5.214LeuGlu: 5.214 ± 0.654
2.72LeuPhe: 2.72 ± 0.413
6.876LeuGly: 6.876 ± 0.633
1.133LeuHis: 1.133 ± 0.333
3.703LeuIle: 3.703 ± 0.404
3.174LeuLys: 3.174 ± 0.389
6.347LeuLeu: 6.347 ± 0.799
1.965LeuMet: 1.965 ± 0.313
2.191LeuAsn: 2.191 ± 0.36
4.685LeuPro: 4.685 ± 0.663
3.249LeuGln: 3.249 ± 0.52
5.667LeuArg: 5.667 ± 0.696
4.534LeuSer: 4.534 ± 0.672
6.498LeuThr: 6.498 ± 0.778
6.876LeuVal: 6.876 ± 0.7
1.662LeuTrp: 1.662 ± 0.581
2.116LeuTyr: 2.116 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
2.116MetAla: 2.116 ± 0.5
0.0MetCys: 0.0 ± 0.0
1.285MetAsp: 1.285 ± 0.291
0.907MetGlu: 0.907 ± 0.265
0.0MetPhe: 0.0 ± 0.0
0.982MetGly: 0.982 ± 0.241
0.302MetHis: 0.302 ± 0.143
0.68MetIle: 0.68 ± 0.206
0.605MetLys: 0.605 ± 0.195
1.662MetLeu: 1.662 ± 0.29
0.0MetMet: 0.0 ± 0.0
1.133MetAsn: 1.133 ± 0.353
1.587MetPro: 1.587 ± 0.323
0.605MetGln: 0.605 ± 0.226
1.587MetArg: 1.587 ± 0.333
1.662MetSer: 1.662 ± 0.357
1.285MetThr: 1.285 ± 0.25
2.04MetVal: 2.04 ± 0.338
0.227MetTrp: 0.227 ± 0.16
0.302MetTyr: 0.302 ± 0.147
0.0MetXaa: 0.0 ± 0.0
Asn
4.232AsnAla: 4.232 ± 0.624
0.378AsnCys: 0.378 ± 0.19
1.36AsnAsp: 1.36 ± 0.274
1.738AsnGlu: 1.738 ± 0.39
0.982AsnPhe: 0.982 ± 0.235
4.458AsnGly: 4.458 ± 0.621
0.378AsnHis: 0.378 ± 0.241
0.453AsnIle: 0.453 ± 0.241
0.982AsnLys: 0.982 ± 0.303
2.72AsnLeu: 2.72 ± 0.34
0.076AsnMet: 0.076 ± 0.08
0.831AsnAsn: 0.831 ± 0.295
2.645AsnPro: 2.645 ± 0.314
1.133AsnGln: 1.133 ± 0.283
2.04AsnArg: 2.04 ± 0.357
1.209AsnSer: 1.209 ± 0.412
1.738AsnThr: 1.738 ± 0.384
1.285AsnVal: 1.285 ± 0.316
1.058AsnTrp: 1.058 ± 0.377
0.756AsnTyr: 0.756 ± 0.267
0.0AsnXaa: 0.0 ± 0.0
Pro
6.272ProAla: 6.272 ± 0.888
0.378ProCys: 0.378 ± 0.192
3.929ProAsp: 3.929 ± 0.518
5.667ProGlu: 5.667 ± 0.877
1.285ProPhe: 1.285 ± 0.281
5.743ProGly: 5.743 ± 0.755
0.68ProHis: 0.68 ± 0.233
2.796ProIle: 2.796 ± 0.67
1.889ProLys: 1.889 ± 0.497
2.947ProLeu: 2.947 ± 0.538
1.285ProMet: 1.285 ± 0.334
2.04ProAsn: 2.04 ± 0.395
3.098ProPro: 3.098 ± 0.515
1.209ProGln: 1.209 ± 0.388
3.627ProArg: 3.627 ± 0.467
3.854ProSer: 3.854 ± 0.631
3.778ProThr: 3.778 ± 0.58
3.627ProVal: 3.627 ± 0.559
0.529ProTrp: 0.529 ± 0.218
1.436ProTyr: 1.436 ± 0.282
0.0ProXaa: 0.0 ± 0.0
Gln
5.063GlnAla: 5.063 ± 0.587
0.529GlnCys: 0.529 ± 0.204
1.889GlnAsp: 1.889 ± 0.333
3.249GlnGlu: 3.249 ± 0.467
1.058GlnPhe: 1.058 ± 0.279
3.551GlnGly: 3.551 ± 0.511
0.453GlnHis: 0.453 ± 0.187
2.04GlnIle: 2.04 ± 0.364
1.965GlnLys: 1.965 ± 0.36
3.551GlnLeu: 3.551 ± 0.439
0.529GlnMet: 0.529 ± 0.17
1.058GlnAsn: 1.058 ± 0.225
1.133GlnPro: 1.133 ± 0.265
0.982GlnGln: 0.982 ± 0.281
2.342GlnArg: 2.342 ± 0.465
1.662GlnSer: 1.662 ± 0.279
2.418GlnThr: 2.418 ± 0.372
2.418GlnVal: 2.418 ± 0.331
0.907GlnTrp: 0.907 ± 0.268
0.756GlnTyr: 0.756 ± 0.225
0.0GlnXaa: 0.0 ± 0.0
Arg
5.743ArgAla: 5.743 ± 0.801
0.68ArgCys: 0.68 ± 0.244
4.232ArgAsp: 4.232 ± 0.566
4.156ArgGlu: 4.156 ± 0.507
1.889ArgPhe: 1.889 ± 0.462
4.987ArgGly: 4.987 ± 0.508
1.814ArgHis: 1.814 ± 0.4
2.342ArgIle: 2.342 ± 0.36
3.174ArgLys: 3.174 ± 0.608
4.836ArgLeu: 4.836 ± 0.788
2.116ArgMet: 2.116 ± 0.296
2.04ArgAsn: 2.04 ± 0.442
2.569ArgPro: 2.569 ± 0.543
2.871ArgGln: 2.871 ± 0.403
5.289ArgArg: 5.289 ± 0.746
2.796ArgSer: 2.796 ± 0.413
3.098ArgThr: 3.098 ± 0.54
6.801ArgVal: 6.801 ± 0.714
2.191ArgTrp: 2.191 ± 0.385
2.418ArgTyr: 2.418 ± 0.501
0.0ArgXaa: 0.0 ± 0.0
Ser
5.214SerAla: 5.214 ± 0.654
0.076SerCys: 0.076 ± 0.074
3.929SerAsp: 3.929 ± 0.503
4.458SerGlu: 4.458 ± 0.449
1.889SerPhe: 1.889 ± 0.484
6.272SerGly: 6.272 ± 0.939
0.68SerHis: 0.68 ± 0.331
1.662SerIle: 1.662 ± 0.364
2.04SerLys: 2.04 ± 0.337
3.476SerLeu: 3.476 ± 0.363
1.209SerMet: 1.209 ± 0.276
1.587SerAsn: 1.587 ± 0.425
2.947SerPro: 2.947 ± 0.483
2.04SerGln: 2.04 ± 0.321
2.796SerArg: 2.796 ± 0.523
3.174SerSer: 3.174 ± 0.496
3.174SerThr: 3.174 ± 0.508
4.383SerVal: 4.383 ± 0.58
0.907SerTrp: 0.907 ± 0.242
1.662SerTyr: 1.662 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
6.423ThrAla: 6.423 ± 0.868
0.529ThrCys: 0.529 ± 0.197
4.156ThrAsp: 4.156 ± 0.481
3.929ThrGlu: 3.929 ± 0.575
1.285ThrPhe: 1.285 ± 0.334
6.347ThrGly: 6.347 ± 0.729
0.831ThrHis: 0.831 ± 0.28
3.098ThrIle: 3.098 ± 0.448
2.267ThrLys: 2.267 ± 0.395
5.289ThrLeu: 5.289 ± 0.497
1.662ThrMet: 1.662 ± 0.318
2.116ThrAsn: 2.116 ± 0.331
4.458ThrPro: 4.458 ± 0.595
2.569ThrGln: 2.569 ± 0.438
3.098ThrArg: 3.098 ± 0.575
4.383ThrSer: 4.383 ± 0.61
4.383ThrThr: 4.383 ± 0.64
5.289ThrVal: 5.289 ± 0.652
1.662ThrTrp: 1.662 ± 0.323
1.511ThrTyr: 1.511 ± 0.354
0.0ThrXaa: 0.0 ± 0.0
Val
9.143ValAla: 9.143 ± 0.735
0.302ValCys: 0.302 ± 0.159
5.818ValAsp: 5.818 ± 0.646
4.912ValGlu: 4.912 ± 0.585
1.814ValPhe: 1.814 ± 0.323
5.592ValGly: 5.592 ± 0.697
1.889ValHis: 1.889 ± 0.452
2.645ValIle: 2.645 ± 0.423
2.947ValLys: 2.947 ± 0.373
6.65ValLeu: 6.65 ± 0.563
1.058ValMet: 1.058 ± 0.27
2.116ValAsn: 2.116 ± 0.498
3.854ValPro: 3.854 ± 0.662
3.325ValGln: 3.325 ± 0.434
4.307ValArg: 4.307 ± 0.619
4.987ValSer: 4.987 ± 0.728
4.836ValThr: 4.836 ± 0.635
7.027ValVal: 7.027 ± 0.619
2.569ValTrp: 2.569 ± 0.686
2.04ValTyr: 2.04 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
3.023TrpAla: 3.023 ± 0.501
0.378TrpCys: 0.378 ± 0.182
1.965TrpAsp: 1.965 ± 0.454
1.889TrpGlu: 1.889 ± 0.374
0.831TrpPhe: 0.831 ± 0.227
1.814TrpGly: 1.814 ± 0.312
0.453TrpHis: 0.453 ± 0.173
0.982TrpIle: 0.982 ± 0.321
0.982TrpLys: 0.982 ± 0.233
1.738TrpLeu: 1.738 ± 0.291
0.453TrpMet: 0.453 ± 0.167
0.756TrpAsn: 0.756 ± 0.341
0.756TrpPro: 0.756 ± 0.212
1.209TrpGln: 1.209 ± 0.342
1.965TrpArg: 1.965 ± 0.474
1.133TrpSer: 1.133 ± 0.318
1.814TrpThr: 1.814 ± 0.394
2.342TrpVal: 2.342 ± 0.451
0.605TrpTrp: 0.605 ± 0.211
0.982TrpTyr: 0.982 ± 0.285
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.494TyrAla: 2.494 ± 0.366
0.227TyrCys: 0.227 ± 0.115
1.738TyrAsp: 1.738 ± 0.374
1.662TyrGlu: 1.662 ± 0.396
0.756TyrPhe: 0.756 ± 0.197
2.871TyrGly: 2.871 ± 0.464
0.605TyrHis: 0.605 ± 0.167
0.756TyrIle: 0.756 ± 0.211
0.982TyrLys: 0.982 ± 0.339
2.418TyrLeu: 2.418 ± 0.456
0.151TyrMet: 0.151 ± 0.095
0.831TyrAsn: 0.831 ± 0.232
1.511TyrPro: 1.511 ± 0.306
1.285TyrGln: 1.285 ± 0.291
2.267TyrArg: 2.267 ± 0.386
1.133TyrSer: 1.133 ± 0.394
1.662TyrThr: 1.662 ± 0.307
2.72TyrVal: 2.72 ± 0.494
0.907TyrTrp: 0.907 ± 0.259
0.68TyrTyr: 0.68 ± 0.272
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (13235 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski