Amino acid dipepetide frequency for Mycobacterium phage KristaRAM

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.233AlaAla: 15.233 ± 1.749
1.058AlaCys: 1.058 ± 0.274
6.929AlaAsp: 6.929 ± 0.71
7.828AlaGlu: 7.828 ± 0.747
3.121AlaPhe: 3.121 ± 0.417
10.314AlaGly: 10.314 ± 1.233
2.433AlaHis: 2.433 ± 0.467
3.861AlaIle: 3.861 ± 0.497
4.02AlaLys: 4.02 ± 0.427
8.357AlaLeu: 8.357 ± 0.75
2.645AlaMet: 2.645 ± 0.337
2.274AlaAsn: 2.274 ± 0.364
5.501AlaPro: 5.501 ± 0.573
3.544AlaGln: 3.544 ± 0.477
6.929AlaArg: 6.929 ± 0.753
4.972AlaSer: 4.972 ± 0.593
5.765AlaThr: 5.765 ± 0.582
7.564AlaVal: 7.564 ± 0.564
2.75AlaTrp: 2.75 ± 0.534
2.222AlaTyr: 2.222 ± 0.296
0.0AlaXaa: 0.0 ± 0.0
Cys
1.428CysAla: 1.428 ± 0.398
0.106CysCys: 0.106 ± 0.082
1.428CysAsp: 1.428 ± 0.322
0.635CysGlu: 0.635 ± 0.171
0.317CysPhe: 0.317 ± 0.133
1.798CysGly: 1.798 ± 0.326
0.37CysHis: 0.37 ± 0.129
0.212CysIle: 0.212 ± 0.109
0.423CysLys: 0.423 ± 0.121
1.058CysLeu: 1.058 ± 0.289
0.264CysMet: 0.264 ± 0.125
0.529CysAsn: 0.529 ± 0.163
1.058CysPro: 1.058 ± 0.243
0.264CysGln: 0.264 ± 0.113
0.952CysArg: 0.952 ± 0.282
0.793CysSer: 0.793 ± 0.222
0.899CysThr: 0.899 ± 0.224
0.741CysVal: 0.741 ± 0.196
0.37CysTrp: 0.37 ± 0.131
0.37CysTyr: 0.37 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
6.294AspAla: 6.294 ± 0.526
1.322AspCys: 1.322 ± 0.236
4.337AspAsp: 4.337 ± 0.574
3.121AspGlu: 3.121 ± 0.398
1.957AspPhe: 1.957 ± 0.277
6.982AspGly: 6.982 ± 0.651
1.322AspHis: 1.322 ± 0.268
2.645AspIle: 2.645 ± 0.34
1.64AspLys: 1.64 ± 0.291
5.871AspLeu: 5.871 ± 0.584
0.899AspMet: 0.899 ± 0.251
1.745AspAsn: 1.745 ± 0.415
4.813AspPro: 4.813 ± 0.563
2.274AspGln: 2.274 ± 0.303
4.972AspArg: 4.972 ± 0.564
3.755AspSer: 3.755 ± 0.604
3.967AspThr: 3.967 ± 0.467
4.284AspVal: 4.284 ± 0.601
1.428AspTrp: 1.428 ± 0.299
1.798AspTyr: 1.798 ± 0.351
0.0AspXaa: 0.0 ± 0.0
Glu
6.823GluAla: 6.823 ± 0.677
1.005GluCys: 1.005 ± 0.262
2.75GluAsp: 2.75 ± 0.346
3.226GluGlu: 3.226 ± 0.578
2.38GluPhe: 2.38 ± 0.334
3.121GluGly: 3.121 ± 0.369
1.64GluHis: 1.64 ± 0.389
2.698GluIle: 2.698 ± 0.407
1.904GluLys: 1.904 ± 0.273
5.395GluLeu: 5.395 ± 0.629
1.428GluMet: 1.428 ± 0.271
2.169GluAsn: 2.169 ± 0.271
2.803GluPro: 2.803 ± 0.488
2.592GluGln: 2.592 ± 0.325
5.131GluArg: 5.131 ± 0.644
3.121GluSer: 3.121 ± 0.543
4.126GluThr: 4.126 ± 0.647
3.914GluVal: 3.914 ± 0.637
1.534GluTrp: 1.534 ± 0.282
1.745GluTyr: 1.745 ± 0.34
0.0GluXaa: 0.0 ± 0.0
Phe
3.491PheAla: 3.491 ± 0.42
0.264PheCys: 0.264 ± 0.097
2.433PheAsp: 2.433 ± 0.447
1.64PheGlu: 1.64 ± 0.302
0.793PhePhe: 0.793 ± 0.28
3.438PheGly: 3.438 ± 0.641
0.37PheHis: 0.37 ± 0.131
1.587PheIle: 1.587 ± 0.35
0.899PheLys: 0.899 ± 0.202
1.957PheLeu: 1.957 ± 0.306
0.899PheMet: 0.899 ± 0.258
1.111PheAsn: 1.111 ± 0.305
1.375PhePro: 1.375 ± 0.296
1.111PheGln: 1.111 ± 0.301
1.587PheArg: 1.587 ± 0.286
1.269PheSer: 1.269 ± 0.256
2.116PheThr: 2.116 ± 0.406
2.327PheVal: 2.327 ± 0.28
0.635PheTrp: 0.635 ± 0.179
1.058PheTyr: 1.058 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
9.151GlyAla: 9.151 ± 1.12
1.111GlyCys: 1.111 ± 0.283
6.506GlyAsp: 6.506 ± 0.618
4.284GlyGlu: 4.284 ± 0.628
2.698GlyPhe: 2.698 ± 0.42
11.319GlyGly: 11.319 ± 2.348
2.169GlyHis: 2.169 ± 0.28
3.967GlyIle: 3.967 ± 0.627
2.698GlyLys: 2.698 ± 0.376
6.083GlyLeu: 6.083 ± 0.589
2.486GlyMet: 2.486 ± 0.425
2.645GlyAsn: 2.645 ± 0.388
4.126GlyPro: 4.126 ± 0.519
2.327GlyGln: 2.327 ± 0.498
5.554GlyArg: 5.554 ± 0.696
5.977GlySer: 5.977 ± 0.906
6.136GlyThr: 6.136 ± 0.619
5.607GlyVal: 5.607 ± 0.597
2.274GlyTrp: 2.274 ± 0.368
2.274GlyTyr: 2.274 ± 0.343
0.0GlyXaa: 0.0 ± 0.0
His
1.798HisAla: 1.798 ± 0.333
0.37HisCys: 0.37 ± 0.164
0.952HisAsp: 0.952 ± 0.22
1.481HisGlu: 1.481 ± 0.28
0.317HisPhe: 0.317 ± 0.133
1.587HisGly: 1.587 ± 0.278
1.005HisHis: 1.005 ± 0.267
1.269HisIle: 1.269 ± 0.302
0.846HisLys: 0.846 ± 0.298
1.534HisLeu: 1.534 ± 0.327
0.635HisMet: 0.635 ± 0.179
0.793HisAsn: 0.793 ± 0.183
1.851HisPro: 1.851 ± 0.346
0.741HisGln: 0.741 ± 0.185
2.169HisArg: 2.169 ± 0.38
0.741HisSer: 0.741 ± 0.198
1.481HisThr: 1.481 ± 0.338
1.428HisVal: 1.428 ± 0.389
0.476HisTrp: 0.476 ± 0.16
0.741HisTyr: 0.741 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
5.184IleAla: 5.184 ± 0.533
0.899IleCys: 0.899 ± 0.237
4.073IleAsp: 4.073 ± 0.564
3.332IleGlu: 3.332 ± 0.413
0.899IlePhe: 0.899 ± 0.258
3.491IleGly: 3.491 ± 0.478
1.217IleHis: 1.217 ± 0.286
1.322IleIle: 1.322 ± 0.284
1.111IleLys: 1.111 ± 0.23
1.851IleLeu: 1.851 ± 0.42
0.37IleMet: 0.37 ± 0.167
2.01IleAsn: 2.01 ± 0.304
3.332IlePro: 3.332 ± 0.36
1.375IleGln: 1.375 ± 0.241
2.698IleArg: 2.698 ± 0.442
2.063IleSer: 2.063 ± 0.356
3.597IleThr: 3.597 ± 0.415
2.539IleVal: 2.539 ± 0.395
1.111IleTrp: 1.111 ± 0.227
0.846IleTyr: 0.846 ± 0.234
0.0IleXaa: 0.0 ± 0.0
Lys
3.491LysAla: 3.491 ± 0.471
0.264LysCys: 0.264 ± 0.118
2.01LysAsp: 2.01 ± 0.323
1.217LysGlu: 1.217 ± 0.253
1.481LysPhe: 1.481 ± 0.211
2.909LysGly: 2.909 ± 0.288
0.846LysHis: 0.846 ± 0.195
1.217LysIle: 1.217 ± 0.301
1.322LysLys: 1.322 ± 0.337
2.274LysLeu: 2.274 ± 0.413
0.793LysMet: 0.793 ± 0.195
0.741LysAsn: 0.741 ± 0.206
2.116LysPro: 2.116 ± 0.353
1.587LysGln: 1.587 ± 0.255
2.592LysArg: 2.592 ± 0.41
1.745LysSer: 1.745 ± 0.299
1.957LysThr: 1.957 ± 0.345
2.327LysVal: 2.327 ± 0.389
0.529LysTrp: 0.529 ± 0.177
1.005LysTyr: 1.005 ± 0.279
0.0LysXaa: 0.0 ± 0.0
Leu
7.987LeuAla: 7.987 ± 0.753
0.846LeuCys: 0.846 ± 0.213
5.131LeuAsp: 5.131 ± 0.505
4.126LeuGlu: 4.126 ± 0.567
2.539LeuPhe: 2.539 ± 0.288
5.184LeuGly: 5.184 ± 0.556
1.111LeuHis: 1.111 ± 0.213
3.279LeuIle: 3.279 ± 0.467
1.851LeuLys: 1.851 ± 0.314
4.655LeuLeu: 4.655 ± 0.499
1.534LeuMet: 1.534 ± 0.266
2.698LeuAsn: 2.698 ± 0.373
5.078LeuPro: 5.078 ± 0.713
2.645LeuGln: 2.645 ± 0.42
5.342LeuArg: 5.342 ± 0.653
5.342LeuSer: 5.342 ± 0.527
5.184LeuThr: 5.184 ± 0.559
4.76LeuVal: 4.76 ± 0.522
1.798LeuTrp: 1.798 ± 0.36
2.116LeuTyr: 2.116 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
2.01MetAla: 2.01 ± 0.357
0.317MetCys: 0.317 ± 0.117
1.164MetAsp: 1.164 ± 0.3
1.322MetGlu: 1.322 ± 0.237
0.635MetPhe: 0.635 ± 0.166
1.693MetGly: 1.693 ± 0.282
0.159MetHis: 0.159 ± 0.109
1.005MetIle: 1.005 ± 0.22
0.635MetLys: 0.635 ± 0.21
1.693MetLeu: 1.693 ± 0.272
0.476MetMet: 0.476 ± 0.2
1.005MetAsn: 1.005 ± 0.237
1.269MetPro: 1.269 ± 0.235
0.582MetGln: 0.582 ± 0.164
1.481MetArg: 1.481 ± 0.308
2.75MetSer: 2.75 ± 0.402
2.38MetThr: 2.38 ± 0.322
1.481MetVal: 1.481 ± 0.32
0.159MetTrp: 0.159 ± 0.08
0.317MetTyr: 0.317 ± 0.114
0.0MetXaa: 0.0 ± 0.0
Asn
3.226AsnAla: 3.226 ± 0.357
0.212AsnCys: 0.212 ± 0.136
1.64AsnAsp: 1.64 ± 0.3
1.798AsnGlu: 1.798 ± 0.35
0.899AsnPhe: 0.899 ± 0.289
3.914AsnGly: 3.914 ± 0.502
1.164AsnHis: 1.164 ± 0.243
1.693AsnIle: 1.693 ± 0.395
0.793AsnLys: 0.793 ± 0.221
2.327AsnLeu: 2.327 ± 0.312
0.582AsnMet: 0.582 ± 0.18
1.745AsnAsn: 1.745 ± 0.356
2.592AsnPro: 2.592 ± 0.372
1.217AsnGln: 1.217 ± 0.33
2.169AsnArg: 2.169 ± 0.359
1.587AsnSer: 1.587 ± 0.327
2.38AsnThr: 2.38 ± 0.327
1.693AsnVal: 1.693 ± 0.298
0.846AsnTrp: 0.846 ± 0.176
0.635AsnTyr: 0.635 ± 0.148
0.0AsnXaa: 0.0 ± 0.0
Pro
5.607ProAla: 5.607 ± 0.651
1.005ProCys: 1.005 ± 0.256
3.967ProAsp: 3.967 ± 0.533
4.179ProGlu: 4.179 ± 0.469
1.64ProPhe: 1.64 ± 0.329
6.717ProGly: 6.717 ± 0.675
1.322ProHis: 1.322 ± 0.294
2.433ProIle: 2.433 ± 0.364
2.433ProLys: 2.433 ± 0.44
4.337ProLeu: 4.337 ± 0.516
1.534ProMet: 1.534 ± 0.295
2.274ProAsn: 2.274 ± 0.351
3.861ProPro: 3.861 ± 0.621
1.64ProGln: 1.64 ± 0.321
3.226ProArg: 3.226 ± 0.548
3.121ProSer: 3.121 ± 0.364
3.226ProThr: 3.226 ± 0.472
4.549ProVal: 4.549 ± 0.566
0.952ProTrp: 0.952 ± 0.243
1.745ProTyr: 1.745 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
4.39GlnAla: 4.39 ± 0.632
0.37GlnCys: 0.37 ± 0.144
1.745GlnAsp: 1.745 ± 0.25
1.798GlnGlu: 1.798 ± 0.382
1.005GlnPhe: 1.005 ± 0.184
2.222GlnGly: 2.222 ± 0.377
0.582GlnHis: 0.582 ± 0.183
1.481GlnIle: 1.481 ± 0.305
1.322GlnLys: 1.322 ± 0.251
3.174GlnLeu: 3.174 ± 0.444
0.582GlnMet: 0.582 ± 0.193
0.952GlnAsn: 0.952 ± 0.249
2.803GlnPro: 2.803 ± 0.36
1.058GlnGln: 1.058 ± 0.238
2.274GlnArg: 2.274 ± 0.372
2.063GlnSer: 2.063 ± 0.371
1.798GlnThr: 1.798 ± 0.355
2.433GlnVal: 2.433 ± 0.38
0.582GlnTrp: 0.582 ± 0.17
1.005GlnTyr: 1.005 ± 0.261
0.0GlnXaa: 0.0 ± 0.0
Arg
6.189ArgAla: 6.189 ± 0.544
1.534ArgCys: 1.534 ± 0.305
4.179ArgAsp: 4.179 ± 0.598
5.025ArgGlu: 5.025 ± 0.702
2.327ArgPhe: 2.327 ± 0.424
3.967ArgGly: 3.967 ± 0.451
1.375ArgHis: 1.375 ± 0.334
4.073ArgIle: 4.073 ± 0.491
2.327ArgLys: 2.327 ± 0.421
5.289ArgLeu: 5.289 ± 0.679
2.539ArgMet: 2.539 ± 0.4
2.433ArgAsn: 2.433 ± 0.393
3.279ArgPro: 3.279 ± 0.489
2.222ArgGln: 2.222 ± 0.363
5.977ArgArg: 5.977 ± 0.856
3.914ArgSer: 3.914 ± 0.392
3.438ArgThr: 3.438 ± 0.508
5.184ArgVal: 5.184 ± 0.563
2.01ArgTrp: 2.01 ± 0.334
2.063ArgTyr: 2.063 ± 0.292
0.0ArgXaa: 0.0 ± 0.0
Ser
6.453SerAla: 6.453 ± 1.046
0.476SerCys: 0.476 ± 0.155
3.861SerAsp: 3.861 ± 0.417
3.438SerGlu: 3.438 ± 0.411
1.957SerPhe: 1.957 ± 0.455
6.189SerGly: 6.189 ± 0.735
1.164SerHis: 1.164 ± 0.22
2.592SerIle: 2.592 ± 0.448
2.539SerLys: 2.539 ± 0.464
3.914SerLeu: 3.914 ± 0.429
1.111SerMet: 1.111 ± 0.233
1.957SerAsn: 1.957 ± 0.373
3.226SerPro: 3.226 ± 0.378
1.745SerGln: 1.745 ± 0.24
3.65SerArg: 3.65 ± 0.423
3.967SerSer: 3.967 ± 0.644
3.174SerThr: 3.174 ± 0.431
4.655SerVal: 4.655 ± 0.495
1.375SerTrp: 1.375 ± 0.242
1.164SerTyr: 1.164 ± 0.208
0.0SerXaa: 0.0 ± 0.0
Thr
6.4ThrAla: 6.4 ± 0.653
0.741ThrCys: 0.741 ± 0.22
4.39ThrAsp: 4.39 ± 0.622
3.438ThrGlu: 3.438 ± 0.37
1.798ThrPhe: 1.798 ± 0.335
5.977ThrGly: 5.977 ± 0.621
1.534ThrHis: 1.534 ± 0.328
3.385ThrIle: 3.385 ± 0.44
1.798ThrLys: 1.798 ± 0.331
4.443ThrLeu: 4.443 ± 0.522
1.269ThrMet: 1.269 ± 0.252
2.486ThrAsn: 2.486 ± 0.391
4.39ThrPro: 4.39 ± 0.583
1.904ThrGln: 1.904 ± 0.274
3.861ThrArg: 3.861 ± 0.434
3.755ThrSer: 3.755 ± 0.378
5.448ThrThr: 5.448 ± 0.664
5.712ThrVal: 5.712 ± 0.637
1.164ThrTrp: 1.164 ± 0.268
1.851ThrTyr: 1.851 ± 0.283
0.0ThrXaa: 0.0 ± 0.0
Val
7.775ValAla: 7.775 ± 0.679
1.481ValCys: 1.481 ± 0.314
5.078ValAsp: 5.078 ± 0.54
4.866ValGlu: 4.866 ± 0.61
2.222ValPhe: 2.222 ± 0.402
5.289ValGly: 5.289 ± 0.688
1.269ValHis: 1.269 ± 0.3
2.803ValIle: 2.803 ± 0.441
2.222ValLys: 2.222 ± 0.325
5.184ValLeu: 5.184 ± 0.59
1.217ValMet: 1.217 ± 0.213
2.116ValAsn: 2.116 ± 0.344
3.861ValPro: 3.861 ± 0.41
2.803ValGln: 2.803 ± 0.385
4.708ValArg: 4.708 ± 0.658
4.972ValSer: 4.972 ± 0.545
4.76ValThr: 4.76 ± 0.454
5.818ValVal: 5.818 ± 0.701
1.693ValTrp: 1.693 ± 0.332
1.322ValTyr: 1.322 ± 0.246
0.0ValXaa: 0.0 ± 0.0
Trp
2.063TrpAla: 2.063 ± 0.332
0.212TrpCys: 0.212 ± 0.116
1.481TrpAsp: 1.481 ± 0.278
1.058TrpGlu: 1.058 ± 0.295
0.635TrpPhe: 0.635 ± 0.155
1.111TrpGly: 1.111 ± 0.255
0.635TrpHis: 0.635 ± 0.202
0.899TrpIle: 0.899 ± 0.193
0.846TrpLys: 0.846 ± 0.197
1.745TrpLeu: 1.745 ± 0.308
0.793TrpMet: 0.793 ± 0.201
0.582TrpAsn: 0.582 ± 0.199
0.952TrpPro: 0.952 ± 0.292
1.217TrpGln: 1.217 ± 0.32
1.957TrpArg: 1.957 ± 0.384
1.693TrpSer: 1.693 ± 0.513
2.01TrpThr: 2.01 ± 0.319
2.01TrpVal: 2.01 ± 0.436
0.899TrpTrp: 0.899 ± 0.196
0.423TrpTyr: 0.423 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.327TyrAla: 2.327 ± 0.369
0.423TyrCys: 0.423 ± 0.157
1.375TyrAsp: 1.375 ± 0.328
1.693TyrGlu: 1.693 ± 0.308
0.741TyrPhe: 0.741 ± 0.232
2.169TyrGly: 2.169 ± 0.375
0.529TyrHis: 0.529 ± 0.169
0.952TyrIle: 0.952 ± 0.207
0.793TyrLys: 0.793 ± 0.22
2.063TyrLeu: 2.063 ± 0.308
0.423TyrMet: 0.423 ± 0.147
0.846TyrAsn: 0.846 ± 0.249
1.481TyrPro: 1.481 ± 0.22
0.793TyrGln: 0.793 ± 0.21
2.116TyrArg: 2.116 ± 0.386
1.005TyrSer: 1.005 ± 0.218
1.904TyrThr: 1.904 ± 0.366
2.433TyrVal: 2.433 ± 0.325
0.635TyrTrp: 0.635 ± 0.191
0.688TyrTyr: 0.688 ± 0.181
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 113 proteins (18907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski