Amino acid dipepetide frequency for Arthrobacter phage Mudcat

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.786AlaAla: 7.786 ± 1.41
0.053AlaCys: 0.053 ± 0.043
5.19AlaAsp: 5.19 ± 0.533
5.508AlaGlu: 5.508 ± 0.59
3.549AlaPhe: 3.549 ± 0.422
4.661AlaGly: 4.661 ± 0.884
1.748AlaHis: 1.748 ± 0.32
5.032AlaIle: 5.032 ± 0.654
6.356AlaLys: 6.356 ± 0.718
7.256AlaLeu: 7.256 ± 0.761
3.231AlaMet: 3.231 ± 0.583
3.549AlaAsn: 3.549 ± 0.59
2.383AlaPro: 2.383 ± 0.451
3.231AlaGln: 3.231 ± 0.419
3.866AlaArg: 3.866 ± 0.495
4.396AlaSer: 4.396 ± 0.595
4.82AlaThr: 4.82 ± 0.59
6.091AlaVal: 6.091 ± 0.699
1.112AlaTrp: 1.112 ± 0.257
2.595AlaTyr: 2.595 ± 0.331
0.0AlaXaa: 0.0 ± 0.0
Cys
0.371CysAla: 0.371 ± 0.164
0.0CysCys: 0.0 ± 0.0
0.583CysAsp: 0.583 ± 0.161
0.477CysGlu: 0.477 ± 0.149
0.053CysPhe: 0.053 ± 0.054
0.583CysGly: 0.583 ± 0.25
0.212CysHis: 0.212 ± 0.134
0.265CysIle: 0.265 ± 0.116
0.371CysLys: 0.371 ± 0.166
0.477CysLeu: 0.477 ± 0.15
0.106CysMet: 0.106 ± 0.085
0.318CysAsn: 0.318 ± 0.132
0.265CysPro: 0.265 ± 0.123
0.318CysGln: 0.318 ± 0.158
0.159CysArg: 0.159 ± 0.109
0.371CysSer: 0.371 ± 0.134
0.424CysThr: 0.424 ± 0.138
0.106CysVal: 0.106 ± 0.073
0.159CysTrp: 0.159 ± 0.101
0.212CysTyr: 0.212 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
5.773AspAla: 5.773 ± 0.649
0.371AspCys: 0.371 ± 0.145
3.337AspAsp: 3.337 ± 0.563
5.614AspGlu: 5.614 ± 0.623
3.866AspPhe: 3.866 ± 0.518
4.979AspGly: 4.979 ± 0.545
1.218AspHis: 1.218 ± 0.261
3.707AspIle: 3.707 ± 0.501
3.072AspLys: 3.072 ± 0.409
5.137AspLeu: 5.137 ± 0.658
2.013AspMet: 2.013 ± 0.361
2.595AspAsn: 2.595 ± 0.376
2.754AspPro: 2.754 ± 0.422
1.801AspGln: 1.801 ± 0.256
3.602AspArg: 3.602 ± 0.526
3.337AspSer: 3.337 ± 0.41
2.754AspThr: 2.754 ± 0.333
4.131AspVal: 4.131 ± 0.408
0.9AspTrp: 0.9 ± 0.256
2.013AspTyr: 2.013 ± 0.293
0.0AspXaa: 0.0 ± 0.0
Glu
5.243GluAla: 5.243 ± 0.65
0.371GluCys: 0.371 ± 0.206
4.237GluAsp: 4.237 ± 0.478
6.62GluGlu: 6.62 ± 0.813
2.224GluPhe: 2.224 ± 0.354
3.866GluGly: 3.866 ± 0.382
1.43GluHis: 1.43 ± 0.333
4.343GluIle: 4.343 ± 0.432
4.502GluLys: 4.502 ± 0.516
4.926GluLeu: 4.926 ± 0.708
1.96GluMet: 1.96 ± 0.289
3.443GluAsn: 3.443 ± 0.349
1.748GluPro: 1.748 ± 0.326
2.913GluGln: 2.913 ± 0.358
4.502GluArg: 4.502 ± 0.533
3.284GluSer: 3.284 ± 0.571
4.343GluThr: 4.343 ± 0.535
4.767GluVal: 4.767 ± 0.657
1.536GluTrp: 1.536 ± 0.366
3.231GluTyr: 3.231 ± 0.464
0.0GluXaa: 0.0 ± 0.0
Phe
2.383PheAla: 2.383 ± 0.389
0.424PheCys: 0.424 ± 0.181
3.019PheAsp: 3.019 ± 0.336
2.807PheGlu: 2.807 ± 0.381
1.642PhePhe: 1.642 ± 0.335
2.913PheGly: 2.913 ± 0.397
0.583PheHis: 0.583 ± 0.134
2.224PheIle: 2.224 ± 0.401
3.125PheLys: 3.125 ± 0.469
2.33PheLeu: 2.33 ± 0.385
0.689PheMet: 0.689 ± 0.154
2.489PheAsn: 2.489 ± 0.322
1.695PhePro: 1.695 ± 0.401
1.324PheGln: 1.324 ± 0.27
1.801PheArg: 1.801 ± 0.339
2.383PheSer: 2.383 ± 0.402
2.171PheThr: 2.171 ± 0.358
2.913PheVal: 2.913 ± 0.457
0.741PheTrp: 0.741 ± 0.271
1.536PheTyr: 1.536 ± 0.286
0.0PheXaa: 0.0 ± 0.0
Gly
5.985GlyAla: 5.985 ± 1.018
0.583GlyCys: 0.583 ± 0.256
4.502GlyAsp: 4.502 ± 0.444
4.29GlyGlu: 4.29 ± 0.394
3.072GlyPhe: 3.072 ± 0.585
3.866GlyGly: 3.866 ± 0.653
1.112GlyHis: 1.112 ± 0.283
4.979GlyIle: 4.979 ± 0.704
3.231GlyLys: 3.231 ± 0.578
6.303GlyLeu: 6.303 ± 0.99
1.907GlyMet: 1.907 ± 0.451
2.701GlyAsn: 2.701 ± 0.398
1.589GlyPro: 1.589 ± 0.22
1.854GlyGln: 1.854 ± 0.357
3.284GlyArg: 3.284 ± 0.442
3.76GlySer: 3.76 ± 0.585
5.402GlyThr: 5.402 ± 0.657
5.985GlyVal: 5.985 ± 0.621
1.112GlyTrp: 1.112 ± 0.214
2.807GlyTyr: 2.807 ± 0.436
0.0GlyXaa: 0.0 ± 0.0
His
1.218HisAla: 1.218 ± 0.307
0.053HisCys: 0.053 ± 0.043
0.953HisAsp: 0.953 ± 0.197
1.059HisGlu: 1.059 ± 0.242
0.477HisPhe: 0.477 ± 0.163
1.271HisGly: 1.271 ± 0.249
0.53HisHis: 0.53 ± 0.153
1.642HisIle: 1.642 ± 0.318
1.165HisLys: 1.165 ± 0.237
1.695HisLeu: 1.695 ± 0.288
0.371HisMet: 0.371 ± 0.162
0.794HisAsn: 0.794 ± 0.253
0.953HisPro: 0.953 ± 0.247
0.583HisGln: 0.583 ± 0.195
1.006HisArg: 1.006 ± 0.278
0.9HisSer: 0.9 ± 0.232
0.953HisThr: 0.953 ± 0.232
1.43HisVal: 1.43 ± 0.378
0.477HisTrp: 0.477 ± 0.154
1.271HisTyr: 1.271 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
5.402IleAla: 5.402 ± 0.742
0.212IleCys: 0.212 ± 0.108
4.926IleAsp: 4.926 ± 0.443
4.396IleGlu: 4.396 ± 0.616
1.536IlePhe: 1.536 ± 0.294
4.926IleGly: 4.926 ± 0.953
1.218IleHis: 1.218 ± 0.288
3.602IleIle: 3.602 ± 0.524
3.866IleLys: 3.866 ± 0.527
4.873IleLeu: 4.873 ± 0.546
1.642IleMet: 1.642 ± 0.314
2.754IleAsn: 2.754 ± 0.361
2.966IlePro: 2.966 ± 0.416
1.907IleGln: 1.907 ± 0.309
3.337IleArg: 3.337 ± 0.408
3.284IleSer: 3.284 ± 0.503
3.972IleThr: 3.972 ± 0.444
4.767IleVal: 4.767 ± 0.444
0.847IleTrp: 0.847 ± 0.229
1.96IleTyr: 1.96 ± 0.3
0.0IleXaa: 0.0 ± 0.0
Lys
5.296LysAla: 5.296 ± 0.562
0.424LysCys: 0.424 ± 0.163
3.284LysAsp: 3.284 ± 0.456
4.555LysGlu: 4.555 ± 0.55
2.701LysPhe: 2.701 ± 0.447
3.284LysGly: 3.284 ± 0.585
1.695LysHis: 1.695 ± 0.403
3.919LysIle: 3.919 ± 0.428
4.29LysLys: 4.29 ± 0.645
5.932LysLeu: 5.932 ± 0.678
2.013LysMet: 2.013 ± 0.264
3.072LysAsn: 3.072 ± 0.491
2.224LysPro: 2.224 ± 0.382
2.542LysGln: 2.542 ± 0.417
3.919LysArg: 3.919 ± 0.569
3.654LysSer: 3.654 ± 0.396
3.707LysThr: 3.707 ± 0.411
4.873LysVal: 4.873 ± 0.537
1.112LysTrp: 1.112 ± 0.247
2.542LysTyr: 2.542 ± 0.512
0.0LysXaa: 0.0 ± 0.0
Leu
7.362LeuAla: 7.362 ± 0.736
0.371LeuCys: 0.371 ± 0.131
5.19LeuAsp: 5.19 ± 0.459
5.826LeuGlu: 5.826 ± 0.577
3.019LeuPhe: 3.019 ± 0.676
5.985LeuGly: 5.985 ± 0.822
1.536LeuHis: 1.536 ± 0.289
5.137LeuIle: 5.137 ± 0.63
5.243LeuLys: 5.243 ± 0.58
6.25LeuLeu: 6.25 ± 0.723
1.907LeuMet: 1.907 ± 0.269
3.813LeuAsn: 3.813 ± 0.374
3.337LeuPro: 3.337 ± 0.519
1.642LeuGln: 1.642 ± 0.321
4.025LeuArg: 4.025 ± 0.568
5.773LeuSer: 5.773 ± 0.519
4.82LeuThr: 4.82 ± 0.628
5.402LeuVal: 5.402 ± 0.656
1.324LeuTrp: 1.324 ± 0.296
2.542LeuTyr: 2.542 ± 0.426
0.0LeuXaa: 0.0 ± 0.0
Met
2.648MetAla: 2.648 ± 0.465
0.212MetCys: 0.212 ± 0.11
1.801MetAsp: 1.801 ± 0.287
1.589MetGlu: 1.589 ± 0.371
1.271MetPhe: 1.271 ± 0.245
1.96MetGly: 1.96 ± 0.383
0.371MetHis: 0.371 ± 0.107
1.324MetIle: 1.324 ± 0.276
1.96MetLys: 1.96 ± 0.399
1.854MetLeu: 1.854 ± 0.353
0.477MetMet: 0.477 ± 0.149
1.483MetAsn: 1.483 ± 0.277
0.953MetPro: 0.953 ± 0.197
0.953MetGln: 0.953 ± 0.207
1.324MetArg: 1.324 ± 0.248
2.595MetSer: 2.595 ± 0.418
2.119MetThr: 2.119 ± 0.312
1.43MetVal: 1.43 ± 0.298
0.159MetTrp: 0.159 ± 0.092
0.794MetTyr: 0.794 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
4.29AsnAla: 4.29 ± 0.427
0.212AsnCys: 0.212 ± 0.133
2.489AsnAsp: 2.489 ± 0.434
2.542AsnGlu: 2.542 ± 0.434
1.854AsnPhe: 1.854 ± 0.275
3.654AsnGly: 3.654 ± 0.384
0.741AsnHis: 0.741 ± 0.191
3.019AsnIle: 3.019 ± 0.364
2.277AsnLys: 2.277 ± 0.266
4.661AsnLeu: 4.661 ± 0.429
1.642AsnMet: 1.642 ± 0.3
2.066AsnAsn: 2.066 ± 0.393
2.966AsnPro: 2.966 ± 0.474
1.96AsnGln: 1.96 ± 0.293
2.171AsnArg: 2.171 ± 0.399
2.966AsnSer: 2.966 ± 0.364
2.754AsnThr: 2.754 ± 0.36
2.913AsnVal: 2.913 ± 0.339
0.53AsnTrp: 0.53 ± 0.229
1.96AsnTyr: 1.96 ± 0.32
0.0AsnXaa: 0.0 ± 0.0
Pro
2.489ProAla: 2.489 ± 0.298
0.212ProCys: 0.212 ± 0.103
2.489ProAsp: 2.489 ± 0.379
3.707ProGlu: 3.707 ± 0.549
1.642ProPhe: 1.642 ± 0.267
2.754ProGly: 2.754 ± 0.336
0.636ProHis: 0.636 ± 0.207
2.066ProIle: 2.066 ± 0.287
2.86ProLys: 2.86 ± 0.382
2.224ProLeu: 2.224 ± 0.305
0.794ProMet: 0.794 ± 0.187
1.907ProAsn: 1.907 ± 0.286
1.854ProPro: 1.854 ± 0.312
1.059ProGln: 1.059 ± 0.245
1.801ProArg: 1.801 ± 0.32
3.072ProSer: 3.072 ± 0.638
2.754ProThr: 2.754 ± 0.422
2.807ProVal: 2.807 ± 0.374
0.477ProTrp: 0.477 ± 0.175
1.218ProTyr: 1.218 ± 0.339
0.0ProXaa: 0.0 ± 0.0
Gln
3.072GlnAla: 3.072 ± 0.52
0.318GlnCys: 0.318 ± 0.172
1.642GlnAsp: 1.642 ± 0.325
1.748GlnGlu: 1.748 ± 0.339
1.324GlnPhe: 1.324 ± 0.283
2.436GlnGly: 2.436 ± 0.398
0.477GlnHis: 0.477 ± 0.141
2.436GlnIle: 2.436 ± 0.31
2.33GlnLys: 2.33 ± 0.437
3.443GlnLeu: 3.443 ± 0.396
0.9GlnMet: 0.9 ± 0.2
1.377GlnAsn: 1.377 ± 0.26
1.218GlnPro: 1.218 ± 0.248
1.483GlnGln: 1.483 ± 0.368
1.483GlnArg: 1.483 ± 0.283
1.43GlnSer: 1.43 ± 0.264
1.748GlnThr: 1.748 ± 0.287
1.854GlnVal: 1.854 ± 0.28
0.477GlnTrp: 0.477 ± 0.153
1.218GlnTyr: 1.218 ± 0.259
0.0GlnXaa: 0.0 ± 0.0
Arg
2.966ArgAla: 2.966 ± 0.47
0.371ArgCys: 0.371 ± 0.193
3.072ArgAsp: 3.072 ± 0.325
3.39ArgGlu: 3.39 ± 0.475
1.589ArgPhe: 1.589 ± 0.26
2.542ArgGly: 2.542 ± 0.343
1.43ArgHis: 1.43 ± 0.365
3.284ArgIle: 3.284 ± 0.432
4.714ArgLys: 4.714 ± 0.808
4.131ArgLeu: 4.131 ± 0.428
1.218ArgMet: 1.218 ± 0.255
2.701ArgAsn: 2.701 ± 0.4
1.96ArgPro: 1.96 ± 0.292
1.748ArgGln: 1.748 ± 0.363
4.025ArgArg: 4.025 ± 0.533
3.813ArgSer: 3.813 ± 0.583
3.284ArgThr: 3.284 ± 0.449
3.602ArgVal: 3.602 ± 0.455
0.477ArgTrp: 0.477 ± 0.168
1.801ArgTyr: 1.801 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
4.078SerAla: 4.078 ± 0.504
0.212SerCys: 0.212 ± 0.109
3.813SerAsp: 3.813 ± 0.538
4.237SerGlu: 4.237 ± 0.471
2.33SerPhe: 2.33 ± 0.423
5.137SerGly: 5.137 ± 0.636
1.059SerHis: 1.059 ± 0.245
3.972SerIle: 3.972 ± 0.481
4.078SerLys: 4.078 ± 0.377
4.502SerLeu: 4.502 ± 0.438
1.748SerMet: 1.748 ± 0.24
3.602SerAsn: 3.602 ± 0.469
2.33SerPro: 2.33 ± 0.307
1.801SerGln: 1.801 ± 0.323
2.754SerArg: 2.754 ± 0.409
4.396SerSer: 4.396 ± 0.761
3.813SerThr: 3.813 ± 0.694
3.39SerVal: 3.39 ± 0.344
1.377SerTrp: 1.377 ± 0.242
2.33SerTyr: 2.33 ± 0.309
0.0SerXaa: 0.0 ± 0.0
Thr
5.826ThrAla: 5.826 ± 0.987
0.318ThrCys: 0.318 ± 0.154
3.972ThrAsp: 3.972 ± 0.451
4.078ThrGlu: 4.078 ± 0.616
2.489ThrPhe: 2.489 ± 0.388
4.29ThrGly: 4.29 ± 0.458
1.271ThrHis: 1.271 ± 0.294
4.025ThrIle: 4.025 ± 0.478
3.496ThrLys: 3.496 ± 0.443
5.243ThrLeu: 5.243 ± 0.721
1.059ThrMet: 1.059 ± 0.301
3.019ThrAsn: 3.019 ± 0.474
3.707ThrPro: 3.707 ± 0.512
2.013ThrGln: 2.013 ± 0.36
2.648ThrArg: 2.648 ± 0.422
3.654ThrSer: 3.654 ± 0.474
4.714ThrThr: 4.714 ± 0.702
4.078ThrVal: 4.078 ± 0.452
1.006ThrTrp: 1.006 ± 0.251
2.701ThrTyr: 2.701 ± 0.399
0.0ThrXaa: 0.0 ± 0.0
Val
5.879ValAla: 5.879 ± 0.67
0.477ValCys: 0.477 ± 0.238
4.608ValAsp: 4.608 ± 0.417
4.025ValGlu: 4.025 ± 0.572
2.33ValPhe: 2.33 ± 0.44
5.296ValGly: 5.296 ± 0.63
0.794ValHis: 0.794 ± 0.257
4.608ValIle: 4.608 ± 0.515
4.449ValLys: 4.449 ± 0.514
5.455ValLeu: 5.455 ± 0.537
2.277ValMet: 2.277 ± 0.298
3.284ValAsn: 3.284 ± 0.345
2.33ValPro: 2.33 ± 0.405
1.854ValGln: 1.854 ± 0.271
3.178ValArg: 3.178 ± 0.461
4.343ValSer: 4.343 ± 0.449
5.508ValThr: 5.508 ± 0.758
5.137ValVal: 5.137 ± 0.619
0.741ValTrp: 0.741 ± 0.239
2.807ValTyr: 2.807 ± 0.372
0.0ValXaa: 0.0 ± 0.0
Trp
1.059TrpAla: 1.059 ± 0.318
0.212TrpCys: 0.212 ± 0.115
0.953TrpAsp: 0.953 ± 0.285
0.9TrpGlu: 0.9 ± 0.206
0.477TrpPhe: 0.477 ± 0.167
0.847TrpGly: 0.847 ± 0.22
0.265TrpHis: 0.265 ± 0.117
0.583TrpIle: 0.583 ± 0.225
1.218TrpLys: 1.218 ± 0.23
1.165TrpLeu: 1.165 ± 0.251
0.265TrpMet: 0.265 ± 0.111
0.953TrpAsn: 0.953 ± 0.272
0.53TrpPro: 0.53 ± 0.167
0.424TrpGln: 0.424 ± 0.135
1.165TrpArg: 1.165 ± 0.337
0.953TrpSer: 0.953 ± 0.206
1.165TrpThr: 1.165 ± 0.268
1.218TrpVal: 1.218 ± 0.288
0.212TrpTrp: 0.212 ± 0.11
0.583TrpTyr: 0.583 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.284TyrAla: 3.284 ± 0.357
0.424TyrCys: 0.424 ± 0.199
2.913TyrAsp: 2.913 ± 0.571
2.119TyrGlu: 2.119 ± 0.361
1.748TyrPhe: 1.748 ± 0.358
3.072TyrGly: 3.072 ± 0.46
0.53TyrHis: 0.53 ± 0.167
2.224TyrIle: 2.224 ± 0.422
2.436TyrLys: 2.436 ± 0.401
2.648TyrLeu: 2.648 ± 0.426
1.006TyrMet: 1.006 ± 0.243
1.748TyrAsn: 1.748 ± 0.294
1.059TyrPro: 1.059 ± 0.227
1.006TyrGln: 1.006 ± 0.223
2.066TyrArg: 2.066 ± 0.34
2.595TyrSer: 2.595 ± 0.37
2.383TyrThr: 2.383 ± 0.335
2.489TyrVal: 2.489 ± 0.365
0.371TyrTrp: 0.371 ± 0.154
1.536TyrTyr: 1.536 ± 0.384
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (18882 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski