Amino acid dipepetide frequency for Arthrobacter phage Tatanka

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.242AlaAla: 8.242 ± 1.211
0.339AlaCys: 0.339 ± 0.153
3.556AlaAsp: 3.556 ± 0.523
6.153AlaGlu: 6.153 ± 0.66
3.556AlaPhe: 3.556 ± 0.466
7.113AlaGly: 7.113 ± 1.095
1.411AlaHis: 1.411 ± 0.237
5.193AlaIle: 5.193 ± 0.626
5.588AlaLys: 5.588 ± 0.985
6.774AlaLeu: 6.774 ± 0.948
2.992AlaMet: 2.992 ± 0.685
4.121AlaAsn: 4.121 ± 0.522
2.879AlaPro: 2.879 ± 0.465
2.935AlaGln: 2.935 ± 0.469
4.911AlaArg: 4.911 ± 0.668
4.742AlaSer: 4.742 ± 0.63
5.363AlaThr: 5.363 ± 0.534
6.83AlaVal: 6.83 ± 0.939
1.75AlaTrp: 1.75 ± 0.346
2.992AlaTyr: 2.992 ± 0.358
0.0AlaXaa: 0.0 ± 0.0
Cys
0.339CysAla: 0.339 ± 0.165
0.113CysCys: 0.113 ± 0.088
0.621CysAsp: 0.621 ± 0.174
0.226CysGlu: 0.226 ± 0.108
0.226CysPhe: 0.226 ± 0.137
0.508CysGly: 0.508 ± 0.216
0.226CysHis: 0.226 ± 0.12
0.452CysIle: 0.452 ± 0.157
0.226CysLys: 0.226 ± 0.15
0.339CysLeu: 0.339 ± 0.131
0.113CysMet: 0.113 ± 0.069
0.056CysAsn: 0.056 ± 0.054
0.169CysPro: 0.169 ± 0.1
0.113CysGln: 0.113 ± 0.073
0.226CysArg: 0.226 ± 0.128
0.339CysSer: 0.339 ± 0.133
0.282CysThr: 0.282 ± 0.158
0.621CysVal: 0.621 ± 0.189
0.0CysTrp: 0.0 ± 0.0
0.113CysTyr: 0.113 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
6.04AspAla: 6.04 ± 0.59
0.226AspCys: 0.226 ± 0.124
4.347AspAsp: 4.347 ± 0.665
4.177AspGlu: 4.177 ± 0.466
2.822AspPhe: 2.822 ± 0.47
4.911AspGly: 4.911 ± 0.624
0.903AspHis: 0.903 ± 0.27
4.008AspIle: 4.008 ± 0.464
3.161AspLys: 3.161 ± 0.45
4.911AspLeu: 4.911 ± 0.56
1.806AspMet: 1.806 ± 0.293
3.274AspAsn: 3.274 ± 0.538
2.427AspPro: 2.427 ± 0.399
2.427AspGln: 2.427 ± 0.437
2.822AspArg: 2.822 ± 0.549
2.597AspSer: 2.597 ± 0.434
2.54AspThr: 2.54 ± 0.354
4.121AspVal: 4.121 ± 0.574
0.96AspTrp: 0.96 ± 0.265
1.355AspTyr: 1.355 ± 0.261
0.0AspXaa: 0.0 ± 0.0
Glu
4.572GluAla: 4.572 ± 0.596
0.452GluCys: 0.452 ± 0.181
3.443GluAsp: 3.443 ± 0.393
4.629GluGlu: 4.629 ± 0.547
2.71GluPhe: 2.71 ± 0.454
4.29GluGly: 4.29 ± 0.486
1.976GluHis: 1.976 ± 0.324
3.951GluIle: 3.951 ± 0.584
3.726GluLys: 3.726 ± 0.512
5.306GluLeu: 5.306 ± 0.533
2.032GluMet: 2.032 ± 0.337
2.314GluAsn: 2.314 ± 0.37
2.427GluPro: 2.427 ± 0.369
2.484GluGln: 2.484 ± 0.275
3.443GluArg: 3.443 ± 0.46
3.218GluSer: 3.218 ± 0.425
3.613GluThr: 3.613 ± 0.515
5.701GluVal: 5.701 ± 0.77
1.524GluTrp: 1.524 ± 0.336
1.693GluTyr: 1.693 ± 0.429
0.0GluXaa: 0.0 ± 0.0
Phe
2.935PheAla: 2.935 ± 0.455
0.282PheCys: 0.282 ± 0.133
2.879PheAsp: 2.879 ± 0.438
2.202PheGlu: 2.202 ± 0.344
1.016PhePhe: 1.016 ± 0.247
2.597PheGly: 2.597 ± 0.407
0.395PheHis: 0.395 ± 0.16
1.75PheIle: 1.75 ± 0.4
1.976PheLys: 1.976 ± 0.332
3.274PheLeu: 3.274 ± 0.446
0.79PheMet: 0.79 ± 0.212
2.766PheAsn: 2.766 ± 0.419
1.693PhePro: 1.693 ± 0.304
1.298PheGln: 1.298 ± 0.266
1.863PheArg: 1.863 ± 0.384
2.484PheSer: 2.484 ± 0.411
2.371PheThr: 2.371 ± 0.354
2.258PheVal: 2.258 ± 0.397
0.508PheTrp: 0.508 ± 0.19
1.693PheTyr: 1.693 ± 0.385
0.0PheXaa: 0.0 ± 0.0
Gly
7.226GlyAla: 7.226 ± 0.793
0.395GlyCys: 0.395 ± 0.217
4.685GlyAsp: 4.685 ± 0.524
4.121GlyGlu: 4.121 ± 0.414
3.105GlyPhe: 3.105 ± 0.445
5.476GlyGly: 5.476 ± 0.841
1.298GlyHis: 1.298 ± 0.265
4.234GlyIle: 4.234 ± 0.641
4.685GlyLys: 4.685 ± 0.605
6.153GlyLeu: 6.153 ± 0.903
2.935GlyMet: 2.935 ± 0.642
3.218GlyAsn: 3.218 ± 0.511
1.863GlyPro: 1.863 ± 0.448
2.202GlyGln: 2.202 ± 0.33
3.105GlyArg: 3.105 ± 0.435
5.306GlySer: 5.306 ± 0.579
6.266GlyThr: 6.266 ± 0.786
6.209GlyVal: 6.209 ± 0.673
1.637GlyTrp: 1.637 ± 0.357
2.427GlyTyr: 2.427 ± 0.339
0.0GlyXaa: 0.0 ± 0.0
His
1.637HisAla: 1.637 ± 0.44
0.339HisCys: 0.339 ± 0.122
0.79HisAsp: 0.79 ± 0.263
1.016HisGlu: 1.016 ± 0.243
0.564HisPhe: 0.564 ± 0.189
1.298HisGly: 1.298 ± 0.275
0.564HisHis: 0.564 ± 0.176
1.524HisIle: 1.524 ± 0.366
1.185HisLys: 1.185 ± 0.286
1.355HisLeu: 1.355 ± 0.255
0.508HisMet: 0.508 ± 0.188
0.903HisAsn: 0.903 ± 0.281
0.96HisPro: 0.96 ± 0.257
0.734HisGln: 0.734 ± 0.194
1.185HisArg: 1.185 ± 0.325
1.298HisSer: 1.298 ± 0.344
0.79HisThr: 0.79 ± 0.226
1.411HisVal: 1.411 ± 0.282
0.169HisTrp: 0.169 ± 0.119
0.903HisTyr: 0.903 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
5.532IleAla: 5.532 ± 0.878
0.169IleCys: 0.169 ± 0.138
4.121IleAsp: 4.121 ± 0.575
3.556IleGlu: 3.556 ± 0.485
1.411IlePhe: 1.411 ± 0.296
3.5IleGly: 3.5 ± 0.771
1.185IleHis: 1.185 ± 0.288
2.71IleIle: 2.71 ± 0.354
4.064IleLys: 4.064 ± 0.575
4.911IleLeu: 4.911 ± 0.628
2.032IleMet: 2.032 ± 0.351
3.218IleAsn: 3.218 ± 0.301
3.048IlePro: 3.048 ± 0.502
2.145IleGln: 2.145 ± 0.315
1.976IleArg: 1.976 ± 0.414
3.782IleSer: 3.782 ± 0.481
3.839IleThr: 3.839 ± 0.426
4.742IleVal: 4.742 ± 0.505
1.016IleTrp: 1.016 ± 0.24
1.976IleTyr: 1.976 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
5.758LysAla: 5.758 ± 0.725
0.395LysCys: 0.395 ± 0.141
3.161LysAsp: 3.161 ± 0.558
4.459LysGlu: 4.459 ± 0.693
2.314LysPhe: 2.314 ± 0.355
5.08LysGly: 5.08 ± 0.574
1.468LysHis: 1.468 ± 0.352
3.669LysIle: 3.669 ± 0.621
5.25LysLys: 5.25 ± 0.884
4.685LysLeu: 4.685 ± 0.618
2.484LysMet: 2.484 ± 0.459
2.089LysAsn: 2.089 ± 0.358
2.822LysPro: 2.822 ± 0.469
2.032LysGln: 2.032 ± 0.364
3.161LysArg: 3.161 ± 0.511
2.935LysSer: 2.935 ± 0.473
3.726LysThr: 3.726 ± 0.489
3.951LysVal: 3.951 ± 0.463
0.847LysTrp: 0.847 ± 0.245
2.202LysTyr: 2.202 ± 0.407
0.0LysXaa: 0.0 ± 0.0
Leu
7.226LeuAla: 7.226 ± 0.714
0.282LeuCys: 0.282 ± 0.141
5.08LeuAsp: 5.08 ± 0.497
4.742LeuGlu: 4.742 ± 0.605
2.427LeuPhe: 2.427 ± 0.397
5.927LeuGly: 5.927 ± 1.174
1.411LeuHis: 1.411 ± 0.284
5.08LeuIle: 5.08 ± 0.565
5.137LeuLys: 5.137 ± 0.694
4.459LeuLeu: 4.459 ± 0.785
1.806LeuMet: 1.806 ± 0.268
3.895LeuAsn: 3.895 ± 0.457
3.161LeuPro: 3.161 ± 0.425
2.258LeuGln: 2.258 ± 0.309
4.459LeuArg: 4.459 ± 0.657
4.572LeuSer: 4.572 ± 0.706
5.306LeuThr: 5.306 ± 0.658
6.209LeuVal: 6.209 ± 0.727
0.847LeuTrp: 0.847 ± 0.206
2.992LeuTyr: 2.992 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
3.105MetAla: 3.105 ± 0.402
0.169MetCys: 0.169 ± 0.106
2.089MetAsp: 2.089 ± 0.345
1.016MetGlu: 1.016 ± 0.298
1.185MetPhe: 1.185 ± 0.243
1.976MetGly: 1.976 ± 0.534
0.564MetHis: 0.564 ± 0.212
1.581MetIle: 1.581 ± 0.446
1.637MetLys: 1.637 ± 0.33
2.54MetLeu: 2.54 ± 0.443
0.677MetMet: 0.677 ± 0.263
1.016MetAsn: 1.016 ± 0.22
1.129MetPro: 1.129 ± 0.232
0.903MetGln: 0.903 ± 0.349
1.355MetArg: 1.355 ± 0.328
2.371MetSer: 2.371 ± 0.386
2.371MetThr: 2.371 ± 0.419
1.75MetVal: 1.75 ± 0.293
0.113MetTrp: 0.113 ± 0.071
0.847MetTyr: 0.847 ± 0.252
0.0MetXaa: 0.0 ± 0.0
Asn
3.443AsnAla: 3.443 ± 0.489
0.282AsnCys: 0.282 ± 0.133
3.105AsnAsp: 3.105 ± 0.541
3.274AsnGlu: 3.274 ± 0.499
1.75AsnPhe: 1.75 ± 0.296
4.855AsnGly: 4.855 ± 0.647
0.677AsnHis: 0.677 ± 0.188
2.822AsnIle: 2.822 ± 0.565
2.71AsnLys: 2.71 ± 0.34
3.5AsnLeu: 3.5 ± 0.615
1.242AsnMet: 1.242 ± 0.307
2.935AsnAsn: 2.935 ± 0.39
2.597AsnPro: 2.597 ± 0.372
1.581AsnGln: 1.581 ± 0.318
2.314AsnArg: 2.314 ± 0.48
3.105AsnSer: 3.105 ± 0.505
3.105AsnThr: 3.105 ± 0.399
2.314AsnVal: 2.314 ± 0.339
0.903AsnTrp: 0.903 ± 0.31
1.75AsnTyr: 1.75 ± 0.347
0.0AsnXaa: 0.0 ± 0.0
Pro
3.161ProAla: 3.161 ± 0.427
0.282ProCys: 0.282 ± 0.12
2.484ProAsp: 2.484 ± 0.438
3.782ProGlu: 3.782 ± 0.549
0.96ProPhe: 0.96 ± 0.233
3.274ProGly: 3.274 ± 0.39
1.016ProHis: 1.016 ± 0.234
2.371ProIle: 2.371 ± 0.422
2.427ProLys: 2.427 ± 0.349
2.371ProLeu: 2.371 ± 0.335
1.129ProMet: 1.129 ± 0.267
1.976ProAsn: 1.976 ± 0.445
1.693ProPro: 1.693 ± 0.38
1.016ProGln: 1.016 ± 0.253
2.089ProArg: 2.089 ± 0.348
2.484ProSer: 2.484 ± 0.443
3.443ProThr: 3.443 ± 0.49
2.766ProVal: 2.766 ± 0.374
0.564ProTrp: 0.564 ± 0.185
1.073ProTyr: 1.073 ± 0.228
0.0ProXaa: 0.0 ± 0.0
Gln
3.274GlnAla: 3.274 ± 0.546
0.169GlnCys: 0.169 ± 0.118
1.75GlnAsp: 1.75 ± 0.31
2.202GlnGlu: 2.202 ± 0.495
1.016GlnPhe: 1.016 ± 0.245
2.71GlnGly: 2.71 ± 0.396
0.734GlnHis: 0.734 ± 0.196
2.427GlnIle: 2.427 ± 0.285
2.314GlnLys: 2.314 ± 0.436
3.556GlnLeu: 3.556 ± 0.453
0.79GlnMet: 0.79 ± 0.211
1.411GlnAsn: 1.411 ± 0.258
1.468GlnPro: 1.468 ± 0.261
1.976GlnGln: 1.976 ± 0.29
1.976GlnArg: 1.976 ± 0.384
1.468GlnSer: 1.468 ± 0.287
1.863GlnThr: 1.863 ± 0.369
2.427GlnVal: 2.427 ± 0.416
0.395GlnTrp: 0.395 ± 0.151
1.524GlnTyr: 1.524 ± 0.262
0.0GlnXaa: 0.0 ± 0.0
Arg
3.5ArgAla: 3.5 ± 0.476
0.169ArgCys: 0.169 ± 0.103
2.371ArgAsp: 2.371 ± 0.348
4.234ArgGlu: 4.234 ± 0.58
1.524ArgPhe: 1.524 ± 0.283
2.653ArgGly: 2.653 ± 0.38
1.016ArgHis: 1.016 ± 0.303
2.653ArgIle: 2.653 ± 0.359
3.839ArgLys: 3.839 ± 0.56
4.403ArgLeu: 4.403 ± 0.694
0.79ArgMet: 0.79 ± 0.213
3.556ArgAsn: 3.556 ± 0.472
2.089ArgPro: 2.089 ± 0.407
2.145ArgGln: 2.145 ± 0.454
3.331ArgArg: 3.331 ± 0.593
2.935ArgSer: 2.935 ± 0.433
2.879ArgThr: 2.879 ± 0.425
4.459ArgVal: 4.459 ± 0.436
0.452ArgTrp: 0.452 ± 0.188
1.75ArgTyr: 1.75 ± 0.366
0.0ArgXaa: 0.0 ± 0.0
Ser
5.927SerAla: 5.927 ± 0.726
0.056SerCys: 0.056 ± 0.063
2.879SerAsp: 2.879 ± 0.427
3.274SerGlu: 3.274 ± 0.504
2.484SerPhe: 2.484 ± 0.463
5.645SerGly: 5.645 ± 0.625
1.355SerHis: 1.355 ± 0.285
3.782SerIle: 3.782 ± 0.437
3.839SerLys: 3.839 ± 0.651
4.742SerLeu: 4.742 ± 0.506
1.919SerMet: 1.919 ± 0.366
2.71SerAsn: 2.71 ± 0.375
2.145SerPro: 2.145 ± 0.327
2.145SerGln: 2.145 ± 0.286
2.484SerArg: 2.484 ± 0.35
4.234SerSer: 4.234 ± 0.812
3.556SerThr: 3.556 ± 0.431
3.951SerVal: 3.951 ± 0.513
1.129SerTrp: 1.129 ± 0.197
2.032SerTyr: 2.032 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
5.476ThrAla: 5.476 ± 0.535
0.339ThrCys: 0.339 ± 0.165
3.726ThrAsp: 3.726 ± 0.644
3.5ThrGlu: 3.5 ± 0.534
2.879ThrPhe: 2.879 ± 0.425
5.758ThrGly: 5.758 ± 0.718
0.79ThrHis: 0.79 ± 0.302
3.782ThrIle: 3.782 ± 0.42
3.782ThrLys: 3.782 ± 0.479
5.927ThrLeu: 5.927 ± 0.553
1.298ThrMet: 1.298 ± 0.215
3.274ThrAsn: 3.274 ± 0.318
3.274ThrPro: 3.274 ± 0.458
1.863ThrGln: 1.863 ± 0.362
2.653ThrArg: 2.653 ± 0.385
2.935ThrSer: 2.935 ± 0.45
6.153ThrThr: 6.153 ± 0.822
4.685ThrVal: 4.685 ± 0.599
0.847ThrTrp: 0.847 ± 0.204
2.258ThrTyr: 2.258 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
6.04ValAla: 6.04 ± 0.601
0.508ValCys: 0.508 ± 0.198
4.403ValAsp: 4.403 ± 0.551
4.629ValGlu: 4.629 ± 0.629
2.653ValPhe: 2.653 ± 0.429
4.798ValGly: 4.798 ± 0.586
1.185ValHis: 1.185 ± 0.288
4.629ValIle: 4.629 ± 0.38
4.177ValLys: 4.177 ± 0.497
5.08ValLeu: 5.08 ± 0.726
2.032ValMet: 2.032 ± 0.345
3.218ValAsn: 3.218 ± 0.4
2.766ValPro: 2.766 ± 0.416
2.822ValGln: 2.822 ± 0.427
4.572ValArg: 4.572 ± 0.661
6.266ValSer: 6.266 ± 0.693
4.121ValThr: 4.121 ± 0.488
5.193ValVal: 5.193 ± 0.58
1.355ValTrp: 1.355 ± 0.288
2.992ValTyr: 2.992 ± 0.628
0.0ValXaa: 0.0 ± 0.0
Trp
1.016TrpAla: 1.016 ± 0.221
0.056TrpCys: 0.056 ± 0.052
1.581TrpAsp: 1.581 ± 0.34
0.621TrpGlu: 0.621 ± 0.175
0.734TrpPhe: 0.734 ± 0.227
1.129TrpGly: 1.129 ± 0.251
0.339TrpHis: 0.339 ± 0.122
0.621TrpIle: 0.621 ± 0.191
0.734TrpLys: 0.734 ± 0.222
1.073TrpLeu: 1.073 ± 0.207
0.564TrpMet: 0.564 ± 0.19
0.677TrpAsn: 0.677 ± 0.212
0.339TrpPro: 0.339 ± 0.114
0.847TrpGln: 0.847 ± 0.219
1.016TrpArg: 1.016 ± 0.304
0.903TrpSer: 0.903 ± 0.246
1.016TrpThr: 1.016 ± 0.189
1.185TrpVal: 1.185 ± 0.292
0.282TrpTrp: 0.282 ± 0.133
0.903TrpTyr: 0.903 ± 0.286
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.992TyrAla: 2.992 ± 0.441
0.282TyrCys: 0.282 ± 0.145
2.766TyrAsp: 2.766 ± 0.533
1.693TyrGlu: 1.693 ± 0.362
1.693TyrPhe: 1.693 ± 0.336
3.048TyrGly: 3.048 ± 0.356
0.621TyrHis: 0.621 ± 0.212
1.693TyrIle: 1.693 ± 0.283
2.032TyrLys: 2.032 ± 0.37
1.919TyrLeu: 1.919 ± 0.364
0.452TyrMet: 0.452 ± 0.188
1.637TyrAsn: 1.637 ± 0.268
1.411TyrPro: 1.411 ± 0.27
1.524TyrGln: 1.524 ± 0.325
1.806TyrArg: 1.806 ± 0.292
2.314TyrSer: 2.314 ± 0.374
2.597TyrThr: 2.597 ± 0.441
2.653TyrVal: 2.653 ± 0.355
0.339TyrTrp: 0.339 ± 0.185
1.242TyrTyr: 1.242 ± 0.299
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 85 proteins (17716 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski