Amino acid dipepetide frequency for Microbacterium phage GaeCeo

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.505AlaAla: 9.505 ± 1.213
0.545AlaCys: 0.545 ± 0.226
5.921AlaAsp: 5.921 ± 0.548
6.389AlaGlu: 6.389 ± 0.605
2.883AlaPhe: 2.883 ± 0.565
8.259AlaGly: 8.259 ± 1.128
2.337AlaHis: 2.337 ± 0.453
4.207AlaIle: 4.207 ± 0.853
6.155AlaLys: 6.155 ± 0.997
10.129AlaLeu: 10.129 ± 1.339
2.961AlaMet: 2.961 ± 0.512
2.571AlaAsn: 2.571 ± 0.576
3.662AlaPro: 3.662 ± 0.611
3.818AlaGln: 3.818 ± 0.597
6.155AlaArg: 6.155 ± 0.795
5.142AlaSer: 5.142 ± 0.914
7.168AlaThr: 7.168 ± 0.731
8.492AlaVal: 8.492 ± 0.783
1.714AlaTrp: 1.714 ± 0.364
2.571AlaTyr: 2.571 ± 0.545
0.0AlaXaa: 0.0 ± 0.0
Cys
0.545CysAla: 0.545 ± 0.191
0.0CysCys: 0.0 ± 0.0
0.39CysAsp: 0.39 ± 0.157
0.156CysGlu: 0.156 ± 0.11
0.234CysPhe: 0.234 ± 0.147
0.779CysGly: 0.779 ± 0.293
0.078CysHis: 0.078 ± 0.083
0.156CysIle: 0.156 ± 0.125
0.467CysLys: 0.467 ± 0.159
0.545CysLeu: 0.545 ± 0.332
0.0CysMet: 0.0 ± 0.0
0.234CysAsn: 0.234 ± 0.132
0.467CysPro: 0.467 ± 0.195
0.156CysGln: 0.156 ± 0.108
0.312CysArg: 0.312 ± 0.141
0.312CysSer: 0.312 ± 0.144
0.701CysThr: 0.701 ± 0.251
0.39CysVal: 0.39 ± 0.175
0.234CysTrp: 0.234 ± 0.145
0.234CysTyr: 0.234 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
6.155AspAla: 6.155 ± 0.758
0.701AspCys: 0.701 ± 0.176
5.376AspAsp: 5.376 ± 1.035
5.22AspGlu: 5.22 ± 1.518
2.493AspPhe: 2.493 ± 0.388
3.662AspGly: 3.662 ± 0.548
1.87AspHis: 1.87 ± 0.322
2.805AspIle: 2.805 ± 0.502
2.727AspLys: 2.727 ± 0.342
6.077AspLeu: 6.077 ± 0.624
1.87AspMet: 1.87 ± 0.404
1.948AspAsn: 1.948 ± 0.444
4.519AspPro: 4.519 ± 0.598
2.961AspGln: 2.961 ± 0.639
3.272AspArg: 3.272 ± 0.538
2.727AspSer: 2.727 ± 0.486
3.35AspThr: 3.35 ± 0.422
4.207AspVal: 4.207 ± 0.554
1.48AspTrp: 1.48 ± 0.347
2.727AspTyr: 2.727 ± 0.493
0.0AspXaa: 0.0 ± 0.0
Glu
8.025GluAla: 8.025 ± 1.115
0.39GluCys: 0.39 ± 0.232
6.233GluAsp: 6.233 ± 1.282
5.064GluGlu: 5.064 ± 0.989
1.87GluPhe: 1.87 ± 0.421
4.597GluGly: 4.597 ± 0.673
1.48GluHis: 1.48 ± 0.425
2.337GluIle: 2.337 ± 0.322
2.337GluLys: 2.337 ± 0.439
5.921GluLeu: 5.921 ± 0.775
1.87GluMet: 1.87 ± 0.446
2.104GluAsn: 2.104 ± 0.415
2.415GluPro: 2.415 ± 0.536
2.493GluGln: 2.493 ± 0.479
4.051GluArg: 4.051 ± 0.639
2.727GluSer: 2.727 ± 0.471
3.896GluThr: 3.896 ± 0.566
4.597GluVal: 4.597 ± 0.491
1.402GluTrp: 1.402 ± 0.366
1.48GluTyr: 1.48 ± 0.439
0.0GluXaa: 0.0 ± 0.0
Phe
2.415PheAla: 2.415 ± 0.496
0.078PheCys: 0.078 ± 0.064
2.182PheAsp: 2.182 ± 0.378
1.948PheGlu: 1.948 ± 0.371
1.169PhePhe: 1.169 ± 0.305
2.883PheGly: 2.883 ± 0.531
0.857PheHis: 0.857 ± 0.29
1.013PheIle: 1.013 ± 0.284
1.792PheLys: 1.792 ± 0.346
2.259PheLeu: 2.259 ± 0.488
0.701PheMet: 0.701 ± 0.264
1.558PheAsn: 1.558 ± 0.294
1.169PhePro: 1.169 ± 0.336
1.48PheGln: 1.48 ± 0.285
2.415PheArg: 2.415 ± 0.43
2.182PheSer: 2.182 ± 0.391
1.714PheThr: 1.714 ± 0.336
1.87PheVal: 1.87 ± 0.397
0.467PheTrp: 0.467 ± 0.233
0.623PheTyr: 0.623 ± 0.284
0.0PheXaa: 0.0 ± 0.0
Gly
5.843GlyAla: 5.843 ± 1.05
0.701GlyCys: 0.701 ± 0.24
5.532GlyAsp: 5.532 ± 0.513
4.363GlyGlu: 4.363 ± 0.708
3.584GlyPhe: 3.584 ± 0.588
5.921GlyGly: 5.921 ± 1.134
1.247GlyHis: 1.247 ± 0.28
4.986GlyIle: 4.986 ± 0.828
4.519GlyLys: 4.519 ± 0.651
6.233GlyLeu: 6.233 ± 0.97
2.026GlyMet: 2.026 ± 0.365
2.805GlyAsn: 2.805 ± 0.454
2.493GlyPro: 2.493 ± 0.424
4.675GlyGln: 4.675 ± 0.784
4.908GlyArg: 4.908 ± 0.592
4.753GlySer: 4.753 ± 0.761
6.467GlyThr: 6.467 ± 0.94
5.999GlyVal: 5.999 ± 0.639
1.48GlyTrp: 1.48 ± 0.298
1.87GlyTyr: 1.87 ± 0.371
0.0GlyXaa: 0.0 ± 0.0
His
1.558HisAla: 1.558 ± 0.344
0.0HisCys: 0.0 ± 0.0
1.169HisAsp: 1.169 ± 0.327
1.091HisGlu: 1.091 ± 0.345
0.623HisPhe: 0.623 ± 0.178
2.182HisGly: 2.182 ± 0.406
0.467HisHis: 0.467 ± 0.193
1.48HisIle: 1.48 ± 0.367
1.013HisLys: 1.013 ± 0.261
1.558HisLeu: 1.558 ± 0.336
0.312HisMet: 0.312 ± 0.155
0.779HisAsn: 0.779 ± 0.204
1.169HisPro: 1.169 ± 0.317
0.467HisGln: 0.467 ± 0.163
0.701HisArg: 0.701 ± 0.201
0.701HisSer: 0.701 ± 0.254
1.013HisThr: 1.013 ± 0.338
1.402HisVal: 1.402 ± 0.331
0.545HisTrp: 0.545 ± 0.186
0.701HisTyr: 0.701 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
4.519IleAla: 4.519 ± 0.474
0.312IleCys: 0.312 ± 0.144
4.597IleAsp: 4.597 ± 0.479
3.272IleGlu: 3.272 ± 0.509
0.935IlePhe: 0.935 ± 0.312
3.662IleGly: 3.662 ± 1.182
0.701IleHis: 0.701 ± 0.219
2.727IleIle: 2.727 ± 0.735
1.714IleLys: 1.714 ± 0.419
2.883IleLeu: 2.883 ± 0.522
1.247IleMet: 1.247 ± 0.339
1.792IleAsn: 1.792 ± 0.324
2.104IlePro: 2.104 ± 0.532
1.792IleGln: 1.792 ± 0.435
2.337IleArg: 2.337 ± 0.43
2.493IleSer: 2.493 ± 0.479
4.207IleThr: 4.207 ± 0.95
2.961IleVal: 2.961 ± 0.688
0.857IleTrp: 0.857 ± 0.267
1.48IleTyr: 1.48 ± 0.259
0.0IleXaa: 0.0 ± 0.0
Lys
5.454LysAla: 5.454 ± 0.796
0.078LysCys: 0.078 ± 0.072
2.727LysAsp: 2.727 ± 0.392
2.961LysGlu: 2.961 ± 0.64
1.402LysPhe: 1.402 ± 0.288
4.207LysGly: 4.207 ± 0.644
0.545LysHis: 0.545 ± 0.169
2.104LysIle: 2.104 ± 0.363
3.428LysLys: 3.428 ± 0.733
4.207LysLeu: 4.207 ± 0.662
1.169LysMet: 1.169 ± 0.29
1.792LysAsn: 1.792 ± 0.334
3.039LysPro: 3.039 ± 0.491
1.636LysGln: 1.636 ± 0.402
2.805LysArg: 2.805 ± 0.522
2.493LysSer: 2.493 ± 0.46
3.584LysThr: 3.584 ± 0.57
4.051LysVal: 4.051 ± 0.562
0.857LysTrp: 0.857 ± 0.287
0.545LysTyr: 0.545 ± 0.208
0.0LysXaa: 0.0 ± 0.0
Leu
9.194LeuAla: 9.194 ± 0.811
0.623LeuCys: 0.623 ± 0.226
5.22LeuAsp: 5.22 ± 0.621
5.064LeuGlu: 5.064 ± 0.669
2.649LeuPhe: 2.649 ± 0.383
6.467LeuGly: 6.467 ± 0.687
1.87LeuHis: 1.87 ± 0.371
4.908LeuIle: 4.908 ± 1.098
3.896LeuLys: 3.896 ± 0.562
7.869LeuLeu: 7.869 ± 0.824
1.636LeuMet: 1.636 ± 0.272
2.961LeuAsn: 2.961 ± 0.549
4.831LeuPro: 4.831 ± 0.648
2.493LeuGln: 2.493 ± 0.439
4.986LeuArg: 4.986 ± 0.729
5.999LeuSer: 5.999 ± 0.588
5.454LeuThr: 5.454 ± 0.534
6.7LeuVal: 6.7 ± 0.827
1.325LeuTrp: 1.325 ± 0.274
1.87LeuTyr: 1.87 ± 0.372
0.0LeuXaa: 0.0 ± 0.0
Met
3.974MetAla: 3.974 ± 0.41
0.156MetCys: 0.156 ± 0.118
1.402MetAsp: 1.402 ± 0.395
1.247MetGlu: 1.247 ± 0.242
0.545MetPhe: 0.545 ± 0.168
1.87MetGly: 1.87 ± 0.354
0.312MetHis: 0.312 ± 0.137
0.779MetIle: 0.779 ± 0.28
0.39MetLys: 0.39 ± 0.182
2.493MetLeu: 2.493 ± 0.458
0.467MetMet: 0.467 ± 0.151
1.013MetAsn: 1.013 ± 0.257
0.935MetPro: 0.935 ± 0.287
1.325MetGln: 1.325 ± 0.302
1.402MetArg: 1.402 ± 0.295
2.337MetSer: 2.337 ± 0.42
2.104MetThr: 2.104 ± 0.353
1.169MetVal: 1.169 ± 0.24
0.312MetTrp: 0.312 ± 0.15
0.467MetTyr: 0.467 ± 0.25
0.0MetXaa: 0.0 ± 0.0
Asn
3.896AsnAla: 3.896 ± 0.521
0.0AsnCys: 0.0 ± 0.0
2.104AsnAsp: 2.104 ± 0.457
2.259AsnGlu: 2.259 ± 0.352
0.39AsnPhe: 0.39 ± 0.151
3.506AsnGly: 3.506 ± 0.463
0.39AsnHis: 0.39 ± 0.148
1.714AsnIle: 1.714 ± 0.438
1.792AsnLys: 1.792 ± 0.404
2.805AsnLeu: 2.805 ± 0.491
0.779AsnMet: 0.779 ± 0.253
1.091AsnAsn: 1.091 ± 0.284
2.104AsnPro: 2.104 ± 0.413
1.48AsnGln: 1.48 ± 0.347
1.325AsnArg: 1.325 ± 0.313
2.259AsnSer: 2.259 ± 0.403
1.714AsnThr: 1.714 ± 0.369
2.259AsnVal: 2.259 ± 0.332
0.545AsnTrp: 0.545 ± 0.232
1.325AsnTyr: 1.325 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
5.22ProAla: 5.22 ± 0.815
0.156ProCys: 0.156 ± 0.103
2.571ProAsp: 2.571 ± 0.432
3.428ProGlu: 3.428 ± 0.579
1.091ProPhe: 1.091 ± 0.268
3.896ProGly: 3.896 ± 0.768
0.467ProHis: 0.467 ± 0.175
1.714ProIle: 1.714 ± 0.294
1.792ProLys: 1.792 ± 0.347
3.74ProLeu: 3.74 ± 0.501
0.545ProMet: 0.545 ± 0.19
1.792ProAsn: 1.792 ± 0.477
1.247ProPro: 1.247 ± 0.342
1.792ProGln: 1.792 ± 0.36
2.727ProArg: 2.727 ± 0.6
2.961ProSer: 2.961 ± 0.559
3.74ProThr: 3.74 ± 0.543
3.74ProVal: 3.74 ± 0.587
1.169ProTrp: 1.169 ± 0.258
1.247ProTyr: 1.247 ± 0.307
0.0ProXaa: 0.0 ± 0.0
Gln
4.986GlnAla: 4.986 ± 0.535
0.156GlnCys: 0.156 ± 0.094
2.104GlnAsp: 2.104 ± 0.39
4.051GlnGlu: 4.051 ± 0.653
0.467GlnPhe: 0.467 ± 0.171
2.883GlnGly: 2.883 ± 0.577
0.779GlnHis: 0.779 ± 0.241
1.636GlnIle: 1.636 ± 0.303
2.259GlnLys: 2.259 ± 0.443
2.961GlnLeu: 2.961 ± 0.441
1.169GlnMet: 1.169 ± 0.296
1.792GlnAsn: 1.792 ± 0.493
1.48GlnPro: 1.48 ± 0.34
1.247GlnGln: 1.247 ± 0.312
2.026GlnArg: 2.026 ± 0.353
2.026GlnSer: 2.026 ± 0.372
1.948GlnThr: 1.948 ± 0.331
2.961GlnVal: 2.961 ± 0.445
1.091GlnTrp: 1.091 ± 0.305
1.636GlnTyr: 1.636 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
5.22ArgAla: 5.22 ± 0.635
0.701ArgCys: 0.701 ± 0.29
3.506ArgAsp: 3.506 ± 0.568
3.896ArgGlu: 3.896 ± 0.552
2.259ArgPhe: 2.259 ± 0.415
3.506ArgGly: 3.506 ± 0.503
1.013ArgHis: 1.013 ± 0.33
2.649ArgIle: 2.649 ± 0.399
3.35ArgLys: 3.35 ± 0.555
5.843ArgLeu: 5.843 ± 0.875
2.182ArgMet: 2.182 ± 0.4
1.948ArgAsn: 1.948 ± 0.294
2.026ArgPro: 2.026 ± 0.377
2.026ArgGln: 2.026 ± 0.373
3.039ArgArg: 3.039 ± 0.482
3.35ArgSer: 3.35 ± 0.444
3.116ArgThr: 3.116 ± 0.472
4.908ArgVal: 4.908 ± 0.689
1.247ArgTrp: 1.247 ± 0.315
1.48ArgTyr: 1.48 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
5.298SerAla: 5.298 ± 0.62
0.312SerCys: 0.312 ± 0.166
3.662SerAsp: 3.662 ± 0.521
3.194SerGlu: 3.194 ± 0.458
2.415SerPhe: 2.415 ± 0.363
5.843SerGly: 5.843 ± 1.059
1.013SerHis: 1.013 ± 0.279
2.571SerIle: 2.571 ± 0.524
3.428SerLys: 3.428 ± 0.48
4.908SerLeu: 4.908 ± 0.596
1.325SerMet: 1.325 ± 0.375
1.714SerAsn: 1.714 ± 0.357
2.259SerPro: 2.259 ± 0.36
2.337SerGln: 2.337 ± 0.344
2.883SerArg: 2.883 ± 0.54
4.051SerSer: 4.051 ± 0.586
4.129SerThr: 4.129 ± 0.591
4.441SerVal: 4.441 ± 0.616
0.935SerTrp: 0.935 ± 0.257
2.337SerTyr: 2.337 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
6.155ThrAla: 6.155 ± 0.928
0.39ThrCys: 0.39 ± 0.181
3.818ThrAsp: 3.818 ± 0.541
3.116ThrGlu: 3.116 ± 0.471
2.805ThrPhe: 2.805 ± 0.41
6.623ThrGly: 6.623 ± 0.698
0.779ThrHis: 0.779 ± 0.234
3.74ThrIle: 3.74 ± 0.514
2.415ThrLys: 2.415 ± 0.518
5.298ThrLeu: 5.298 ± 0.794
1.792ThrMet: 1.792 ± 0.292
1.558ThrAsn: 1.558 ± 0.281
3.428ThrPro: 3.428 ± 0.43
2.415ThrGln: 2.415 ± 0.594
4.207ThrArg: 4.207 ± 0.566
4.285ThrSer: 4.285 ± 0.447
4.519ThrThr: 4.519 ± 0.565
5.765ThrVal: 5.765 ± 0.608
0.935ThrTrp: 0.935 ± 0.225
2.571ThrTyr: 2.571 ± 0.379
0.0ThrXaa: 0.0 ± 0.0
Val
8.414ValAla: 8.414 ± 0.721
0.312ValCys: 0.312 ± 0.164
4.363ValAsp: 4.363 ± 0.629
5.22ValGlu: 5.22 ± 0.687
1.558ValPhe: 1.558 ± 0.365
5.532ValGly: 5.532 ± 0.83
1.792ValHis: 1.792 ± 0.417
3.116ValIle: 3.116 ± 0.587
3.194ValLys: 3.194 ± 0.65
5.843ValLeu: 5.843 ± 0.698
1.87ValMet: 1.87 ± 0.403
2.493ValAsn: 2.493 ± 0.372
3.194ValPro: 3.194 ± 0.514
4.051ValGln: 4.051 ± 0.404
4.986ValArg: 4.986 ± 0.661
4.597ValSer: 4.597 ± 0.472
4.831ValThr: 4.831 ± 0.525
5.376ValVal: 5.376 ± 0.776
1.87ValTrp: 1.87 ± 0.354
2.337ValTyr: 2.337 ± 0.468
0.0ValXaa: 0.0 ± 0.0
Trp
0.935TrpAla: 0.935 ± 0.296
0.156TrpCys: 0.156 ± 0.115
1.402TrpAsp: 1.402 ± 0.37
1.402TrpGlu: 1.402 ± 0.38
0.857TrpPhe: 0.857 ± 0.34
1.169TrpGly: 1.169 ± 0.383
0.623TrpHis: 0.623 ± 0.228
0.857TrpIle: 0.857 ± 0.25
1.091TrpLys: 1.091 ± 0.244
2.259TrpLeu: 2.259 ± 0.375
0.156TrpMet: 0.156 ± 0.095
0.701TrpAsn: 0.701 ± 0.227
0.857TrpPro: 0.857 ± 0.266
0.545TrpGln: 0.545 ± 0.197
0.779TrpArg: 0.779 ± 0.2
1.247TrpSer: 1.247 ± 0.331
1.558TrpThr: 1.558 ± 0.36
1.948TrpVal: 1.948 ± 0.335
0.701TrpTrp: 0.701 ± 0.303
0.623TrpTyr: 0.623 ± 0.22
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.272TyrAla: 3.272 ± 0.452
0.623TyrCys: 0.623 ± 0.207
1.87TyrAsp: 1.87 ± 0.285
1.948TyrGlu: 1.948 ± 0.486
0.545TyrPhe: 0.545 ± 0.175
2.961TyrGly: 2.961 ± 0.425
0.312TyrHis: 0.312 ± 0.173
1.091TyrIle: 1.091 ± 0.269
1.325TyrLys: 1.325 ± 0.333
2.259TyrLeu: 2.259 ± 0.376
0.545TyrMet: 0.545 ± 0.181
1.091TyrAsn: 1.091 ± 0.304
1.636TyrPro: 1.636 ± 0.515
0.467TyrGln: 0.467 ± 0.158
1.948TyrArg: 1.948 ± 0.42
2.415TyrSer: 2.415 ± 0.46
1.402TyrThr: 1.402 ± 0.309
1.714TyrVal: 1.714 ± 0.379
0.701TyrTrp: 0.701 ± 0.215
0.779TyrTyr: 0.779 ± 0.304
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (12836 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski