Amino acid dipepetide frequency for Xanthomonas phage Suba

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.753AlaAla: 12.753 ± 3.235
0.845AlaCys: 0.845 ± 0.31
6.081AlaAsp: 6.081 ± 0.73
9.122AlaGlu: 9.122 ± 0.987
3.885AlaPhe: 3.885 ± 0.503
7.01AlaGly: 7.01 ± 0.866
1.351AlaHis: 1.351 ± 0.356
6.588AlaIle: 6.588 ± 0.819
8.024AlaLys: 8.024 ± 1.071
7.01AlaLeu: 7.01 ± 0.939
2.956AlaMet: 2.956 ± 0.461
5.068AlaAsn: 5.068 ± 0.465
3.801AlaPro: 3.801 ± 0.723
3.716AlaGln: 3.716 ± 1.054
5.236AlaArg: 5.236 ± 0.797
5.405AlaSer: 5.405 ± 0.712
6.25AlaThr: 6.25 ± 1.137
5.912AlaVal: 5.912 ± 0.598
1.098AlaTrp: 1.098 ± 0.289
2.449AlaTyr: 2.449 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
0.676CysAla: 0.676 ± 0.2
0.422CysCys: 0.422 ± 0.193
0.591CysAsp: 0.591 ± 0.243
0.845CysGlu: 0.845 ± 0.314
0.253CysPhe: 0.253 ± 0.131
1.014CysGly: 1.014 ± 0.346
0.253CysHis: 0.253 ± 0.14
0.422CysIle: 0.422 ± 0.159
0.591CysLys: 0.591 ± 0.22
0.845CysLeu: 0.845 ± 0.291
0.084CysMet: 0.084 ± 0.089
0.422CysAsn: 0.422 ± 0.233
0.591CysPro: 0.591 ± 0.234
0.253CysGln: 0.253 ± 0.171
0.929CysArg: 0.929 ± 0.312
0.169CysSer: 0.169 ± 0.123
0.422CysThr: 0.422 ± 0.204
1.351CysVal: 1.351 ± 0.316
0.084CysTrp: 0.084 ± 0.083
0.338CysTyr: 0.338 ± 0.183
0.0CysXaa: 0.0 ± 0.0
Asp
6.419AspAla: 6.419 ± 0.84
0.422AspCys: 0.422 ± 0.207
3.041AspAsp: 3.041 ± 0.557
3.97AspGlu: 3.97 ± 0.599
2.027AspPhe: 2.027 ± 0.519
5.152AspGly: 5.152 ± 0.746
0.253AspHis: 0.253 ± 0.189
2.365AspIle: 2.365 ± 0.44
3.885AspLys: 3.885 ± 0.617
4.645AspLeu: 4.645 ± 0.584
0.676AspMet: 0.676 ± 0.268
2.872AspAsn: 2.872 ± 0.492
3.716AspPro: 3.716 ± 0.482
1.014AspGln: 1.014 ± 0.368
2.703AspArg: 2.703 ± 0.528
3.294AspSer: 3.294 ± 0.458
2.196AspThr: 2.196 ± 0.473
4.054AspVal: 4.054 ± 0.542
0.76AspTrp: 0.76 ± 0.285
1.52AspTyr: 1.52 ± 0.337
0.0AspXaa: 0.0 ± 0.0
Glu
7.601GluAla: 7.601 ± 0.853
0.676GluCys: 0.676 ± 0.211
3.125GluAsp: 3.125 ± 0.555
5.068GluGlu: 5.068 ± 0.603
3.716GluPhe: 3.716 ± 0.484
4.307GluGly: 4.307 ± 0.542
1.098GluHis: 1.098 ± 0.267
5.321GluIle: 5.321 ± 0.564
4.983GluLys: 4.983 ± 0.711
6.503GluLeu: 6.503 ± 0.792
1.943GluMet: 1.943 ± 0.532
3.294GluAsn: 3.294 ± 0.407
1.689GluPro: 1.689 ± 0.425
2.618GluGln: 2.618 ± 0.667
4.476GluArg: 4.476 ± 0.595
3.378GluSer: 3.378 ± 0.527
3.463GluThr: 3.463 ± 0.5
5.405GluVal: 5.405 ± 0.632
1.014GluTrp: 1.014 ± 0.244
2.28GluTyr: 2.28 ± 0.31
0.0GluXaa: 0.0 ± 0.0
Phe
2.956PheAla: 2.956 ± 0.611
0.507PheCys: 0.507 ± 0.174
3.97PheAsp: 3.97 ± 0.556
2.956PheGlu: 2.956 ± 0.671
1.689PhePhe: 1.689 ± 0.412
3.632PheGly: 3.632 ± 0.536
0.76PheHis: 0.76 ± 0.342
1.943PheIle: 1.943 ± 0.404
2.365PheLys: 2.365 ± 0.449
2.365PheLeu: 2.365 ± 0.454
1.182PheMet: 1.182 ± 0.304
2.787PheAsn: 2.787 ± 0.447
1.943PhePro: 1.943 ± 0.411
0.676PheGln: 0.676 ± 0.217
2.027PheArg: 2.027 ± 0.383
2.703PheSer: 2.703 ± 0.537
2.111PheThr: 2.111 ± 0.456
2.28PheVal: 2.28 ± 0.424
0.169PheTrp: 0.169 ± 0.126
1.774PheTyr: 1.774 ± 0.456
0.0PheXaa: 0.0 ± 0.0
Gly
6.166GlyAla: 6.166 ± 0.727
0.422GlyCys: 0.422 ± 0.177
3.716GlyAsp: 3.716 ± 0.693
6.166GlyGlu: 6.166 ± 0.708
3.97GlyPhe: 3.97 ± 0.707
7.686GlyGly: 7.686 ± 2.051
1.014GlyHis: 1.014 ± 0.369
4.983GlyIle: 4.983 ± 0.661
4.476GlyLys: 4.476 ± 0.663
6.334GlyLeu: 6.334 ± 0.65
2.703GlyMet: 2.703 ± 0.391
5.152GlyAsn: 5.152 ± 1.14
2.703GlyPro: 2.703 ± 0.442
3.885GlyGln: 3.885 ± 0.814
3.632GlyArg: 3.632 ± 0.495
4.899GlySer: 4.899 ± 0.757
4.223GlyThr: 4.223 ± 0.619
5.49GlyVal: 5.49 ± 0.819
2.196GlyTrp: 2.196 ± 0.641
2.027GlyTyr: 2.027 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
0.929HisAla: 0.929 ± 0.35
0.084HisCys: 0.084 ± 0.077
0.676HisAsp: 0.676 ± 0.238
0.591HisGlu: 0.591 ± 0.202
0.676HisPhe: 0.676 ± 0.245
1.351HisGly: 1.351 ± 0.334
0.338HisHis: 0.338 ± 0.141
1.182HisIle: 1.182 ± 0.33
0.676HisLys: 0.676 ± 0.201
0.929HisLeu: 0.929 ± 0.29
0.169HisMet: 0.169 ± 0.12
0.845HisAsn: 0.845 ± 0.213
1.098HisPro: 1.098 ± 0.359
0.169HisGln: 0.169 ± 0.117
0.845HisArg: 0.845 ± 0.345
1.267HisSer: 1.267 ± 0.402
1.098HisThr: 1.098 ± 0.374
1.182HisVal: 1.182 ± 0.324
0.253HisTrp: 0.253 ± 0.138
0.929HisTyr: 0.929 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
7.01IleAla: 7.01 ± 0.809
1.098IleCys: 1.098 ± 0.289
4.392IleAsp: 4.392 ± 0.514
6.081IleGlu: 6.081 ± 0.643
1.351IlePhe: 1.351 ± 0.324
4.307IleGly: 4.307 ± 0.556
0.422IleHis: 0.422 ± 0.215
2.111IleIle: 2.111 ± 0.505
2.956IleLys: 2.956 ± 0.36
3.209IleLeu: 3.209 ± 0.384
0.591IleMet: 0.591 ± 0.255
2.027IleAsn: 2.027 ± 0.305
3.209IlePro: 3.209 ± 0.527
1.689IleGln: 1.689 ± 0.325
2.956IleArg: 2.956 ± 0.494
3.209IleSer: 3.209 ± 0.517
2.872IleThr: 2.872 ± 0.512
3.125IleVal: 3.125 ± 0.436
0.76IleTrp: 0.76 ± 0.232
0.929IleTyr: 0.929 ± 0.232
0.0IleXaa: 0.0 ± 0.0
Lys
8.361LysAla: 8.361 ± 1.019
0.253LysCys: 0.253 ± 0.139
2.365LysAsp: 2.365 ± 0.437
3.801LysGlu: 3.801 ± 0.566
3.378LysPhe: 3.378 ± 0.414
4.054LysGly: 4.054 ± 0.57
1.689LysHis: 1.689 ± 0.455
3.801LysIle: 3.801 ± 0.552
2.703LysLys: 2.703 ± 0.544
5.659LysLeu: 5.659 ± 0.631
1.605LysMet: 1.605 ± 0.35
1.52LysAsn: 1.52 ± 0.323
2.703LysPro: 2.703 ± 0.451
2.449LysGln: 2.449 ± 0.436
3.209LysArg: 3.209 ± 0.506
3.463LysSer: 3.463 ± 0.523
3.209LysThr: 3.209 ± 0.449
4.392LysVal: 4.392 ± 0.566
1.014LysTrp: 1.014 ± 0.297
2.28LysTyr: 2.28 ± 0.454
0.0LysXaa: 0.0 ± 0.0
Leu
6.672LeuAla: 6.672 ± 1.055
0.676LeuCys: 0.676 ± 0.232
3.801LeuAsp: 3.801 ± 0.461
3.716LeuGlu: 3.716 ± 0.481
3.716LeuPhe: 3.716 ± 0.626
6.419LeuGly: 6.419 ± 0.69
1.351LeuHis: 1.351 ± 0.363
3.547LeuIle: 3.547 ± 0.488
4.307LeuLys: 4.307 ± 0.619
4.983LeuLeu: 4.983 ± 0.822
1.436LeuMet: 1.436 ± 0.304
3.547LeuAsn: 3.547 ± 0.535
3.041LeuPro: 3.041 ± 0.571
3.209LeuGln: 3.209 ± 0.628
3.97LeuArg: 3.97 ± 0.462
4.899LeuSer: 4.899 ± 0.492
4.054LeuThr: 4.054 ± 0.742
5.405LeuVal: 5.405 ± 0.762
0.845LeuTrp: 0.845 ± 0.224
2.111LeuTyr: 2.111 ± 0.345
0.0LeuXaa: 0.0 ± 0.0
Met
3.463MetAla: 3.463 ± 0.583
0.253MetCys: 0.253 ± 0.15
1.182MetAsp: 1.182 ± 0.364
1.098MetGlu: 1.098 ± 0.335
0.676MetPhe: 0.676 ± 0.241
1.605MetGly: 1.605 ± 0.293
0.422MetHis: 0.422 ± 0.233
1.436MetIle: 1.436 ± 0.29
1.605MetLys: 1.605 ± 0.46
1.774MetLeu: 1.774 ± 0.402
0.422MetMet: 0.422 ± 0.168
1.267MetAsn: 1.267 ± 0.313
1.014MetPro: 1.014 ± 0.347
1.52MetGln: 1.52 ± 0.28
1.182MetArg: 1.182 ± 0.347
1.52MetSer: 1.52 ± 0.294
1.436MetThr: 1.436 ± 0.387
1.098MetVal: 1.098 ± 0.362
0.253MetTrp: 0.253 ± 0.15
0.253MetTyr: 0.253 ± 0.131
0.0MetXaa: 0.0 ± 0.0
Asn
5.152AsnAla: 5.152 ± 0.607
0.929AsnCys: 0.929 ± 0.315
2.196AsnAsp: 2.196 ± 0.389
3.801AsnGlu: 3.801 ± 0.546
1.943AsnPhe: 1.943 ± 0.353
5.574AsnGly: 5.574 ± 0.956
0.676AsnHis: 0.676 ± 0.245
2.703AsnIle: 2.703 ± 0.489
2.365AsnLys: 2.365 ± 0.365
2.956AsnLeu: 2.956 ± 0.495
1.351AsnMet: 1.351 ± 0.293
3.378AsnAsn: 3.378 ± 0.987
2.28AsnPro: 2.28 ± 0.584
1.182AsnGln: 1.182 ± 0.453
2.365AsnArg: 2.365 ± 0.382
3.125AsnSer: 3.125 ± 0.541
2.28AsnThr: 2.28 ± 0.367
3.801AsnVal: 3.801 ± 0.486
0.676AsnTrp: 0.676 ± 0.203
0.676AsnTyr: 0.676 ± 0.224
0.0AsnXaa: 0.0 ± 0.0
Pro
4.561ProAla: 4.561 ± 0.665
0.338ProCys: 0.338 ± 0.143
2.111ProAsp: 2.111 ± 0.507
2.618ProGlu: 2.618 ± 0.504
1.267ProPhe: 1.267 ± 0.271
3.885ProGly: 3.885 ± 0.713
0.676ProHis: 0.676 ± 0.237
2.111ProIle: 2.111 ± 0.451
2.787ProLys: 2.787 ± 0.595
2.618ProLeu: 2.618 ± 0.453
0.845ProMet: 0.845 ± 0.261
1.774ProAsn: 1.774 ± 0.507
2.365ProPro: 2.365 ± 0.542
1.858ProGln: 1.858 ± 0.33
2.111ProArg: 2.111 ± 0.506
2.365ProSer: 2.365 ± 0.507
3.97ProThr: 3.97 ± 0.695
3.209ProVal: 3.209 ± 0.389
0.084ProTrp: 0.084 ± 0.087
1.858ProTyr: 1.858 ± 0.446
0.0ProXaa: 0.0 ± 0.0
Gln
4.645GlnAla: 4.645 ± 1.052
0.422GlnCys: 0.422 ± 0.256
1.267GlnAsp: 1.267 ± 0.281
1.943GlnGlu: 1.943 ± 0.441
1.52GlnPhe: 1.52 ± 0.407
3.294GlnGly: 3.294 ± 1.273
0.591GlnHis: 0.591 ± 0.268
1.774GlnIle: 1.774 ± 0.518
2.449GlnLys: 2.449 ± 0.572
2.956GlnLeu: 2.956 ± 0.504
1.182GlnMet: 1.182 ± 0.372
2.449GlnAsn: 2.449 ± 0.455
1.098GlnPro: 1.098 ± 0.241
2.872GlnGln: 2.872 ± 1.113
2.534GlnArg: 2.534 ± 0.59
1.774GlnSer: 1.774 ± 0.354
1.943GlnThr: 1.943 ± 0.436
2.196GlnVal: 2.196 ± 0.41
0.422GlnTrp: 0.422 ± 0.141
1.436GlnTyr: 1.436 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
4.054ArgAla: 4.054 ± 0.591
0.76ArgCys: 0.76 ± 0.278
3.041ArgAsp: 3.041 ± 0.441
3.632ArgGlu: 3.632 ± 0.478
2.534ArgPhe: 2.534 ± 0.477
3.632ArgGly: 3.632 ± 0.507
0.507ArgHis: 0.507 ± 0.178
4.561ArgIle: 4.561 ± 0.523
3.125ArgLys: 3.125 ± 0.49
3.463ArgLeu: 3.463 ± 0.578
1.858ArgMet: 1.858 ± 0.429
2.872ArgAsn: 2.872 ± 0.456
1.351ArgPro: 1.351 ± 0.34
1.774ArgGln: 1.774 ± 0.356
3.463ArgArg: 3.463 ± 0.44
2.872ArgSer: 2.872 ± 0.605
2.534ArgThr: 2.534 ± 0.406
3.885ArgVal: 3.885 ± 0.706
0.845ArgTrp: 0.845 ± 0.268
2.027ArgTyr: 2.027 ± 0.405
0.0ArgXaa: 0.0 ± 0.0
Ser
6.672SerAla: 6.672 ± 0.807
0.591SerCys: 0.591 ± 0.218
3.378SerAsp: 3.378 ± 0.473
4.73SerGlu: 4.73 ± 0.659
2.111SerPhe: 2.111 ± 0.466
5.49SerGly: 5.49 ± 0.651
1.014SerHis: 1.014 ± 0.261
2.365SerIle: 2.365 ± 0.427
3.378SerLys: 3.378 ± 0.521
3.97SerLeu: 3.97 ± 0.562
0.929SerMet: 0.929 ± 0.346
3.547SerAsn: 3.547 ± 0.674
2.449SerPro: 2.449 ± 0.532
2.618SerGln: 2.618 ± 0.534
3.041SerArg: 3.041 ± 0.562
3.294SerSer: 3.294 ± 0.632
2.703SerThr: 2.703 ± 0.441
3.716SerVal: 3.716 ± 0.89
0.76SerTrp: 0.76 ± 0.286
2.365SerTyr: 2.365 ± 0.36
0.0SerXaa: 0.0 ± 0.0
Thr
5.997ThrAla: 5.997 ± 0.731
0.338ThrCys: 0.338 ± 0.172
3.378ThrAsp: 3.378 ± 0.49
4.476ThrGlu: 4.476 ± 0.643
2.28ThrPhe: 2.28 ± 0.581
4.899ThrGly: 4.899 ± 0.49
0.591ThrHis: 0.591 ± 0.226
2.449ThrIle: 2.449 ± 0.447
2.872ThrLys: 2.872 ± 0.546
3.294ThrLeu: 3.294 ± 0.446
1.182ThrMet: 1.182 ± 0.302
2.111ThrAsn: 2.111 ± 0.318
3.209ThrPro: 3.209 ± 0.546
2.534ThrGln: 2.534 ± 0.722
2.449ThrArg: 2.449 ± 0.442
2.787ThrSer: 2.787 ± 0.59
3.463ThrThr: 3.463 ± 0.652
3.209ThrVal: 3.209 ± 0.538
1.098ThrTrp: 1.098 ± 0.313
1.689ThrTyr: 1.689 ± 0.355
0.0ThrXaa: 0.0 ± 0.0
Val
7.348ValAla: 7.348 ± 0.692
1.098ValCys: 1.098 ± 0.276
3.463ValAsp: 3.463 ± 0.451
4.73ValGlu: 4.73 ± 0.539
2.365ValPhe: 2.365 ± 0.505
5.659ValGly: 5.659 ± 0.715
0.845ValHis: 0.845 ± 0.243
3.294ValIle: 3.294 ± 0.473
5.49ValLys: 5.49 ± 0.709
4.814ValLeu: 4.814 ± 0.652
1.52ValMet: 1.52 ± 0.366
2.365ValAsn: 2.365 ± 0.378
2.365ValPro: 2.365 ± 0.507
2.196ValGln: 2.196 ± 0.418
2.956ValArg: 2.956 ± 0.529
5.405ValSer: 5.405 ± 0.958
3.716ValThr: 3.716 ± 0.631
4.392ValVal: 4.392 ± 0.486
0.929ValTrp: 0.929 ± 0.313
1.774ValTyr: 1.774 ± 0.398
0.0ValXaa: 0.0 ± 0.0
Trp
0.929TrpAla: 0.929 ± 0.224
0.169TrpCys: 0.169 ± 0.111
0.591TrpAsp: 0.591 ± 0.256
0.845TrpGlu: 0.845 ± 0.316
0.422TrpPhe: 0.422 ± 0.23
0.929TrpGly: 0.929 ± 0.345
0.507TrpHis: 0.507 ± 0.221
0.507TrpIle: 0.507 ± 0.264
0.591TrpLys: 0.591 ± 0.237
1.351TrpLeu: 1.351 ± 0.324
0.169TrpMet: 0.169 ± 0.102
0.845TrpAsn: 0.845 ± 0.301
0.845TrpPro: 0.845 ± 0.232
1.182TrpGln: 1.182 ± 0.237
1.098TrpArg: 1.098 ± 0.298
0.507TrpSer: 0.507 ± 0.199
0.507TrpThr: 0.507 ± 0.184
1.436TrpVal: 1.436 ± 0.397
0.169TrpTrp: 0.169 ± 0.12
0.507TrpTyr: 0.507 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.28TyrAla: 2.28 ± 0.361
0.253TyrCys: 0.253 ± 0.179
2.618TyrAsp: 2.618 ± 0.409
1.774TyrGlu: 1.774 ± 0.327
1.014TyrPhe: 1.014 ± 0.251
1.943TyrGly: 1.943 ± 0.416
0.929TyrHis: 0.929 ± 0.238
0.845TyrIle: 0.845 ± 0.208
2.28TyrLys: 2.28 ± 0.456
2.027TyrLeu: 2.027 ± 0.314
0.591TyrMet: 0.591 ± 0.246
1.267TyrAsn: 1.267 ± 0.314
1.943TyrPro: 1.943 ± 0.362
1.351TyrGln: 1.351 ± 0.371
1.605TyrArg: 1.605 ± 0.393
2.787TyrSer: 2.787 ± 0.615
1.858TyrThr: 1.858 ± 0.584
1.182TyrVal: 1.182 ± 0.248
0.676TyrTrp: 0.676 ± 0.212
0.76TyrTyr: 0.76 ± 0.277
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (11841 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski