Amino acid dipepetide frequency for Xanthomonas phage Xp15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.464AlaAla: 7.464 ± 1.126
0.529AlaCys: 0.529 ± 0.166
4.114AlaAsp: 4.114 ± 0.577
4.702AlaGlu: 4.702 ± 0.531
2.88AlaPhe: 2.88 ± 0.499
5.76AlaGly: 5.76 ± 0.643
1.528AlaHis: 1.528 ± 0.246
5.231AlaIle: 5.231 ± 0.546
4.937AlaLys: 4.937 ± 0.625
7.288AlaLeu: 7.288 ± 0.615
2.88AlaMet: 2.88 ± 0.491
4.761AlaAsn: 4.761 ± 0.585
3.703AlaPro: 3.703 ± 0.477
2.703AlaGln: 2.703 ± 0.478
4.349AlaArg: 4.349 ± 0.582
6.347AlaSer: 6.347 ± 0.755
5.407AlaThr: 5.407 ± 0.554
4.761AlaVal: 4.761 ± 0.52
1.411AlaTrp: 1.411 ± 0.282
3.232AlaTyr: 3.232 ± 0.485
0.0AlaXaa: 0.0 ± 0.0
Cys
0.646CysAla: 0.646 ± 0.206
0.059CysCys: 0.059 ± 0.065
0.353CysAsp: 0.353 ± 0.148
0.47CysGlu: 0.47 ± 0.222
0.411CysPhe: 0.411 ± 0.126
0.823CysGly: 0.823 ± 0.229
0.529CysHis: 0.529 ± 0.197
0.764CysIle: 0.764 ± 0.244
0.764CysLys: 0.764 ± 0.232
0.882CysLeu: 0.882 ± 0.207
0.47CysMet: 0.47 ± 0.178
0.411CysAsn: 0.411 ± 0.183
0.529CysPro: 0.529 ± 0.226
0.411CysGln: 0.411 ± 0.193
0.823CysArg: 0.823 ± 0.186
0.47CysSer: 0.47 ± 0.171
0.529CysThr: 0.529 ± 0.229
0.588CysVal: 0.588 ± 0.178
0.176CysTrp: 0.176 ± 0.099
0.353CysTyr: 0.353 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
5.348AspAla: 5.348 ± 0.664
0.705AspCys: 0.705 ± 0.215
3.056AspAsp: 3.056 ± 0.395
2.88AspGlu: 2.88 ± 0.412
2.41AspPhe: 2.41 ± 0.387
4.29AspGly: 4.29 ± 0.639
0.999AspHis: 0.999 ± 0.227
2.762AspIle: 2.762 ± 0.398
2.527AspLys: 2.527 ± 0.404
5.348AspLeu: 5.348 ± 0.523
1.352AspMet: 1.352 ± 0.212
2.468AspAsn: 2.468 ± 0.322
2.527AspPro: 2.527 ± 0.479
2.41AspGln: 2.41 ± 0.349
2.997AspArg: 2.997 ± 0.429
3.409AspSer: 3.409 ± 0.409
2.88AspThr: 2.88 ± 0.498
3.938AspVal: 3.938 ± 0.475
0.764AspTrp: 0.764 ± 0.199
2.292AspTyr: 2.292 ± 0.439
0.0AspXaa: 0.0 ± 0.0
Glu
4.349GluAla: 4.349 ± 0.554
0.411GluCys: 0.411 ± 0.155
3.291GluAsp: 3.291 ± 0.461
3.526GluGlu: 3.526 ± 0.603
3.174GluPhe: 3.174 ± 0.503
4.055GluGly: 4.055 ± 0.767
1.469GluHis: 1.469 ± 0.346
4.173GluIle: 4.173 ± 0.499
3.056GluLys: 3.056 ± 0.53
5.407GluLeu: 5.407 ± 0.53
1.998GluMet: 1.998 ± 0.349
1.763GluAsn: 1.763 ± 0.332
1.646GluPro: 1.646 ± 0.362
1.881GluGln: 1.881 ± 0.318
2.233GluArg: 2.233 ± 0.424
3.468GluSer: 3.468 ± 0.393
2.88GluThr: 2.88 ± 0.39
4.761GluVal: 4.761 ± 0.661
1.469GluTrp: 1.469 ± 0.262
2.175GluTyr: 2.175 ± 0.309
0.118GluXaa: 0.118 ± 0.099
Phe
3.056PheAla: 3.056 ± 0.366
0.235PheCys: 0.235 ± 0.116
2.703PheAsp: 2.703 ± 0.381
2.586PheGlu: 2.586 ± 0.376
1.293PhePhe: 1.293 ± 0.335
2.762PheGly: 2.762 ± 0.536
0.764PheHis: 0.764 ± 0.175
1.175PheIle: 1.175 ± 0.244
2.233PheLys: 2.233 ± 0.36
2.586PheLeu: 2.586 ± 0.445
1.175PheMet: 1.175 ± 0.26
2.057PheAsn: 2.057 ± 0.394
1.411PhePro: 1.411 ± 0.306
1.998PheGln: 1.998 ± 0.392
2.175PheArg: 2.175 ± 0.413
2.527PheSer: 2.527 ± 0.315
2.762PheThr: 2.762 ± 0.377
2.468PheVal: 2.468 ± 0.354
0.823PheTrp: 0.823 ± 0.225
1.881PheTyr: 1.881 ± 0.354
0.059PheXaa: 0.059 ± 0.049
Gly
6.582GlyAla: 6.582 ± 0.83
1.175GlyCys: 1.175 ± 0.336
4.349GlyAsp: 4.349 ± 0.387
4.114GlyGlu: 4.114 ± 0.487
2.762GlyPhe: 2.762 ± 0.331
5.583GlyGly: 5.583 ± 0.805
1.293GlyHis: 1.293 ± 0.348
3.879GlyIle: 3.879 ± 0.566
4.232GlyLys: 4.232 ± 0.442
6.347GlyLeu: 6.347 ± 0.64
2.292GlyMet: 2.292 ± 0.386
3.409GlyAsn: 3.409 ± 0.558
2.351GlyPro: 2.351 ± 0.346
2.645GlyGln: 2.645 ± 0.349
3.703GlyArg: 3.703 ± 0.394
5.525GlySer: 5.525 ± 0.594
4.996GlyThr: 4.996 ± 0.619
5.642GlyVal: 5.642 ± 0.673
1.117GlyTrp: 1.117 ± 0.241
2.057GlyTyr: 2.057 ± 0.393
0.0GlyXaa: 0.0 ± 0.0
His
1.763HisAla: 1.763 ± 0.346
0.529HisCys: 0.529 ± 0.214
0.999HisAsp: 0.999 ± 0.186
0.823HisGlu: 0.823 ± 0.212
0.764HisPhe: 0.764 ± 0.236
1.411HisGly: 1.411 ± 0.368
0.353HisHis: 0.353 ± 0.158
1.175HisIle: 1.175 ± 0.285
1.175HisLys: 1.175 ± 0.305
1.998HisLeu: 1.998 ± 0.377
0.529HisMet: 0.529 ± 0.171
0.882HisAsn: 0.882 ± 0.236
1.293HisPro: 1.293 ± 0.259
0.764HisGln: 0.764 ± 0.207
1.528HisArg: 1.528 ± 0.274
0.882HisSer: 0.882 ± 0.282
0.999HisThr: 0.999 ± 0.217
1.411HisVal: 1.411 ± 0.293
0.294HisTrp: 0.294 ± 0.159
0.823HisTyr: 0.823 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
5.172IleAla: 5.172 ± 0.733
0.646IleCys: 0.646 ± 0.205
3.761IleAsp: 3.761 ± 0.458
2.997IleGlu: 2.997 ± 0.391
1.998IlePhe: 1.998 ± 0.385
4.702IleGly: 4.702 ± 0.62
1.175IleHis: 1.175 ± 0.264
2.88IleIle: 2.88 ± 0.413
3.291IleLys: 3.291 ± 0.417
4.232IleLeu: 4.232 ± 0.526
1.469IleMet: 1.469 ± 0.317
2.939IleAsn: 2.939 ± 0.519
3.174IlePro: 3.174 ± 0.428
2.527IleGln: 2.527 ± 0.382
3.938IleArg: 3.938 ± 0.465
3.996IleSer: 3.996 ± 0.568
4.643IleThr: 4.643 ± 0.58
3.703IleVal: 3.703 ± 0.439
0.529IleTrp: 0.529 ± 0.183
1.822IleTyr: 1.822 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
4.996LysAla: 4.996 ± 0.668
0.353LysCys: 0.353 ± 0.155
2.41LysAsp: 2.41 ± 0.352
2.703LysGlu: 2.703 ± 0.417
2.41LysPhe: 2.41 ± 0.411
3.115LysGly: 3.115 ± 0.611
0.999LysHis: 0.999 ± 0.301
3.232LysIle: 3.232 ± 0.465
2.645LysLys: 2.645 ± 0.404
5.701LysLeu: 5.701 ± 0.676
2.116LysMet: 2.116 ± 0.367
2.41LysAsn: 2.41 ± 0.435
2.116LysPro: 2.116 ± 0.38
2.175LysGln: 2.175 ± 0.29
2.762LysArg: 2.762 ± 0.439
3.468LysSer: 3.468 ± 0.543
3.115LysThr: 3.115 ± 0.376
4.055LysVal: 4.055 ± 0.58
0.882LysTrp: 0.882 ± 0.263
2.233LysTyr: 2.233 ± 0.355
0.0LysXaa: 0.0 ± 0.0
Leu
7.229LeuAla: 7.229 ± 0.658
0.94LeuCys: 0.94 ± 0.257
4.761LeuAsp: 4.761 ± 0.564
5.231LeuGlu: 5.231 ± 0.532
2.703LeuPhe: 2.703 ± 0.423
5.76LeuGly: 5.76 ± 0.553
1.352LeuHis: 1.352 ± 0.277
5.407LeuIle: 5.407 ± 0.64
4.349LeuLys: 4.349 ± 0.543
5.995LeuLeu: 5.995 ± 0.818
2.645LeuMet: 2.645 ± 0.528
3.35LeuAsn: 3.35 ± 0.466
3.174LeuPro: 3.174 ± 0.435
3.644LeuGln: 3.644 ± 0.434
5.054LeuArg: 5.054 ± 0.548
5.995LeuSer: 5.995 ± 0.558
4.937LeuThr: 4.937 ± 0.54
4.643LeuVal: 4.643 ± 0.607
1.234LeuTrp: 1.234 ± 0.35
2.88LeuTyr: 2.88 ± 0.483
0.0LeuXaa: 0.0 ± 0.0
Met
3.703MetAla: 3.703 ± 0.607
0.411MetCys: 0.411 ± 0.159
2.116MetAsp: 2.116 ± 0.369
1.881MetGlu: 1.881 ± 0.319
0.823MetPhe: 0.823 ± 0.207
1.763MetGly: 1.763 ± 0.435
0.411MetHis: 0.411 ± 0.163
2.351MetIle: 2.351 ± 0.358
2.175MetLys: 2.175 ± 0.388
2.116MetLeu: 2.116 ± 0.39
1.175MetMet: 1.175 ± 0.256
1.058MetAsn: 1.058 ± 0.259
1.175MetPro: 1.175 ± 0.234
0.823MetGln: 0.823 ± 0.198
1.939MetArg: 1.939 ± 0.361
2.351MetSer: 2.351 ± 0.402
2.292MetThr: 2.292 ± 0.42
2.175MetVal: 2.175 ± 0.379
0.529MetTrp: 0.529 ± 0.191
0.823MetTyr: 0.823 ± 0.255
0.0MetXaa: 0.0 ± 0.0
Asn
3.35AsnAla: 3.35 ± 0.479
0.294AsnCys: 0.294 ± 0.116
1.998AsnAsp: 1.998 ± 0.371
1.998AsnGlu: 1.998 ± 0.321
2.116AsnPhe: 2.116 ± 0.391
4.643AsnGly: 4.643 ± 0.691
1.293AsnHis: 1.293 ± 0.295
2.292AsnIle: 2.292 ± 0.507
1.939AsnLys: 1.939 ± 0.316
3.115AsnLeu: 3.115 ± 0.372
1.293AsnMet: 1.293 ± 0.251
1.293AsnAsn: 1.293 ± 0.284
3.115AsnPro: 3.115 ± 0.406
1.763AsnGln: 1.763 ± 0.417
1.939AsnArg: 1.939 ± 0.361
3.468AsnSer: 3.468 ± 0.48
2.586AsnThr: 2.586 ± 0.409
2.997AsnVal: 2.997 ± 0.359
0.823AsnTrp: 0.823 ± 0.228
1.763AsnTyr: 1.763 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
3.409ProAla: 3.409 ± 0.474
0.529ProCys: 0.529 ± 0.185
2.703ProAsp: 2.703 ± 0.43
3.409ProGlu: 3.409 ± 0.567
1.704ProPhe: 1.704 ± 0.394
3.703ProGly: 3.703 ± 0.613
1.175ProHis: 1.175 ± 0.296
2.233ProIle: 2.233 ± 0.392
1.881ProLys: 1.881 ± 0.301
2.41ProLeu: 2.41 ± 0.354
1.469ProMet: 1.469 ± 0.282
2.057ProAsn: 2.057 ± 0.369
2.175ProPro: 2.175 ± 0.566
1.117ProGln: 1.117 ± 0.222
2.468ProArg: 2.468 ± 0.369
3.644ProSer: 3.644 ± 0.513
2.292ProThr: 2.292 ± 0.383
3.526ProVal: 3.526 ± 0.404
0.764ProTrp: 0.764 ± 0.211
1.352ProTyr: 1.352 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.82GlnAla: 3.82 ± 0.478
0.47GlnCys: 0.47 ± 0.18
2.175GlnAsp: 2.175 ± 0.445
2.116GlnGlu: 2.116 ± 0.35
1.469GlnPhe: 1.469 ± 0.237
3.115GlnGly: 3.115 ± 0.448
0.705GlnHis: 0.705 ± 0.189
3.115GlnIle: 3.115 ± 0.308
1.998GlnLys: 1.998 ± 0.363
2.997GlnLeu: 2.997 ± 0.387
1.528GlnMet: 1.528 ± 0.296
1.822GlnAsn: 1.822 ± 0.284
1.469GlnPro: 1.469 ± 0.313
1.175GlnGln: 1.175 ± 0.271
1.704GlnArg: 1.704 ± 0.258
2.527GlnSer: 2.527 ± 0.404
2.351GlnThr: 2.351 ± 0.351
2.468GlnVal: 2.468 ± 0.424
0.529GlnTrp: 0.529 ± 0.211
1.352GlnTyr: 1.352 ± 0.323
0.0GlnXaa: 0.0 ± 0.0
Arg
3.468ArgAla: 3.468 ± 0.525
0.705ArgCys: 0.705 ± 0.254
2.645ArgAsp: 2.645 ± 0.401
2.821ArgGlu: 2.821 ± 0.452
2.586ArgPhe: 2.586 ± 0.307
3.703ArgGly: 3.703 ± 0.43
1.058ArgHis: 1.058 ± 0.249
3.82ArgIle: 3.82 ± 0.405
3.232ArgLys: 3.232 ± 0.514
4.29ArgLeu: 4.29 ± 0.54
1.763ArgMet: 1.763 ± 0.295
2.468ArgAsn: 2.468 ± 0.407
2.997ArgPro: 2.997 ± 0.47
1.704ArgGln: 1.704 ± 0.29
2.939ArgArg: 2.939 ± 0.461
4.349ArgSer: 4.349 ± 0.557
3.35ArgThr: 3.35 ± 0.395
3.938ArgVal: 3.938 ± 0.587
0.705ArgTrp: 0.705 ± 0.23
1.881ArgTyr: 1.881 ± 0.291
0.0ArgXaa: 0.0 ± 0.0
Ser
5.642SerAla: 5.642 ± 0.583
0.529SerCys: 0.529 ± 0.168
3.996SerAsp: 3.996 ± 0.55
4.055SerGlu: 4.055 ± 0.558
1.763SerPhe: 1.763 ± 0.302
5.936SerGly: 5.936 ± 0.609
1.293SerHis: 1.293 ± 0.305
3.938SerIle: 3.938 ± 0.395
4.055SerLys: 4.055 ± 0.485
6.053SerLeu: 6.053 ± 0.518
2.821SerMet: 2.821 ± 0.307
3.174SerAsn: 3.174 ± 0.404
2.586SerPro: 2.586 ± 0.422
3.115SerGln: 3.115 ± 0.344
3.526SerArg: 3.526 ± 0.382
4.525SerSer: 4.525 ± 0.521
3.703SerThr: 3.703 ± 0.475
3.703SerVal: 3.703 ± 0.468
1.528SerTrp: 1.528 ± 0.28
2.586SerTyr: 2.586 ± 0.474
0.059SerXaa: 0.059 ± 0.049
Thr
5.642ThrAla: 5.642 ± 0.55
0.411ThrCys: 0.411 ± 0.179
2.939ThrAsp: 2.939 ± 0.382
3.409ThrGlu: 3.409 ± 0.386
2.821ThrPhe: 2.821 ± 0.561
4.702ThrGly: 4.702 ± 0.423
1.469ThrHis: 1.469 ± 0.294
3.879ThrIle: 3.879 ± 0.505
3.056ThrLys: 3.056 ± 0.46
5.054ThrLeu: 5.054 ± 0.524
1.469ThrMet: 1.469 ± 0.236
1.704ThrAsn: 1.704 ± 0.375
3.585ThrPro: 3.585 ± 0.366
2.939ThrGln: 2.939 ± 0.436
2.997ThrArg: 2.997 ± 0.409
3.703ThrSer: 3.703 ± 0.548
3.938ThrThr: 3.938 ± 0.636
4.819ThrVal: 4.819 ± 0.502
0.94ThrTrp: 0.94 ± 0.239
2.233ThrTyr: 2.233 ± 0.371
0.059ThrXaa: 0.059 ± 0.066
Val
5.054ValAla: 5.054 ± 0.541
0.823ValCys: 0.823 ± 0.182
4.349ValAsp: 4.349 ± 0.454
4.173ValGlu: 4.173 ± 0.594
2.233ValPhe: 2.233 ± 0.341
4.467ValGly: 4.467 ± 0.464
1.528ValHis: 1.528 ± 0.375
4.173ValIle: 4.173 ± 0.475
4.29ValLys: 4.29 ± 0.522
5.172ValLeu: 5.172 ± 0.546
2.175ValMet: 2.175 ± 0.289
3.938ValAsn: 3.938 ± 0.449
3.056ValPro: 3.056 ± 0.383
3.115ValGln: 3.115 ± 0.407
3.35ValArg: 3.35 ± 0.554
4.29ValSer: 4.29 ± 0.498
4.525ValThr: 4.525 ± 0.605
5.054ValVal: 5.054 ± 0.539
1.058ValTrp: 1.058 ± 0.23
2.057ValTyr: 2.057 ± 0.296
0.0ValXaa: 0.0 ± 0.0
Trp
0.823TrpAla: 0.823 ± 0.253
0.118TrpCys: 0.118 ± 0.084
0.94TrpAsp: 0.94 ± 0.179
0.764TrpGlu: 0.764 ± 0.226
0.646TrpPhe: 0.646 ± 0.195
1.058TrpGly: 1.058 ± 0.227
0.529TrpHis: 0.529 ± 0.179
1.234TrpIle: 1.234 ± 0.245
0.646TrpLys: 0.646 ± 0.199
1.234TrpLeu: 1.234 ± 0.315
0.529TrpMet: 0.529 ± 0.186
0.411TrpAsn: 0.411 ± 0.137
0.823TrpPro: 0.823 ± 0.179
0.176TrpGln: 0.176 ± 0.098
1.822TrpArg: 1.822 ± 0.428
1.117TrpSer: 1.117 ± 0.252
0.94TrpThr: 0.94 ± 0.303
1.469TrpVal: 1.469 ± 0.305
0.294TrpTrp: 0.294 ± 0.108
0.705TrpTyr: 0.705 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.468TyrAla: 2.468 ± 0.392
0.588TyrCys: 0.588 ± 0.183
1.704TyrAsp: 1.704 ± 0.338
2.586TyrGlu: 2.586 ± 0.366
1.763TyrPhe: 1.763 ± 0.349
2.586TyrGly: 2.586 ± 0.367
0.47TyrHis: 0.47 ± 0.152
1.881TyrIle: 1.881 ± 0.382
1.352TyrLys: 1.352 ± 0.268
3.056TyrLeu: 3.056 ± 0.439
0.882TyrMet: 0.882 ± 0.227
1.704TyrAsn: 1.704 ± 0.318
1.117TyrPro: 1.117 ± 0.241
1.763TyrGln: 1.763 ± 0.322
2.233TyrArg: 2.233 ± 0.449
2.527TyrSer: 2.527 ± 0.423
2.586TyrThr: 2.586 ± 0.368
2.762TyrVal: 2.762 ± 0.35
0.411TyrTrp: 0.411 ± 0.158
1.058TyrTyr: 1.058 ± 0.233
0.059TyrXaa: 0.059 ± 0.066
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.059XaaGlu: 0.059 ± 0.049
0.0XaaPhe: 0.0 ± 0.0
0.118XaaGly: 0.118 ± 0.099
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.059XaaLys: 0.059 ± 0.066
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.059XaaGln: 0.059 ± 0.066
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.059XaaThr: 0.059 ± 0.049
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (17016 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski