Amino acid dipepetide frequency for Xanthomonas phage XAJ24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.532AlaAla: 14.532 ± 2.099
0.363AlaCys: 0.363 ± 0.172
6.321AlaAsp: 6.321 ± 0.859
6.394AlaGlu: 6.394 ± 0.695
2.398AlaPhe: 2.398 ± 0.494
7.847AlaGly: 7.847 ± 1.179
1.598AlaHis: 1.598 ± 0.291
4.795AlaIle: 4.795 ± 0.743
5.086AlaLys: 5.086 ± 0.651
7.847AlaLeu: 7.847 ± 0.845
3.197AlaMet: 3.197 ± 0.695
4.069AlaAsn: 4.069 ± 0.513
4.432AlaPro: 4.432 ± 0.786
5.522AlaGln: 5.522 ± 0.749
5.958AlaArg: 5.958 ± 0.681
5.885AlaSer: 5.885 ± 0.725
6.031AlaThr: 6.031 ± 0.627
7.992AlaVal: 7.992 ± 0.723
2.252AlaTrp: 2.252 ± 0.431
3.415AlaTyr: 3.415 ± 0.567
0.0AlaXaa: 0.0 ± 0.0
Cys
0.727CysAla: 0.727 ± 0.276
0.0CysCys: 0.0 ± 0.0
0.509CysAsp: 0.509 ± 0.202
0.218CysGlu: 0.218 ± 0.125
0.436CysPhe: 0.436 ± 0.19
0.727CysGly: 0.727 ± 0.311
0.145CysHis: 0.145 ± 0.124
0.436CysIle: 0.436 ± 0.175
0.436CysLys: 0.436 ± 0.169
0.509CysLeu: 0.509 ± 0.194
0.218CysMet: 0.218 ± 0.171
0.509CysAsn: 0.509 ± 0.25
0.145CysPro: 0.145 ± 0.099
0.509CysGln: 0.509 ± 0.23
0.436CysArg: 0.436 ± 0.17
0.436CysSer: 0.436 ± 0.2
0.872CysThr: 0.872 ± 0.276
0.727CysVal: 0.727 ± 0.211
0.073CysTrp: 0.073 ± 0.063
0.509CysTyr: 0.509 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
6.249AspAla: 6.249 ± 0.654
0.581AspCys: 0.581 ± 0.21
4.069AspAsp: 4.069 ± 0.603
3.706AspGlu: 3.706 ± 0.537
3.124AspPhe: 3.124 ± 0.458
4.795AspGly: 4.795 ± 0.57
0.945AspHis: 0.945 ± 0.222
3.124AspIle: 3.124 ± 0.447
3.052AspLys: 3.052 ± 0.443
3.633AspLeu: 3.633 ± 0.526
1.598AspMet: 1.598 ± 0.282
2.616AspAsn: 2.616 ± 0.422
2.834AspPro: 2.834 ± 0.348
2.398AspGln: 2.398 ± 0.427
3.924AspArg: 3.924 ± 0.506
2.616AspSer: 2.616 ± 0.449
3.488AspThr: 3.488 ± 0.582
4.65AspVal: 4.65 ± 0.539
0.945AspTrp: 0.945 ± 0.248
1.744AspTyr: 1.744 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
6.176GluAla: 6.176 ± 0.649
0.509GluCys: 0.509 ± 0.189
2.906GluAsp: 2.906 ± 0.371
3.996GluGlu: 3.996 ± 0.498
2.47GluPhe: 2.47 ± 0.434
4.723GluGly: 4.723 ± 0.46
1.453GluHis: 1.453 ± 0.299
2.47GluIle: 2.47 ± 0.415
2.979GluLys: 2.979 ± 0.444
6.103GluLeu: 6.103 ± 0.568
1.889GluMet: 1.889 ± 0.383
2.325GluAsn: 2.325 ± 0.472
1.235GluPro: 1.235 ± 0.316
2.834GluGln: 2.834 ± 0.411
2.979GluArg: 2.979 ± 0.56
3.27GluSer: 3.27 ± 0.466
2.979GluThr: 2.979 ± 0.486
3.778GluVal: 3.778 ± 0.577
1.163GluTrp: 1.163 ± 0.319
2.398GluTyr: 2.398 ± 0.402
0.0GluXaa: 0.0 ± 0.0
Phe
2.906PheAla: 2.906 ± 0.435
0.363PheCys: 0.363 ± 0.173
2.107PheAsp: 2.107 ± 0.503
1.308PheGlu: 1.308 ± 0.379
0.727PhePhe: 0.727 ± 0.259
3.124PheGly: 3.124 ± 0.427
1.09PheHis: 1.09 ± 0.279
1.453PheIle: 1.453 ± 0.331
1.816PheLys: 1.816 ± 0.398
2.034PheLeu: 2.034 ± 0.475
0.654PheMet: 0.654 ± 0.204
1.381PheAsn: 1.381 ± 0.382
1.453PhePro: 1.453 ± 0.317
1.09PheGln: 1.09 ± 0.266
1.526PheArg: 1.526 ± 0.294
2.325PheSer: 2.325 ± 0.361
2.616PheThr: 2.616 ± 0.423
2.47PheVal: 2.47 ± 0.677
0.145PheTrp: 0.145 ± 0.1
1.381PheTyr: 1.381 ± 0.336
0.0PheXaa: 0.0 ± 0.0
Gly
8.21GlyAla: 8.21 ± 1.08
0.727GlyCys: 0.727 ± 0.262
5.304GlyAsp: 5.304 ± 0.555
4.577GlyGlu: 4.577 ± 0.699
3.488GlyPhe: 3.488 ± 0.379
9.082GlyGly: 9.082 ± 1.331
0.727GlyHis: 0.727 ± 0.241
3.778GlyIle: 3.778 ± 0.59
4.941GlyLys: 4.941 ± 0.706
5.885GlyLeu: 5.885 ± 0.563
2.616GlyMet: 2.616 ± 0.368
3.415GlyAsn: 3.415 ± 0.46
2.252GlyPro: 2.252 ± 0.442
3.342GlyGln: 3.342 ± 0.486
4.505GlyArg: 4.505 ± 0.512
5.449GlySer: 5.449 ± 0.724
6.103GlyThr: 6.103 ± 0.746
6.903GlyVal: 6.903 ± 0.562
1.09GlyTrp: 1.09 ± 0.265
2.761GlyTyr: 2.761 ± 0.41
0.0GlyXaa: 0.0 ± 0.0
His
1.381HisAla: 1.381 ± 0.294
0.218HisCys: 0.218 ± 0.139
1.09HisAsp: 1.09 ± 0.302
1.598HisGlu: 1.598 ± 0.324
0.509HisPhe: 0.509 ± 0.185
1.526HisGly: 1.526 ± 0.286
0.509HisHis: 0.509 ± 0.2
1.308HisIle: 1.308 ± 0.389
1.017HisLys: 1.017 ± 0.35
1.017HisLeu: 1.017 ± 0.267
0.654HisMet: 0.654 ± 0.217
0.799HisAsn: 0.799 ± 0.23
1.453HisPro: 1.453 ± 0.305
0.509HisGln: 0.509 ± 0.245
0.799HisArg: 0.799 ± 0.252
1.308HisSer: 1.308 ± 0.2
1.163HisThr: 1.163 ± 0.311
1.09HisVal: 1.09 ± 0.402
0.291HisTrp: 0.291 ± 0.122
0.509HisTyr: 0.509 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
4.941IleAla: 4.941 ± 0.541
0.509IleCys: 0.509 ± 0.163
3.052IleAsp: 3.052 ± 0.432
2.906IleGlu: 2.906 ± 0.444
1.017IlePhe: 1.017 ± 0.311
3.56IleGly: 3.56 ± 0.412
1.09IleHis: 1.09 ± 0.319
1.381IleIle: 1.381 ± 0.377
2.834IleLys: 2.834 ± 0.573
3.124IleLeu: 3.124 ± 0.505
0.872IleMet: 0.872 ± 0.313
2.398IleAsn: 2.398 ± 0.425
2.906IlePro: 2.906 ± 0.394
1.598IleGln: 1.598 ± 0.297
2.47IleArg: 2.47 ± 0.424
2.398IleSer: 2.398 ± 0.341
3.488IleThr: 3.488 ± 0.577
2.616IleVal: 2.616 ± 0.452
0.363IleTrp: 0.363 ± 0.142
1.308IleTyr: 1.308 ± 0.254
0.0IleXaa: 0.0 ± 0.0
Lys
5.013LysAla: 5.013 ± 0.724
0.291LysCys: 0.291 ± 0.141
2.906LysAsp: 2.906 ± 0.488
3.706LysGlu: 3.706 ± 0.562
1.526LysPhe: 1.526 ± 0.328
3.342LysGly: 3.342 ± 0.462
1.235LysHis: 1.235 ± 0.296
1.962LysIle: 1.962 ± 0.536
1.889LysLys: 1.889 ± 0.499
4.941LysLeu: 4.941 ± 0.708
1.235LysMet: 1.235 ± 0.325
1.889LysAsn: 1.889 ± 0.354
2.18LysPro: 2.18 ± 0.483
2.034LysGln: 2.034 ± 0.449
3.706LysArg: 3.706 ± 0.648
2.325LysSer: 2.325 ± 0.316
2.979LysThr: 2.979 ± 0.501
3.851LysVal: 3.851 ± 0.635
0.945LysTrp: 0.945 ± 0.206
2.325LysTyr: 2.325 ± 0.419
0.0LysXaa: 0.0 ± 0.0
Leu
7.992LeuAla: 7.992 ± 0.892
1.017LeuCys: 1.017 ± 0.313
4.723LeuAsp: 4.723 ± 0.488
4.795LeuGlu: 4.795 ± 0.538
1.453LeuPhe: 1.453 ± 0.334
5.595LeuGly: 5.595 ± 0.559
1.816LeuHis: 1.816 ± 0.318
3.27LeuIle: 3.27 ± 0.458
3.996LeuLys: 3.996 ± 0.473
6.83LeuLeu: 6.83 ± 0.772
1.744LeuMet: 1.744 ± 0.324
3.706LeuAsn: 3.706 ± 0.471
3.27LeuPro: 3.27 ± 0.662
3.924LeuGln: 3.924 ± 0.574
6.321LeuArg: 6.321 ± 0.703
6.249LeuSer: 6.249 ± 0.492
4.577LeuThr: 4.577 ± 0.435
4.723LeuVal: 4.723 ± 0.671
0.799LeuTrp: 0.799 ± 0.231
2.252LeuTyr: 2.252 ± 0.452
0.0LeuXaa: 0.0 ± 0.0
Met
2.979MetAla: 2.979 ± 0.449
0.073MetCys: 0.073 ± 0.065
2.398MetAsp: 2.398 ± 0.452
1.381MetGlu: 1.381 ± 0.314
0.799MetPhe: 0.799 ± 0.248
2.18MetGly: 2.18 ± 0.427
0.581MetHis: 0.581 ± 0.2
1.09MetIle: 1.09 ± 0.27
1.09MetLys: 1.09 ± 0.295
2.107MetLeu: 2.107 ± 0.219
1.09MetMet: 1.09 ± 0.244
0.581MetAsn: 0.581 ± 0.218
0.799MetPro: 0.799 ± 0.265
1.017MetGln: 1.017 ± 0.263
2.325MetArg: 2.325 ± 0.402
2.616MetSer: 2.616 ± 0.359
1.962MetThr: 1.962 ± 0.363
1.235MetVal: 1.235 ± 0.396
0.218MetTrp: 0.218 ± 0.122
0.799MetTyr: 0.799 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
4.287AsnAla: 4.287 ± 0.697
0.291AsnCys: 0.291 ± 0.153
2.252AsnAsp: 2.252 ± 0.442
2.325AsnGlu: 2.325 ± 0.402
1.017AsnPhe: 1.017 ± 0.234
4.36AsnGly: 4.36 ± 0.503
0.727AsnHis: 0.727 ± 0.239
1.598AsnIle: 1.598 ± 0.324
2.761AsnLys: 2.761 ± 0.488
2.906AsnLeu: 2.906 ± 0.404
1.09AsnMet: 1.09 ± 0.283
1.889AsnAsn: 1.889 ± 0.437
2.761AsnPro: 2.761 ± 0.332
1.235AsnGln: 1.235 ± 0.261
2.761AsnArg: 2.761 ± 0.67
2.616AsnSer: 2.616 ± 0.403
2.761AsnThr: 2.761 ± 0.391
2.034AsnVal: 2.034 ± 0.362
0.945AsnTrp: 0.945 ± 0.231
1.381AsnTyr: 1.381 ± 0.325
0.0AsnXaa: 0.0 ± 0.0
Pro
4.795ProAla: 4.795 ± 0.87
0.363ProCys: 0.363 ± 0.16
2.979ProAsp: 2.979 ± 0.453
2.834ProGlu: 2.834 ± 0.499
0.945ProPhe: 0.945 ± 0.334
3.415ProGly: 3.415 ± 0.599
0.799ProHis: 0.799 ± 0.285
1.235ProIle: 1.235 ± 0.314
1.744ProLys: 1.744 ± 0.316
3.56ProLeu: 3.56 ± 0.494
0.945ProMet: 0.945 ± 0.301
1.526ProAsn: 1.526 ± 0.226
1.598ProPro: 1.598 ± 0.433
1.816ProGln: 1.816 ± 0.258
1.526ProArg: 1.526 ± 0.283
2.398ProSer: 2.398 ± 0.446
3.124ProThr: 3.124 ± 0.493
3.778ProVal: 3.778 ± 0.411
0.799ProTrp: 0.799 ± 0.201
0.872ProTyr: 0.872 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
5.231GlnAla: 5.231 ± 0.77
0.363GlnCys: 0.363 ± 0.139
2.398GlnAsp: 2.398 ± 0.375
1.453GlnGlu: 1.453 ± 0.368
1.671GlnPhe: 1.671 ± 0.293
3.706GlnGly: 3.706 ± 0.422
0.581GlnHis: 0.581 ± 0.211
2.18GlnIle: 2.18 ± 0.344
1.235GlnLys: 1.235 ± 0.257
4.505GlnLeu: 4.505 ± 0.68
1.308GlnMet: 1.308 ± 0.297
1.308GlnAsn: 1.308 ± 0.375
1.235GlnPro: 1.235 ± 0.33
3.488GlnGln: 3.488 ± 0.7
3.052GlnArg: 3.052 ± 0.693
2.543GlnSer: 2.543 ± 0.475
3.197GlnThr: 3.197 ± 0.548
2.543GlnVal: 2.543 ± 0.5
0.727GlnTrp: 0.727 ± 0.222
1.598GlnTyr: 1.598 ± 0.355
0.0GlnXaa: 0.0 ± 0.0
Arg
6.612ArgAla: 6.612 ± 0.914
0.654ArgCys: 0.654 ± 0.229
3.56ArgAsp: 3.56 ± 0.397
3.197ArgGlu: 3.197 ± 0.52
2.18ArgPhe: 2.18 ± 0.344
4.723ArgGly: 4.723 ± 0.821
1.308ArgHis: 1.308 ± 0.281
3.27ArgIle: 3.27 ± 0.577
3.851ArgLys: 3.851 ± 0.547
4.287ArgLeu: 4.287 ± 0.604
2.47ArgMet: 2.47 ± 0.4
2.47ArgAsn: 2.47 ± 0.327
2.034ArgPro: 2.034 ± 0.353
2.543ArgGln: 2.543 ± 0.416
3.706ArgArg: 3.706 ± 0.822
2.47ArgSer: 2.47 ± 0.453
3.052ArgThr: 3.052 ± 0.333
4.577ArgVal: 4.577 ± 0.576
0.581ArgTrp: 0.581 ± 0.19
1.816ArgTyr: 1.816 ± 0.3
0.0ArgXaa: 0.0 ± 0.0
Ser
5.958SerAla: 5.958 ± 0.6
0.799SerCys: 0.799 ± 0.271
3.052SerAsp: 3.052 ± 0.393
3.488SerGlu: 3.488 ± 0.365
2.398SerPhe: 2.398 ± 0.563
5.377SerGly: 5.377 ± 0.835
0.799SerHis: 0.799 ± 0.234
3.197SerIle: 3.197 ± 0.486
3.052SerLys: 3.052 ± 0.436
4.723SerLeu: 4.723 ± 0.481
1.163SerMet: 1.163 ± 0.286
2.18SerAsn: 2.18 ± 0.405
2.616SerPro: 2.616 ± 0.368
2.543SerGln: 2.543 ± 0.385
3.488SerArg: 3.488 ± 0.463
3.778SerSer: 3.778 ± 0.61
3.706SerThr: 3.706 ± 0.463
5.013SerVal: 5.013 ± 0.709
1.163SerTrp: 1.163 ± 0.299
1.308SerTyr: 1.308 ± 0.291
0.0SerXaa: 0.0 ± 0.0
Thr
6.685ThrAla: 6.685 ± 0.707
0.363ThrCys: 0.363 ± 0.135
3.27ThrAsp: 3.27 ± 0.373
3.124ThrGlu: 3.124 ± 0.532
2.107ThrPhe: 2.107 ± 0.316
6.903ThrGly: 6.903 ± 0.621
1.017ThrHis: 1.017 ± 0.293
2.18ThrIle: 2.18 ± 0.322
3.27ThrLys: 3.27 ± 0.481
4.36ThrLeu: 4.36 ± 0.552
1.526ThrMet: 1.526 ± 0.29
2.834ThrAsn: 2.834 ± 0.4
2.834ThrPro: 2.834 ± 0.61
3.197ThrGln: 3.197 ± 0.39
2.834ThrArg: 2.834 ± 0.524
4.36ThrSer: 4.36 ± 0.702
4.432ThrThr: 4.432 ± 0.699
4.214ThrVal: 4.214 ± 0.628
1.453ThrTrp: 1.453 ± 0.332
2.906ThrTyr: 2.906 ± 0.528
0.0ThrXaa: 0.0 ± 0.0
Val
6.975ValAla: 6.975 ± 0.846
0.436ValCys: 0.436 ± 0.176
3.924ValAsp: 3.924 ± 0.599
4.287ValGlu: 4.287 ± 0.402
1.744ValPhe: 1.744 ± 0.439
6.685ValGly: 6.685 ± 0.738
1.308ValHis: 1.308 ± 0.313
3.27ValIle: 3.27 ± 0.482
2.979ValLys: 2.979 ± 0.443
6.103ValLeu: 6.103 ± 0.606
1.526ValMet: 1.526 ± 0.366
3.56ValAsn: 3.56 ± 0.667
3.633ValPro: 3.633 ± 0.373
2.47ValGln: 2.47 ± 0.359
4.214ValArg: 4.214 ± 0.61
3.488ValSer: 3.488 ± 0.641
4.868ValThr: 4.868 ± 0.783
3.924ValVal: 3.924 ± 0.59
1.235ValTrp: 1.235 ± 0.264
2.543ValTyr: 2.543 ± 0.453
0.0ValXaa: 0.0 ± 0.0
Trp
1.598TrpAla: 1.598 ± 0.283
0.363TrpCys: 0.363 ± 0.159
0.945TrpAsp: 0.945 ± 0.312
1.163TrpGlu: 1.163 ± 0.359
0.727TrpPhe: 0.727 ± 0.175
0.727TrpGly: 0.727 ± 0.25
0.363TrpHis: 0.363 ± 0.165
1.163TrpIle: 1.163 ± 0.296
1.017TrpLys: 1.017 ± 0.276
1.453TrpLeu: 1.453 ± 0.332
0.727TrpMet: 0.727 ± 0.329
0.799TrpAsn: 0.799 ± 0.235
0.509TrpPro: 0.509 ± 0.193
0.654TrpGln: 0.654 ± 0.166
0.654TrpArg: 0.654 ± 0.172
0.799TrpSer: 0.799 ± 0.245
0.727TrpThr: 0.727 ± 0.189
0.872TrpVal: 0.872 ± 0.195
0.291TrpTrp: 0.291 ± 0.123
0.727TrpTyr: 0.727 ± 0.175
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.761TyrAla: 2.761 ± 0.393
0.291TyrCys: 0.291 ± 0.166
2.325TyrAsp: 2.325 ± 0.326
2.18TyrGlu: 2.18 ± 0.401
1.235TyrPhe: 1.235 ± 0.309
2.761TyrGly: 2.761 ± 0.363
0.654TyrHis: 0.654 ± 0.21
1.671TyrIle: 1.671 ± 0.503
1.235TyrLys: 1.235 ± 0.353
3.124TyrLeu: 3.124 ± 0.5
0.727TyrMet: 0.727 ± 0.234
1.744TyrAsn: 1.744 ± 0.297
0.872TyrPro: 0.872 ± 0.28
1.598TyrGln: 1.598 ± 0.393
2.18TyrArg: 2.18 ± 0.299
2.47TyrSer: 2.47 ± 0.396
1.744TyrThr: 1.744 ± 0.407
2.18TyrVal: 2.18 ± 0.324
0.799TyrTrp: 0.799 ± 0.303
1.017TyrTyr: 1.017 ± 0.318
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (13764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski