Amino acid dipepetide frequency for Xanthomonas phage PBR31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.864AlaAla: 12.864 ± 1.108
1.375AlaCys: 1.375 ± 0.349
6.311AlaAsp: 6.311 ± 0.692
6.553AlaGlu: 6.553 ± 1.093
3.155AlaPhe: 3.155 ± 0.527
7.362AlaGly: 7.362 ± 1.054
1.294AlaHis: 1.294 ± 0.273
6.715AlaIle: 6.715 ± 0.909
5.178AlaLys: 5.178 ± 0.729
9.709AlaLeu: 9.709 ± 0.821
3.641AlaMet: 3.641 ± 0.529
4.288AlaAsn: 4.288 ± 0.628
4.693AlaPro: 4.693 ± 0.533
5.583AlaGln: 5.583 ± 0.864
5.663AlaArg: 5.663 ± 0.671
6.392AlaSer: 6.392 ± 0.931
5.987AlaThr: 5.987 ± 0.929
6.877AlaVal: 6.877 ± 0.585
1.942AlaTrp: 1.942 ± 0.515
3.155AlaTyr: 3.155 ± 0.344
0.0AlaXaa: 0.0 ± 0.0
Cys
0.89CysAla: 0.89 ± 0.258
0.081CysCys: 0.081 ± 0.084
1.133CysAsp: 1.133 ± 0.295
0.728CysGlu: 0.728 ± 0.259
0.162CysPhe: 0.162 ± 0.112
1.618CysGly: 1.618 ± 0.338
0.324CysHis: 0.324 ± 0.152
1.214CysIle: 1.214 ± 0.29
0.566CysLys: 0.566 ± 0.23
0.485CysLeu: 0.485 ± 0.219
0.647CysMet: 0.647 ± 0.231
0.243CysAsn: 0.243 ± 0.133
0.405CysPro: 0.405 ± 0.187
0.243CysGln: 0.243 ± 0.122
0.809CysArg: 0.809 ± 0.286
0.566CysSer: 0.566 ± 0.211
0.485CysThr: 0.485 ± 0.191
0.809CysVal: 0.809 ± 0.231
0.324CysTrp: 0.324 ± 0.13
0.405CysTyr: 0.405 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
6.472AspAla: 6.472 ± 0.581
0.89AspCys: 0.89 ± 0.264
2.994AspAsp: 2.994 ± 0.568
3.56AspGlu: 3.56 ± 0.588
2.913AspPhe: 2.913 ± 0.512
4.612AspGly: 4.612 ± 0.697
0.89AspHis: 0.89 ± 0.267
2.346AspIle: 2.346 ± 0.441
1.78AspLys: 1.78 ± 0.406
3.964AspLeu: 3.964 ± 0.567
2.023AspMet: 2.023 ± 0.464
1.942AspAsn: 1.942 ± 0.41
3.155AspPro: 3.155 ± 0.554
3.155AspGln: 3.155 ± 0.543
3.317AspArg: 3.317 ± 0.528
3.398AspSer: 3.398 ± 0.627
3.074AspThr: 3.074 ± 0.604
3.479AspVal: 3.479 ± 0.478
1.537AspTrp: 1.537 ± 0.312
1.133AspTyr: 1.133 ± 0.351
0.0AspXaa: 0.0 ± 0.0
Glu
5.906GluAla: 5.906 ± 0.823
0.809GluCys: 0.809 ± 0.235
3.479GluAsp: 3.479 ± 0.635
4.126GluGlu: 4.126 ± 0.779
1.942GluPhe: 1.942 ± 0.429
2.589GluGly: 2.589 ± 0.474
1.942GluHis: 1.942 ± 0.398
2.913GluIle: 2.913 ± 0.499
2.913GluLys: 2.913 ± 0.482
5.502GluLeu: 5.502 ± 0.843
1.78GluMet: 1.78 ± 0.408
1.618GluAsn: 1.618 ± 0.449
2.104GluPro: 2.104 ± 0.461
2.67GluGln: 2.67 ± 0.56
4.854GluArg: 4.854 ± 0.528
2.427GluSer: 2.427 ± 0.351
3.479GluThr: 3.479 ± 0.632
3.883GluVal: 3.883 ± 0.66
1.214GluTrp: 1.214 ± 0.311
1.618GluTyr: 1.618 ± 0.363
0.0GluXaa: 0.0 ± 0.0
Phe
3.479PheAla: 3.479 ± 0.594
0.485PheCys: 0.485 ± 0.204
2.67PheAsp: 2.67 ± 0.577
1.456PheGlu: 1.456 ± 0.398
0.728PhePhe: 0.728 ± 0.207
2.751PheGly: 2.751 ± 0.559
0.728PheHis: 0.728 ± 0.254
2.023PheIle: 2.023 ± 0.47
0.971PheLys: 0.971 ± 0.258
2.265PheLeu: 2.265 ± 0.459
0.566PheMet: 0.566 ± 0.215
1.375PheAsn: 1.375 ± 0.31
1.942PhePro: 1.942 ± 0.416
1.537PheGln: 1.537 ± 0.368
1.375PheArg: 1.375 ± 0.319
1.942PheSer: 1.942 ± 0.52
2.346PheThr: 2.346 ± 0.442
2.265PheVal: 2.265 ± 0.397
0.405PheTrp: 0.405 ± 0.172
1.133PheTyr: 1.133 ± 0.41
0.0PheXaa: 0.0 ± 0.0
Gly
7.767GlyAla: 7.767 ± 0.96
1.294GlyCys: 1.294 ± 0.317
3.074GlyAsp: 3.074 ± 0.43
4.693GlyGlu: 4.693 ± 0.577
3.56GlyPhe: 3.56 ± 0.556
6.634GlyGly: 6.634 ± 1.137
1.294GlyHis: 1.294 ± 0.374
3.074GlyIle: 3.074 ± 0.543
4.369GlyLys: 4.369 ± 0.603
5.744GlyLeu: 5.744 ± 0.673
2.023GlyMet: 2.023 ± 0.491
3.803GlyAsn: 3.803 ± 0.555
2.913GlyPro: 2.913 ± 0.455
3.398GlyGln: 3.398 ± 0.635
3.722GlyArg: 3.722 ± 0.517
4.693GlySer: 4.693 ± 0.782
6.553GlyThr: 6.553 ± 1.026
5.178GlyVal: 5.178 ± 0.676
2.265GlyTrp: 2.265 ± 0.436
2.67GlyTyr: 2.67 ± 0.418
0.0GlyXaa: 0.0 ± 0.0
His
1.78HisAla: 1.78 ± 0.489
0.485HisCys: 0.485 ± 0.173
0.728HisAsp: 0.728 ± 0.282
1.618HisGlu: 1.618 ± 0.333
0.89HisPhe: 0.89 ± 0.218
1.861HisGly: 1.861 ± 0.394
0.405HisHis: 0.405 ± 0.214
0.89HisIle: 0.89 ± 0.282
1.052HisLys: 1.052 ± 0.301
1.699HisLeu: 1.699 ± 0.301
0.728HisMet: 0.728 ± 0.217
0.485HisAsn: 0.485 ± 0.185
1.133HisPro: 1.133 ± 0.292
0.647HisGln: 0.647 ± 0.222
0.809HisArg: 0.809 ± 0.266
1.052HisSer: 1.052 ± 0.305
0.566HisThr: 0.566 ± 0.203
1.456HisVal: 1.456 ± 0.371
0.647HisTrp: 0.647 ± 0.263
0.405HisTyr: 0.405 ± 0.151
0.0HisXaa: 0.0 ± 0.0
Ile
6.149IleAla: 6.149 ± 0.811
0.566IleCys: 0.566 ± 0.213
3.479IleAsp: 3.479 ± 0.473
3.964IleGlu: 3.964 ± 0.718
1.375IlePhe: 1.375 ± 0.362
4.693IleGly: 4.693 ± 0.614
0.647IleHis: 0.647 ± 0.223
1.78IleIle: 1.78 ± 0.34
2.508IleLys: 2.508 ± 0.489
3.155IleLeu: 3.155 ± 0.536
0.971IleMet: 0.971 ± 0.272
1.456IleAsn: 1.456 ± 0.417
2.751IlePro: 2.751 ± 0.472
2.184IleGln: 2.184 ± 0.405
2.67IleArg: 2.67 ± 0.456
2.994IleSer: 2.994 ± 0.55
2.994IleThr: 2.994 ± 0.511
3.883IleVal: 3.883 ± 0.589
0.647IleTrp: 0.647 ± 0.214
1.133IleTyr: 1.133 ± 0.296
0.0IleXaa: 0.0 ± 0.0
Lys
5.259LysAla: 5.259 ± 0.808
0.566LysCys: 0.566 ± 0.205
2.589LysAsp: 2.589 ± 0.451
3.155LysGlu: 3.155 ± 0.534
1.456LysPhe: 1.456 ± 0.357
2.913LysGly: 2.913 ± 0.518
1.375LysHis: 1.375 ± 0.443
2.104LysIle: 2.104 ± 0.387
2.913LysLys: 2.913 ± 0.613
3.398LysLeu: 3.398 ± 0.541
1.133LysMet: 1.133 ± 0.281
1.699LysAsn: 1.699 ± 0.401
2.994LysPro: 2.994 ± 0.515
2.184LysGln: 2.184 ± 0.398
3.641LysArg: 3.641 ± 0.679
2.508LysSer: 2.508 ± 0.58
2.994LysThr: 2.994 ± 0.536
3.398LysVal: 3.398 ± 0.451
0.647LysTrp: 0.647 ± 0.219
0.809LysTyr: 0.809 ± 0.265
0.0LysXaa: 0.0 ± 0.0
Leu
8.576LeuAla: 8.576 ± 0.865
1.052LeuCys: 1.052 ± 0.294
4.207LeuAsp: 4.207 ± 0.576
3.803LeuGlu: 3.803 ± 0.574
2.346LeuPhe: 2.346 ± 0.477
6.392LeuGly: 6.392 ± 0.782
1.294LeuHis: 1.294 ± 0.328
3.641LeuIle: 3.641 ± 0.559
3.964LeuLys: 3.964 ± 0.518
6.392LeuLeu: 6.392 ± 0.767
2.346LeuMet: 2.346 ± 0.476
4.045LeuAsn: 4.045 ± 0.635
3.883LeuPro: 3.883 ± 0.551
3.641LeuGln: 3.641 ± 0.693
5.016LeuArg: 5.016 ± 0.917
6.149LeuSer: 6.149 ± 0.619
4.288LeuThr: 4.288 ± 0.496
4.773LeuVal: 4.773 ± 0.53
1.618LeuTrp: 1.618 ± 0.366
2.104LeuTyr: 2.104 ± 0.418
0.0LeuXaa: 0.0 ± 0.0
Met
4.126MetAla: 4.126 ± 0.626
0.162MetCys: 0.162 ± 0.102
1.78MetAsp: 1.78 ± 0.458
1.456MetGlu: 1.456 ± 0.403
0.485MetPhe: 0.485 ± 0.216
1.861MetGly: 1.861 ± 0.46
0.485MetHis: 0.485 ± 0.227
0.971MetIle: 0.971 ± 0.341
1.699MetLys: 1.699 ± 0.42
2.104MetLeu: 2.104 ± 0.396
0.324MetMet: 0.324 ± 0.163
1.294MetAsn: 1.294 ± 0.301
1.942MetPro: 1.942 ± 0.458
2.751MetGln: 2.751 ± 0.548
1.942MetArg: 1.942 ± 0.325
1.942MetSer: 1.942 ± 0.394
1.699MetThr: 1.699 ± 0.419
1.214MetVal: 1.214 ± 0.318
0.405MetTrp: 0.405 ± 0.154
0.324MetTyr: 0.324 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
4.369AsnAla: 4.369 ± 0.544
0.405AsnCys: 0.405 ± 0.179
2.104AsnAsp: 2.104 ± 0.375
1.942AsnGlu: 1.942 ± 0.371
1.214AsnPhe: 1.214 ± 0.32
3.883AsnGly: 3.883 ± 0.627
0.89AsnHis: 0.89 ± 0.28
1.861AsnIle: 1.861 ± 0.516
1.942AsnLys: 1.942 ± 0.413
3.074AsnLeu: 3.074 ± 0.489
1.294AsnMet: 1.294 ± 0.323
2.913AsnAsn: 2.913 ± 0.656
3.641AsnPro: 3.641 ± 0.518
1.699AsnGln: 1.699 ± 0.396
2.184AsnArg: 2.184 ± 0.429
2.184AsnSer: 2.184 ± 0.395
2.832AsnThr: 2.832 ± 0.595
1.375AsnVal: 1.375 ± 0.509
0.566AsnTrp: 0.566 ± 0.189
1.133AsnTyr: 1.133 ± 0.335
0.0AsnXaa: 0.0 ± 0.0
Pro
5.502ProAla: 5.502 ± 0.675
0.324ProCys: 0.324 ± 0.14
3.317ProAsp: 3.317 ± 0.476
2.832ProGlu: 2.832 ± 0.617
1.861ProPhe: 1.861 ± 0.309
4.207ProGly: 4.207 ± 0.669
0.971ProHis: 0.971 ± 0.32
2.346ProIle: 2.346 ± 0.307
2.265ProLys: 2.265 ± 0.52
4.126ProLeu: 4.126 ± 0.472
1.294ProMet: 1.294 ± 0.302
1.537ProAsn: 1.537 ± 0.393
2.67ProPro: 2.67 ± 0.445
2.346ProGln: 2.346 ± 0.358
2.589ProArg: 2.589 ± 0.524
3.155ProSer: 3.155 ± 0.56
2.589ProThr: 2.589 ± 0.392
4.207ProVal: 4.207 ± 0.632
0.971ProTrp: 0.971 ± 0.257
1.052ProTyr: 1.052 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
5.825GlnAla: 5.825 ± 1.018
0.324GlnCys: 0.324 ± 0.171
1.942GlnAsp: 1.942 ± 0.378
3.317GlnGlu: 3.317 ± 0.552
1.942GlnPhe: 1.942 ± 0.474
3.074GlnGly: 3.074 ± 0.619
0.89GlnHis: 0.89 ± 0.252
2.913GlnIle: 2.913 ± 0.582
2.913GlnLys: 2.913 ± 0.607
3.479GlnLeu: 3.479 ± 0.537
1.861GlnMet: 1.861 ± 0.382
2.832GlnAsn: 2.832 ± 0.661
2.023GlnPro: 2.023 ± 0.423
3.641GlnGln: 3.641 ± 0.843
2.427GlnArg: 2.427 ± 0.457
2.346GlnSer: 2.346 ± 0.455
2.346GlnThr: 2.346 ± 0.505
2.751GlnVal: 2.751 ± 0.499
1.133GlnTrp: 1.133 ± 0.254
1.699GlnTyr: 1.699 ± 0.378
0.0GlnXaa: 0.0 ± 0.0
Arg
4.935ArgAla: 4.935 ± 0.6
0.971ArgCys: 0.971 ± 0.254
2.913ArgAsp: 2.913 ± 0.4
3.803ArgGlu: 3.803 ± 0.565
1.942ArgPhe: 1.942 ± 0.393
3.56ArgGly: 3.56 ± 0.655
2.104ArgHis: 2.104 ± 0.433
3.56ArgIle: 3.56 ± 0.579
2.346ArgLys: 2.346 ± 0.384
5.502ArgLeu: 5.502 ± 0.778
1.618ArgMet: 1.618 ± 0.465
1.456ArgAsn: 1.456 ± 0.422
2.104ArgPro: 2.104 ± 0.342
2.751ArgGln: 2.751 ± 0.597
3.317ArgArg: 3.317 ± 0.513
3.155ArgSer: 3.155 ± 0.495
3.155ArgThr: 3.155 ± 0.493
4.045ArgVal: 4.045 ± 0.542
1.052ArgTrp: 1.052 ± 0.278
2.104ArgTyr: 2.104 ± 0.429
0.0ArgXaa: 0.0 ± 0.0
Ser
6.311SerAla: 6.311 ± 1.014
0.243SerCys: 0.243 ± 0.142
3.641SerAsp: 3.641 ± 0.55
2.67SerGlu: 2.67 ± 0.451
1.618SerPhe: 1.618 ± 0.345
5.583SerGly: 5.583 ± 0.823
1.133SerHis: 1.133 ± 0.378
2.751SerIle: 2.751 ± 0.506
2.67SerLys: 2.67 ± 0.419
4.773SerLeu: 4.773 ± 0.668
1.861SerMet: 1.861 ± 0.459
3.479SerAsn: 3.479 ± 0.547
2.427SerPro: 2.427 ± 0.331
3.641SerGln: 3.641 ± 0.539
2.751SerArg: 2.751 ± 0.348
3.398SerSer: 3.398 ± 0.606
3.883SerThr: 3.883 ± 0.752
4.126SerVal: 4.126 ± 0.645
0.405SerTrp: 0.405 ± 0.189
1.456SerTyr: 1.456 ± 0.375
0.0SerXaa: 0.0 ± 0.0
Thr
8.172ThrAla: 8.172 ± 0.977
0.324ThrCys: 0.324 ± 0.236
3.883ThrAsp: 3.883 ± 0.567
2.265ThrGlu: 2.265 ± 0.46
1.618ThrPhe: 1.618 ± 0.451
6.068ThrGly: 6.068 ± 1.023
1.052ThrHis: 1.052 ± 0.322
3.155ThrIle: 3.155 ± 0.589
2.104ThrLys: 2.104 ± 0.423
5.097ThrLeu: 5.097 ± 0.72
1.214ThrMet: 1.214 ± 0.307
1.942ThrAsn: 1.942 ± 0.417
4.369ThrPro: 4.369 ± 0.721
2.751ThrGln: 2.751 ± 0.492
1.537ThrArg: 1.537 ± 0.312
4.045ThrSer: 4.045 ± 0.707
4.207ThrThr: 4.207 ± 0.683
3.641ThrVal: 3.641 ± 0.6
0.89ThrTrp: 0.89 ± 0.251
1.942ThrTyr: 1.942 ± 0.469
0.0ThrXaa: 0.0 ± 0.0
Val
6.715ValAla: 6.715 ± 0.882
0.971ValCys: 0.971 ± 0.267
3.56ValAsp: 3.56 ± 0.388
3.479ValGlu: 3.479 ± 0.488
1.133ValPhe: 1.133 ± 0.313
5.744ValGly: 5.744 ± 0.709
0.971ValHis: 0.971 ± 0.242
3.641ValIle: 3.641 ± 0.604
3.722ValLys: 3.722 ± 0.499
5.34ValLeu: 5.34 ± 0.602
2.184ValMet: 2.184 ± 0.395
3.398ValAsn: 3.398 ± 0.593
2.589ValPro: 2.589 ± 0.459
2.104ValGln: 2.104 ± 0.431
4.126ValArg: 4.126 ± 0.764
3.803ValSer: 3.803 ± 0.51
4.126ValThr: 4.126 ± 0.777
4.45ValVal: 4.45 ± 0.545
0.647ValTrp: 0.647 ± 0.231
1.942ValTyr: 1.942 ± 0.429
0.0ValXaa: 0.0 ± 0.0
Trp
1.214TrpAla: 1.214 ± 0.267
0.566TrpCys: 0.566 ± 0.205
0.647TrpAsp: 0.647 ± 0.226
0.647TrpGlu: 0.647 ± 0.22
0.728TrpPhe: 0.728 ± 0.242
1.618TrpGly: 1.618 ± 0.318
0.405TrpHis: 0.405 ± 0.182
1.214TrpIle: 1.214 ± 0.278
0.485TrpLys: 0.485 ± 0.166
1.861TrpLeu: 1.861 ± 0.392
0.485TrpMet: 0.485 ± 0.202
0.728TrpAsn: 0.728 ± 0.212
1.375TrpPro: 1.375 ± 0.402
0.809TrpGln: 0.809 ± 0.226
1.537TrpArg: 1.537 ± 0.305
1.375TrpSer: 1.375 ± 0.369
0.89TrpThr: 0.89 ± 0.276
0.89TrpVal: 0.89 ± 0.264
0.324TrpTrp: 0.324 ± 0.163
0.809TrpTyr: 0.809 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.508TyrAla: 2.508 ± 0.556
0.243TyrCys: 0.243 ± 0.164
2.184TyrAsp: 2.184 ± 0.454
1.699TyrGlu: 1.699 ± 0.312
1.375TyrPhe: 1.375 ± 0.365
1.699TyrGly: 1.699 ± 0.382
0.243TyrHis: 0.243 ± 0.142
0.89TyrIle: 0.89 ± 0.319
1.375TyrLys: 1.375 ± 0.366
1.861TyrLeu: 1.861 ± 0.361
0.89TyrMet: 0.89 ± 0.256
0.971TyrAsn: 0.971 ± 0.3
1.052TyrPro: 1.052 ± 0.29
1.942TyrGln: 1.942 ± 0.401
2.104TyrArg: 2.104 ± 0.398
1.294TyrSer: 1.294 ± 0.305
1.78TyrThr: 1.78 ± 0.515
1.942TyrVal: 1.942 ± 0.346
0.971TyrTrp: 0.971 ± 0.29
0.89TyrTyr: 0.89 ± 0.212
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (12361 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski