Amino acid dipepetide frequency for Xanthomonas phage PPDBI

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.864AlaAla: 12.864 ± 1.152
1.375AlaCys: 1.375 ± 0.371
6.311AlaAsp: 6.311 ± 0.763
6.553AlaGlu: 6.553 ± 1.065
3.155AlaPhe: 3.155 ± 0.441
7.362AlaGly: 7.362 ± 1.186
1.294AlaHis: 1.294 ± 0.278
6.715AlaIle: 6.715 ± 0.845
5.178AlaLys: 5.178 ± 0.714
9.709AlaLeu: 9.709 ± 0.863
3.722AlaMet: 3.722 ± 0.538
4.288AlaAsn: 4.288 ± 0.754
4.693AlaPro: 4.693 ± 0.619
5.583AlaGln: 5.583 ± 0.794
5.663AlaArg: 5.663 ± 0.533
6.392AlaSer: 6.392 ± 1.05
5.987AlaThr: 5.987 ± 0.984
6.877AlaVal: 6.877 ± 0.569
1.942AlaTrp: 1.942 ± 0.455
3.155AlaTyr: 3.155 ± 0.372
0.0AlaXaa: 0.0 ± 0.0
Cys
0.89CysAla: 0.89 ± 0.232
0.081CysCys: 0.081 ± 0.069
1.133CysAsp: 1.133 ± 0.329
0.728CysGlu: 0.728 ± 0.265
0.162CysPhe: 0.162 ± 0.112
1.618CysGly: 1.618 ± 0.361
0.324CysHis: 0.324 ± 0.16
1.214CysIle: 1.214 ± 0.275
0.566CysLys: 0.566 ± 0.207
0.485CysLeu: 0.485 ± 0.17
0.566CysMet: 0.566 ± 0.228
0.243CysAsn: 0.243 ± 0.127
0.405CysPro: 0.405 ± 0.169
0.243CysGln: 0.243 ± 0.126
0.809CysArg: 0.809 ± 0.237
0.566CysSer: 0.566 ± 0.211
0.485CysThr: 0.485 ± 0.174
0.809CysVal: 0.809 ± 0.236
0.324CysTrp: 0.324 ± 0.154
0.405CysTyr: 0.405 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
6.472AspAla: 6.472 ± 0.666
0.89AspCys: 0.89 ± 0.244
2.994AspAsp: 2.994 ± 0.623
3.56AspGlu: 3.56 ± 0.504
2.913AspPhe: 2.913 ± 0.436
4.612AspGly: 4.612 ± 0.68
0.89AspHis: 0.89 ± 0.283
2.346AspIle: 2.346 ± 0.454
1.78AspLys: 1.78 ± 0.387
3.964AspLeu: 3.964 ± 0.593
2.023AspMet: 2.023 ± 0.47
1.942AspAsn: 1.942 ± 0.399
3.155AspPro: 3.155 ± 0.6
3.155AspGln: 3.155 ± 0.492
3.317AspArg: 3.317 ± 0.536
3.398AspSer: 3.398 ± 0.564
3.074AspThr: 3.074 ± 0.594
3.479AspVal: 3.479 ± 0.518
1.537AspTrp: 1.537 ± 0.325
1.133AspTyr: 1.133 ± 0.335
0.0AspXaa: 0.0 ± 0.0
Glu
5.906GluAla: 5.906 ± 0.728
0.809GluCys: 0.809 ± 0.233
3.479GluAsp: 3.479 ± 0.645
4.126GluGlu: 4.126 ± 0.778
1.942GluPhe: 1.942 ± 0.424
2.589GluGly: 2.589 ± 0.538
1.942GluHis: 1.942 ± 0.302
2.913GluIle: 2.913 ± 0.532
2.913GluLys: 2.913 ± 0.457
5.502GluLeu: 5.502 ± 0.774
1.78GluMet: 1.78 ± 0.461
1.618GluAsn: 1.618 ± 0.419
2.104GluPro: 2.104 ± 0.385
2.67GluGln: 2.67 ± 0.557
4.854GluArg: 4.854 ± 0.534
2.427GluSer: 2.427 ± 0.365
3.479GluThr: 3.479 ± 0.652
3.883GluVal: 3.883 ± 0.596
1.214GluTrp: 1.214 ± 0.351
1.618GluTyr: 1.618 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
3.479PheAla: 3.479 ± 0.57
0.485PheCys: 0.485 ± 0.165
2.67PheAsp: 2.67 ± 0.547
1.456PheGlu: 1.456 ± 0.35
0.728PhePhe: 0.728 ± 0.211
2.751PheGly: 2.751 ± 0.558
0.728PheHis: 0.728 ± 0.255
2.023PheIle: 2.023 ± 0.503
0.971PheLys: 0.971 ± 0.263
2.265PheLeu: 2.265 ± 0.447
0.566PheMet: 0.566 ± 0.231
1.375PheAsn: 1.375 ± 0.386
1.942PhePro: 1.942 ± 0.474
1.537PheGln: 1.537 ± 0.378
1.375PheArg: 1.375 ± 0.318
1.942PheSer: 1.942 ± 0.59
2.346PheThr: 2.346 ± 0.472
2.265PheVal: 2.265 ± 0.468
0.405PheTrp: 0.405 ± 0.155
1.133PheTyr: 1.133 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
7.767GlyAla: 7.767 ± 0.947
1.294GlyCys: 1.294 ± 0.304
3.074GlyAsp: 3.074 ± 0.458
4.693GlyGlu: 4.693 ± 0.499
3.56GlyPhe: 3.56 ± 0.571
6.634GlyGly: 6.634 ± 1.179
1.294GlyHis: 1.294 ± 0.373
3.074GlyIle: 3.074 ± 0.459
4.369GlyLys: 4.369 ± 0.58
5.744GlyLeu: 5.744 ± 0.683
2.023GlyMet: 2.023 ± 0.538
3.803GlyAsn: 3.803 ± 0.547
2.913GlyPro: 2.913 ± 0.458
3.398GlyGln: 3.398 ± 0.609
3.722GlyArg: 3.722 ± 0.524
4.693GlySer: 4.693 ± 0.657
6.553GlyThr: 6.553 ± 0.976
5.178GlyVal: 5.178 ± 0.77
2.265GlyTrp: 2.265 ± 0.417
2.67GlyTyr: 2.67 ± 0.386
0.0GlyXaa: 0.0 ± 0.0
His
1.78HisAla: 1.78 ± 0.548
0.485HisCys: 0.485 ± 0.192
0.728HisAsp: 0.728 ± 0.259
1.618HisGlu: 1.618 ± 0.348
0.89HisPhe: 0.89 ± 0.24
1.861HisGly: 1.861 ± 0.413
0.405HisHis: 0.405 ± 0.186
0.89HisIle: 0.89 ± 0.288
1.052HisLys: 1.052 ± 0.292
1.699HisLeu: 1.699 ± 0.311
0.728HisMet: 0.728 ± 0.262
0.485HisAsn: 0.485 ± 0.199
1.133HisPro: 1.133 ± 0.338
0.647HisGln: 0.647 ± 0.218
0.809HisArg: 0.809 ± 0.271
1.052HisSer: 1.052 ± 0.332
0.566HisThr: 0.566 ± 0.201
1.456HisVal: 1.456 ± 0.412
0.647HisTrp: 0.647 ± 0.261
0.405HisTyr: 0.405 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
6.149IleAla: 6.149 ± 0.844
0.566IleCys: 0.566 ± 0.237
3.479IleAsp: 3.479 ± 0.476
3.964IleGlu: 3.964 ± 0.812
1.375IlePhe: 1.375 ± 0.296
4.693IleGly: 4.693 ± 0.623
0.647IleHis: 0.647 ± 0.218
1.78IleIle: 1.78 ± 0.358
2.508IleLys: 2.508 ± 0.483
3.155IleLeu: 3.155 ± 0.608
0.971IleMet: 0.971 ± 0.3
1.456IleAsn: 1.456 ± 0.417
2.751IlePro: 2.751 ± 0.473
2.184IleGln: 2.184 ± 0.453
2.67IleArg: 2.67 ± 0.506
2.994IleSer: 2.994 ± 0.576
2.994IleThr: 2.994 ± 0.489
3.883IleVal: 3.883 ± 0.586
0.647IleTrp: 0.647 ± 0.2
1.133IleTyr: 1.133 ± 0.337
0.0IleXaa: 0.0 ± 0.0
Lys
5.259LysAla: 5.259 ± 0.775
0.566LysCys: 0.566 ± 0.204
2.589LysAsp: 2.589 ± 0.393
3.155LysGlu: 3.155 ± 0.522
1.456LysPhe: 1.456 ± 0.327
2.913LysGly: 2.913 ± 0.539
1.375LysHis: 1.375 ± 0.439
2.104LysIle: 2.104 ± 0.396
2.913LysLys: 2.913 ± 0.605
3.398LysLeu: 3.398 ± 0.515
1.133LysMet: 1.133 ± 0.241
1.699LysAsn: 1.699 ± 0.396
2.994LysPro: 2.994 ± 0.556
2.184LysGln: 2.184 ± 0.413
3.641LysArg: 3.641 ± 0.783
2.508LysSer: 2.508 ± 0.563
2.994LysThr: 2.994 ± 0.497
3.398LysVal: 3.398 ± 0.422
0.647LysTrp: 0.647 ± 0.243
0.809LysTyr: 0.809 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
8.576LeuAla: 8.576 ± 0.844
1.052LeuCys: 1.052 ± 0.253
4.207LeuAsp: 4.207 ± 0.501
3.803LeuGlu: 3.803 ± 0.582
2.346LeuPhe: 2.346 ± 0.517
6.392LeuGly: 6.392 ± 0.768
1.294LeuHis: 1.294 ± 0.417
3.641LeuIle: 3.641 ± 0.515
3.964LeuLys: 3.964 ± 0.529
6.392LeuLeu: 6.392 ± 0.769
2.346LeuMet: 2.346 ± 0.465
4.045LeuAsn: 4.045 ± 0.638
3.883LeuPro: 3.883 ± 0.584
3.641LeuGln: 3.641 ± 0.688
5.016LeuArg: 5.016 ± 0.844
6.149LeuSer: 6.149 ± 0.656
4.288LeuThr: 4.288 ± 0.64
4.773LeuVal: 4.773 ± 0.608
1.618LeuTrp: 1.618 ± 0.389
2.104LeuTyr: 2.104 ± 0.426
0.0LeuXaa: 0.0 ± 0.0
Met
4.126MetAla: 4.126 ± 0.601
0.162MetCys: 0.162 ± 0.12
1.78MetAsp: 1.78 ± 0.452
1.456MetGlu: 1.456 ± 0.356
0.485MetPhe: 0.485 ± 0.205
1.861MetGly: 1.861 ± 0.401
0.485MetHis: 0.485 ± 0.175
0.971MetIle: 0.971 ± 0.304
1.699MetLys: 1.699 ± 0.353
2.104MetLeu: 2.104 ± 0.391
0.324MetMet: 0.324 ± 0.163
1.294MetAsn: 1.294 ± 0.303
1.942MetPro: 1.942 ± 0.399
2.751MetGln: 2.751 ± 0.565
1.942MetArg: 1.942 ± 0.381
1.942MetSer: 1.942 ± 0.375
1.699MetThr: 1.699 ± 0.385
1.214MetVal: 1.214 ± 0.305
0.405MetTrp: 0.405 ± 0.15
0.324MetTyr: 0.324 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
4.369AsnAla: 4.369 ± 0.687
0.405AsnCys: 0.405 ± 0.165
2.104AsnAsp: 2.104 ± 0.397
1.942AsnGlu: 1.942 ± 0.383
1.214AsnPhe: 1.214 ± 0.302
3.883AsnGly: 3.883 ± 0.671
0.89AsnHis: 0.89 ± 0.277
1.861AsnIle: 1.861 ± 0.506
1.942AsnLys: 1.942 ± 0.387
3.074AsnLeu: 3.074 ± 0.561
1.294AsnMet: 1.294 ± 0.277
2.913AsnAsn: 2.913 ± 0.536
3.641AsnPro: 3.641 ± 0.517
1.699AsnGln: 1.699 ± 0.469
2.184AsnArg: 2.184 ± 0.381
2.184AsnSer: 2.184 ± 0.495
2.832AsnThr: 2.832 ± 0.534
1.375AsnVal: 1.375 ± 0.489
0.566AsnTrp: 0.566 ± 0.2
1.133AsnTyr: 1.133 ± 0.293
0.0AsnXaa: 0.0 ± 0.0
Pro
5.502ProAla: 5.502 ± 0.755
0.324ProCys: 0.324 ± 0.139
3.317ProAsp: 3.317 ± 0.554
2.832ProGlu: 2.832 ± 0.599
1.861ProPhe: 1.861 ± 0.344
4.207ProGly: 4.207 ± 0.609
0.971ProHis: 0.971 ± 0.31
2.346ProIle: 2.346 ± 0.339
2.265ProLys: 2.265 ± 0.457
4.126ProLeu: 4.126 ± 0.492
1.294ProMet: 1.294 ± 0.351
1.537ProAsn: 1.537 ± 0.409
2.67ProPro: 2.67 ± 0.479
2.346ProGln: 2.346 ± 0.307
2.589ProArg: 2.589 ± 0.638
3.155ProSer: 3.155 ± 0.557
2.589ProThr: 2.589 ± 0.428
4.207ProVal: 4.207 ± 0.577
0.971ProTrp: 0.971 ± 0.265
1.052ProTyr: 1.052 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
5.825GlnAla: 5.825 ± 0.951
0.324GlnCys: 0.324 ± 0.158
1.942GlnAsp: 1.942 ± 0.36
3.317GlnGlu: 3.317 ± 0.556
1.942GlnPhe: 1.942 ± 0.483
3.074GlnGly: 3.074 ± 0.631
0.89GlnHis: 0.89 ± 0.261
2.913GlnIle: 2.913 ± 0.494
2.913GlnLys: 2.913 ± 0.68
3.479GlnLeu: 3.479 ± 0.489
1.861GlnMet: 1.861 ± 0.338
2.832GlnAsn: 2.832 ± 0.583
2.023GlnPro: 2.023 ± 0.41
3.641GlnGln: 3.641 ± 0.847
2.427GlnArg: 2.427 ± 0.42
2.346GlnSer: 2.346 ± 0.419
2.346GlnThr: 2.346 ± 0.531
2.751GlnVal: 2.751 ± 0.574
1.133GlnTrp: 1.133 ± 0.284
1.699GlnTyr: 1.699 ± 0.386
0.0GlnXaa: 0.0 ± 0.0
Arg
4.935ArgAla: 4.935 ± 0.517
0.971ArgCys: 0.971 ± 0.295
2.913ArgAsp: 2.913 ± 0.407
3.803ArgGlu: 3.803 ± 0.578
1.942ArgPhe: 1.942 ± 0.41
3.56ArgGly: 3.56 ± 0.546
2.104ArgHis: 2.104 ± 0.5
3.56ArgIle: 3.56 ± 0.578
2.346ArgLys: 2.346 ± 0.417
5.502ArgLeu: 5.502 ± 0.748
1.618ArgMet: 1.618 ± 0.445
1.456ArgAsn: 1.456 ± 0.386
2.104ArgPro: 2.104 ± 0.347
2.751ArgGln: 2.751 ± 0.551
3.317ArgArg: 3.317 ± 0.491
3.155ArgSer: 3.155 ± 0.482
3.155ArgThr: 3.155 ± 0.575
4.045ArgVal: 4.045 ± 0.568
1.052ArgTrp: 1.052 ± 0.277
2.104ArgTyr: 2.104 ± 0.472
0.0ArgXaa: 0.0 ± 0.0
Ser
6.311SerAla: 6.311 ± 1.025
0.243SerCys: 0.243 ± 0.139
3.641SerAsp: 3.641 ± 0.627
2.67SerGlu: 2.67 ± 0.484
1.618SerPhe: 1.618 ± 0.338
5.583SerGly: 5.583 ± 0.895
1.133SerHis: 1.133 ± 0.427
2.751SerIle: 2.751 ± 0.541
2.67SerLys: 2.67 ± 0.392
4.773SerLeu: 4.773 ± 0.637
1.861SerMet: 1.861 ± 0.459
3.479SerAsn: 3.479 ± 0.533
2.427SerPro: 2.427 ± 0.392
3.641SerGln: 3.641 ± 0.563
2.751SerArg: 2.751 ± 0.434
3.398SerSer: 3.398 ± 0.732
3.883SerThr: 3.883 ± 0.797
4.126SerVal: 4.126 ± 0.661
0.405SerTrp: 0.405 ± 0.173
1.456SerTyr: 1.456 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
8.172ThrAla: 8.172 ± 0.977
0.324ThrCys: 0.324 ± 0.184
3.883ThrAsp: 3.883 ± 0.606
2.265ThrGlu: 2.265 ± 0.384
1.618ThrPhe: 1.618 ± 0.411
6.068ThrGly: 6.068 ± 0.916
1.052ThrHis: 1.052 ± 0.294
3.155ThrIle: 3.155 ± 0.555
2.104ThrLys: 2.104 ± 0.355
5.097ThrLeu: 5.097 ± 0.627
1.214ThrMet: 1.214 ± 0.277
1.942ThrAsn: 1.942 ± 0.399
4.369ThrPro: 4.369 ± 0.761
2.751ThrGln: 2.751 ± 0.476
1.537ThrArg: 1.537 ± 0.329
4.045ThrSer: 4.045 ± 0.791
4.207ThrThr: 4.207 ± 0.9
3.641ThrVal: 3.641 ± 0.686
0.89ThrTrp: 0.89 ± 0.229
1.942ThrTyr: 1.942 ± 0.51
0.0ThrXaa: 0.0 ± 0.0
Val
6.715ValAla: 6.715 ± 0.831
0.971ValCys: 0.971 ± 0.291
3.56ValAsp: 3.56 ± 0.358
3.479ValGlu: 3.479 ± 0.565
1.133ValPhe: 1.133 ± 0.328
5.744ValGly: 5.744 ± 0.747
0.971ValHis: 0.971 ± 0.249
3.641ValIle: 3.641 ± 0.534
3.722ValLys: 3.722 ± 0.495
5.34ValLeu: 5.34 ± 0.612
2.184ValMet: 2.184 ± 0.451
3.398ValAsn: 3.398 ± 0.566
2.589ValPro: 2.589 ± 0.438
2.104ValGln: 2.104 ± 0.452
4.126ValArg: 4.126 ± 0.675
3.803ValSer: 3.803 ± 0.586
4.126ValThr: 4.126 ± 0.779
4.45ValVal: 4.45 ± 0.532
0.647ValTrp: 0.647 ± 0.209
1.942ValTyr: 1.942 ± 0.368
0.0ValXaa: 0.0 ± 0.0
Trp
1.214TrpAla: 1.214 ± 0.269
0.566TrpCys: 0.566 ± 0.204
0.647TrpAsp: 0.647 ± 0.238
0.647TrpGlu: 0.647 ± 0.255
0.728TrpPhe: 0.728 ± 0.227
1.618TrpGly: 1.618 ± 0.332
0.405TrpHis: 0.405 ± 0.164
1.214TrpIle: 1.214 ± 0.305
0.485TrpLys: 0.485 ± 0.192
1.861TrpLeu: 1.861 ± 0.41
0.485TrpMet: 0.485 ± 0.204
0.728TrpAsn: 0.728 ± 0.245
1.375TrpPro: 1.375 ± 0.406
0.809TrpGln: 0.809 ± 0.214
1.537TrpArg: 1.537 ± 0.333
1.375TrpSer: 1.375 ± 0.391
0.89TrpThr: 0.89 ± 0.273
0.89TrpVal: 0.89 ± 0.222
0.324TrpTrp: 0.324 ± 0.123
0.809TrpTyr: 0.809 ± 0.234
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.508TyrAla: 2.508 ± 0.475
0.243TyrCys: 0.243 ± 0.169
2.184TyrAsp: 2.184 ± 0.409
1.699TyrGlu: 1.699 ± 0.335
1.375TyrPhe: 1.375 ± 0.375
1.699TyrGly: 1.699 ± 0.4
0.243TyrHis: 0.243 ± 0.131
0.89TyrIle: 0.89 ± 0.32
1.375TyrLys: 1.375 ± 0.381
1.861TyrLeu: 1.861 ± 0.383
0.89TyrMet: 0.89 ± 0.285
0.971TyrAsn: 0.971 ± 0.335
1.052TyrPro: 1.052 ± 0.347
1.942TyrGln: 1.942 ± 0.389
2.104TyrArg: 2.104 ± 0.445
1.294TyrSer: 1.294 ± 0.303
1.78TyrThr: 1.78 ± 0.49
1.942TyrVal: 1.942 ± 0.372
0.971TyrTrp: 0.971 ± 0.266
0.89TyrTyr: 0.89 ± 0.242
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (12361 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski