Amino acid dipepetide frequency for Xanthomonas phage XPV3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.1AlaAla: 17.1 ± 1.088
0.961AlaCys: 0.961 ± 0.246
7.486AlaAsp: 7.486 ± 0.838
5.288AlaGlu: 5.288 ± 0.822
2.884AlaPhe: 2.884 ± 0.492
9.889AlaGly: 9.889 ± 1.075
2.266AlaHis: 2.266 ± 0.409
4.945AlaIle: 4.945 ± 0.559
4.67AlaLys: 4.67 ± 0.628
12.43AlaLeu: 12.43 ± 0.92
3.159AlaMet: 3.159 ± 0.415
4.464AlaAsn: 4.464 ± 0.826
6.73AlaPro: 6.73 ± 0.758
6.112AlaGln: 6.112 ± 0.685
7.692AlaArg: 7.692 ± 1.003
6.593AlaSer: 6.593 ± 0.786
8.379AlaThr: 8.379 ± 0.866
7.829AlaVal: 7.829 ± 0.741
1.854AlaTrp: 1.854 ± 0.352
3.434AlaTyr: 3.434 ± 0.488
0.0AlaXaa: 0.0 ± 0.0
Cys
0.755CysAla: 0.755 ± 0.231
0.069CysCys: 0.069 ± 0.07
0.618CysAsp: 0.618 ± 0.213
0.412CysGlu: 0.412 ± 0.164
0.137CysPhe: 0.137 ± 0.1
0.549CysGly: 0.549 ± 0.177
0.069CysHis: 0.069 ± 0.075
0.343CysIle: 0.343 ± 0.124
0.275CysLys: 0.275 ± 0.155
0.687CysLeu: 0.687 ± 0.199
0.206CysMet: 0.206 ± 0.147
0.0CysAsn: 0.0 ± 0.0
0.618CysPro: 0.618 ± 0.221
0.275CysGln: 0.275 ± 0.137
0.824CysArg: 0.824 ± 0.216
0.755CysSer: 0.755 ± 0.218
0.412CysThr: 0.412 ± 0.16
0.481CysVal: 0.481 ± 0.154
0.069CysTrp: 0.069 ± 0.063
0.275CysTyr: 0.275 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
8.653AspAla: 8.653 ± 0.773
0.755AspCys: 0.755 ± 0.241
4.395AspAsp: 4.395 ± 0.767
2.541AspGlu: 2.541 ± 0.442
1.992AspPhe: 1.992 ± 0.356
4.807AspGly: 4.807 ± 0.429
0.206AspHis: 0.206 ± 0.103
2.884AspIle: 2.884 ± 0.538
2.06AspLys: 2.06 ± 0.366
6.936AspLeu: 6.936 ± 0.847
1.305AspMet: 1.305 ± 0.335
2.198AspAsn: 2.198 ± 0.312
3.777AspPro: 3.777 ± 0.64
2.335AspGln: 2.335 ± 0.425
3.228AspArg: 3.228 ± 0.387
3.296AspSer: 3.296 ± 0.55
3.503AspThr: 3.503 ± 0.593
5.425AspVal: 5.425 ± 0.553
1.786AspTrp: 1.786 ± 0.421
2.61AspTyr: 2.61 ± 0.402
0.0AspXaa: 0.0 ± 0.0
Glu
6.662GluAla: 6.662 ± 0.857
0.343GluCys: 0.343 ± 0.144
2.404GluAsp: 2.404 ± 0.431
1.58GluGlu: 1.58 ± 0.352
1.305GluPhe: 1.305 ± 0.314
3.159GluGly: 3.159 ± 0.519
0.755GluHis: 0.755 ± 0.249
2.129GluIle: 2.129 ± 0.484
0.893GluLys: 0.893 ± 0.244
5.631GluLeu: 5.631 ± 0.793
0.961GluMet: 0.961 ± 0.263
0.687GluAsn: 0.687 ± 0.261
2.335GluPro: 2.335 ± 0.477
1.58GluGln: 1.58 ± 0.364
3.709GluArg: 3.709 ± 0.522
2.884GluSer: 2.884 ± 0.362
2.472GluThr: 2.472 ± 0.423
3.228GluVal: 3.228 ± 0.499
0.824GluTrp: 0.824 ± 0.23
1.854GluTyr: 1.854 ± 0.279
0.0GluXaa: 0.0 ± 0.0
Phe
3.915PheAla: 3.915 ± 0.456
0.481PheCys: 0.481 ± 0.183
3.022PheAsp: 3.022 ± 0.463
1.511PheGlu: 1.511 ± 0.342
0.824PhePhe: 0.824 ± 0.252
3.296PheGly: 3.296 ± 0.501
0.206PheHis: 0.206 ± 0.105
0.893PheIle: 0.893 ± 0.258
0.755PheLys: 0.755 ± 0.248
1.374PheLeu: 1.374 ± 0.241
0.549PheMet: 0.549 ± 0.175
1.236PheAsn: 1.236 ± 0.336
1.03PhePro: 1.03 ± 0.254
0.549PheGln: 0.549 ± 0.198
1.58PheArg: 1.58 ± 0.383
1.511PheSer: 1.511 ± 0.328
1.511PheThr: 1.511 ± 0.259
3.228PheVal: 3.228 ± 0.412
0.481PheTrp: 0.481 ± 0.142
0.343PheTyr: 0.343 ± 0.187
0.0PheXaa: 0.0 ± 0.0
Gly
8.173GlyAla: 8.173 ± 1.063
0.824GlyCys: 0.824 ± 0.24
3.846GlyAsp: 3.846 ± 0.465
3.571GlyGlu: 3.571 ± 0.506
3.434GlyPhe: 3.434 ± 0.485
7.417GlyGly: 7.417 ± 1.532
0.824GlyHis: 0.824 ± 0.264
3.846GlyIle: 3.846 ± 0.474
3.709GlyLys: 3.709 ± 0.526
6.456GlyLeu: 6.456 ± 0.838
1.58GlyMet: 1.58 ± 0.369
2.06GlyAsn: 2.06 ± 0.4
2.747GlyPro: 2.747 ± 0.351
3.365GlyGln: 3.365 ± 0.524
5.975GlyArg: 5.975 ± 0.583
3.915GlySer: 3.915 ± 0.863
5.425GlyThr: 5.425 ± 0.716
7.005GlyVal: 7.005 ± 0.697
1.374GlyTrp: 1.374 ± 0.305
2.747GlyTyr: 2.747 ± 0.413
0.0GlyXaa: 0.0 ± 0.0
His
1.236HisAla: 1.236 ± 0.285
0.206HisCys: 0.206 ± 0.109
0.824HisAsp: 0.824 ± 0.204
0.549HisGlu: 0.549 ± 0.231
0.343HisPhe: 0.343 ± 0.172
1.305HisGly: 1.305 ± 0.304
0.206HisHis: 0.206 ± 0.131
0.893HisIle: 0.893 ± 0.224
0.343HisLys: 0.343 ± 0.175
0.824HisLeu: 0.824 ± 0.242
0.618HisMet: 0.618 ± 0.193
0.961HisAsn: 0.961 ± 0.269
1.168HisPro: 1.168 ± 0.276
0.343HisGln: 0.343 ± 0.133
0.755HisArg: 0.755 ± 0.252
0.961HisSer: 0.961 ± 0.252
0.961HisThr: 0.961 ± 0.268
1.511HisVal: 1.511 ± 0.33
0.137HisTrp: 0.137 ± 0.092
0.549HisTyr: 0.549 ± 0.205
0.0HisXaa: 0.0 ± 0.0
Ile
5.219IleAla: 5.219 ± 0.481
0.069IleCys: 0.069 ± 0.063
3.983IleAsp: 3.983 ± 0.491
2.61IleGlu: 2.61 ± 0.417
0.893IlePhe: 0.893 ± 0.23
3.228IleGly: 3.228 ± 0.485
0.755IleHis: 0.755 ± 0.214
1.717IleIle: 1.717 ± 0.323
1.717IleLys: 1.717 ± 0.406
2.747IleLeu: 2.747 ± 0.458
0.961IleMet: 0.961 ± 0.277
1.648IleAsn: 1.648 ± 0.264
2.06IlePro: 2.06 ± 0.28
1.511IleGln: 1.511 ± 0.31
3.022IleArg: 3.022 ± 0.397
2.678IleSer: 2.678 ± 0.393
3.09IleThr: 3.09 ± 0.463
4.258IleVal: 4.258 ± 0.599
0.618IleTrp: 0.618 ± 0.176
1.236IleTyr: 1.236 ± 0.338
0.0IleXaa: 0.0 ± 0.0
Lys
3.503LysAla: 3.503 ± 0.513
0.343LysCys: 0.343 ± 0.164
1.511LysAsp: 1.511 ± 0.381
1.442LysGlu: 1.442 ± 0.26
0.824LysPhe: 0.824 ± 0.241
2.541LysGly: 2.541 ± 0.4
0.824LysHis: 0.824 ± 0.192
1.923LysIle: 1.923 ± 0.34
0.618LysLys: 0.618 ± 0.183
3.296LysLeu: 3.296 ± 0.517
1.099LysMet: 1.099 ± 0.298
0.687LysAsn: 0.687 ± 0.195
1.442LysPro: 1.442 ± 0.324
1.442LysGln: 1.442 ± 0.409
2.335LysArg: 2.335 ± 0.513
1.854LysSer: 1.854 ± 0.452
1.786LysThr: 1.786 ± 0.408
2.335LysVal: 2.335 ± 0.36
0.549LysTrp: 0.549 ± 0.179
0.893LysTyr: 0.893 ± 0.215
0.0LysXaa: 0.0 ± 0.0
Leu
10.714LeuAla: 10.714 ± 0.994
0.755LeuCys: 0.755 ± 0.216
6.799LeuAsp: 6.799 ± 0.778
4.464LeuGlu: 4.464 ± 0.53
2.816LeuPhe: 2.816 ± 0.437
5.906LeuGly: 5.906 ± 0.477
1.236LeuHis: 1.236 ± 0.275
2.953LeuIle: 2.953 ± 0.466
2.266LeuLys: 2.266 ± 0.43
6.387LeuLeu: 6.387 ± 0.676
2.404LeuMet: 2.404 ± 0.435
2.678LeuAsn: 2.678 ± 0.429
6.25LeuPro: 6.25 ± 0.826
3.709LeuGln: 3.709 ± 0.523
6.73LeuArg: 6.73 ± 0.918
6.593LeuSer: 6.593 ± 0.505
6.868LeuThr: 6.868 ± 0.583
4.739LeuVal: 4.739 ± 0.605
1.03LeuTrp: 1.03 ± 0.31
2.06LeuTyr: 2.06 ± 0.438
0.0LeuXaa: 0.0 ± 0.0
Met
3.709MetAla: 3.709 ± 0.645
0.206MetCys: 0.206 ± 0.119
1.648MetAsp: 1.648 ± 0.293
0.549MetGlu: 0.549 ± 0.243
0.549MetPhe: 0.549 ± 0.176
0.824MetGly: 0.824 ± 0.22
0.481MetHis: 0.481 ± 0.149
1.03MetIle: 1.03 ± 0.248
1.236MetLys: 1.236 ± 0.279
1.717MetLeu: 1.717 ± 0.362
0.481MetMet: 0.481 ± 0.159
0.961MetAsn: 0.961 ± 0.294
1.099MetPro: 1.099 ± 0.27
0.755MetGln: 0.755 ± 0.24
1.786MetArg: 1.786 ± 0.317
1.992MetSer: 1.992 ± 0.34
2.472MetThr: 2.472 ± 0.29
1.03MetVal: 1.03 ± 0.226
0.275MetTrp: 0.275 ± 0.108
0.412MetTyr: 0.412 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
5.082AsnAla: 5.082 ± 0.607
0.275AsnCys: 0.275 ± 0.112
2.06AsnAsp: 2.06 ± 0.433
1.442AsnGlu: 1.442 ± 0.301
0.824AsnPhe: 0.824 ± 0.26
3.022AsnGly: 3.022 ± 0.467
0.481AsnHis: 0.481 ± 0.171
1.511AsnIle: 1.511 ± 0.367
0.687AsnLys: 0.687 ± 0.259
2.678AsnLeu: 2.678 ± 0.379
0.618AsnMet: 0.618 ± 0.269
1.236AsnAsn: 1.236 ± 0.402
2.816AsnPro: 2.816 ± 0.4
0.618AsnGln: 0.618 ± 0.237
1.648AsnArg: 1.648 ± 0.32
1.648AsnSer: 1.648 ± 0.296
2.472AsnThr: 2.472 ± 0.469
2.404AsnVal: 2.404 ± 0.627
0.687AsnTrp: 0.687 ± 0.232
0.893AsnTyr: 0.893 ± 0.276
0.0AsnXaa: 0.0 ± 0.0
Pro
6.25ProAla: 6.25 ± 0.486
0.0ProCys: 0.0 ± 0.0
4.533ProAsp: 4.533 ± 0.684
3.64ProGlu: 3.64 ± 0.591
1.442ProPhe: 1.442 ± 0.345
4.876ProGly: 4.876 ± 0.673
0.961ProHis: 0.961 ± 0.314
2.266ProIle: 2.266 ± 0.35
1.442ProLys: 1.442 ± 0.371
3.571ProLeu: 3.571 ± 0.563
1.374ProMet: 1.374 ± 0.276
1.854ProAsn: 1.854 ± 0.32
2.266ProPro: 2.266 ± 0.537
2.953ProGln: 2.953 ± 0.338
3.296ProArg: 3.296 ± 0.462
3.915ProSer: 3.915 ± 0.574
4.121ProThr: 4.121 ± 0.613
4.739ProVal: 4.739 ± 0.705
0.549ProTrp: 0.549 ± 0.167
1.923ProTyr: 1.923 ± 0.285
0.0ProXaa: 0.0 ± 0.0
Gln
5.082GlnAla: 5.082 ± 0.682
0.343GlnCys: 0.343 ± 0.134
2.06GlnAsp: 2.06 ± 0.432
0.961GlnGlu: 0.961 ± 0.226
1.374GlnPhe: 1.374 ± 0.211
2.541GlnGly: 2.541 ± 0.36
0.481GlnHis: 0.481 ± 0.202
2.266GlnIle: 2.266 ± 0.41
1.168GlnLys: 1.168 ± 0.299
5.288GlnLeu: 5.288 ± 0.631
0.618GlnMet: 0.618 ± 0.211
0.961GlnAsn: 0.961 ± 0.276
2.61GlnPro: 2.61 ± 0.353
2.472GlnGln: 2.472 ± 0.655
3.709GlnArg: 3.709 ± 0.507
2.678GlnSer: 2.678 ± 0.373
3.296GlnThr: 3.296 ± 0.63
2.747GlnVal: 2.747 ± 0.519
0.687GlnTrp: 0.687 ± 0.24
1.236GlnTyr: 1.236 ± 0.308
0.0GlnXaa: 0.0 ± 0.0
Arg
8.173ArgAla: 8.173 ± 0.891
0.343ArgCys: 0.343 ± 0.162
3.64ArgAsp: 3.64 ± 0.49
3.915ArgGlu: 3.915 ± 0.634
1.854ArgPhe: 1.854 ± 0.44
3.64ArgGly: 3.64 ± 0.442
1.442ArgHis: 1.442 ± 0.305
3.365ArgIle: 3.365 ± 0.422
2.198ArgLys: 2.198 ± 0.481
6.181ArgLeu: 6.181 ± 0.524
2.335ArgMet: 2.335 ± 0.323
2.198ArgAsn: 2.198 ± 0.446
2.884ArgPro: 2.884 ± 0.428
3.022ArgGln: 3.022 ± 0.487
3.846ArgArg: 3.846 ± 0.504
3.296ArgSer: 3.296 ± 0.424
4.601ArgThr: 4.601 ± 0.521
4.327ArgVal: 4.327 ± 0.462
1.374ArgTrp: 1.374 ± 0.328
2.884ArgTyr: 2.884 ± 0.462
0.0ArgXaa: 0.0 ± 0.0
Ser
7.829SerAla: 7.829 ± 0.881
0.481SerCys: 0.481 ± 0.17
3.709SerAsp: 3.709 ± 0.457
2.198SerGlu: 2.198 ± 0.422
1.58SerPhe: 1.58 ± 0.341
7.005SerGly: 7.005 ± 0.866
0.755SerHis: 0.755 ± 0.164
2.747SerIle: 2.747 ± 0.437
1.854SerLys: 1.854 ± 0.333
4.395SerLeu: 4.395 ± 0.522
1.168SerMet: 1.168 ± 0.237
1.786SerAsn: 1.786 ± 0.314
3.64SerPro: 3.64 ± 0.479
2.198SerGln: 2.198 ± 0.43
3.022SerArg: 3.022 ± 0.388
4.464SerSer: 4.464 ± 0.561
5.013SerThr: 5.013 ± 0.74
4.739SerVal: 4.739 ± 0.636
0.893SerTrp: 0.893 ± 0.25
1.511SerTyr: 1.511 ± 0.337
0.0SerXaa: 0.0 ± 0.0
Thr
8.035ThrAla: 8.035 ± 0.976
0.206ThrCys: 0.206 ± 0.112
4.464ThrAsp: 4.464 ± 0.64
2.06ThrGlu: 2.06 ± 0.39
2.06ThrPhe: 2.06 ± 0.362
5.494ThrGly: 5.494 ± 0.794
0.824ThrHis: 0.824 ± 0.189
3.09ThrIle: 3.09 ± 0.46
1.854ThrLys: 1.854 ± 0.382
6.868ThrLeu: 6.868 ± 0.813
0.687ThrMet: 0.687 ± 0.203
2.335ThrAsn: 2.335 ± 0.42
5.357ThrPro: 5.357 ± 0.644
3.571ThrGln: 3.571 ± 0.485
3.296ThrArg: 3.296 ± 0.489
4.052ThrSer: 4.052 ± 0.605
5.838ThrThr: 5.838 ± 0.776
5.631ThrVal: 5.631 ± 0.726
1.168ThrTrp: 1.168 ± 0.249
2.953ThrTyr: 2.953 ± 0.548
0.0ThrXaa: 0.0 ± 0.0
Val
8.928ValAla: 8.928 ± 1.006
0.412ValCys: 0.412 ± 0.166
4.258ValAsp: 4.258 ± 0.439
4.601ValGlu: 4.601 ± 0.578
1.786ValPhe: 1.786 ± 0.35
5.357ValGly: 5.357 ± 0.639
0.961ValHis: 0.961 ± 0.268
3.365ValIle: 3.365 ± 0.394
2.335ValLys: 2.335 ± 0.461
6.112ValLeu: 6.112 ± 0.762
2.198ValMet: 2.198 ± 0.38
3.434ValAsn: 3.434 ± 0.681
4.189ValPro: 4.189 ± 0.623
2.678ValGln: 2.678 ± 0.379
4.876ValArg: 4.876 ± 0.557
4.876ValSer: 4.876 ± 0.563
4.67ValThr: 4.67 ± 0.673
4.945ValVal: 4.945 ± 0.511
1.442ValTrp: 1.442 ± 0.28
2.472ValTyr: 2.472 ± 0.443
0.0ValXaa: 0.0 ± 0.0
Trp
1.992TrpAla: 1.992 ± 0.378
0.275TrpCys: 0.275 ± 0.156
1.099TrpAsp: 1.099 ± 0.236
1.168TrpGlu: 1.168 ± 0.306
0.481TrpPhe: 0.481 ± 0.182
0.687TrpGly: 0.687 ± 0.216
0.343TrpHis: 0.343 ± 0.162
0.481TrpIle: 0.481 ± 0.209
0.343TrpLys: 0.343 ± 0.13
1.305TrpLeu: 1.305 ± 0.3
0.412TrpMet: 0.412 ± 0.153
0.206TrpAsn: 0.206 ± 0.116
0.893TrpPro: 0.893 ± 0.253
1.168TrpGln: 1.168 ± 0.297
1.305TrpArg: 1.305 ± 0.3
1.03TrpSer: 1.03 ± 0.238
1.374TrpThr: 1.374 ± 0.298
0.961TrpVal: 0.961 ± 0.21
0.481TrpTrp: 0.481 ± 0.191
1.099TrpTyr: 1.099 ± 0.306
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.228TyrAla: 3.228 ± 0.455
0.481TyrCys: 0.481 ± 0.168
2.335TyrAsp: 2.335 ± 0.361
0.893TyrGlu: 0.893 ± 0.257
0.893TyrPhe: 0.893 ± 0.247
2.747TyrGly: 2.747 ± 0.498
0.618TyrHis: 0.618 ± 0.226
1.374TyrIle: 1.374 ± 0.332
0.824TyrLys: 0.824 ± 0.225
2.472TyrLeu: 2.472 ± 0.426
0.412TyrMet: 0.412 ± 0.193
1.511TyrAsn: 1.511 ± 0.328
2.06TyrPro: 2.06 ± 0.402
1.923TyrGln: 1.923 ± 0.331
2.884TyrArg: 2.884 ± 0.42
1.992TyrSer: 1.992 ± 0.388
1.58TyrThr: 1.58 ± 0.348
2.335TyrVal: 2.335 ± 0.476
0.824TyrTrp: 0.824 ± 0.243
0.412TyrTyr: 0.412 ± 0.198
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 76 proteins (14562 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski