Amino acid dipepetide frequency for Pectobacterium phage Arno160

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.229AlaAla: 13.229 ± 1.671
1.16AlaCys: 1.16 ± 0.334
5.648AlaAsp: 5.648 ± 0.883
6.344AlaGlu: 6.344 ± 0.738
3.095AlaPhe: 3.095 ± 0.464
7.427AlaGly: 7.427 ± 1.227
2.089AlaHis: 2.089 ± 0.353
4.332AlaIle: 4.332 ± 0.605
5.648AlaLys: 5.648 ± 0.788
8.355AlaLeu: 8.355 ± 0.826
3.172AlaMet: 3.172 ± 0.459
4.1AlaAsn: 4.1 ± 0.777
2.553AlaPro: 2.553 ± 0.404
5.106AlaGln: 5.106 ± 0.887
4.41AlaArg: 4.41 ± 0.595
6.266AlaSer: 6.266 ± 0.781
5.183AlaThr: 5.183 ± 0.807
6.808AlaVal: 6.808 ± 0.696
1.625AlaTrp: 1.625 ± 0.398
3.172AlaTyr: 3.172 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.619CysAla: 0.619 ± 0.191
0.077CysCys: 0.077 ± 0.073
1.006CysAsp: 1.006 ± 0.308
0.387CysGlu: 0.387 ± 0.169
0.387CysPhe: 0.387 ± 0.175
0.851CysGly: 0.851 ± 0.233
0.155CysHis: 0.155 ± 0.169
0.232CysIle: 0.232 ± 0.143
0.232CysLys: 0.232 ± 0.153
0.851CysLeu: 0.851 ± 0.21
0.619CysMet: 0.619 ± 0.248
0.309CysAsn: 0.309 ± 0.141
0.774CysPro: 0.774 ± 0.306
0.387CysGln: 0.387 ± 0.165
0.774CysArg: 0.774 ± 0.236
0.309CysSer: 0.309 ± 0.116
0.387CysThr: 0.387 ± 0.171
1.315CysVal: 1.315 ± 0.33
0.155CysTrp: 0.155 ± 0.113
0.464CysTyr: 0.464 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
5.725AspAla: 5.725 ± 0.592
0.542AspCys: 0.542 ± 0.224
2.398AspAsp: 2.398 ± 0.297
3.172AspGlu: 3.172 ± 0.396
3.017AspPhe: 3.017 ± 0.516
4.642AspGly: 4.642 ± 0.624
1.006AspHis: 1.006 ± 0.326
3.172AspIle: 3.172 ± 0.443
2.862AspLys: 2.862 ± 0.488
4.178AspLeu: 4.178 ± 0.61
1.702AspMet: 1.702 ± 0.366
3.095AspAsn: 3.095 ± 0.508
2.785AspPro: 2.785 ± 0.414
0.851AspGln: 0.851 ± 0.193
3.017AspArg: 3.017 ± 0.511
4.255AspSer: 4.255 ± 0.601
3.868AspThr: 3.868 ± 0.707
4.255AspVal: 4.255 ± 0.507
0.774AspTrp: 0.774 ± 0.266
3.017AspTyr: 3.017 ± 0.506
0.0AspXaa: 0.0 ± 0.0
Glu
6.499GluAla: 6.499 ± 0.891
0.542GluCys: 0.542 ± 0.19
4.564GluAsp: 4.564 ± 0.764
4.41GluGlu: 4.41 ± 0.54
2.166GluPhe: 2.166 ± 0.534
5.802GluGly: 5.802 ± 0.404
1.083GluHis: 1.083 ± 0.268
2.244GluIle: 2.244 ± 0.522
2.553GluLys: 2.553 ± 0.539
6.576GluLeu: 6.576 ± 0.647
1.47GluMet: 1.47 ± 0.261
1.547GluAsn: 1.547 ± 0.416
1.393GluPro: 1.393 ± 0.261
3.017GluGln: 3.017 ± 0.476
3.946GluArg: 3.946 ± 0.572
2.63GluSer: 2.63 ± 0.336
2.785GluThr: 2.785 ± 0.425
3.481GluVal: 3.481 ± 0.571
1.083GluTrp: 1.083 ± 0.297
2.011GluTyr: 2.011 ± 0.452
0.0GluXaa: 0.0 ± 0.0
Phe
2.94PheAla: 2.94 ± 0.368
0.619PheCys: 0.619 ± 0.226
2.862PheAsp: 2.862 ± 0.515
2.166PheGlu: 2.166 ± 0.376
1.16PhePhe: 1.16 ± 0.336
2.708PheGly: 2.708 ± 0.497
0.464PheHis: 0.464 ± 0.195
2.089PheIle: 2.089 ± 0.439
2.244PheLys: 2.244 ± 0.48
3.017PheLeu: 3.017 ± 0.381
1.16PheMet: 1.16 ± 0.255
1.47PheAsn: 1.47 ± 0.34
1.625PhePro: 1.625 ± 0.315
1.47PheGln: 1.47 ± 0.262
2.244PheArg: 2.244 ± 0.349
2.244PheSer: 2.244 ± 0.413
2.476PheThr: 2.476 ± 0.433
1.779PheVal: 1.779 ± 0.403
0.387PheTrp: 0.387 ± 0.154
1.47PheTyr: 1.47 ± 0.36
0.0PheXaa: 0.0 ± 0.0
Gly
6.885GlyAla: 6.885 ± 1.199
1.083GlyCys: 1.083 ± 0.301
4.178GlyAsp: 4.178 ± 0.44
3.713GlyGlu: 3.713 ± 0.452
2.63GlyPhe: 2.63 ± 0.428
6.653GlyGly: 6.653 ± 0.862
1.393GlyHis: 1.393 ± 0.282
4.41GlyIle: 4.41 ± 0.747
4.874GlyLys: 4.874 ± 0.76
6.653GlyLeu: 6.653 ± 0.549
1.934GlyMet: 1.934 ± 0.362
2.785GlyAsn: 2.785 ± 0.341
1.47GlyPro: 1.47 ± 0.31
2.862GlyGln: 2.862 ± 0.456
4.719GlyArg: 4.719 ± 0.506
5.802GlySer: 5.802 ± 0.746
6.034GlyThr: 6.034 ± 0.934
5.57GlyVal: 5.57 ± 0.775
1.315GlyTrp: 1.315 ± 0.232
2.398GlyTyr: 2.398 ± 0.431
0.0GlyXaa: 0.0 ± 0.0
His
1.857HisAla: 1.857 ± 0.382
0.232HisCys: 0.232 ± 0.124
1.238HisAsp: 1.238 ± 0.346
0.928HisGlu: 0.928 ± 0.348
0.696HisPhe: 0.696 ± 0.168
1.702HisGly: 1.702 ± 0.423
0.464HisHis: 0.464 ± 0.208
0.619HisIle: 0.619 ± 0.161
1.006HisLys: 1.006 ± 0.331
1.315HisLeu: 1.315 ± 0.281
0.387HisMet: 0.387 ± 0.154
1.083HisAsn: 1.083 ± 0.336
1.006HisPro: 1.006 ± 0.282
0.774HisGln: 0.774 ± 0.198
1.006HisArg: 1.006 ± 0.29
1.702HisSer: 1.702 ± 0.404
1.393HisThr: 1.393 ± 0.357
1.315HisVal: 1.315 ± 0.328
0.232HisTrp: 0.232 ± 0.114
0.851HisTyr: 0.851 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
3.481IleAla: 3.481 ± 0.474
0.155IleCys: 0.155 ± 0.124
2.321IleAsp: 2.321 ± 0.343
2.398IleGlu: 2.398 ± 0.414
1.779IlePhe: 1.779 ± 0.41
3.636IleGly: 3.636 ± 0.608
1.547IleHis: 1.547 ± 0.336
1.857IleIle: 1.857 ± 0.516
2.166IleLys: 2.166 ± 0.526
2.94IleLeu: 2.94 ± 0.481
1.547IleMet: 1.547 ± 0.295
1.934IleAsn: 1.934 ± 0.285
1.702IlePro: 1.702 ± 0.309
2.244IleGln: 2.244 ± 0.339
3.249IleArg: 3.249 ± 0.475
3.481IleSer: 3.481 ± 0.606
3.172IleThr: 3.172 ± 0.511
2.708IleVal: 2.708 ± 0.507
0.542IleTrp: 0.542 ± 0.218
1.47IleTyr: 1.47 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
5.725LysAla: 5.725 ± 0.971
0.387LysCys: 0.387 ± 0.16
2.94LysAsp: 2.94 ± 0.442
3.095LysGlu: 3.095 ± 0.478
1.857LysPhe: 1.857 ± 0.377
4.178LysGly: 4.178 ± 0.426
1.779LysHis: 1.779 ± 0.403
1.547LysIle: 1.547 ± 0.401
2.862LysLys: 2.862 ± 0.681
5.106LysLeu: 5.106 ± 0.733
1.16LysMet: 1.16 ± 0.252
1.083LysAsn: 1.083 ± 0.411
2.553LysPro: 2.553 ± 0.492
2.244LysGln: 2.244 ± 0.326
2.63LysArg: 2.63 ± 0.457
3.017LysSer: 3.017 ± 0.429
2.94LysThr: 2.94 ± 0.382
5.106LysVal: 5.106 ± 0.524
0.696LysTrp: 0.696 ± 0.218
1.934LysTyr: 1.934 ± 0.396
0.0LysXaa: 0.0 ± 0.0
Leu
8.974LeuAla: 8.974 ± 0.86
0.851LeuCys: 0.851 ± 0.33
5.415LeuAsp: 5.415 ± 0.644
4.719LeuGlu: 4.719 ± 0.679
2.63LeuPhe: 2.63 ± 0.387
5.725LeuGly: 5.725 ± 0.66
1.934LeuHis: 1.934 ± 0.338
3.481LeuIle: 3.481 ± 0.534
4.642LeuLys: 4.642 ± 0.662
6.266LeuLeu: 6.266 ± 0.599
2.785LeuMet: 2.785 ± 0.453
3.946LeuAsn: 3.946 ± 0.526
4.178LeuPro: 4.178 ± 0.454
4.797LeuGln: 4.797 ± 0.729
5.029LeuArg: 5.029 ± 0.563
5.88LeuSer: 5.88 ± 0.545
4.951LeuThr: 4.951 ± 0.455
6.266LeuVal: 6.266 ± 0.735
0.774LeuTrp: 0.774 ± 0.266
2.398LeuTyr: 2.398 ± 0.393
0.0LeuXaa: 0.0 ± 0.0
Met
3.791MetAla: 3.791 ± 0.506
0.077MetCys: 0.077 ± 0.075
1.857MetAsp: 1.857 ± 0.415
1.702MetGlu: 1.702 ± 0.33
1.083MetPhe: 1.083 ± 0.258
1.857MetGly: 1.857 ± 0.412
0.542MetHis: 0.542 ± 0.177
1.006MetIle: 1.006 ± 0.206
1.47MetLys: 1.47 ± 0.407
2.166MetLeu: 2.166 ± 0.351
0.851MetMet: 0.851 ± 0.409
1.315MetAsn: 1.315 ± 0.341
1.315MetPro: 1.315 ± 0.322
2.089MetGln: 2.089 ± 0.418
1.779MetArg: 1.779 ± 0.335
2.244MetSer: 2.244 ± 0.441
1.315MetThr: 1.315 ± 0.289
1.934MetVal: 1.934 ± 0.36
0.387MetTrp: 0.387 ± 0.137
1.16MetTyr: 1.16 ± 0.246
0.0MetXaa: 0.0 ± 0.0
Asn
2.862AsnAla: 2.862 ± 0.453
0.696AsnCys: 0.696 ± 0.316
2.089AsnAsp: 2.089 ± 0.326
1.625AsnGlu: 1.625 ± 0.35
1.238AsnPhe: 1.238 ± 0.237
3.404AsnGly: 3.404 ± 0.516
0.928AsnHis: 0.928 ± 0.235
2.862AsnIle: 2.862 ± 0.447
2.089AsnLys: 2.089 ± 0.377
2.94AsnLeu: 2.94 ± 0.483
1.779AsnMet: 1.779 ± 0.318
1.547AsnAsn: 1.547 ± 0.463
2.011AsnPro: 2.011 ± 0.317
1.16AsnGln: 1.16 ± 0.294
3.017AsnArg: 3.017 ± 0.388
3.095AsnSer: 3.095 ± 0.576
2.862AsnThr: 2.862 ± 0.505
2.785AsnVal: 2.785 ± 0.449
0.387AsnTrp: 0.387 ± 0.161
1.393AsnTyr: 1.393 ± 0.313
0.0AsnXaa: 0.0 ± 0.0
Pro
3.095ProAla: 3.095 ± 0.377
0.542ProCys: 0.542 ± 0.211
2.476ProAsp: 2.476 ± 0.443
3.017ProGlu: 3.017 ± 0.426
1.238ProPhe: 1.238 ± 0.292
2.94ProGly: 2.94 ± 0.45
0.542ProHis: 0.542 ± 0.195
1.934ProIle: 1.934 ± 0.399
1.547ProLys: 1.547 ± 0.417
3.636ProLeu: 3.636 ± 0.42
0.928ProMet: 0.928 ± 0.254
1.934ProAsn: 1.934 ± 0.42
1.625ProPro: 1.625 ± 0.559
1.702ProGln: 1.702 ± 0.368
1.238ProArg: 1.238 ± 0.291
1.934ProSer: 1.934 ± 0.384
2.862ProThr: 2.862 ± 0.504
2.862ProVal: 2.862 ± 0.437
0.464ProTrp: 0.464 ± 0.183
1.934ProTyr: 1.934 ± 0.415
0.0ProXaa: 0.0 ± 0.0
Gln
5.183GlnAla: 5.183 ± 0.594
0.232GlnCys: 0.232 ± 0.139
1.934GlnAsp: 1.934 ± 0.432
3.249GlnGlu: 3.249 ± 0.461
1.393GlnPhe: 1.393 ± 0.332
2.94GlnGly: 2.94 ± 0.528
1.393GlnHis: 1.393 ± 0.364
1.857GlnIle: 1.857 ± 0.459
2.011GlnLys: 2.011 ± 0.433
4.255GlnLeu: 4.255 ± 0.437
1.625GlnMet: 1.625 ± 0.305
1.547GlnAsn: 1.547 ± 0.356
0.851GlnPro: 0.851 ± 0.234
2.476GlnGln: 2.476 ± 0.566
2.089GlnArg: 2.089 ± 0.403
2.553GlnSer: 2.553 ± 0.46
2.011GlnThr: 2.011 ± 0.389
3.172GlnVal: 3.172 ± 0.695
0.619GlnTrp: 0.619 ± 0.242
1.934GlnTyr: 1.934 ± 0.431
0.0GlnXaa: 0.0 ± 0.0
Arg
4.951ArgAla: 4.951 ± 0.668
0.464ArgCys: 0.464 ± 0.214
3.017ArgAsp: 3.017 ± 0.374
4.332ArgGlu: 4.332 ± 0.59
2.785ArgPhe: 2.785 ± 0.391
3.946ArgGly: 3.946 ± 0.421
1.083ArgHis: 1.083 ± 0.291
2.785ArgIle: 2.785 ± 0.569
2.94ArgLys: 2.94 ± 0.415
5.415ArgLeu: 5.415 ± 0.574
1.934ArgMet: 1.934 ± 0.412
2.321ArgAsn: 2.321 ± 0.402
2.244ArgPro: 2.244 ± 0.457
2.321ArgGln: 2.321 ± 0.484
4.023ArgArg: 4.023 ± 0.611
2.94ArgSer: 2.94 ± 0.445
3.172ArgThr: 3.172 ± 0.522
3.404ArgVal: 3.404 ± 0.46
0.696ArgTrp: 0.696 ± 0.233
2.398ArgTyr: 2.398 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
7.504SerAla: 7.504 ± 0.663
0.309SerCys: 0.309 ± 0.137
3.481SerAsp: 3.481 ± 0.637
3.946SerGlu: 3.946 ± 0.574
2.476SerPhe: 2.476 ± 0.494
5.029SerGly: 5.029 ± 0.642
0.851SerHis: 0.851 ± 0.206
1.934SerIle: 1.934 ± 0.493
4.874SerLys: 4.874 ± 0.648
5.415SerLeu: 5.415 ± 0.607
1.702SerMet: 1.702 ± 0.391
2.862SerAsn: 2.862 ± 0.751
2.94SerPro: 2.94 ± 0.512
2.785SerGln: 2.785 ± 0.525
2.708SerArg: 2.708 ± 0.444
4.255SerSer: 4.255 ± 0.566
4.178SerThr: 4.178 ± 0.481
5.648SerVal: 5.648 ± 0.6
0.851SerTrp: 0.851 ± 0.233
1.625SerTyr: 1.625 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
6.034ThrAla: 6.034 ± 0.977
0.774ThrCys: 0.774 ± 0.243
3.559ThrAsp: 3.559 ± 0.484
3.946ThrGlu: 3.946 ± 0.496
2.398ThrPhe: 2.398 ± 0.527
5.338ThrGly: 5.338 ± 0.621
1.315ThrHis: 1.315 ± 0.261
2.476ThrIle: 2.476 ± 0.323
2.94ThrLys: 2.94 ± 0.502
5.57ThrLeu: 5.57 ± 0.727
2.011ThrMet: 2.011 ± 0.432
1.702ThrAsn: 1.702 ± 0.455
2.785ThrPro: 2.785 ± 0.479
1.934ThrGln: 1.934 ± 0.419
3.791ThrArg: 3.791 ± 0.444
4.564ThrSer: 4.564 ± 0.679
3.095ThrThr: 3.095 ± 0.905
3.481ThrVal: 3.481 ± 0.669
0.387ThrTrp: 0.387 ± 0.169
2.63ThrTyr: 2.63 ± 0.551
0.0ThrXaa: 0.0 ± 0.0
Val
6.189ValAla: 6.189 ± 0.633
0.774ValCys: 0.774 ± 0.25
3.636ValAsp: 3.636 ± 0.486
3.868ValGlu: 3.868 ± 0.625
2.011ValPhe: 2.011 ± 0.281
5.338ValGly: 5.338 ± 0.682
0.774ValHis: 0.774 ± 0.193
3.327ValIle: 3.327 ± 0.613
3.636ValLys: 3.636 ± 0.567
6.112ValLeu: 6.112 ± 0.666
2.166ValMet: 2.166 ± 0.331
3.868ValAsn: 3.868 ± 0.485
2.708ValPro: 2.708 ± 0.498
2.862ValGln: 2.862 ± 0.491
4.642ValArg: 4.642 ± 0.658
4.874ValSer: 4.874 ± 0.591
5.338ValThr: 5.338 ± 0.865
5.183ValVal: 5.183 ± 0.742
0.851ValTrp: 0.851 ± 0.217
2.398ValTyr: 2.398 ± 0.7
0.0ValXaa: 0.0 ± 0.0
Trp
1.083TrpAla: 1.083 ± 0.318
0.077TrpCys: 0.077 ± 0.085
0.928TrpAsp: 0.928 ± 0.336
1.083TrpGlu: 1.083 ± 0.276
1.238TrpPhe: 1.238 ± 0.203
0.387TrpGly: 0.387 ± 0.182
0.077TrpHis: 0.077 ± 0.08
0.309TrpIle: 0.309 ± 0.132
0.774TrpLys: 0.774 ± 0.282
2.244TrpLeu: 2.244 ± 0.451
0.077TrpMet: 0.077 ± 0.071
0.155TrpAsn: 0.155 ± 0.099
0.387TrpPro: 0.387 ± 0.143
0.232TrpGln: 0.232 ± 0.114
0.696TrpArg: 0.696 ± 0.232
0.851TrpSer: 0.851 ± 0.239
0.464TrpThr: 0.464 ± 0.206
1.083TrpVal: 1.083 ± 0.254
0.619TrpTrp: 0.619 ± 0.205
0.464TrpTyr: 0.464 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.095TyrAla: 3.095 ± 0.366
0.696TyrCys: 0.696 ± 0.248
2.785TyrAsp: 2.785 ± 0.523
1.779TyrGlu: 1.779 ± 0.415
1.315TyrPhe: 1.315 ± 0.325
2.862TyrGly: 2.862 ± 0.372
0.387TyrHis: 0.387 ± 0.153
1.857TyrIle: 1.857 ± 0.362
1.47TyrLys: 1.47 ± 0.311
2.63TyrLeu: 2.63 ± 0.422
0.851TyrMet: 0.851 ± 0.199
2.166TyrAsn: 2.166 ± 0.445
1.547TyrPro: 1.547 ± 0.31
2.011TyrGln: 2.011 ± 0.365
2.166TyrArg: 2.166 ± 0.316
2.476TyrSer: 2.476 ± 0.276
2.244TyrThr: 2.244 ± 0.603
2.398TyrVal: 2.398 ± 0.444
0.387TyrTrp: 0.387 ± 0.145
0.619TyrTyr: 0.619 ± 0.252
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (12927 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski