Amino acid dipepetide frequency for Gordonia phage Cleo

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.975AlaAla: 9.975 ± 1.392
0.482AlaCys: 0.482 ± 0.16
6.192AlaAsp: 6.192 ± 0.714
5.916AlaGlu: 5.916 ± 0.558
3.784AlaPhe: 3.784 ± 0.735
7.43AlaGly: 7.43 ± 0.871
1.238AlaHis: 1.238 ± 0.3
5.779AlaIle: 5.779 ± 0.651
4.678AlaLys: 4.678 ± 1.221
9.218AlaLeu: 9.218 ± 1.0
2.889AlaMet: 2.889 ± 0.486
3.302AlaAsn: 3.302 ± 0.556
5.228AlaPro: 5.228 ± 0.781
3.509AlaGln: 3.509 ± 0.493
6.329AlaArg: 6.329 ± 0.801
4.678AlaSer: 4.678 ± 0.649
5.985AlaThr: 5.985 ± 0.718
6.535AlaVal: 6.535 ± 0.691
2.064AlaTrp: 2.064 ± 0.326
1.995AlaTyr: 1.995 ± 0.346
0.0AlaXaa: 0.0 ± 0.0
Cys
0.55CysAla: 0.55 ± 0.23
0.069CysCys: 0.069 ± 0.067
0.482CysAsp: 0.482 ± 0.147
0.344CysGlu: 0.344 ± 0.159
0.344CysPhe: 0.344 ± 0.18
0.894CysGly: 0.894 ± 0.277
0.206CysHis: 0.206 ± 0.116
0.138CysIle: 0.138 ± 0.099
0.275CysLys: 0.275 ± 0.134
0.482CysLeu: 0.482 ± 0.227
0.069CysMet: 0.069 ± 0.062
0.069CysAsn: 0.069 ± 0.073
0.619CysPro: 0.619 ± 0.214
0.206CysGln: 0.206 ± 0.122
0.482CysArg: 0.482 ± 0.196
0.482CysSer: 0.482 ± 0.196
0.344CysThr: 0.344 ± 0.16
0.413CysVal: 0.413 ± 0.164
0.275CysTrp: 0.275 ± 0.167
0.413CysTyr: 0.413 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
5.435AspAla: 5.435 ± 0.725
0.344AspCys: 0.344 ± 0.165
5.228AspAsp: 5.228 ± 0.981
5.435AspGlu: 5.435 ± 0.713
2.339AspPhe: 2.339 ± 0.438
5.779AspGly: 5.779 ± 0.617
1.582AspHis: 1.582 ± 0.322
3.302AspIle: 3.302 ± 0.487
3.027AspLys: 3.027 ± 0.492
5.71AspLeu: 5.71 ± 0.599
1.376AspMet: 1.376 ± 0.39
2.27AspAsn: 2.27 ± 0.461
4.265AspPro: 4.265 ± 0.627
2.133AspGln: 2.133 ± 0.357
4.472AspArg: 4.472 ± 0.526
3.99AspSer: 3.99 ± 0.529
3.715AspThr: 3.715 ± 0.471
3.99AspVal: 3.99 ± 0.482
1.513AspTrp: 1.513 ± 0.268
2.27AspTyr: 2.27 ± 0.374
0.0AspXaa: 0.0 ± 0.0
Glu
6.192GluAla: 6.192 ± 0.572
0.619GluCys: 0.619 ± 0.237
4.54GluAsp: 4.54 ± 0.708
3.715GluGlu: 3.715 ± 0.649
1.238GluPhe: 1.238 ± 0.282
4.678GluGly: 4.678 ± 0.667
0.757GluHis: 0.757 ± 0.242
2.545GluIle: 2.545 ± 0.469
1.857GluLys: 1.857 ± 0.499
5.916GluLeu: 5.916 ± 0.867
1.307GluMet: 1.307 ± 0.327
1.445GluAsn: 1.445 ± 0.272
2.752GluPro: 2.752 ± 0.443
2.958GluGln: 2.958 ± 0.481
4.128GluArg: 4.128 ± 0.682
3.302GluSer: 3.302 ± 0.436
2.201GluThr: 2.201 ± 0.294
3.44GluVal: 3.44 ± 0.527
1.238GluTrp: 1.238 ± 0.395
2.339GluTyr: 2.339 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
3.233PheAla: 3.233 ± 0.454
0.55PheCys: 0.55 ± 0.208
2.752PheAsp: 2.752 ± 0.479
1.307PheGlu: 1.307 ± 0.267
1.032PhePhe: 1.032 ± 0.244
3.165PheGly: 3.165 ± 0.45
0.757PheHis: 0.757 ± 0.221
1.513PheIle: 1.513 ± 0.342
1.307PheLys: 1.307 ± 0.266
2.339PheLeu: 2.339 ± 0.333
0.826PheMet: 0.826 ± 0.267
1.307PheAsn: 1.307 ± 0.272
1.651PhePro: 1.651 ± 0.292
1.445PheGln: 1.445 ± 0.378
2.064PheArg: 2.064 ± 0.396
1.513PheSer: 1.513 ± 0.377
2.408PheThr: 2.408 ± 0.418
2.133PheVal: 2.133 ± 0.365
0.55PheTrp: 0.55 ± 0.2
0.55PheTyr: 0.55 ± 0.228
0.0PheXaa: 0.0 ± 0.0
Gly
6.26GlyAla: 6.26 ± 0.602
0.55GlyCys: 0.55 ± 0.264
7.017GlyAsp: 7.017 ± 0.785
3.921GlyGlu: 3.921 ± 0.517
3.44GlyPhe: 3.44 ± 0.505
7.774GlyGly: 7.774 ± 1.008
1.445GlyHis: 1.445 ± 0.281
4.403GlyIle: 4.403 ± 0.676
4.265GlyLys: 4.265 ± 0.535
6.811GlyLeu: 6.811 ± 0.846
1.789GlyMet: 1.789 ± 0.388
2.133GlyAsn: 2.133 ± 0.275
3.027GlyPro: 3.027 ± 0.551
3.165GlyGln: 3.165 ± 0.511
5.366GlyArg: 5.366 ± 0.615
6.742GlySer: 6.742 ± 0.763
6.054GlyThr: 6.054 ± 0.659
7.017GlyVal: 7.017 ± 0.702
2.27GlyTrp: 2.27 ± 0.406
2.752GlyTyr: 2.752 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
1.376HisAla: 1.376 ± 0.309
0.275HisCys: 0.275 ± 0.12
1.17HisAsp: 1.17 ± 0.265
1.032HisGlu: 1.032 ± 0.312
0.619HisPhe: 0.619 ± 0.268
1.17HisGly: 1.17 ± 0.324
0.413HisHis: 0.413 ± 0.153
0.206HisIle: 0.206 ± 0.121
1.032HisLys: 1.032 ± 0.324
1.995HisLeu: 1.995 ± 0.366
0.826HisMet: 0.826 ± 0.225
0.619HisAsn: 0.619 ± 0.17
1.513HisPro: 1.513 ± 0.358
0.482HisGln: 0.482 ± 0.192
1.376HisArg: 1.376 ± 0.332
1.032HisSer: 1.032 ± 0.251
1.513HisThr: 1.513 ± 0.271
0.826HisVal: 0.826 ± 0.215
0.344HisTrp: 0.344 ± 0.15
0.413HisTyr: 0.413 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
5.16IleAla: 5.16 ± 0.605
0.275IleCys: 0.275 ± 0.14
2.958IleAsp: 2.958 ± 0.407
2.408IleGlu: 2.408 ± 0.385
1.032IlePhe: 1.032 ± 0.248
3.853IleGly: 3.853 ± 0.65
0.757IleHis: 0.757 ± 0.259
2.064IleIle: 2.064 ± 0.343
2.752IleLys: 2.752 ± 0.598
4.059IleLeu: 4.059 ± 0.465
1.032IleMet: 1.032 ± 0.237
1.789IleAsn: 1.789 ± 0.304
3.096IlePro: 3.096 ± 0.371
1.926IleGln: 1.926 ± 0.383
3.784IleArg: 3.784 ± 0.582
2.545IleSer: 2.545 ± 0.458
2.821IleThr: 2.821 ± 0.424
3.44IleVal: 3.44 ± 0.496
0.826IleTrp: 0.826 ± 0.205
0.894IleTyr: 0.894 ± 0.226
0.0IleXaa: 0.0 ± 0.0
Lys
6.192LysAla: 6.192 ± 1.107
0.275LysCys: 0.275 ± 0.149
2.821LysAsp: 2.821 ± 0.422
2.683LysGlu: 2.683 ± 0.447
1.789LysPhe: 1.789 ± 0.447
3.99LysGly: 3.99 ± 0.675
1.032LysHis: 1.032 ± 0.317
2.064LysIle: 2.064 ± 0.304
3.44LysLys: 3.44 ± 0.848
3.853LysLeu: 3.853 ± 0.734
1.17LysMet: 1.17 ± 0.284
1.995LysAsn: 1.995 ± 0.321
2.958LysPro: 2.958 ± 0.544
1.101LysGln: 1.101 ± 0.279
3.096LysArg: 3.096 ± 0.557
2.958LysSer: 2.958 ± 0.423
3.233LysThr: 3.233 ± 0.51
3.44LysVal: 3.44 ± 0.468
0.894LysTrp: 0.894 ± 0.292
1.445LysTyr: 1.445 ± 0.269
0.0LysXaa: 0.0 ± 0.0
Leu
8.462LeuAla: 8.462 ± 0.98
0.482LeuCys: 0.482 ± 0.207
6.742LeuAsp: 6.742 ± 0.612
4.54LeuGlu: 4.54 ± 0.689
2.133LeuPhe: 2.133 ± 0.396
6.879LeuGly: 6.879 ± 0.716
1.307LeuHis: 1.307 ± 0.264
3.096LeuIle: 3.096 ± 0.478
4.884LeuLys: 4.884 ± 0.587
6.26LeuLeu: 6.26 ± 0.746
2.201LeuMet: 2.201 ± 0.377
2.614LeuAsn: 2.614 ± 0.47
4.334LeuPro: 4.334 ± 0.483
2.821LeuGln: 2.821 ± 0.559
4.609LeuArg: 4.609 ± 0.577
5.297LeuSer: 5.297 ± 0.595
4.265LeuThr: 4.265 ± 0.589
5.366LeuVal: 5.366 ± 0.559
2.064LeuTrp: 2.064 ± 0.369
2.614LeuTyr: 2.614 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
4.196MetAla: 4.196 ± 0.445
0.0MetCys: 0.0 ± 0.0
1.445MetAsp: 1.445 ± 0.3
1.101MetGlu: 1.101 ± 0.247
0.894MetPhe: 0.894 ± 0.277
1.857MetGly: 1.857 ± 0.398
0.413MetHis: 0.413 ± 0.143
1.789MetIle: 1.789 ± 0.373
2.064MetLys: 2.064 ± 0.345
1.582MetLeu: 1.582 ± 0.321
0.757MetMet: 0.757 ± 0.237
0.963MetAsn: 0.963 ± 0.274
1.376MetPro: 1.376 ± 0.262
0.482MetGln: 0.482 ± 0.177
0.826MetArg: 0.826 ± 0.25
2.064MetSer: 2.064 ± 0.461
2.064MetThr: 2.064 ± 0.286
1.995MetVal: 1.995 ± 0.428
0.344MetTrp: 0.344 ± 0.153
0.55MetTyr: 0.55 ± 0.275
0.0MetXaa: 0.0 ± 0.0
Asn
3.096AsnAla: 3.096 ± 0.473
0.206AsnCys: 0.206 ± 0.124
2.201AsnAsp: 2.201 ± 0.508
1.238AsnGlu: 1.238 ± 0.216
1.17AsnPhe: 1.17 ± 0.27
3.165AsnGly: 3.165 ± 0.438
0.619AsnHis: 0.619 ± 0.222
1.445AsnIle: 1.445 ± 0.32
1.513AsnLys: 1.513 ± 0.288
2.821AsnLeu: 2.821 ± 0.531
0.688AsnMet: 0.688 ± 0.222
1.376AsnAsn: 1.376 ± 0.329
3.165AsnPro: 3.165 ± 0.537
1.72AsnGln: 1.72 ± 0.341
2.752AsnArg: 2.752 ± 0.405
1.513AsnSer: 1.513 ± 0.239
1.72AsnThr: 1.72 ± 0.375
1.995AsnVal: 1.995 ± 0.344
0.55AsnTrp: 0.55 ± 0.172
0.55AsnTyr: 0.55 ± 0.194
0.0AsnXaa: 0.0 ± 0.0
Pro
5.366ProAla: 5.366 ± 1.218
0.619ProCys: 0.619 ± 0.206
3.646ProAsp: 3.646 ± 0.579
3.577ProGlu: 3.577 ± 0.626
1.857ProPhe: 1.857 ± 0.376
5.572ProGly: 5.572 ± 0.742
1.032ProHis: 1.032 ± 0.276
2.201ProIle: 2.201 ± 0.394
3.302ProLys: 3.302 ± 0.53
2.821ProLeu: 2.821 ± 0.503
1.307ProMet: 1.307 ± 0.303
1.926ProAsn: 1.926 ± 0.355
2.752ProPro: 2.752 ± 0.536
1.513ProGln: 1.513 ± 0.337
2.339ProArg: 2.339 ± 0.392
3.165ProSer: 3.165 ± 0.41
3.509ProThr: 3.509 ± 0.568
4.609ProVal: 4.609 ± 0.555
1.789ProTrp: 1.789 ± 0.266
1.513ProTyr: 1.513 ± 0.308
0.0ProXaa: 0.0 ± 0.0
Gln
4.196GlnAla: 4.196 ± 0.525
0.138GlnCys: 0.138 ± 0.102
1.307GlnAsp: 1.307 ± 0.26
2.752GlnGlu: 2.752 ± 0.412
1.445GlnPhe: 1.445 ± 0.289
2.339GlnGly: 2.339 ± 0.358
0.826GlnHis: 0.826 ± 0.216
2.27GlnIle: 2.27 ± 0.371
1.789GlnLys: 1.789 ± 0.36
2.27GlnLeu: 2.27 ± 0.343
1.307GlnMet: 1.307 ± 0.3
1.307GlnAsn: 1.307 ± 0.388
2.408GlnPro: 2.408 ± 0.4
1.101GlnGln: 1.101 ± 0.279
2.958GlnArg: 2.958 ± 0.458
2.27GlnSer: 2.27 ± 0.413
1.582GlnThr: 1.582 ± 0.308
2.821GlnVal: 2.821 ± 0.361
0.826GlnTrp: 0.826 ± 0.254
1.17GlnTyr: 1.17 ± 0.245
0.0GlnXaa: 0.0 ± 0.0
Arg
5.985ArgAla: 5.985 ± 0.725
0.55ArgCys: 0.55 ± 0.216
3.784ArgAsp: 3.784 ± 0.441
3.027ArgGlu: 3.027 ± 0.56
1.789ArgPhe: 1.789 ± 0.387
5.16ArgGly: 5.16 ± 0.643
1.032ArgHis: 1.032 ± 0.267
2.614ArgIle: 2.614 ± 0.375
3.99ArgLys: 3.99 ± 0.697
5.985ArgLeu: 5.985 ± 0.702
2.201ArgMet: 2.201 ± 0.582
2.201ArgAsn: 2.201 ± 0.359
2.27ArgPro: 2.27 ± 0.419
2.821ArgGln: 2.821 ± 0.397
5.572ArgArg: 5.572 ± 1.063
3.233ArgSer: 3.233 ± 0.459
3.44ArgThr: 3.44 ± 0.51
4.472ArgVal: 4.472 ± 0.65
1.582ArgTrp: 1.582 ± 0.297
1.72ArgTyr: 1.72 ± 0.41
0.0ArgXaa: 0.0 ± 0.0
Ser
5.641SerAla: 5.641 ± 0.762
0.55SerCys: 0.55 ± 0.161
3.371SerAsp: 3.371 ± 0.378
3.44SerGlu: 3.44 ± 0.443
1.651SerPhe: 1.651 ± 0.305
6.742SerGly: 6.742 ± 0.673
0.826SerHis: 0.826 ± 0.256
3.233SerIle: 3.233 ± 0.462
2.133SerLys: 2.133 ± 0.354
4.334SerLeu: 4.334 ± 0.448
1.789SerMet: 1.789 ± 0.395
1.995SerAsn: 1.995 ± 0.35
3.646SerPro: 3.646 ± 0.539
1.857SerGln: 1.857 ± 0.445
3.371SerArg: 3.371 ± 0.462
2.545SerSer: 2.545 ± 0.409
3.509SerThr: 3.509 ± 0.418
3.853SerVal: 3.853 ± 0.68
1.238SerTrp: 1.238 ± 0.222
1.376SerTyr: 1.376 ± 0.299
0.0SerXaa: 0.0 ± 0.0
Thr
5.435ThrAla: 5.435 ± 0.502
0.413ThrCys: 0.413 ± 0.196
4.059ThrAsp: 4.059 ± 0.616
3.44ThrGlu: 3.44 ± 0.49
1.376ThrPhe: 1.376 ± 0.275
5.779ThrGly: 5.779 ± 0.586
0.894ThrHis: 0.894 ± 0.326
3.784ThrIle: 3.784 ± 0.668
2.821ThrLys: 2.821 ± 0.479
4.54ThrLeu: 4.54 ± 0.59
2.201ThrMet: 2.201 ± 0.482
1.582ThrAsn: 1.582 ± 0.343
3.509ThrPro: 3.509 ± 0.52
2.545ThrGln: 2.545 ± 0.352
3.096ThrArg: 3.096 ± 0.439
3.096ThrSer: 3.096 ± 0.387
4.747ThrThr: 4.747 ± 0.674
5.091ThrVal: 5.091 ± 0.672
1.17ThrTrp: 1.17 ± 0.216
1.651ThrTyr: 1.651 ± 0.333
0.0ThrXaa: 0.0 ± 0.0
Val
6.467ValAla: 6.467 ± 0.564
0.206ValCys: 0.206 ± 0.122
5.022ValAsp: 5.022 ± 0.572
4.953ValGlu: 4.953 ± 0.538
2.201ValPhe: 2.201 ± 0.347
5.985ValGly: 5.985 ± 0.848
1.72ValHis: 1.72 ± 0.361
3.853ValIle: 3.853 ± 0.505
3.096ValLys: 3.096 ± 0.473
5.228ValLeu: 5.228 ± 0.566
1.445ValMet: 1.445 ± 0.284
2.339ValAsn: 2.339 ± 0.456
3.44ValPro: 3.44 ± 0.504
3.096ValGln: 3.096 ± 0.508
3.853ValArg: 3.853 ± 0.632
2.958ValSer: 2.958 ± 0.522
5.572ValThr: 5.572 ± 0.634
4.884ValVal: 4.884 ± 0.6
2.27ValTrp: 2.27 ± 0.402
1.926ValTyr: 1.926 ± 0.4
0.0ValXaa: 0.0 ± 0.0
Trp
2.201TrpAla: 2.201 ± 0.309
0.413TrpCys: 0.413 ± 0.147
1.857TrpAsp: 1.857 ± 0.306
0.757TrpGlu: 0.757 ± 0.234
0.894TrpPhe: 0.894 ± 0.202
1.445TrpGly: 1.445 ± 0.325
0.55TrpHis: 0.55 ± 0.168
0.344TrpIle: 0.344 ± 0.158
1.376TrpLys: 1.376 ± 0.304
2.752TrpLeu: 2.752 ± 0.45
0.688TrpMet: 0.688 ± 0.244
0.963TrpAsn: 0.963 ± 0.273
0.894TrpPro: 0.894 ± 0.239
1.101TrpGln: 1.101 ± 0.251
0.894TrpArg: 0.894 ± 0.232
1.376TrpSer: 1.376 ± 0.29
0.894TrpThr: 0.894 ± 0.202
2.133TrpVal: 2.133 ± 0.479
0.413TrpTrp: 0.413 ± 0.138
0.757TrpTyr: 0.757 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.789TyrAla: 1.789 ± 0.326
0.138TyrCys: 0.138 ± 0.1
1.582TyrAsp: 1.582 ± 0.349
1.582TyrGlu: 1.582 ± 0.375
1.376TyrPhe: 1.376 ± 0.322
2.545TyrGly: 2.545 ± 0.34
0.826TyrHis: 0.826 ± 0.24
0.826TyrIle: 0.826 ± 0.224
0.826TyrLys: 0.826 ± 0.285
1.995TyrLeu: 1.995 ± 0.377
0.826TyrMet: 0.826 ± 0.218
1.307TyrAsn: 1.307 ± 0.278
1.513TyrPro: 1.513 ± 0.275
1.101TyrGln: 1.101 ± 0.295
2.133TyrArg: 2.133 ± 0.408
2.339TyrSer: 2.339 ± 0.364
1.651TyrThr: 1.651 ± 0.3
2.064TyrVal: 2.064 ± 0.449
0.482TyrTrp: 0.482 ± 0.178
0.963TyrTyr: 0.963 ± 0.32
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (14537 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski