Amino acid dipepetide frequency for Streptomyces phage Alvy

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.647AlaAla: 12.647 ± 1.12
0.733AlaCys: 0.733 ± 0.245
7.27AlaAsp: 7.27 ± 0.713
8.614AlaGlu: 8.614 ± 0.85
2.81AlaPhe: 2.81 ± 0.415
9.592AlaGly: 9.592 ± 0.786
2.077AlaHis: 2.077 ± 0.407
4.338AlaIle: 4.338 ± 0.566
5.376AlaLys: 5.376 ± 0.746
10.814AlaLeu: 10.814 ± 0.983
2.322AlaMet: 2.322 ± 0.309
2.444AlaAsn: 2.444 ± 0.29
5.499AlaPro: 5.499 ± 0.506
3.971AlaGln: 3.971 ± 0.588
7.698AlaArg: 7.698 ± 0.864
5.804AlaSer: 5.804 ± 0.55
6.537AlaThr: 6.537 ± 0.541
9.042AlaVal: 9.042 ± 0.841
2.261AlaTrp: 2.261 ± 0.335
3.91AlaTyr: 3.91 ± 0.606
0.0AlaXaa: 0.0 ± 0.0
Cys
0.489CysAla: 0.489 ± 0.163
0.122CysCys: 0.122 ± 0.099
0.611CysAsp: 0.611 ± 0.209
0.611CysGlu: 0.611 ± 0.219
0.244CysPhe: 0.244 ± 0.141
0.916CysGly: 0.916 ± 0.259
0.305CysHis: 0.305 ± 0.127
0.489CysIle: 0.489 ± 0.184
0.55CysLys: 0.55 ± 0.199
0.244CysLeu: 0.244 ± 0.11
0.061CysMet: 0.061 ± 0.063
0.183CysAsn: 0.183 ± 0.105
0.55CysPro: 0.55 ± 0.199
0.183CysGln: 0.183 ± 0.113
0.305CysArg: 0.305 ± 0.123
0.916CysSer: 0.916 ± 0.227
0.428CysThr: 0.428 ± 0.177
0.122CysVal: 0.122 ± 0.113
0.367CysTrp: 0.367 ± 0.154
0.244CysTyr: 0.244 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
6.109AspAla: 6.109 ± 0.68
0.55AspCys: 0.55 ± 0.147
4.643AspAsp: 4.643 ± 0.478
5.376AspGlu: 5.376 ± 0.705
1.772AspPhe: 1.772 ± 0.341
6.659AspGly: 6.659 ± 0.736
1.283AspHis: 1.283 ± 0.334
2.81AspIle: 2.81 ± 0.424
2.199AspLys: 2.199 ± 0.445
5.682AspLeu: 5.682 ± 0.669
1.711AspMet: 1.711 ± 0.351
1.466AspAsn: 1.466 ± 0.301
4.093AspPro: 4.093 ± 0.479
2.077AspGln: 2.077 ± 0.338
3.666AspArg: 3.666 ± 0.503
3.971AspSer: 3.971 ± 0.55
3.116AspThr: 3.116 ± 0.405
4.154AspVal: 4.154 ± 0.506
1.833AspTrp: 1.833 ± 0.324
1.405AspTyr: 1.405 ± 0.261
0.0AspXaa: 0.0 ± 0.0
Glu
8.675GluAla: 8.675 ± 0.814
0.794GluCys: 0.794 ± 0.228
4.093GluAsp: 4.093 ± 0.533
5.376GluGlu: 5.376 ± 0.845
1.527GluPhe: 1.527 ± 0.331
6.72GluGly: 6.72 ± 0.583
1.344GluHis: 1.344 ± 0.35
2.933GluIle: 2.933 ± 0.49
2.261GluLys: 2.261 ± 0.328
7.454GluLeu: 7.454 ± 0.753
1.466GluMet: 1.466 ± 0.274
1.161GluAsn: 1.161 ± 0.249
2.688GluPro: 2.688 ± 0.593
2.688GluGln: 2.688 ± 0.404
4.46GluArg: 4.46 ± 0.675
2.627GluSer: 2.627 ± 0.462
4.154GluThr: 4.154 ± 0.636
5.56GluVal: 5.56 ± 0.636
1.283GluTrp: 1.283 ± 0.276
2.261GluTyr: 2.261 ± 0.438
0.0GluXaa: 0.0 ± 0.0
Phe
3.055PheAla: 3.055 ± 0.44
0.428PheCys: 0.428 ± 0.147
1.894PheAsp: 1.894 ± 0.392
2.077PheGlu: 2.077 ± 0.346
0.733PhePhe: 0.733 ± 0.273
2.933PheGly: 2.933 ± 0.383
0.428PheHis: 0.428 ± 0.18
1.161PheIle: 1.161 ± 0.27
1.405PheLys: 1.405 ± 0.301
1.833PheLeu: 1.833 ± 0.348
0.611PheMet: 0.611 ± 0.226
0.978PheAsn: 0.978 ± 0.187
1.405PhePro: 1.405 ± 0.311
0.978PheGln: 0.978 ± 0.272
1.955PheArg: 1.955 ± 0.334
1.772PheSer: 1.772 ± 0.316
2.138PheThr: 2.138 ± 0.385
1.833PheVal: 1.833 ± 0.354
0.55PheTrp: 0.55 ± 0.15
0.855PheTyr: 0.855 ± 0.227
0.0PheXaa: 0.0 ± 0.0
Gly
7.698GlyAla: 7.698 ± 0.831
0.672GlyCys: 0.672 ± 0.233
5.315GlyAsp: 5.315 ± 0.591
4.949GlyGlu: 4.949 ± 0.532
3.238GlyPhe: 3.238 ± 0.488
6.171GlyGly: 6.171 ± 0.958
2.444GlyHis: 2.444 ± 0.448
3.727GlyIle: 3.727 ± 0.653
4.826GlyLys: 4.826 ± 0.729
6.965GlyLeu: 6.965 ± 0.789
1.772GlyMet: 1.772 ± 0.283
2.566GlyAsn: 2.566 ± 0.408
3.91GlyPro: 3.91 ± 0.537
2.688GlyGln: 2.688 ± 0.398
5.804GlyArg: 5.804 ± 0.662
5.499GlySer: 5.499 ± 0.877
5.132GlyThr: 5.132 ± 0.861
5.56GlyVal: 5.56 ± 0.73
2.383GlyTrp: 2.383 ± 0.416
2.383GlyTyr: 2.383 ± 0.493
0.0GlyXaa: 0.0 ± 0.0
His
1.894HisAla: 1.894 ± 0.407
0.183HisCys: 0.183 ± 0.113
1.222HisAsp: 1.222 ± 0.24
1.527HisGlu: 1.527 ± 0.351
0.855HisPhe: 0.855 ± 0.295
1.466HisGly: 1.466 ± 0.312
0.733HisHis: 0.733 ± 0.188
0.672HisIle: 0.672 ± 0.188
0.55HisLys: 0.55 ± 0.202
1.833HisLeu: 1.833 ± 0.322
0.305HisMet: 0.305 ± 0.133
0.489HisAsn: 0.489 ± 0.174
1.527HisPro: 1.527 ± 0.272
0.733HisGln: 0.733 ± 0.263
1.527HisArg: 1.527 ± 0.406
0.916HisSer: 0.916 ± 0.323
1.405HisThr: 1.405 ± 0.288
1.833HisVal: 1.833 ± 0.406
0.611HisTrp: 0.611 ± 0.208
0.489HisTyr: 0.489 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
4.582IleAla: 4.582 ± 0.48
0.122IleCys: 0.122 ± 0.078
3.421IleAsp: 3.421 ± 0.457
3.238IleGlu: 3.238 ± 0.389
1.283IlePhe: 1.283 ± 0.314
3.238IleGly: 3.238 ± 0.636
0.672IleHis: 0.672 ± 0.209
1.894IleIle: 1.894 ± 0.447
1.711IleLys: 1.711 ± 0.314
3.238IleLeu: 3.238 ± 0.514
0.55IleMet: 0.55 ± 0.231
1.283IleAsn: 1.283 ± 0.317
2.199IlePro: 2.199 ± 0.261
1.405IleGln: 1.405 ± 0.261
2.871IleArg: 2.871 ± 0.376
1.711IleSer: 1.711 ± 0.421
2.688IleThr: 2.688 ± 0.445
3.238IleVal: 3.238 ± 0.488
0.305IleTrp: 0.305 ± 0.114
1.283IleTyr: 1.283 ± 0.238
0.0IleXaa: 0.0 ± 0.0
Lys
4.888LysAla: 4.888 ± 0.65
0.305LysCys: 0.305 ± 0.13
2.383LysAsp: 2.383 ± 0.535
2.871LysGlu: 2.871 ± 0.47
0.733LysPhe: 0.733 ± 0.183
4.399LysGly: 4.399 ± 0.608
0.855LysHis: 0.855 ± 0.244
1.711LysIle: 1.711 ± 0.308
2.81LysLys: 2.81 ± 0.486
3.727LysLeu: 3.727 ± 0.528
0.916LysMet: 0.916 ± 0.223
1.161LysAsn: 1.161 ± 0.269
2.566LysPro: 2.566 ± 0.454
1.466LysGln: 1.466 ± 0.26
3.727LysArg: 3.727 ± 0.652
2.261LysSer: 2.261 ± 0.476
3.299LysThr: 3.299 ± 0.448
2.566LysVal: 2.566 ± 0.351
0.489LysTrp: 0.489 ± 0.154
1.466LysTyr: 1.466 ± 0.356
0.0LysXaa: 0.0 ± 0.0
Leu
11.669LeuAla: 11.669 ± 1.015
0.55LeuCys: 0.55 ± 0.219
5.437LeuAsp: 5.437 ± 0.508
4.093LeuGlu: 4.093 ± 0.525
2.261LeuPhe: 2.261 ± 0.417
7.209LeuGly: 7.209 ± 0.596
1.1LeuHis: 1.1 ± 0.248
3.36LeuIle: 3.36 ± 0.426
2.933LeuLys: 2.933 ± 0.547
6.659LeuLeu: 6.659 ± 0.914
1.772LeuMet: 1.772 ± 0.357
3.177LeuAsn: 3.177 ± 0.446
5.071LeuPro: 5.071 ± 0.549
2.994LeuGln: 2.994 ± 0.371
5.682LeuArg: 5.682 ± 0.703
5.01LeuSer: 5.01 ± 0.492
5.926LeuThr: 5.926 ± 0.531
6.72LeuVal: 6.72 ± 0.729
0.855LeuTrp: 0.855 ± 0.237
1.222LeuTyr: 1.222 ± 0.243
0.0LeuXaa: 0.0 ± 0.0
Met
3.666MetAla: 3.666 ± 0.547
0.183MetCys: 0.183 ± 0.103
0.978MetAsp: 0.978 ± 0.218
1.039MetGlu: 1.039 ± 0.319
0.367MetPhe: 0.367 ± 0.129
0.978MetGly: 0.978 ± 0.192
0.489MetHis: 0.489 ± 0.155
0.916MetIle: 0.916 ± 0.227
0.611MetLys: 0.611 ± 0.164
1.711MetLeu: 1.711 ± 0.331
0.367MetMet: 0.367 ± 0.129
0.672MetAsn: 0.672 ± 0.163
1.283MetPro: 1.283 ± 0.276
0.611MetGln: 0.611 ± 0.171
1.405MetArg: 1.405 ± 0.303
2.444MetSer: 2.444 ± 0.364
1.65MetThr: 1.65 ± 0.348
1.1MetVal: 1.1 ± 0.281
0.367MetTrp: 0.367 ± 0.123
0.489MetTyr: 0.489 ± 0.152
0.0MetXaa: 0.0 ± 0.0
Asn
2.871AsnAla: 2.871 ± 0.402
0.367AsnCys: 0.367 ± 0.17
1.65AsnAsp: 1.65 ± 0.278
1.65AsnGlu: 1.65 ± 0.29
0.978AsnPhe: 0.978 ± 0.212
2.688AsnGly: 2.688 ± 0.729
0.794AsnHis: 0.794 ± 0.218
1.222AsnIle: 1.222 ± 0.272
0.916AsnLys: 0.916 ± 0.237
2.322AsnLeu: 2.322 ± 0.404
0.489AsnMet: 0.489 ± 0.191
1.1AsnAsn: 1.1 ± 0.262
1.711AsnPro: 1.711 ± 0.308
0.916AsnGln: 0.916 ± 0.212
1.65AsnArg: 1.65 ± 0.272
1.711AsnSer: 1.711 ± 0.376
1.283AsnThr: 1.283 ± 0.278
2.077AsnVal: 2.077 ± 0.294
0.611AsnTrp: 0.611 ± 0.17
0.489AsnTyr: 0.489 ± 0.145
0.0AsnXaa: 0.0 ± 0.0
Pro
6.598ProAla: 6.598 ± 0.741
0.733ProCys: 0.733 ± 0.223
3.727ProAsp: 3.727 ± 0.42
4.216ProGlu: 4.216 ± 0.623
1.466ProPhe: 1.466 ± 0.277
4.216ProGly: 4.216 ± 0.548
0.611ProHis: 0.611 ± 0.2
1.711ProIle: 1.711 ± 0.384
3.299ProLys: 3.299 ± 0.631
3.666ProLeu: 3.666 ± 0.526
1.1ProMet: 1.1 ± 0.282
0.916ProAsn: 0.916 ± 0.232
2.383ProPro: 2.383 ± 0.506
1.222ProGln: 1.222 ± 0.269
2.383ProArg: 2.383 ± 0.469
2.749ProSer: 2.749 ± 0.542
3.238ProThr: 3.238 ± 0.596
3.91ProVal: 3.91 ± 0.413
0.978ProTrp: 0.978 ± 0.245
1.405ProTyr: 1.405 ± 0.376
0.0ProXaa: 0.0 ± 0.0
Gln
4.582GlnAla: 4.582 ± 0.47
0.183GlnCys: 0.183 ± 0.113
1.527GlnAsp: 1.527 ± 0.309
2.138GlnGlu: 2.138 ± 0.418
1.1GlnPhe: 1.1 ± 0.339
2.322GlnGly: 2.322 ± 0.369
0.611GlnHis: 0.611 ± 0.182
1.772GlnIle: 1.772 ± 0.321
1.65GlnLys: 1.65 ± 0.372
3.116GlnLeu: 3.116 ± 0.408
0.978GlnMet: 0.978 ± 0.3
0.916GlnAsn: 0.916 ± 0.204
0.978GlnPro: 0.978 ± 0.224
1.222GlnGln: 1.222 ± 0.215
2.261GlnArg: 2.261 ± 0.365
1.711GlnSer: 1.711 ± 0.347
2.016GlnThr: 2.016 ± 0.442
2.322GlnVal: 2.322 ± 0.39
0.489GlnTrp: 0.489 ± 0.16
0.855GlnTyr: 0.855 ± 0.228
0.0GlnXaa: 0.0 ± 0.0
Arg
6.476ArgAla: 6.476 ± 0.818
0.489ArgCys: 0.489 ± 0.192
4.154ArgAsp: 4.154 ± 0.549
4.765ArgGlu: 4.765 ± 0.627
2.566ArgPhe: 2.566 ± 0.392
4.093ArgGly: 4.093 ± 0.644
1.711ArgHis: 1.711 ± 0.406
2.383ArgIle: 2.383 ± 0.318
3.543ArgLys: 3.543 ± 0.509
5.132ArgLeu: 5.132 ± 0.683
1.955ArgMet: 1.955 ± 0.371
1.955ArgAsn: 1.955 ± 0.315
3.605ArgPro: 3.605 ± 0.511
2.444ArgGln: 2.444 ± 0.451
6.171ArgArg: 6.171 ± 0.922
3.666ArgSer: 3.666 ± 0.595
4.154ArgThr: 4.154 ± 0.473
4.704ArgVal: 4.704 ± 0.595
1.1ArgTrp: 1.1 ± 0.34
2.322ArgTyr: 2.322 ± 0.326
0.0ArgXaa: 0.0 ± 0.0
Ser
6.109SerAla: 6.109 ± 0.688
0.061SerCys: 0.061 ± 0.063
3.788SerAsp: 3.788 ± 0.501
3.36SerGlu: 3.36 ± 0.405
1.405SerPhe: 1.405 ± 0.299
5.193SerGly: 5.193 ± 0.732
1.711SerHis: 1.711 ± 0.458
2.749SerIle: 2.749 ± 0.444
2.505SerLys: 2.505 ± 0.391
5.01SerLeu: 5.01 ± 0.554
1.1SerMet: 1.1 ± 0.252
1.405SerAsn: 1.405 ± 0.371
2.261SerPro: 2.261 ± 0.359
1.65SerGln: 1.65 ± 0.423
3.727SerArg: 3.727 ± 0.538
4.032SerSer: 4.032 ± 0.681
4.277SerThr: 4.277 ± 0.543
4.216SerVal: 4.216 ± 0.587
1.1SerTrp: 1.1 ± 0.216
2.077SerTyr: 2.077 ± 0.379
0.0SerXaa: 0.0 ± 0.0
Thr
7.454ThrAla: 7.454 ± 0.645
0.489ThrCys: 0.489 ± 0.209
4.277ThrAsp: 4.277 ± 0.518
4.216ThrGlu: 4.216 ± 0.588
2.261ThrPhe: 2.261 ± 0.392
5.437ThrGly: 5.437 ± 0.887
0.733ThrHis: 0.733 ± 0.21
2.505ThrIle: 2.505 ± 0.605
2.138ThrLys: 2.138 ± 0.269
5.499ThrLeu: 5.499 ± 0.563
0.855ThrMet: 0.855 ± 0.253
1.527ThrAsn: 1.527 ± 0.349
3.543ThrPro: 3.543 ± 0.53
1.772ThrGln: 1.772 ± 0.412
3.421ThrArg: 3.421 ± 0.473
3.788ThrSer: 3.788 ± 0.589
4.032ThrThr: 4.032 ± 0.652
6.048ThrVal: 6.048 ± 0.499
1.222ThrTrp: 1.222 ± 0.287
2.383ThrTyr: 2.383 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
8.798ValAla: 8.798 ± 0.664
0.55ValCys: 0.55 ± 0.185
4.154ValAsp: 4.154 ± 0.403
5.499ValGlu: 5.499 ± 0.481
1.833ValPhe: 1.833 ± 0.348
5.132ValGly: 5.132 ± 0.711
1.833ValHis: 1.833 ± 0.318
3.299ValIle: 3.299 ± 0.434
3.116ValLys: 3.116 ± 0.439
5.621ValLeu: 5.621 ± 0.533
2.016ValMet: 2.016 ± 0.289
2.81ValAsn: 2.81 ± 0.583
3.177ValPro: 3.177 ± 0.527
2.81ValGln: 2.81 ± 0.434
4.826ValArg: 4.826 ± 0.465
3.849ValSer: 3.849 ± 0.537
5.071ValThr: 5.071 ± 0.49
4.949ValVal: 4.949 ± 0.62
1.588ValTrp: 1.588 ± 0.274
1.955ValTyr: 1.955 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
2.383TrpAla: 2.383 ± 0.347
0.305TrpCys: 0.305 ± 0.122
1.466TrpAsp: 1.466 ± 0.321
1.466TrpGlu: 1.466 ± 0.299
0.611TrpPhe: 0.611 ± 0.202
1.283TrpGly: 1.283 ± 0.27
0.428TrpHis: 0.428 ± 0.159
0.367TrpIle: 0.367 ± 0.135
1.161TrpLys: 1.161 ± 0.271
1.222TrpLeu: 1.222 ± 0.3
0.489TrpMet: 0.489 ± 0.151
0.489TrpAsn: 0.489 ± 0.155
0.794TrpPro: 0.794 ± 0.21
0.305TrpGln: 0.305 ± 0.147
1.405TrpArg: 1.405 ± 0.256
1.283TrpSer: 1.283 ± 0.25
1.711TrpThr: 1.711 ± 0.321
1.405TrpVal: 1.405 ± 0.308
0.061TrpTrp: 0.061 ± 0.064
0.489TrpTyr: 0.489 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.116TyrAla: 3.116 ± 0.494
0.122TyrCys: 0.122 ± 0.076
2.566TyrAsp: 2.566 ± 0.32
2.444TyrGlu: 2.444 ± 0.451
0.978TyrPhe: 0.978 ± 0.232
3.055TyrGly: 3.055 ± 0.544
0.611TyrHis: 0.611 ± 0.208
0.916TyrIle: 0.916 ± 0.247
1.039TyrLys: 1.039 ± 0.263
1.955TyrLeu: 1.955 ± 0.299
0.428TyrMet: 0.428 ± 0.138
0.794TyrAsn: 0.794 ± 0.224
1.283TyrPro: 1.283 ± 0.398
0.611TyrGln: 0.611 ± 0.208
2.505TyrArg: 2.505 ± 0.463
2.016TyrSer: 2.016 ± 0.397
1.344TyrThr: 1.344 ± 0.275
1.527TyrVal: 1.527 ± 0.257
0.672TyrTrp: 0.672 ± 0.187
0.672TyrTyr: 0.672 ± 0.197
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (16369 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski