Amino acid dipepetide frequency for Streptococcus phage Javan181

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.29AlaAla: 4.29 ± 0.882
0.585AlaCys: 0.585 ± 0.207
4.778AlaAsp: 4.778 ± 0.54
3.9AlaGlu: 3.9 ± 0.704
2.73AlaPhe: 2.73 ± 0.454
4.583AlaGly: 4.583 ± 0.777
0.585AlaHis: 0.585 ± 0.205
5.265AlaIle: 5.265 ± 1.167
5.753AlaLys: 5.753 ± 0.547
6.24AlaLeu: 6.24 ± 1.17
1.755AlaMet: 1.755 ± 0.371
3.51AlaAsn: 3.51 ± 0.488
1.658AlaPro: 1.658 ± 0.388
2.535AlaGln: 2.535 ± 0.544
3.12AlaArg: 3.12 ± 0.625
4.778AlaSer: 4.778 ± 0.775
5.753AlaThr: 5.753 ± 0.915
4.778AlaVal: 4.778 ± 0.782
0.585AlaTrp: 0.585 ± 0.203
3.998AlaTyr: 3.998 ± 0.627
0.0AlaXaa: 0.0 ± 0.0
Cys
0.195CysAla: 0.195 ± 0.219
0.195CysCys: 0.195 ± 0.116
0.39CysAsp: 0.39 ± 0.201
0.683CysGlu: 0.683 ± 0.218
0.098CysPhe: 0.098 ± 0.108
0.488CysGly: 0.488 ± 0.214
0.098CysHis: 0.098 ± 0.112
0.39CysIle: 0.39 ± 0.177
0.293CysLys: 0.293 ± 0.208
0.975CysLeu: 0.975 ± 0.308
0.098CysMet: 0.098 ± 0.09
0.195CysAsn: 0.195 ± 0.127
0.293CysPro: 0.293 ± 0.205
0.39CysGln: 0.39 ± 0.18
0.488CysArg: 0.488 ± 0.206
0.585CysSer: 0.585 ± 0.24
0.39CysThr: 0.39 ± 0.186
0.488CysVal: 0.488 ± 0.166
0.0CysTrp: 0.0 ± 0.0
0.878CysTyr: 0.878 ± 0.294
0.0CysXaa: 0.0 ± 0.0
Asp
3.413AspAla: 3.413 ± 0.475
0.585AspCys: 0.585 ± 0.197
3.803AspAsp: 3.803 ± 1.084
5.655AspGlu: 5.655 ± 0.58
2.925AspPhe: 2.925 ± 0.467
5.07AspGly: 5.07 ± 0.664
0.975AspHis: 0.975 ± 0.31
3.12AspIle: 3.12 ± 0.458
4.68AspLys: 4.68 ± 0.511
6.435AspLeu: 6.435 ± 0.831
2.048AspMet: 2.048 ± 0.414
2.633AspAsn: 2.633 ± 0.553
1.463AspPro: 1.463 ± 0.525
2.048AspGln: 2.048 ± 0.49
2.145AspArg: 2.145 ± 0.608
3.023AspSer: 3.023 ± 0.496
3.12AspThr: 3.12 ± 0.537
3.608AspVal: 3.608 ± 0.537
0.78AspTrp: 0.78 ± 0.244
3.803AspTyr: 3.803 ± 0.881
0.0AspXaa: 0.0 ± 0.0
Glu
5.753GluAla: 5.753 ± 0.836
0.78GluCys: 0.78 ± 0.305
3.705GluAsp: 3.705 ± 0.611
6.923GluGlu: 6.923 ± 0.89
2.438GluPhe: 2.438 ± 0.458
4.973GluGly: 4.973 ± 0.643
1.17GluHis: 1.17 ± 0.315
3.608GluIle: 3.608 ± 0.441
6.143GluLys: 6.143 ± 0.713
9.068GluLeu: 9.068 ± 1.077
2.145GluMet: 2.145 ± 0.676
3.023GluAsn: 3.023 ± 0.575
1.658GluPro: 1.658 ± 0.485
3.413GluGln: 3.413 ± 0.506
2.34GluArg: 2.34 ± 0.443
3.218GluSer: 3.218 ± 0.551
5.07GluThr: 5.07 ± 0.691
5.07GluVal: 5.07 ± 0.698
0.585GluTrp: 0.585 ± 0.165
1.56GluTyr: 1.56 ± 0.331
0.0GluXaa: 0.0 ± 0.0
Phe
2.048PheAla: 2.048 ± 0.591
0.585PheCys: 0.585 ± 0.244
2.925PheAsp: 2.925 ± 0.457
2.73PheGlu: 2.73 ± 0.505
1.463PhePhe: 1.463 ± 0.397
3.12PheGly: 3.12 ± 0.431
0.78PheHis: 0.78 ± 0.31
2.145PheIle: 2.145 ± 0.511
3.413PheLys: 3.413 ± 0.704
2.828PheLeu: 2.828 ± 0.578
0.975PheMet: 0.975 ± 0.315
2.145PheAsn: 2.145 ± 0.339
0.683PhePro: 0.683 ± 0.315
1.17PheGln: 1.17 ± 0.315
1.073PheArg: 1.073 ± 0.237
2.925PheSer: 2.925 ± 0.556
1.95PheThr: 1.95 ± 0.469
1.95PheVal: 1.95 ± 0.403
0.683PheTrp: 0.683 ± 0.238
2.243PheTyr: 2.243 ± 0.418
0.0PheXaa: 0.0 ± 0.0
Gly
3.413GlyAla: 3.413 ± 0.592
0.293GlyCys: 0.293 ± 0.17
4.583GlyAsp: 4.583 ± 0.618
3.12GlyGlu: 3.12 ± 0.471
2.048GlyPhe: 2.048 ± 0.325
5.558GlyGly: 5.558 ± 0.805
2.145GlyHis: 2.145 ± 0.468
5.168GlyIle: 5.168 ± 0.692
5.168GlyLys: 5.168 ± 0.778
5.753GlyLeu: 5.753 ± 0.838
2.34GlyMet: 2.34 ± 0.398
3.705GlyAsn: 3.705 ± 0.814
0.78GlyPro: 0.78 ± 0.206
2.925GlyGln: 2.925 ± 0.466
3.413GlyArg: 3.413 ± 0.572
5.07GlySer: 5.07 ± 1.101
4.68GlyThr: 4.68 ± 0.646
5.46GlyVal: 5.46 ± 0.886
0.78GlyTrp: 0.78 ± 0.233
2.438GlyTyr: 2.438 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
0.975HisAla: 0.975 ± 0.26
0.098HisCys: 0.098 ± 0.11
1.365HisAsp: 1.365 ± 0.365
0.878HisGlu: 0.878 ± 0.334
1.073HisPhe: 1.073 ± 0.335
1.17HisGly: 1.17 ± 0.315
0.78HisHis: 0.78 ± 0.255
1.073HisIle: 1.073 ± 0.29
1.073HisLys: 1.073 ± 0.279
1.853HisLeu: 1.853 ± 0.374
0.39HisMet: 0.39 ± 0.168
1.365HisAsn: 1.365 ± 0.368
1.073HisPro: 1.073 ± 0.378
0.488HisGln: 0.488 ± 0.177
1.17HisArg: 1.17 ± 0.387
1.073HisSer: 1.073 ± 0.27
0.78HisThr: 0.78 ± 0.309
1.17HisVal: 1.17 ± 0.342
0.293HisTrp: 0.293 ± 0.146
0.585HisTyr: 0.585 ± 0.31
0.0HisXaa: 0.0 ± 0.0
Ile
4.778IleAla: 4.778 ± 0.517
0.78IleCys: 0.78 ± 0.315
4.68IleAsp: 4.68 ± 0.727
3.9IleGlu: 3.9 ± 0.522
1.853IlePhe: 1.853 ± 0.539
4.485IleGly: 4.485 ± 0.709
0.878IleHis: 0.878 ± 0.257
3.315IleIle: 3.315 ± 0.583
4.875IleLys: 4.875 ± 0.965
5.558IleLeu: 5.558 ± 0.57
0.975IleMet: 0.975 ± 0.277
2.633IleAsn: 2.633 ± 0.482
1.95IlePro: 1.95 ± 0.416
2.243IleGln: 2.243 ± 0.443
2.243IleArg: 2.243 ± 0.539
3.413IleSer: 3.413 ± 0.574
4.583IleThr: 4.583 ± 0.66
3.9IleVal: 3.9 ± 0.724
0.878IleTrp: 0.878 ± 0.343
1.853IleTyr: 1.853 ± 0.386
0.0IleXaa: 0.0 ± 0.0
Lys
7.118LysAla: 7.118 ± 0.839
0.293LysCys: 0.293 ± 0.139
3.998LysAsp: 3.998 ± 0.738
5.363LysGlu: 5.363 ± 0.661
1.755LysPhe: 1.755 ± 0.317
4.973LysGly: 4.973 ± 0.577
2.145LysHis: 2.145 ± 0.483
3.705LysIle: 3.705 ± 0.464
4.388LysLys: 4.388 ± 0.526
6.435LysLeu: 6.435 ± 1.066
1.268LysMet: 1.268 ± 0.383
3.218LysAsn: 3.218 ± 0.548
2.438LysPro: 2.438 ± 0.413
3.9LysGln: 3.9 ± 0.626
3.9LysArg: 3.9 ± 0.67
3.9LysSer: 3.9 ± 0.51
4.778LysThr: 4.778 ± 0.721
5.265LysVal: 5.265 ± 0.763
0.975LysTrp: 0.975 ± 0.271
1.755LysTyr: 1.755 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
7.41LeuAla: 7.41 ± 1.052
0.488LeuCys: 0.488 ± 0.218
6.24LeuAsp: 6.24 ± 0.701
7.898LeuGlu: 7.898 ± 0.83
2.633LeuPhe: 2.633 ± 0.542
5.46LeuGly: 5.46 ± 0.809
1.17LeuHis: 1.17 ± 0.372
4.29LeuIle: 4.29 ± 0.519
8.093LeuLys: 8.093 ± 0.837
7.995LeuLeu: 7.995 ± 0.977
2.535LeuMet: 2.535 ± 0.417
4.583LeuAsn: 4.583 ± 0.778
3.413LeuPro: 3.413 ± 0.658
3.9LeuGln: 3.9 ± 0.587
3.9LeuArg: 3.9 ± 0.679
7.41LeuSer: 7.41 ± 0.898
6.435LeuThr: 6.435 ± 0.811
6.63LeuVal: 6.63 ± 0.877
0.78LeuTrp: 0.78 ± 0.211
3.413LeuTyr: 3.413 ± 0.692
0.0LeuXaa: 0.0 ± 0.0
Met
2.145MetAla: 2.145 ± 0.367
0.098MetCys: 0.098 ± 0.11
1.463MetAsp: 1.463 ± 0.401
1.658MetGlu: 1.658 ± 0.487
0.78MetPhe: 0.78 ± 0.282
2.145MetGly: 2.145 ± 0.525
0.0MetHis: 0.0 ± 0.0
1.268MetIle: 1.268 ± 0.301
1.755MetLys: 1.755 ± 0.362
1.268MetLeu: 1.268 ± 0.434
0.975MetMet: 0.975 ± 0.29
0.683MetAsn: 0.683 ± 0.238
0.585MetPro: 0.585 ± 0.175
0.683MetGln: 0.683 ± 0.231
1.073MetArg: 1.073 ± 0.323
2.145MetSer: 2.145 ± 0.413
2.633MetThr: 2.633 ± 0.584
1.56MetVal: 1.56 ± 0.317
0.098MetTrp: 0.098 ± 0.086
0.293MetTyr: 0.293 ± 0.209
0.0MetXaa: 0.0 ± 0.0
Asn
3.9AsnAla: 3.9 ± 0.619
0.098AsnCys: 0.098 ± 0.09
2.145AsnAsp: 2.145 ± 0.391
3.023AsnGlu: 3.023 ± 0.522
2.243AsnPhe: 2.243 ± 0.435
4.778AsnGly: 4.778 ± 0.83
0.585AsnHis: 0.585 ± 0.27
2.243AsnIle: 2.243 ± 0.509
3.023AsnLys: 3.023 ± 0.594
3.51AsnLeu: 3.51 ± 0.559
1.073AsnMet: 1.073 ± 0.283
2.145AsnAsn: 2.145 ± 0.662
1.95AsnPro: 1.95 ± 0.316
1.95AsnGln: 1.95 ± 0.426
1.463AsnArg: 1.463 ± 0.291
3.803AsnSer: 3.803 ± 0.477
2.633AsnThr: 2.633 ± 0.579
2.438AsnVal: 2.438 ± 0.529
0.975AsnTrp: 0.975 ± 0.325
1.073AsnTyr: 1.073 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
1.073ProAla: 1.073 ± 0.305
0.293ProCys: 0.293 ± 0.166
1.658ProAsp: 1.658 ± 0.403
2.145ProGlu: 2.145 ± 0.509
1.073ProPhe: 1.073 ± 0.322
1.268ProGly: 1.268 ± 0.403
0.78ProHis: 0.78 ± 0.265
1.853ProIle: 1.853 ± 0.418
2.438ProLys: 2.438 ± 0.472
2.925ProLeu: 2.925 ± 0.47
0.195ProMet: 0.195 ± 0.121
1.365ProAsn: 1.365 ± 0.402
0.975ProPro: 0.975 ± 0.323
0.975ProGln: 0.975 ± 0.241
1.073ProArg: 1.073 ± 0.308
3.12ProSer: 3.12 ± 0.559
3.12ProThr: 3.12 ± 0.615
1.755ProVal: 1.755 ± 0.412
0.488ProTrp: 0.488 ± 0.196
1.755ProTyr: 1.755 ± 0.432
0.0ProXaa: 0.0 ± 0.0
Gln
3.51GlnAla: 3.51 ± 0.791
0.195GlnCys: 0.195 ± 0.109
1.755GlnAsp: 1.755 ± 0.379
3.218GlnGlu: 3.218 ± 0.554
2.243GlnPhe: 2.243 ± 0.55
2.438GlnGly: 2.438 ± 0.589
0.78GlnHis: 0.78 ± 0.262
2.925GlnIle: 2.925 ± 0.446
2.243GlnLys: 2.243 ± 0.373
4.193GlnLeu: 4.193 ± 0.676
0.878GlnMet: 0.878 ± 0.327
1.95GlnAsn: 1.95 ± 0.445
1.853GlnPro: 1.853 ± 0.424
1.755GlnGln: 1.755 ± 0.466
1.365GlnArg: 1.365 ± 0.25
2.048GlnSer: 2.048 ± 0.296
3.023GlnThr: 3.023 ± 0.455
2.925GlnVal: 2.925 ± 0.445
0.585GlnTrp: 0.585 ± 0.241
0.878GlnTyr: 0.878 ± 0.285
0.0GlnXaa: 0.0 ± 0.0
Arg
2.73ArgAla: 2.73 ± 0.586
0.488ArgCys: 0.488 ± 0.211
2.34ArgAsp: 2.34 ± 0.551
2.828ArgGlu: 2.828 ± 0.456
1.463ArgPhe: 1.463 ± 0.367
2.633ArgGly: 2.633 ± 0.441
1.073ArgHis: 1.073 ± 0.34
2.438ArgIle: 2.438 ± 0.471
2.828ArgLys: 2.828 ± 0.618
4.388ArgLeu: 4.388 ± 0.742
0.585ArgMet: 0.585 ± 0.219
1.658ArgAsn: 1.658 ± 0.403
1.365ArgPro: 1.365 ± 0.487
2.73ArgGln: 2.73 ± 0.42
1.658ArgArg: 1.658 ± 0.429
2.145ArgSer: 2.145 ± 0.343
2.048ArgThr: 2.048 ± 0.493
2.73ArgVal: 2.73 ± 0.594
0.683ArgTrp: 0.683 ± 0.232
1.463ArgTyr: 1.463 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
5.46SerAla: 5.46 ± 0.869
0.293SerCys: 0.293 ± 0.183
4.485SerAsp: 4.485 ± 0.687
4.68SerGlu: 4.68 ± 0.791
3.218SerPhe: 3.218 ± 0.574
4.68SerGly: 4.68 ± 0.809
1.658SerHis: 1.658 ± 0.38
4.583SerIle: 4.583 ± 0.942
3.9SerLys: 3.9 ± 0.431
6.533SerLeu: 6.533 ± 0.754
1.073SerMet: 1.073 ± 0.321
2.34SerAsn: 2.34 ± 0.511
2.34SerPro: 2.34 ± 0.443
2.243SerGln: 2.243 ± 0.424
2.243SerArg: 2.243 ± 0.393
5.753SerSer: 5.753 ± 1.115
4.29SerThr: 4.29 ± 0.789
5.265SerVal: 5.265 ± 0.597
1.463SerTrp: 1.463 ± 0.237
1.755SerTyr: 1.755 ± 0.389
0.0SerXaa: 0.0 ± 0.0
Thr
4.973ThrAla: 4.973 ± 0.867
0.293ThrCys: 0.293 ± 0.227
3.998ThrAsp: 3.998 ± 0.669
4.875ThrGlu: 4.875 ± 0.6
3.315ThrPhe: 3.315 ± 0.725
4.485ThrGly: 4.485 ± 0.647
0.975ThrHis: 0.975 ± 0.269
5.363ThrIle: 5.363 ± 1.043
4.973ThrLys: 4.973 ± 0.55
6.143ThrLeu: 6.143 ± 0.985
1.365ThrMet: 1.365 ± 0.318
3.023ThrAsn: 3.023 ± 0.506
2.34ThrPro: 2.34 ± 0.814
1.755ThrGln: 1.755 ± 0.564
2.145ThrArg: 2.145 ± 0.435
5.46ThrSer: 5.46 ± 0.979
4.68ThrThr: 4.68 ± 0.701
5.85ThrVal: 5.85 ± 0.697
1.365ThrTrp: 1.365 ± 0.402
2.438ThrTyr: 2.438 ± 0.394
0.0ThrXaa: 0.0 ± 0.0
Val
3.9ValAla: 3.9 ± 0.834
0.488ValCys: 0.488 ± 0.247
3.803ValAsp: 3.803 ± 0.693
4.973ValGlu: 4.973 ± 0.721
2.243ValPhe: 2.243 ± 0.452
3.413ValGly: 3.413 ± 0.48
1.17ValHis: 1.17 ± 0.332
4.68ValIle: 4.68 ± 0.703
4.388ValLys: 4.388 ± 0.628
7.8ValLeu: 7.8 ± 0.894
1.56ValMet: 1.56 ± 0.43
2.048ValAsn: 2.048 ± 0.362
1.95ValPro: 1.95 ± 0.511
2.535ValGln: 2.535 ± 0.37
2.633ValArg: 2.633 ± 0.472
5.363ValSer: 5.363 ± 0.903
6.923ValThr: 6.923 ± 0.762
4.388ValVal: 4.388 ± 0.662
1.268ValTrp: 1.268 ± 0.483
2.828ValTyr: 2.828 ± 0.489
0.0ValXaa: 0.0 ± 0.0
Trp
0.78TrpAla: 0.78 ± 0.291
0.195TrpCys: 0.195 ± 0.144
0.585TrpAsp: 0.585 ± 0.18
1.17TrpGlu: 1.17 ± 0.284
0.78TrpPhe: 0.78 ± 0.319
0.585TrpGly: 0.585 ± 0.226
0.195TrpHis: 0.195 ± 0.12
0.683TrpIle: 0.683 ± 0.254
0.585TrpLys: 0.585 ± 0.296
1.268TrpLeu: 1.268 ± 0.243
0.488TrpMet: 0.488 ± 0.166
1.268TrpAsn: 1.268 ± 0.428
0.098TrpPro: 0.098 ± 0.088
0.878TrpGln: 0.878 ± 0.27
0.975TrpArg: 0.975 ± 0.324
1.073TrpSer: 1.073 ± 0.344
0.78TrpThr: 0.78 ± 0.278
0.878TrpVal: 0.878 ± 0.287
0.195TrpTrp: 0.195 ± 0.12
0.195TrpTyr: 0.195 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.315TyrAla: 3.315 ± 0.561
0.488TyrCys: 0.488 ± 0.215
2.633TyrAsp: 2.633 ± 0.529
3.023TyrGlu: 3.023 ± 0.541
1.463TyrPhe: 1.463 ± 0.355
2.535TyrGly: 2.535 ± 0.455
0.78TyrHis: 0.78 ± 0.258
1.658TyrIle: 1.658 ± 0.441
1.56TyrLys: 1.56 ± 0.36
3.51TyrLeu: 3.51 ± 0.751
0.488TyrMet: 0.488 ± 0.2
1.658TyrAsn: 1.658 ± 0.371
1.463TyrPro: 1.463 ± 0.273
2.243TyrGln: 2.243 ± 0.428
1.853TyrArg: 1.853 ± 0.336
1.95TyrSer: 1.95 ± 0.447
2.145TyrThr: 2.145 ± 0.525
2.243TyrVal: 2.243 ± 0.426
0.195TyrTrp: 0.195 ± 0.131
0.878TyrTyr: 0.878 ± 0.308
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37 proteins (10257 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski