Amino acid dipepetide frequency for Streptococcus phage Javan472

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.647AlaAla: 4.647 ± 0.903
0.321AlaCys: 0.321 ± 0.117
4.968AlaAsp: 4.968 ± 0.606
6.891AlaGlu: 6.891 ± 0.727
2.564AlaPhe: 2.564 ± 0.618
4.808AlaGly: 4.808 ± 0.899
0.481AlaHis: 0.481 ± 0.168
6.41AlaIle: 6.41 ± 0.754
6.571AlaLys: 6.571 ± 0.749
7.452AlaLeu: 7.452 ± 0.87
2.083AlaMet: 2.083 ± 0.457
5.609AlaAsn: 5.609 ± 0.924
1.522AlaPro: 1.522 ± 0.322
3.846AlaGln: 3.846 ± 0.817
2.564AlaArg: 2.564 ± 0.503
4.808AlaSer: 4.808 ± 1.113
6.01AlaThr: 6.01 ± 1.113
4.888AlaVal: 4.888 ± 0.777
0.481AlaTrp: 0.481 ± 0.188
3.446AlaTyr: 3.446 ± 0.651
0.0AlaXaa: 0.0 ± 0.0
Cys
0.08CysAla: 0.08 ± 0.085
0.08CysCys: 0.08 ± 0.085
0.24CysAsp: 0.24 ± 0.147
0.561CysGlu: 0.561 ± 0.247
0.0CysPhe: 0.0 ± 0.0
0.16CysGly: 0.16 ± 0.113
0.08CysHis: 0.08 ± 0.071
0.16CysIle: 0.16 ± 0.105
0.321CysLys: 0.321 ± 0.176
0.561CysLeu: 0.561 ± 0.247
0.16CysMet: 0.16 ± 0.122
0.16CysAsn: 0.16 ± 0.126
0.16CysPro: 0.16 ± 0.127
0.481CysGln: 0.481 ± 0.164
0.24CysArg: 0.24 ± 0.16
0.16CysSer: 0.16 ± 0.107
0.321CysThr: 0.321 ± 0.192
0.24CysVal: 0.24 ± 0.13
0.24CysTrp: 0.24 ± 0.108
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.446AspAla: 3.446 ± 0.451
0.16AspCys: 0.16 ± 0.11
3.526AspAsp: 3.526 ± 0.546
5.048AspGlu: 5.048 ± 0.763
3.125AspPhe: 3.125 ± 0.603
4.487AspGly: 4.487 ± 0.757
0.401AspHis: 0.401 ± 0.203
4.647AspIle: 4.647 ± 0.571
6.571AspLys: 6.571 ± 0.8
5.609AspLeu: 5.609 ± 0.867
1.522AspMet: 1.522 ± 0.304
3.125AspAsn: 3.125 ± 0.529
1.603AspPro: 1.603 ± 0.379
1.202AspGln: 1.202 ± 0.294
2.804AspArg: 2.804 ± 0.558
3.125AspSer: 3.125 ± 0.516
2.965AspThr: 2.965 ± 0.459
3.125AspVal: 3.125 ± 0.594
0.641AspTrp: 0.641 ± 0.211
3.365AspTyr: 3.365 ± 0.626
0.0AspXaa: 0.0 ± 0.0
Glu
5.208GluAla: 5.208 ± 0.717
0.321GluCys: 0.321 ± 0.16
3.365GluAsp: 3.365 ± 0.573
6.571GluGlu: 6.571 ± 0.876
2.564GluPhe: 2.564 ± 0.451
3.446GluGly: 3.446 ± 0.486
1.282GluHis: 1.282 ± 0.398
5.369GluIle: 5.369 ± 0.848
5.529GluLys: 5.529 ± 0.841
8.654GluLeu: 8.654 ± 1.162
1.763GluMet: 1.763 ± 0.328
3.846GluAsn: 3.846 ± 0.634
2.083GluPro: 2.083 ± 0.385
2.484GluGln: 2.484 ± 0.483
3.686GluArg: 3.686 ± 0.731
3.766GluSer: 3.766 ± 0.432
3.766GluThr: 3.766 ± 0.556
4.087GluVal: 4.087 ± 0.647
1.042GluTrp: 1.042 ± 0.271
3.045GluTyr: 3.045 ± 0.549
0.0GluXaa: 0.0 ± 0.0
Phe
3.285PheAla: 3.285 ± 0.727
0.08PheCys: 0.08 ± 0.076
3.125PheAsp: 3.125 ± 0.554
2.804PheGlu: 2.804 ± 0.454
1.362PhePhe: 1.362 ± 0.385
2.244PheGly: 2.244 ± 0.554
0.16PheHis: 0.16 ± 0.118
2.003PheIle: 2.003 ± 0.412
3.846PheLys: 3.846 ± 0.617
2.804PheLeu: 2.804 ± 0.538
0.721PheMet: 0.721 ± 0.261
2.724PheAsn: 2.724 ± 0.375
0.481PhePro: 0.481 ± 0.162
0.721PheGln: 0.721 ± 0.241
1.522PheArg: 1.522 ± 0.282
2.804PheSer: 2.804 ± 0.365
2.885PheThr: 2.885 ± 0.404
1.843PheVal: 1.843 ± 0.263
0.641PheTrp: 0.641 ± 0.25
1.282PheTyr: 1.282 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
5.369GlyAla: 5.369 ± 0.645
0.0GlyCys: 0.0 ± 0.0
3.365GlyAsp: 3.365 ± 0.463
3.606GlyGlu: 3.606 ± 0.562
2.804GlyPhe: 2.804 ± 0.528
4.167GlyGly: 4.167 ± 0.534
1.122GlyHis: 1.122 ± 0.34
4.567GlyIle: 4.567 ± 1.051
6.17GlyLys: 6.17 ± 0.659
6.651GlyLeu: 6.651 ± 0.728
1.923GlyMet: 1.923 ± 0.419
2.083GlyAsn: 2.083 ± 0.402
0.721GlyPro: 0.721 ± 0.208
2.003GlyGln: 2.003 ± 0.299
2.885GlyArg: 2.885 ± 0.583
3.125GlySer: 3.125 ± 0.466
4.487GlyThr: 4.487 ± 0.618
4.647GlyVal: 4.647 ± 0.608
0.801GlyTrp: 0.801 ± 0.215
2.885GlyTyr: 2.885 ± 0.511
0.0GlyXaa: 0.0 ± 0.0
His
0.962HisAla: 0.962 ± 0.346
0.08HisCys: 0.08 ± 0.083
0.401HisAsp: 0.401 ± 0.211
0.962HisGlu: 0.962 ± 0.266
0.641HisPhe: 0.641 ± 0.226
0.801HisGly: 0.801 ± 0.363
0.321HisHis: 0.321 ± 0.168
0.962HisIle: 0.962 ± 0.297
0.801HisLys: 0.801 ± 0.213
0.962HisLeu: 0.962 ± 0.257
0.0HisMet: 0.0 ± 0.0
0.641HisAsn: 0.641 ± 0.212
0.721HisPro: 0.721 ± 0.271
0.16HisGln: 0.16 ± 0.111
0.561HisArg: 0.561 ± 0.206
0.881HisSer: 0.881 ± 0.267
0.962HisThr: 0.962 ± 0.315
0.721HisVal: 0.721 ± 0.271
0.0HisTrp: 0.0 ± 0.0
0.401HisTyr: 0.401 ± 0.173
0.0HisXaa: 0.0 ± 0.0
Ile
5.689IleAla: 5.689 ± 1.03
0.24IleCys: 0.24 ± 0.134
6.571IleAsp: 6.571 ± 0.603
4.728IleGlu: 4.728 ± 0.871
2.324IlePhe: 2.324 ± 0.401
3.846IleGly: 3.846 ± 0.595
0.721IleHis: 0.721 ± 0.207
4.647IleIle: 4.647 ± 0.807
7.692IleLys: 7.692 ± 0.86
5.609IleLeu: 5.609 ± 0.675
1.202IleMet: 1.202 ± 0.226
4.087IleAsn: 4.087 ± 0.77
2.324IlePro: 2.324 ± 0.403
2.244IleGln: 2.244 ± 0.613
1.923IleArg: 1.923 ± 0.348
4.167IleSer: 4.167 ± 0.671
4.006IleThr: 4.006 ± 0.573
4.167IleVal: 4.167 ± 0.589
0.561IleTrp: 0.561 ± 0.217
3.365IleTyr: 3.365 ± 0.605
0.0IleXaa: 0.0 ± 0.0
Lys
6.971LysAla: 6.971 ± 0.654
0.321LysCys: 0.321 ± 0.144
5.208LysAsp: 5.208 ± 0.738
5.849LysGlu: 5.849 ± 0.705
1.923LysPhe: 1.923 ± 0.361
4.888LysGly: 4.888 ± 0.602
1.362LysHis: 1.362 ± 0.459
7.292LysIle: 7.292 ± 0.752
7.372LysLys: 7.372 ± 1.088
7.692LysLeu: 7.692 ± 0.765
2.564LysMet: 2.564 ± 0.359
4.567LysAsn: 4.567 ± 0.605
2.885LysPro: 2.885 ± 0.48
3.926LysGln: 3.926 ± 0.602
4.407LysArg: 4.407 ± 0.717
3.766LysSer: 3.766 ± 0.433
5.609LysThr: 5.609 ± 0.702
4.888LysVal: 4.888 ± 0.675
1.282LysTrp: 1.282 ± 0.38
3.446LysTyr: 3.446 ± 0.552
0.0LysXaa: 0.0 ± 0.0
Leu
6.891LeuAla: 6.891 ± 0.723
0.16LeuCys: 0.16 ± 0.112
6.33LeuAsp: 6.33 ± 0.658
6.41LeuGlu: 6.41 ± 0.922
2.724LeuPhe: 2.724 ± 0.411
6.33LeuGly: 6.33 ± 0.758
0.801LeuHis: 0.801 ± 0.263
4.888LeuIle: 4.888 ± 0.588
8.574LeuLys: 8.574 ± 0.926
6.811LeuLeu: 6.811 ± 0.811
2.324LeuMet: 2.324 ± 0.447
5.689LeuAsn: 5.689 ± 0.557
4.167LeuPro: 4.167 ± 0.813
3.926LeuGln: 3.926 ± 0.407
3.766LeuArg: 3.766 ± 0.744
6.891LeuSer: 6.891 ± 0.845
5.128LeuThr: 5.128 ± 0.588
6.01LeuVal: 6.01 ± 0.579
0.721LeuTrp: 0.721 ± 0.23
2.484LeuTyr: 2.484 ± 0.584
0.0LeuXaa: 0.0 ± 0.0
Met
3.125MetAla: 3.125 ± 0.681
0.08MetCys: 0.08 ± 0.085
1.282MetAsp: 1.282 ± 0.301
1.603MetGlu: 1.603 ± 0.382
0.801MetPhe: 0.801 ± 0.309
1.362MetGly: 1.362 ± 0.312
0.16MetHis: 0.16 ± 0.101
1.603MetIle: 1.603 ± 0.322
1.122MetLys: 1.122 ± 0.286
1.843MetLeu: 1.843 ± 0.344
0.561MetMet: 0.561 ± 0.236
1.362MetAsn: 1.362 ± 0.352
0.801MetPro: 0.801 ± 0.232
0.962MetGln: 0.962 ± 0.236
1.442MetArg: 1.442 ± 0.314
2.564MetSer: 2.564 ± 0.56
1.603MetThr: 1.603 ± 0.312
0.881MetVal: 0.881 ± 0.319
0.08MetTrp: 0.08 ± 0.061
0.321MetTyr: 0.321 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
5.048AsnAla: 5.048 ± 0.687
0.321AsnCys: 0.321 ± 0.151
2.804AsnAsp: 2.804 ± 0.471
3.125AsnGlu: 3.125 ± 0.628
2.404AsnPhe: 2.404 ± 0.557
4.247AsnGly: 4.247 ± 0.53
0.641AsnHis: 0.641 ± 0.223
4.087AsnIle: 4.087 ± 0.488
3.526AsnLys: 3.526 ± 0.506
4.888AsnLeu: 4.888 ± 0.559
1.042AsnMet: 1.042 ± 0.231
3.125AsnAsn: 3.125 ± 0.59
2.163AsnPro: 2.163 ± 0.427
1.763AsnGln: 1.763 ± 0.297
2.324AsnArg: 2.324 ± 0.384
4.487AsnSer: 4.487 ± 0.892
3.045AsnThr: 3.045 ± 0.483
2.804AsnVal: 2.804 ± 0.494
0.481AsnTrp: 0.481 ± 0.176
2.244AsnTyr: 2.244 ± 0.439
0.0AsnXaa: 0.0 ± 0.0
Pro
1.522ProAla: 1.522 ± 0.365
0.24ProCys: 0.24 ± 0.193
1.282ProAsp: 1.282 ± 0.341
2.324ProGlu: 2.324 ± 0.503
1.282ProPhe: 1.282 ± 0.404
1.923ProGly: 1.923 ± 0.449
0.321ProHis: 0.321 ± 0.198
1.202ProIle: 1.202 ± 0.311
2.484ProLys: 2.484 ± 0.503
2.083ProLeu: 2.083 ± 0.398
0.481ProMet: 0.481 ± 0.225
2.003ProAsn: 2.003 ± 0.356
0.481ProPro: 0.481 ± 0.198
1.763ProGln: 1.763 ± 0.396
1.202ProArg: 1.202 ± 0.38
1.843ProSer: 1.843 ± 0.411
1.843ProThr: 1.843 ± 0.343
2.244ProVal: 2.244 ± 0.491
0.16ProTrp: 0.16 ± 0.128
0.881ProTyr: 0.881 ± 0.346
0.0ProXaa: 0.0 ± 0.0
Gln
3.686GlnAla: 3.686 ± 0.648
0.16GlnCys: 0.16 ± 0.104
1.843GlnAsp: 1.843 ± 0.323
3.205GlnGlu: 3.205 ± 0.603
1.763GlnPhe: 1.763 ± 0.324
2.644GlnGly: 2.644 ± 0.429
0.24GlnHis: 0.24 ± 0.161
3.446GlnIle: 3.446 ± 0.685
3.045GlnLys: 3.045 ± 0.501
3.846GlnLeu: 3.846 ± 0.617
1.122GlnMet: 1.122 ± 0.291
2.564GlnAsn: 2.564 ± 0.546
0.24GlnPro: 0.24 ± 0.126
1.522GlnGln: 1.522 ± 0.461
1.522GlnArg: 1.522 ± 0.219
3.125GlnSer: 3.125 ± 0.622
2.404GlnThr: 2.404 ± 0.485
1.122GlnVal: 1.122 ± 0.23
0.401GlnTrp: 0.401 ± 0.203
0.881GlnTyr: 0.881 ± 0.28
0.0GlnXaa: 0.0 ± 0.0
Arg
3.285ArgAla: 3.285 ± 0.581
0.401ArgCys: 0.401 ± 0.172
2.083ArgAsp: 2.083 ± 0.54
2.484ArgGlu: 2.484 ± 0.431
2.083ArgPhe: 2.083 ± 0.38
1.843ArgGly: 1.843 ± 0.356
0.641ArgHis: 0.641 ± 0.201
3.365ArgIle: 3.365 ± 0.479
3.365ArgLys: 3.365 ± 0.633
5.849ArgLeu: 5.849 ± 0.833
1.603ArgMet: 1.603 ± 0.349
2.163ArgAsn: 2.163 ± 0.429
0.801ArgPro: 0.801 ± 0.188
1.522ArgGln: 1.522 ± 0.316
1.442ArgArg: 1.442 ± 0.397
2.163ArgSer: 2.163 ± 0.405
2.644ArgThr: 2.644 ± 0.552
2.404ArgVal: 2.404 ± 0.541
0.561ArgTrp: 0.561 ± 0.212
1.522ArgTyr: 1.522 ± 0.381
0.0ArgXaa: 0.0 ± 0.0
Ser
6.811SerAla: 6.811 ± 1.53
0.24SerCys: 0.24 ± 0.142
3.606SerAsp: 3.606 ± 0.707
4.087SerGlu: 4.087 ± 0.546
1.923SerPhe: 1.923 ± 0.512
4.888SerGly: 4.888 ± 0.851
0.801SerHis: 0.801 ± 0.33
4.327SerIle: 4.327 ± 0.722
5.048SerLys: 5.048 ± 0.558
5.609SerLeu: 5.609 ± 1.424
1.282SerMet: 1.282 ± 0.479
3.606SerAsn: 3.606 ± 0.636
1.603SerPro: 1.603 ± 0.316
2.404SerGln: 2.404 ± 0.448
2.724SerArg: 2.724 ± 0.483
4.728SerSer: 4.728 ± 1.457
3.526SerThr: 3.526 ± 0.582
4.888SerVal: 4.888 ± 0.75
0.561SerTrp: 0.561 ± 0.19
2.404SerTyr: 2.404 ± 0.49
0.0SerXaa: 0.0 ± 0.0
Thr
5.288ThrAla: 5.288 ± 0.725
0.24ThrCys: 0.24 ± 0.182
3.446ThrAsp: 3.446 ± 0.551
3.205ThrGlu: 3.205 ± 0.476
2.885ThrPhe: 2.885 ± 0.47
5.128ThrGly: 5.128 ± 0.704
0.881ThrHis: 0.881 ± 0.307
3.846ThrIle: 3.846 ± 0.499
5.208ThrLys: 5.208 ± 0.719
5.048ThrLeu: 5.048 ± 0.612
1.282ThrMet: 1.282 ± 0.299
1.923ThrAsn: 1.923 ± 0.477
2.163ThrPro: 2.163 ± 0.369
2.484ThrGln: 2.484 ± 0.533
2.244ThrArg: 2.244 ± 0.429
4.968ThrSer: 4.968 ± 1.402
4.808ThrThr: 4.808 ± 0.745
4.247ThrVal: 4.247 ± 0.604
0.801ThrTrp: 0.801 ± 0.35
2.484ThrTyr: 2.484 ± 0.42
0.0ThrXaa: 0.0 ± 0.0
Val
5.128ValAla: 5.128 ± 0.579
0.321ValCys: 0.321 ± 0.166
3.606ValAsp: 3.606 ± 0.478
4.407ValGlu: 4.407 ± 0.676
2.083ValPhe: 2.083 ± 0.36
3.686ValGly: 3.686 ± 0.615
0.881ValHis: 0.881 ± 0.261
4.647ValIle: 4.647 ± 0.655
4.808ValLys: 4.808 ± 0.511
4.487ValLeu: 4.487 ± 0.628
1.522ValMet: 1.522 ± 0.363
3.285ValAsn: 3.285 ± 0.497
1.282ValPro: 1.282 ± 0.352
2.244ValGln: 2.244 ± 0.61
2.484ValArg: 2.484 ± 0.47
4.327ValSer: 4.327 ± 0.83
4.167ValThr: 4.167 ± 0.687
3.926ValVal: 3.926 ± 0.401
0.24ValTrp: 0.24 ± 0.122
2.163ValTyr: 2.163 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
0.801TrpAla: 0.801 ± 0.231
0.08TrpCys: 0.08 ± 0.085
0.721TrpAsp: 0.721 ± 0.201
1.122TrpGlu: 1.122 ± 0.259
0.561TrpPhe: 0.561 ± 0.186
0.401TrpGly: 0.401 ± 0.172
0.16TrpHis: 0.16 ± 0.098
0.641TrpIle: 0.641 ± 0.176
0.801TrpLys: 0.801 ± 0.204
1.122TrpLeu: 1.122 ± 0.351
0.0TrpMet: 0.0 ± 0.0
0.481TrpAsn: 0.481 ± 0.153
0.481TrpPro: 0.481 ± 0.224
0.801TrpGln: 0.801 ± 0.26
0.481TrpArg: 0.481 ± 0.234
0.641TrpSer: 0.641 ± 0.182
0.08TrpThr: 0.08 ± 0.083
0.561TrpVal: 0.561 ± 0.305
0.08TrpTrp: 0.08 ± 0.071
0.08TrpTyr: 0.08 ± 0.083
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.125TyrAla: 3.125 ± 0.464
0.561TyrCys: 0.561 ± 0.218
2.724TyrAsp: 2.724 ± 0.607
2.885TyrGlu: 2.885 ± 0.502
1.522TyrPhe: 1.522 ± 0.316
2.083TyrGly: 2.083 ± 0.42
0.481TyrHis: 0.481 ± 0.157
2.003TyrIle: 2.003 ± 0.412
3.446TyrLys: 3.446 ± 0.512
3.446TyrLeu: 3.446 ± 0.623
0.321TyrMet: 0.321 ± 0.15
1.522TyrAsn: 1.522 ± 0.348
0.881TyrPro: 0.881 ± 0.309
2.324TyrGln: 2.324 ± 0.443
2.003TyrArg: 2.003 ± 0.46
2.724TyrSer: 2.724 ± 0.488
2.324TyrThr: 2.324 ± 0.571
2.003TyrVal: 2.003 ± 0.405
0.321TyrTrp: 0.321 ± 0.148
2.003TyrTyr: 2.003 ± 0.396
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12481 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski