Amino acid dipepetide frequency for Streptococcus phage Javan169

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.668AlaAla: 3.668 ± 1.272
0.764AlaCys: 0.764 ± 0.219
4.508AlaAsp: 4.508 ± 0.565
5.654AlaGlu: 5.654 ± 0.707
1.299AlaPhe: 1.299 ± 0.303
4.508AlaGly: 4.508 ± 0.807
0.611AlaHis: 0.611 ± 0.201
4.432AlaIle: 4.432 ± 0.497
5.654AlaLys: 5.654 ± 0.641
5.654AlaLeu: 5.654 ± 1.095
1.834AlaMet: 1.834 ± 0.51
3.591AlaAsn: 3.591 ± 0.556
1.299AlaPro: 1.299 ± 0.31
3.515AlaGln: 3.515 ± 0.539
2.216AlaArg: 2.216 ± 0.361
4.661AlaSer: 4.661 ± 0.724
3.362AlaThr: 3.362 ± 0.704
4.585AlaVal: 4.585 ± 0.845
0.841AlaTrp: 0.841 ± 0.223
2.674AlaTyr: 2.674 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
0.153CysAla: 0.153 ± 0.098
0.153CysCys: 0.153 ± 0.087
0.229CysAsp: 0.229 ± 0.115
0.993CysGlu: 0.993 ± 0.322
0.458CysPhe: 0.458 ± 0.155
0.535CysGly: 0.535 ± 0.222
0.229CysHis: 0.229 ± 0.119
0.306CysIle: 0.306 ± 0.16
0.611CysLys: 0.611 ± 0.235
0.306CysLeu: 0.306 ± 0.149
0.153CysMet: 0.153 ± 0.1
0.458CysAsn: 0.458 ± 0.204
0.229CysPro: 0.229 ± 0.194
0.229CysGln: 0.229 ± 0.128
0.153CysArg: 0.153 ± 0.119
0.229CysSer: 0.229 ± 0.12
0.153CysThr: 0.153 ± 0.113
0.535CysVal: 0.535 ± 0.189
0.153CysTrp: 0.153 ± 0.121
0.076CysTyr: 0.076 ± 0.071
0.0CysXaa: 0.0 ± 0.0
Asp
3.133AspAla: 3.133 ± 0.544
0.688AspCys: 0.688 ± 0.238
5.502AspAsp: 5.502 ± 0.773
4.814AspGlu: 4.814 ± 0.704
3.591AspPhe: 3.591 ± 0.487
5.349AspGly: 5.349 ± 0.83
0.917AspHis: 0.917 ± 0.251
4.279AspIle: 4.279 ± 0.562
6.877AspLys: 6.877 ± 0.599
7.183AspLeu: 7.183 ± 0.59
2.063AspMet: 2.063 ± 0.424
3.821AspAsn: 3.821 ± 0.387
1.605AspPro: 1.605 ± 0.401
0.841AspGln: 0.841 ± 0.256
2.445AspArg: 2.445 ± 0.429
3.973AspSer: 3.973 ± 0.601
3.821AspThr: 3.821 ± 0.444
3.744AspVal: 3.744 ± 0.559
0.764AspTrp: 0.764 ± 0.268
3.668AspTyr: 3.668 ± 0.605
0.0AspXaa: 0.0 ± 0.0
Glu
4.661GluAla: 4.661 ± 0.624
0.306GluCys: 0.306 ± 0.17
3.821GluAsp: 3.821 ± 0.689
6.877GluGlu: 6.877 ± 0.725
2.216GluPhe: 2.216 ± 0.39
2.445GluGly: 2.445 ± 0.458
1.146GluHis: 1.146 ± 0.271
7.412GluIle: 7.412 ± 0.779
7.565GluLys: 7.565 ± 0.677
8.176GluLeu: 8.176 ± 0.938
2.292GluMet: 2.292 ± 0.42
3.286GluAsn: 3.286 ± 0.504
1.528GluPro: 1.528 ± 0.334
3.973GluGln: 3.973 ± 0.538
3.515GluArg: 3.515 ± 0.597
3.973GluSer: 3.973 ± 0.625
4.279GluThr: 4.279 ± 0.528
4.585GluVal: 4.585 ± 0.655
0.841GluTrp: 0.841 ± 0.206
3.209GluTyr: 3.209 ± 0.528
0.0GluXaa: 0.0 ± 0.0
Phe
2.369PheAla: 2.369 ± 0.436
0.153PheCys: 0.153 ± 0.11
3.439PheAsp: 3.439 ± 0.455
3.133PheGlu: 3.133 ± 0.501
0.764PhePhe: 0.764 ± 0.29
2.98PheGly: 2.98 ± 0.496
0.382PheHis: 0.382 ± 0.165
2.445PheIle: 2.445 ± 0.402
3.362PheLys: 3.362 ± 0.459
2.751PheLeu: 2.751 ± 0.606
1.07PheMet: 1.07 ± 0.205
2.292PheAsn: 2.292 ± 0.442
1.299PhePro: 1.299 ± 0.374
0.841PheGln: 0.841 ± 0.311
1.757PheArg: 1.757 ± 0.382
2.522PheSer: 2.522 ± 0.459
2.598PheThr: 2.598 ± 0.391
1.91PheVal: 1.91 ± 0.346
0.229PheTrp: 0.229 ± 0.146
1.91PheTyr: 1.91 ± 0.413
0.0PheXaa: 0.0 ± 0.0
Gly
3.897GlyAla: 3.897 ± 0.605
0.306GlyCys: 0.306 ± 0.131
3.973GlyAsp: 3.973 ± 0.7
3.897GlyGlu: 3.897 ± 0.656
2.369GlyPhe: 2.369 ± 0.423
4.126GlyGly: 4.126 ± 0.62
0.688GlyHis: 0.688 ± 0.224
5.043GlyIle: 5.043 ± 0.55
5.884GlyLys: 5.884 ± 0.739
5.578GlyLeu: 5.578 ± 0.794
1.91GlyMet: 1.91 ± 0.397
3.362GlyAsn: 3.362 ± 0.522
2.598GlyPro: 2.598 ± 1.531
2.598GlyGln: 2.598 ± 0.467
2.904GlyArg: 2.904 ± 0.376
2.674GlySer: 2.674 ± 0.331
2.292GlyThr: 2.292 ± 0.408
3.973GlyVal: 3.973 ± 0.455
1.146GlyTrp: 1.146 ± 0.267
2.674GlyTyr: 2.674 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
0.458HisAla: 0.458 ± 0.172
0.076HisCys: 0.076 ± 0.064
0.611HisAsp: 0.611 ± 0.273
0.993HisGlu: 0.993 ± 0.24
0.535HisPhe: 0.535 ± 0.164
0.382HisGly: 0.382 ± 0.13
0.076HisHis: 0.076 ± 0.079
0.993HisIle: 0.993 ± 0.286
1.07HisLys: 1.07 ± 0.228
0.993HisLeu: 0.993 ± 0.259
0.229HisMet: 0.229 ± 0.122
0.841HisAsn: 0.841 ± 0.237
0.611HisPro: 0.611 ± 0.188
0.764HisGln: 0.764 ± 0.243
0.688HisArg: 0.688 ± 0.191
0.535HisSer: 0.535 ± 0.224
0.917HisThr: 0.917 ± 0.252
0.764HisVal: 0.764 ± 0.233
0.076HisTrp: 0.076 ± 0.075
0.611HisTyr: 0.611 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
4.126IleAla: 4.126 ± 0.513
0.535IleCys: 0.535 ± 0.216
5.196IleAsp: 5.196 ± 0.78
6.113IleGlu: 6.113 ± 0.614
2.14IlePhe: 2.14 ± 0.466
4.05IleGly: 4.05 ± 0.552
0.993IleHis: 0.993 ± 0.286
3.668IleIle: 3.668 ± 0.598
7.03IleLys: 7.03 ± 0.764
5.349IleLeu: 5.349 ± 0.668
1.452IleMet: 1.452 ± 0.311
4.432IleAsn: 4.432 ± 0.521
2.063IlePro: 2.063 ± 0.428
1.834IleGln: 1.834 ± 0.422
2.827IleArg: 2.827 ± 0.503
4.585IleSer: 4.585 ± 0.611
4.967IleThr: 4.967 ± 0.561
4.967IleVal: 4.967 ± 0.563
0.535IleTrp: 0.535 ± 0.184
2.751IleTyr: 2.751 ± 0.46
0.0IleXaa: 0.0 ± 0.0
Lys
7.259LysAla: 7.259 ± 0.711
0.458LysCys: 0.458 ± 0.225
5.196LysAsp: 5.196 ± 0.608
6.495LysGlu: 6.495 ± 0.678
2.751LysPhe: 2.751 ± 0.443
4.738LysGly: 4.738 ± 0.616
0.764LysHis: 0.764 ± 0.196
7.03LysIle: 7.03 ± 1.022
7.794LysLys: 7.794 ± 0.776
7.488LysLeu: 7.488 ± 1.009
3.056LysMet: 3.056 ± 0.427
5.654LysAsn: 5.654 ± 0.662
2.522LysPro: 2.522 ± 0.444
4.967LysGln: 4.967 ± 0.601
3.744LysArg: 3.744 ± 0.478
4.89LysSer: 4.89 ± 0.581
6.037LysThr: 6.037 ± 0.786
6.877LysVal: 6.877 ± 0.843
0.917LysTrp: 0.917 ± 0.27
3.668LysTyr: 3.668 ± 0.638
0.0LysXaa: 0.0 ± 0.0
Leu
5.502LeuAla: 5.502 ± 1.007
0.382LeuCys: 0.382 ± 0.197
7.87LeuAsp: 7.87 ± 0.706
7.259LeuGlu: 7.259 ± 0.938
3.056LeuPhe: 3.056 ± 0.633
4.967LeuGly: 4.967 ± 0.78
0.917LeuHis: 0.917 ± 0.284
5.884LeuIle: 5.884 ± 0.72
8.329LeuLys: 8.329 ± 0.879
6.113LeuLeu: 6.113 ± 0.753
1.757LeuMet: 1.757 ± 0.427
5.196LeuAsn: 5.196 ± 0.532
2.522LeuPro: 2.522 ± 0.46
3.133LeuGln: 3.133 ± 0.455
4.126LeuArg: 4.126 ± 0.42
4.738LeuSer: 4.738 ± 0.904
6.266LeuThr: 6.266 ± 0.647
4.661LeuVal: 4.661 ± 0.684
0.611LeuTrp: 0.611 ± 0.212
2.904LeuTyr: 2.904 ± 0.508
0.0LeuXaa: 0.0 ± 0.0
Met
2.522MetAla: 2.522 ± 0.441
0.076MetCys: 0.076 ± 0.08
1.223MetAsp: 1.223 ± 0.332
1.528MetGlu: 1.528 ± 0.305
1.528MetPhe: 1.528 ± 0.295
1.605MetGly: 1.605 ± 0.302
0.306MetHis: 0.306 ± 0.154
1.605MetIle: 1.605 ± 0.357
1.681MetLys: 1.681 ± 0.336
2.292MetLeu: 2.292 ± 0.41
0.458MetMet: 0.458 ± 0.167
1.223MetAsn: 1.223 ± 0.332
0.688MetPro: 0.688 ± 0.214
0.917MetGln: 0.917 ± 0.314
1.452MetArg: 1.452 ± 0.251
1.834MetSer: 1.834 ± 0.392
1.91MetThr: 1.91 ± 0.331
1.223MetVal: 1.223 ± 0.331
0.229MetTrp: 0.229 ± 0.115
0.764MetTyr: 0.764 ± 0.256
0.0MetXaa: 0.0 ± 0.0
Asn
3.439AsnAla: 3.439 ± 0.523
0.153AsnCys: 0.153 ± 0.097
4.279AsnAsp: 4.279 ± 0.514
3.821AsnGlu: 3.821 ± 0.608
2.063AsnPhe: 2.063 ± 0.389
4.738AsnGly: 4.738 ± 0.651
0.764AsnHis: 0.764 ± 0.275
3.209AsnIle: 3.209 ± 0.55
3.897AsnLys: 3.897 ± 0.499
4.89AsnLeu: 4.89 ± 0.574
1.452AsnMet: 1.452 ± 0.38
3.668AsnAsn: 3.668 ± 0.6
2.292AsnPro: 2.292 ± 0.358
2.674AsnGln: 2.674 ± 0.372
2.369AsnArg: 2.369 ± 0.497
3.515AsnSer: 3.515 ± 0.596
2.292AsnThr: 2.292 ± 0.42
3.133AsnVal: 3.133 ± 0.539
1.07AsnTrp: 1.07 ± 0.269
2.445AsnTyr: 2.445 ± 0.356
0.0AsnXaa: 0.0 ± 0.0
Pro
1.605ProAla: 1.605 ± 0.45
0.076ProCys: 0.076 ± 0.062
1.757ProAsp: 1.757 ± 0.345
1.757ProGlu: 1.757 ± 0.379
1.528ProPhe: 1.528 ± 0.277
1.146ProGly: 1.146 ± 0.378
0.611ProHis: 0.611 ± 0.171
1.91ProIle: 1.91 ± 0.352
3.439ProLys: 3.439 ± 0.634
2.598ProLeu: 2.598 ± 0.562
0.764ProMet: 0.764 ± 0.267
1.452ProAsn: 1.452 ± 0.415
0.917ProPro: 0.917 ± 0.395
2.14ProGln: 2.14 ± 0.412
0.917ProArg: 0.917 ± 0.291
1.757ProSer: 1.757 ± 0.378
2.14ProThr: 2.14 ± 0.349
1.299ProVal: 1.299 ± 0.376
0.153ProTrp: 0.153 ± 0.109
0.993ProTyr: 0.993 ± 0.328
0.0ProXaa: 0.0 ± 0.0
Gln
2.904GlnAla: 2.904 ± 0.439
0.229GlnCys: 0.229 ± 0.124
1.987GlnAsp: 1.987 ± 0.342
3.515GlnGlu: 3.515 ± 0.525
1.757GlnPhe: 1.757 ± 0.388
2.674GlnGly: 2.674 ± 0.549
0.535GlnHis: 0.535 ± 0.192
2.751GlnIle: 2.751 ± 0.318
3.515GlnLys: 3.515 ± 0.608
3.515GlnLeu: 3.515 ± 0.523
1.223GlnMet: 1.223 ± 0.309
2.369GlnAsn: 2.369 ± 0.523
0.764GlnPro: 0.764 ± 0.25
1.757GlnGln: 1.757 ± 0.415
1.834GlnArg: 1.834 ± 0.374
4.355GlnSer: 4.355 ± 0.586
2.598GlnThr: 2.598 ± 0.446
2.216GlnVal: 2.216 ± 0.509
0.535GlnTrp: 0.535 ± 0.225
0.917GlnTyr: 0.917 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
2.369ArgAla: 2.369 ± 0.401
0.153ArgCys: 0.153 ± 0.102
2.674ArgAsp: 2.674 ± 0.446
2.904ArgGlu: 2.904 ± 0.448
1.605ArgPhe: 1.605 ± 0.263
2.216ArgGly: 2.216 ± 0.723
0.688ArgHis: 0.688 ± 0.238
3.515ArgIle: 3.515 ± 0.564
4.661ArgLys: 4.661 ± 0.57
3.973ArgLeu: 3.973 ± 0.583
0.993ArgMet: 0.993 ± 0.295
2.98ArgAsn: 2.98 ± 0.506
0.611ArgPro: 0.611 ± 0.203
2.063ArgGln: 2.063 ± 0.379
2.369ArgArg: 2.369 ± 0.425
1.375ArgSer: 1.375 ± 0.323
2.445ArgThr: 2.445 ± 0.526
2.445ArgVal: 2.445 ± 0.451
0.458ArgTrp: 0.458 ± 0.19
1.834ArgTyr: 1.834 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
4.432SerAla: 4.432 ± 0.888
0.382SerCys: 0.382 ± 0.194
4.355SerAsp: 4.355 ± 0.538
4.203SerGlu: 4.203 ± 0.497
2.598SerPhe: 2.598 ± 0.454
4.661SerGly: 4.661 ± 0.624
0.841SerHis: 0.841 ± 0.253
4.432SerIle: 4.432 ± 0.652
5.043SerLys: 5.043 ± 0.551
3.973SerLeu: 3.973 ± 0.534
1.223SerMet: 1.223 ± 0.25
3.515SerAsn: 3.515 ± 0.693
1.605SerPro: 1.605 ± 0.334
3.591SerGln: 3.591 ± 0.514
2.216SerArg: 2.216 ± 0.361
2.904SerSer: 2.904 ± 0.46
2.063SerThr: 2.063 ± 0.463
3.668SerVal: 3.668 ± 0.468
0.535SerTrp: 0.535 ± 0.231
2.751SerTyr: 2.751 ± 0.447
0.0SerXaa: 0.0 ± 0.0
Thr
4.661ThrAla: 4.661 ± 0.779
0.306ThrCys: 0.306 ± 0.162
3.362ThrAsp: 3.362 ± 0.564
3.515ThrGlu: 3.515 ± 0.581
3.286ThrPhe: 3.286 ± 0.445
4.508ThrGly: 4.508 ± 0.681
0.458ThrHis: 0.458 ± 0.184
3.897ThrIle: 3.897 ± 0.487
5.502ThrLys: 5.502 ± 0.625
5.272ThrLeu: 5.272 ± 0.533
1.146ThrMet: 1.146 ± 0.332
2.674ThrAsn: 2.674 ± 0.475
1.91ThrPro: 1.91 ± 0.369
2.292ThrGln: 2.292 ± 0.373
2.292ThrArg: 2.292 ± 0.35
3.439ThrSer: 3.439 ± 0.506
3.897ThrThr: 3.897 ± 0.742
3.362ThrVal: 3.362 ± 0.428
0.764ThrTrp: 0.764 ± 0.217
2.216ThrTyr: 2.216 ± 0.419
0.0ThrXaa: 0.0 ± 0.0
Val
5.196ValAla: 5.196 ± 0.795
0.229ValCys: 0.229 ± 0.124
5.043ValAsp: 5.043 ± 0.606
4.738ValGlu: 4.738 ± 0.698
2.369ValPhe: 2.369 ± 0.411
3.362ValGly: 3.362 ± 0.427
0.611ValHis: 0.611 ± 0.175
4.05ValIle: 4.05 ± 0.52
5.654ValLys: 5.654 ± 0.631
6.189ValLeu: 6.189 ± 0.713
0.917ValMet: 0.917 ± 0.274
2.98ValAsn: 2.98 ± 0.511
1.834ValPro: 1.834 ± 0.346
0.917ValGln: 0.917 ± 0.256
2.14ValArg: 2.14 ± 0.366
4.738ValSer: 4.738 ± 0.559
3.362ValThr: 3.362 ± 0.508
3.056ValVal: 3.056 ± 0.474
0.764ValTrp: 0.764 ± 0.247
1.681ValTyr: 1.681 ± 0.414
0.0ValXaa: 0.0 ± 0.0
Trp
0.458TrpAla: 0.458 ± 0.169
0.076TrpCys: 0.076 ± 0.081
0.917TrpAsp: 0.917 ± 0.273
1.07TrpGlu: 1.07 ± 0.268
0.688TrpPhe: 0.688 ± 0.265
0.764TrpGly: 0.764 ± 0.258
0.229TrpHis: 0.229 ± 0.132
0.382TrpIle: 0.382 ± 0.156
1.146TrpLys: 1.146 ± 0.375
1.146TrpLeu: 1.146 ± 0.334
0.229TrpMet: 0.229 ± 0.127
0.458TrpAsn: 0.458 ± 0.193
0.306TrpPro: 0.306 ± 0.143
0.458TrpGln: 0.458 ± 0.2
0.535TrpArg: 0.535 ± 0.201
0.535TrpSer: 0.535 ± 0.165
0.688TrpThr: 0.688 ± 0.257
0.535TrpVal: 0.535 ± 0.215
0.0TrpTrp: 0.0 ± 0.0
0.458TrpTyr: 0.458 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.522TyrAla: 2.522 ± 0.424
0.841TyrCys: 0.841 ± 0.273
3.133TyrAsp: 3.133 ± 0.587
2.827TyrGlu: 2.827 ± 0.39
1.605TyrPhe: 1.605 ± 0.358
2.14TyrGly: 2.14 ± 0.428
0.458TyrHis: 0.458 ± 0.179
2.369TyrIle: 2.369 ± 0.467
3.591TyrLys: 3.591 ± 0.583
2.827TyrLeu: 2.827 ± 0.449
0.764TyrMet: 0.764 ± 0.24
1.91TyrAsn: 1.91 ± 0.43
1.757TyrPro: 1.757 ± 0.401
2.445TyrGln: 2.445 ± 0.378
1.834TyrArg: 1.834 ± 0.374
1.757TyrSer: 1.757 ± 0.403
2.674TyrThr: 2.674 ± 0.446
2.292TyrVal: 2.292 ± 0.391
0.458TyrTrp: 0.458 ± 0.19
1.375TyrTyr: 1.375 ± 0.291
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13088 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski