Amino acid dipepetide frequency for Streptococcus phage SW24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.425AlaAla: 5.425 ± 2.766
0.301AlaCys: 0.301 ± 0.177
3.717AlaAsp: 3.717 ± 0.661
4.923AlaGlu: 4.923 ± 0.863
2.813AlaPhe: 2.813 ± 0.745
4.621AlaGly: 4.621 ± 1.226
0.703AlaHis: 0.703 ± 0.242
6.831AlaIle: 6.831 ± 1.796
6.229AlaLys: 6.229 ± 0.806
6.329AlaLeu: 6.329 ± 1.012
2.813AlaMet: 2.813 ± 1.023
4.822AlaAsn: 4.822 ± 0.804
2.913AlaPro: 2.913 ± 0.621
3.416AlaGln: 3.416 ± 0.778
2.712AlaArg: 2.712 ± 0.661
4.119AlaSer: 4.119 ± 0.885
5.124AlaThr: 5.124 ± 1.474
5.023AlaVal: 5.023 ± 1.365
0.502AlaTrp: 0.502 ± 0.236
2.009AlaTyr: 2.009 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.201CysAla: 0.201 ± 0.15
0.201CysCys: 0.201 ± 0.203
0.603CysAsp: 0.603 ± 0.257
0.703CysGlu: 0.703 ± 0.272
0.201CysPhe: 0.201 ± 0.147
0.301CysGly: 0.301 ± 0.163
0.201CysHis: 0.201 ± 0.154
0.301CysIle: 0.301 ± 0.138
0.502CysLys: 0.502 ± 0.269
0.502CysLeu: 0.502 ± 0.312
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.1CysPro: 0.1 ± 0.091
0.1CysGln: 0.1 ± 0.097
0.1CysArg: 0.1 ± 0.097
0.703CysSer: 0.703 ± 0.244
0.1CysThr: 0.1 ± 0.114
0.201CysVal: 0.201 ± 0.171
0.1CysTrp: 0.1 ± 0.102
0.301CysTyr: 0.301 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
3.114AspAla: 3.114 ± 0.622
0.301AspCys: 0.301 ± 0.179
5.124AspAsp: 5.124 ± 1.12
4.119AspGlu: 4.119 ± 0.643
4.018AspPhe: 4.018 ± 0.839
4.621AspGly: 4.621 ± 0.824
0.402AspHis: 0.402 ± 0.214
4.32AspIle: 4.32 ± 0.822
4.621AspLys: 4.621 ± 0.795
3.918AspLeu: 3.918 ± 0.971
1.808AspMet: 1.808 ± 0.358
3.918AspAsn: 3.918 ± 0.799
1.105AspPro: 1.105 ± 0.355
1.507AspGln: 1.507 ± 0.353
2.913AspArg: 2.913 ± 0.694
3.215AspSer: 3.215 ± 0.982
3.215AspThr: 3.215 ± 0.492
3.014AspVal: 3.014 ± 0.743
0.703AspTrp: 0.703 ± 0.309
2.411AspTyr: 2.411 ± 0.635
0.0AspXaa: 0.0 ± 0.0
Glu
4.822GluAla: 4.822 ± 0.857
0.1GluCys: 0.1 ± 0.095
3.617GluAsp: 3.617 ± 0.754
7.635GluGlu: 7.635 ± 1.721
2.913GluPhe: 2.913 ± 0.649
2.712GluGly: 2.712 ± 0.392
1.206GluHis: 1.206 ± 0.345
4.42GluIle: 4.42 ± 0.943
5.927GluLys: 5.927 ± 1.156
7.233GluLeu: 7.233 ± 1.212
2.612GluMet: 2.612 ± 0.763
4.521GluAsn: 4.521 ± 1.035
1.306GluPro: 1.306 ± 0.38
3.416GluGln: 3.416 ± 0.704
3.416GluArg: 3.416 ± 0.631
3.014GluSer: 3.014 ± 0.487
3.617GluThr: 3.617 ± 0.793
6.128GluVal: 6.128 ± 0.852
0.904GluTrp: 0.904 ± 0.308
3.315GluTyr: 3.315 ± 0.789
0.0GluXaa: 0.0 ± 0.0
Phe
2.311PheAla: 2.311 ± 0.493
0.201PheCys: 0.201 ± 0.133
3.717PheAsp: 3.717 ± 0.716
4.119PheGlu: 4.119 ± 0.76
1.306PhePhe: 1.306 ± 0.281
3.416PheGly: 3.416 ± 0.741
0.402PheHis: 0.402 ± 0.161
2.612PheIle: 2.612 ± 0.506
3.818PheLys: 3.818 ± 0.651
1.808PheLeu: 1.808 ± 0.512
1.105PheMet: 1.105 ± 0.403
4.219PheAsn: 4.219 ± 0.736
0.502PhePro: 0.502 ± 0.176
1.306PheGln: 1.306 ± 0.388
1.406PheArg: 1.406 ± 0.356
3.617PheSer: 3.617 ± 0.672
2.21PheThr: 2.21 ± 0.366
2.21PheVal: 2.21 ± 0.384
0.201PheTrp: 0.201 ± 0.159
1.808PheTyr: 1.808 ± 0.492
0.0PheXaa: 0.0 ± 0.0
Gly
4.119GlyAla: 4.119 ± 1.243
0.502GlyCys: 0.502 ± 0.26
3.315GlyAsp: 3.315 ± 0.713
3.818GlyGlu: 3.818 ± 0.624
3.114GlyPhe: 3.114 ± 0.633
2.411GlyGly: 2.411 ± 0.482
1.406GlyHis: 1.406 ± 0.557
7.736GlyIle: 7.736 ± 1.755
4.621GlyLys: 4.621 ± 0.76
5.927GlyLeu: 5.927 ± 0.987
2.21GlyMet: 2.21 ± 0.684
2.913GlyAsn: 2.913 ± 0.706
0.402GlyPro: 0.402 ± 0.179
2.712GlyGln: 2.712 ± 0.489
1.909GlyArg: 1.909 ± 0.487
3.818GlySer: 3.818 ± 0.957
5.224GlyThr: 5.224 ± 1.439
1.909GlyVal: 1.909 ± 0.342
0.402GlyTrp: 0.402 ± 0.171
2.411GlyTyr: 2.411 ± 0.613
0.0GlyXaa: 0.0 ± 0.0
His
0.301HisAla: 0.301 ± 0.201
0.1HisCys: 0.1 ± 0.101
0.703HisAsp: 0.703 ± 0.303
0.804HisGlu: 0.804 ± 0.267
0.603HisPhe: 0.603 ± 0.278
0.904HisGly: 0.904 ± 0.333
0.301HisHis: 0.301 ± 0.208
0.603HisIle: 0.603 ± 0.281
1.005HisLys: 1.005 ± 0.364
1.206HisLeu: 1.206 ± 0.38
0.1HisMet: 0.1 ± 0.093
0.703HisAsn: 0.703 ± 0.339
0.603HisPro: 0.603 ± 0.256
0.603HisGln: 0.603 ± 0.295
0.703HisArg: 0.703 ± 0.24
0.603HisSer: 0.603 ± 0.251
0.603HisThr: 0.603 ± 0.257
0.804HisVal: 0.804 ± 0.292
0.301HisTrp: 0.301 ± 0.167
0.301HisTyr: 0.301 ± 0.202
0.0HisXaa: 0.0 ± 0.0
Ile
6.831IleAla: 6.831 ± 1.487
0.201IleCys: 0.201 ± 0.164
4.923IleAsp: 4.923 ± 0.662
4.923IleGlu: 4.923 ± 1.001
1.306IlePhe: 1.306 ± 0.421
5.425IleGly: 5.425 ± 1.6
1.005IleHis: 1.005 ± 0.424
4.42IleIle: 4.42 ± 1.289
8.037IleLys: 8.037 ± 0.66
4.219IleLeu: 4.219 ± 0.675
1.306IleMet: 1.306 ± 0.423
4.722IleAsn: 4.722 ± 0.876
2.11IlePro: 2.11 ± 0.38
2.612IleGln: 2.612 ± 0.452
2.813IleArg: 2.813 ± 0.595
6.329IleSer: 6.329 ± 1.463
4.621IleThr: 4.621 ± 0.865
3.818IleVal: 3.818 ± 0.549
0.402IleTrp: 0.402 ± 0.232
2.311IleTyr: 2.311 ± 0.707
0.0IleXaa: 0.0 ± 0.0
Lys
7.133LysAla: 7.133 ± 0.831
0.703LysCys: 0.703 ± 0.294
4.119LysAsp: 4.119 ± 0.716
6.329LysGlu: 6.329 ± 1.275
2.311LysPhe: 2.311 ± 0.441
4.521LysGly: 4.521 ± 0.487
0.804LysHis: 0.804 ± 0.286
5.124LysIle: 5.124 ± 0.62
6.128LysLys: 6.128 ± 1.4
6.831LysLeu: 6.831 ± 1.073
2.411LysMet: 2.411 ± 0.449
3.315LysAsn: 3.315 ± 0.56
3.014LysPro: 3.014 ± 0.605
4.119LysGln: 4.119 ± 0.821
4.521LysArg: 4.521 ± 0.86
6.229LysSer: 6.229 ± 0.711
6.229LysThr: 6.229 ± 0.859
4.621LysVal: 4.621 ± 0.543
0.804LysTrp: 0.804 ± 0.304
4.219LysTyr: 4.219 ± 0.996
0.0LysXaa: 0.0 ± 0.0
Leu
6.43LeuAla: 6.43 ± 1.117
0.301LeuCys: 0.301 ± 0.239
4.621LeuAsp: 4.621 ± 0.716
6.329LeuGlu: 6.329 ± 1.345
2.913LeuPhe: 2.913 ± 0.569
5.525LeuGly: 5.525 ± 1.007
0.703LeuHis: 0.703 ± 0.319
3.617LeuIle: 3.617 ± 0.752
7.736LeuLys: 7.736 ± 0.978
4.119LeuLeu: 4.119 ± 0.728
1.607LeuMet: 1.607 ± 0.485
4.923LeuAsn: 4.923 ± 0.751
2.411LeuPro: 2.411 ± 0.554
3.014LeuGln: 3.014 ± 0.542
3.617LeuArg: 3.617 ± 0.747
6.028LeuSer: 6.028 ± 0.84
5.525LeuThr: 5.525 ± 0.667
4.521LeuVal: 4.521 ± 0.552
0.502LeuTrp: 0.502 ± 0.216
2.913LeuTyr: 2.913 ± 0.637
0.0LeuXaa: 0.0 ± 0.0
Met
2.21MetAla: 2.21 ± 1.006
0.1MetCys: 0.1 ± 0.122
1.105MetAsp: 1.105 ± 0.531
1.105MetGlu: 1.105 ± 0.292
1.005MetPhe: 1.005 ± 0.276
1.708MetGly: 1.708 ± 0.452
0.402MetHis: 0.402 ± 0.233
2.913MetIle: 2.913 ± 0.505
2.712MetLys: 2.712 ± 0.443
2.411MetLeu: 2.411 ± 0.627
1.306MetMet: 1.306 ± 0.438
1.607MetAsn: 1.607 ± 0.338
0.402MetPro: 0.402 ± 0.173
1.808MetGln: 1.808 ± 0.641
1.306MetArg: 1.306 ± 0.476
2.11MetSer: 2.11 ± 0.56
2.21MetThr: 2.21 ± 0.538
1.909MetVal: 1.909 ± 0.671
0.201MetTrp: 0.201 ± 0.152
1.005MetTyr: 1.005 ± 0.456
0.0MetXaa: 0.0 ± 0.0
Asn
3.818AsnAla: 3.818 ± 0.535
0.402AsnCys: 0.402 ± 0.207
3.918AsnAsp: 3.918 ± 0.755
3.114AsnGlu: 3.114 ± 0.741
3.315AsnPhe: 3.315 ± 0.717
3.918AsnGly: 3.918 ± 0.761
0.804AsnHis: 0.804 ± 0.274
4.018AsnIle: 4.018 ± 0.868
4.722AsnLys: 4.722 ± 0.854
3.918AsnLeu: 3.918 ± 0.559
1.206AsnMet: 1.206 ± 0.35
2.512AsnAsn: 2.512 ± 0.39
2.311AsnPro: 2.311 ± 0.395
3.617AsnGln: 3.617 ± 0.767
1.005AsnArg: 1.005 ± 0.341
4.018AsnSer: 4.018 ± 0.769
2.913AsnThr: 2.913 ± 0.643
3.315AsnVal: 3.315 ± 0.565
1.105AsnTrp: 1.105 ± 0.374
2.512AsnTyr: 2.512 ± 0.564
0.0AsnXaa: 0.0 ± 0.0
Pro
1.005ProAla: 1.005 ± 0.374
0.0ProCys: 0.0 ± 0.0
1.105ProAsp: 1.105 ± 0.332
1.808ProGlu: 1.808 ± 0.528
1.607ProPhe: 1.607 ± 0.463
1.306ProGly: 1.306 ± 0.337
0.201ProHis: 0.201 ± 0.153
2.009ProIle: 2.009 ± 0.539
2.913ProLys: 2.913 ± 0.649
1.909ProLeu: 1.909 ± 0.676
0.201ProMet: 0.201 ± 0.136
1.607ProAsn: 1.607 ± 0.54
0.603ProPro: 0.603 ± 0.208
1.708ProGln: 1.708 ± 0.407
1.005ProArg: 1.005 ± 0.383
2.11ProSer: 2.11 ± 0.437
1.607ProThr: 1.607 ± 0.412
2.11ProVal: 2.11 ± 0.507
0.1ProTrp: 0.1 ± 0.102
1.909ProTyr: 1.909 ± 0.43
0.0ProXaa: 0.0 ± 0.0
Gln
5.726GlnAla: 5.726 ± 1.155
0.301GlnCys: 0.301 ± 0.172
2.11GlnAsp: 2.11 ± 0.45
2.712GlnGlu: 2.712 ± 0.679
2.11GlnPhe: 2.11 ± 0.411
2.512GlnGly: 2.512 ± 0.737
0.301GlnHis: 0.301 ± 0.19
3.215GlnIle: 3.215 ± 0.869
3.617GlnLys: 3.617 ± 0.697
4.521GlnLeu: 4.521 ± 0.933
2.512GlnMet: 2.512 ± 0.506
1.708GlnAsn: 1.708 ± 0.422
0.502GlnPro: 0.502 ± 0.254
3.114GlnGln: 3.114 ± 0.808
1.206GlnArg: 1.206 ± 0.424
3.617GlnSer: 3.617 ± 0.722
2.11GlnThr: 2.11 ± 0.344
2.411GlnVal: 2.411 ± 0.442
0.703GlnTrp: 0.703 ± 0.227
0.904GlnTyr: 0.904 ± 0.384
0.0GlnXaa: 0.0 ± 0.0
Arg
2.311ArgAla: 2.311 ± 0.438
0.201ArgCys: 0.201 ± 0.126
3.416ArgAsp: 3.416 ± 0.681
3.416ArgGlu: 3.416 ± 0.842
0.904ArgPhe: 0.904 ± 0.343
1.607ArgGly: 1.607 ± 0.417
0.603ArgHis: 0.603 ± 0.244
2.009ArgIle: 2.009 ± 0.486
3.516ArgLys: 3.516 ± 0.945
3.416ArgLeu: 3.416 ± 0.519
1.406ArgMet: 1.406 ± 0.506
2.311ArgAsn: 2.311 ± 0.421
1.206ArgPro: 1.206 ± 0.443
1.607ArgGln: 1.607 ± 0.417
1.406ArgArg: 1.406 ± 0.537
2.009ArgSer: 2.009 ± 0.417
2.21ArgThr: 2.21 ± 0.628
2.21ArgVal: 2.21 ± 0.628
0.402ArgTrp: 0.402 ± 0.174
2.311ArgTyr: 2.311 ± 0.486
0.0ArgXaa: 0.0 ± 0.0
Ser
6.731SerAla: 6.731 ± 3.059
0.402SerCys: 0.402 ± 0.222
2.913SerAsp: 2.913 ± 0.713
3.717SerGlu: 3.717 ± 0.728
3.114SerPhe: 3.114 ± 0.531
4.722SerGly: 4.722 ± 0.919
0.703SerHis: 0.703 ± 0.208
5.726SerIle: 5.726 ± 0.934
4.923SerLys: 4.923 ± 0.799
6.731SerLeu: 6.731 ± 0.926
2.21SerMet: 2.21 ± 0.499
3.516SerAsn: 3.516 ± 0.665
2.009SerPro: 2.009 ± 0.373
3.617SerGln: 3.617 ± 0.947
2.21SerArg: 2.21 ± 0.557
4.521SerSer: 4.521 ± 0.901
3.717SerThr: 3.717 ± 0.8
5.224SerVal: 5.224 ± 0.747
0.502SerTrp: 0.502 ± 0.221
1.909SerTyr: 1.909 ± 0.531
0.0SerXaa: 0.0 ± 0.0
Thr
5.425ThrAla: 5.425 ± 1.289
0.201ThrCys: 0.201 ± 0.158
3.617ThrAsp: 3.617 ± 0.622
3.918ThrGlu: 3.918 ± 0.659
2.712ThrPhe: 2.712 ± 0.533
3.315ThrGly: 3.315 ± 0.805
0.502ThrHis: 0.502 ± 0.227
4.722ThrIle: 4.722 ± 0.688
4.923ThrLys: 4.923 ± 0.831
5.124ThrLeu: 5.124 ± 0.828
1.406ThrMet: 1.406 ± 0.624
3.315ThrAsn: 3.315 ± 0.783
2.512ThrPro: 2.512 ± 0.522
3.215ThrGln: 3.215 ± 0.923
2.21ThrArg: 2.21 ± 0.562
4.219ThrSer: 4.219 ± 1.157
4.018ThrThr: 4.018 ± 0.819
4.42ThrVal: 4.42 ± 0.562
0.402ThrTrp: 0.402 ± 0.205
2.813ThrTyr: 2.813 ± 0.585
0.0ThrXaa: 0.0 ± 0.0
Val
5.023ValAla: 5.023 ± 0.701
0.402ValCys: 0.402 ± 0.235
2.311ValAsp: 2.311 ± 0.54
5.425ValGlu: 5.425 ± 1.111
3.315ValPhe: 3.315 ± 0.526
4.219ValGly: 4.219 ± 1.022
0.603ValHis: 0.603 ± 0.263
4.621ValIle: 4.621 ± 0.571
4.32ValLys: 4.32 ± 0.707
2.512ValLeu: 2.512 ± 0.563
1.507ValMet: 1.507 ± 0.352
2.913ValAsn: 2.913 ± 0.626
1.708ValPro: 1.708 ± 0.364
2.11ValGln: 2.11 ± 0.635
1.607ValArg: 1.607 ± 0.417
4.621ValSer: 4.621 ± 0.811
5.224ValThr: 5.224 ± 0.625
3.717ValVal: 3.717 ± 0.728
1.005ValTrp: 1.005 ± 0.406
2.512ValTyr: 2.512 ± 0.602
0.0ValXaa: 0.0 ± 0.0
Trp
0.502TrpAla: 0.502 ± 0.264
0.0TrpCys: 0.0 ± 0.0
0.603TrpAsp: 0.603 ± 0.217
0.603TrpGlu: 0.603 ± 0.257
0.402TrpPhe: 0.402 ± 0.208
0.301TrpGly: 0.301 ± 0.174
0.201TrpHis: 0.201 ± 0.151
0.603TrpIle: 0.603 ± 0.233
0.502TrpLys: 0.502 ± 0.197
0.804TrpLeu: 0.804 ± 0.263
0.301TrpMet: 0.301 ± 0.151
0.703TrpAsn: 0.703 ± 0.249
0.201TrpPro: 0.201 ± 0.136
0.502TrpGln: 0.502 ± 0.191
0.703TrpArg: 0.703 ± 0.368
0.603TrpSer: 0.603 ± 0.248
0.904TrpThr: 0.904 ± 0.343
0.502TrpVal: 0.502 ± 0.272
0.1TrpTrp: 0.1 ± 0.084
0.703TrpTyr: 0.703 ± 0.248
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.909TyrAla: 1.909 ± 0.56
0.502TyrCys: 0.502 ± 0.258
2.813TyrAsp: 2.813 ± 0.735
3.315TyrGlu: 3.315 ± 0.719
2.311TyrPhe: 2.311 ± 0.696
2.813TyrGly: 2.813 ± 0.601
0.402TyrHis: 0.402 ± 0.22
2.512TyrIle: 2.512 ± 0.548
2.712TyrLys: 2.712 ± 0.723
4.018TyrLeu: 4.018 ± 0.821
1.406TyrMet: 1.406 ± 0.425
2.21TyrAsn: 2.21 ± 0.561
0.904TyrPro: 0.904 ± 0.358
1.808TyrGln: 1.808 ± 0.469
1.708TyrArg: 1.708 ± 0.602
3.617TyrSer: 3.617 ± 0.66
1.607TyrThr: 1.607 ± 0.514
1.607TyrVal: 1.607 ± 0.546
0.402TyrTrp: 0.402 ± 0.215
2.512TyrTyr: 2.512 ± 0.862
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 42 proteins (9955 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski