Amino acid dipepetide frequency for Streptococcus phage Javan221

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.547AlaAla: 5.547 ± 0.961
0.564AlaCys: 0.564 ± 0.251
5.829AlaAsp: 5.829 ± 0.963
5.641AlaGlu: 5.641 ± 0.763
1.88AlaPhe: 1.88 ± 0.475
5.641AlaGly: 5.641 ± 1.221
1.128AlaHis: 1.128 ± 0.232
6.017AlaIle: 6.017 ± 1.036
6.299AlaLys: 6.299 ± 0.786
5.923AlaLeu: 5.923 ± 0.871
1.692AlaMet: 1.692 ± 0.474
4.513AlaAsn: 4.513 ± 0.909
1.88AlaPro: 1.88 ± 0.418
4.043AlaGln: 4.043 ± 0.518
2.821AlaArg: 2.821 ± 0.501
6.769AlaSer: 6.769 ± 0.922
4.889AlaThr: 4.889 ± 0.708
5.359AlaVal: 5.359 ± 0.729
1.128AlaTrp: 1.128 ± 0.364
1.88AlaTyr: 1.88 ± 0.503
0.0AlaXaa: 0.0 ± 0.0
Cys
0.188CysAla: 0.188 ± 0.121
0.188CysCys: 0.188 ± 0.109
0.47CysAsp: 0.47 ± 0.214
0.47CysGlu: 0.47 ± 0.218
0.188CysPhe: 0.188 ± 0.114
0.658CysGly: 0.658 ± 0.33
0.094CysHis: 0.094 ± 0.091
0.094CysIle: 0.094 ± 0.099
0.188CysLys: 0.188 ± 0.12
0.47CysLeu: 0.47 ± 0.189
0.188CysMet: 0.188 ± 0.143
0.094CysAsn: 0.094 ± 0.095
0.094CysPro: 0.094 ± 0.113
0.376CysGln: 0.376 ± 0.156
0.188CysArg: 0.188 ± 0.112
0.094CysSer: 0.094 ± 0.099
0.188CysThr: 0.188 ± 0.127
0.564CysVal: 0.564 ± 0.207
0.094CysTrp: 0.094 ± 0.111
0.282CysTyr: 0.282 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
3.761AspAla: 3.761 ± 0.63
0.376AspCys: 0.376 ± 0.163
3.385AspAsp: 3.385 ± 0.594
5.359AspGlu: 5.359 ± 0.87
2.351AspPhe: 2.351 ± 0.499
5.829AspGly: 5.829 ± 0.712
0.94AspHis: 0.94 ± 0.296
2.915AspIle: 2.915 ± 0.589
3.949AspLys: 3.949 ± 0.64
5.171AspLeu: 5.171 ± 0.744
1.88AspMet: 1.88 ± 0.336
5.735AspAsn: 5.735 ± 0.529
1.222AspPro: 1.222 ± 0.396
1.128AspGln: 1.128 ± 0.367
1.692AspArg: 1.692 ± 0.376
3.385AspSer: 3.385 ± 0.485
2.539AspThr: 2.539 ± 0.47
4.231AspVal: 4.231 ± 0.669
1.034AspTrp: 1.034 ± 0.277
2.445AspTyr: 2.445 ± 0.523
0.0AspXaa: 0.0 ± 0.0
Glu
4.889GluAla: 4.889 ± 0.787
0.376GluCys: 0.376 ± 0.16
3.385GluAsp: 3.385 ± 0.583
5.641GluGlu: 5.641 ± 1.099
3.573GluPhe: 3.573 ± 0.544
2.445GluGly: 2.445 ± 0.488
1.222GluHis: 1.222 ± 0.379
6.017GluIle: 6.017 ± 0.902
5.641GluLys: 5.641 ± 1.085
8.932GluLeu: 8.932 ± 1.233
2.162GluMet: 2.162 ± 0.477
3.949GluAsn: 3.949 ± 0.707
2.727GluPro: 2.727 ± 0.473
3.197GluGln: 3.197 ± 0.435
2.915GluArg: 2.915 ± 0.543
3.949GluSer: 3.949 ± 0.664
4.419GluThr: 4.419 ± 0.589
4.701GluVal: 4.701 ± 0.893
1.034GluTrp: 1.034 ± 0.343
3.009GluTyr: 3.009 ± 0.471
0.0GluXaa: 0.0 ± 0.0
Phe
2.256PheAla: 2.256 ± 0.474
0.094PheCys: 0.094 ± 0.084
2.915PheAsp: 2.915 ± 0.648
4.419PheGlu: 4.419 ± 0.781
1.034PhePhe: 1.034 ± 0.262
3.197PheGly: 3.197 ± 0.405
0.752PheHis: 0.752 ± 0.325
2.445PheIle: 2.445 ± 0.438
3.761PheLys: 3.761 ± 0.438
1.598PheLeu: 1.598 ± 0.345
0.94PheMet: 0.94 ± 0.333
2.162PheAsn: 2.162 ± 0.378
0.658PhePro: 0.658 ± 0.227
1.128PheGln: 1.128 ± 0.212
1.504PheArg: 1.504 ± 0.356
2.915PheSer: 2.915 ± 0.739
1.786PheThr: 1.786 ± 0.348
2.633PheVal: 2.633 ± 0.381
0.282PheTrp: 0.282 ± 0.138
0.94PheTyr: 0.94 ± 0.415
0.0PheXaa: 0.0 ± 0.0
Gly
3.949GlyAla: 3.949 ± 0.689
0.094GlyCys: 0.094 ± 0.072
2.915GlyAsp: 2.915 ± 0.436
4.701GlyGlu: 4.701 ± 0.468
2.727GlyPhe: 2.727 ± 0.523
5.171GlyGly: 5.171 ± 0.884
1.034GlyHis: 1.034 ± 0.379
4.043GlyIle: 4.043 ± 0.774
5.077GlyLys: 5.077 ± 0.652
6.299GlyLeu: 6.299 ± 0.824
1.786GlyMet: 1.786 ± 0.514
4.513GlyAsn: 4.513 ± 0.804
0.846GlyPro: 0.846 ± 0.316
3.103GlyGln: 3.103 ± 0.601
2.915GlyArg: 2.915 ± 0.458
4.231GlySer: 4.231 ± 0.825
4.325GlyThr: 4.325 ± 0.623
4.983GlyVal: 4.983 ± 0.596
1.316GlyTrp: 1.316 ± 0.338
3.573GlyTyr: 3.573 ± 0.626
0.0GlyXaa: 0.0 ± 0.0
His
1.034HisAla: 1.034 ± 0.228
0.094HisCys: 0.094 ± 0.084
0.376HisAsp: 0.376 ± 0.179
1.222HisGlu: 1.222 ± 0.306
0.47HisPhe: 0.47 ± 0.212
1.034HisGly: 1.034 ± 0.261
0.094HisHis: 0.094 ± 0.086
0.846HisIle: 0.846 ± 0.324
0.94HisLys: 0.94 ± 0.19
0.846HisLeu: 0.846 ± 0.244
0.0HisMet: 0.0 ± 0.0
0.47HisAsn: 0.47 ± 0.237
0.47HisPro: 0.47 ± 0.166
0.658HisGln: 0.658 ± 0.215
0.376HisArg: 0.376 ± 0.186
0.846HisSer: 0.846 ± 0.309
0.846HisThr: 0.846 ± 0.3
0.564HisVal: 0.564 ± 0.211
0.282HisTrp: 0.282 ± 0.201
0.752HisTyr: 0.752 ± 0.237
0.0HisXaa: 0.0 ± 0.0
Ile
4.513IleAla: 4.513 ± 0.707
0.282IleCys: 0.282 ± 0.166
6.111IleAsp: 6.111 ± 0.978
6.111IleGlu: 6.111 ± 0.756
1.598IlePhe: 1.598 ± 0.357
3.197IleGly: 3.197 ± 0.647
0.376IleHis: 0.376 ± 0.162
3.385IleIle: 3.385 ± 0.553
5.265IleLys: 5.265 ± 0.562
4.795IleLeu: 4.795 ± 0.805
0.564IleMet: 0.564 ± 0.236
2.727IleAsn: 2.727 ± 0.452
2.445IlePro: 2.445 ± 0.623
2.915IleGln: 2.915 ± 0.523
2.727IleArg: 2.727 ± 0.357
5.077IleSer: 5.077 ± 0.622
4.607IleThr: 4.607 ± 0.482
3.197IleVal: 3.197 ± 0.51
0.752IleTrp: 0.752 ± 0.23
1.974IleTyr: 1.974 ± 0.529
0.0IleXaa: 0.0 ± 0.0
Lys
6.487LysAla: 6.487 ± 0.805
0.282LysCys: 0.282 ± 0.155
4.231LysAsp: 4.231 ± 0.638
7.334LysGlu: 7.334 ± 1.243
2.539LysPhe: 2.539 ± 0.501
4.701LysGly: 4.701 ± 0.654
1.034LysHis: 1.034 ± 0.274
4.513LysIle: 4.513 ± 0.839
5.735LysLys: 5.735 ± 1.107
4.137LysLeu: 4.137 ± 0.488
2.068LysMet: 2.068 ± 0.422
4.701LysAsn: 4.701 ± 0.592
1.88LysPro: 1.88 ± 0.378
2.915LysGln: 2.915 ± 0.459
4.137LysArg: 4.137 ± 0.785
5.923LysSer: 5.923 ± 0.921
5.359LysThr: 5.359 ± 0.65
5.265LysVal: 5.265 ± 0.703
1.034LysTrp: 1.034 ± 0.345
2.633LysTyr: 2.633 ± 0.613
0.0LysXaa: 0.0 ± 0.0
Leu
7.24LeuAla: 7.24 ± 0.763
0.188LeuCys: 0.188 ± 0.127
4.419LeuAsp: 4.419 ± 0.533
6.675LeuGlu: 6.675 ± 0.992
2.633LeuPhe: 2.633 ± 0.501
5.547LeuGly: 5.547 ± 0.939
0.846LeuHis: 0.846 ± 0.243
3.949LeuIle: 3.949 ± 0.571
7.898LeuLys: 7.898 ± 1.194
6.581LeuLeu: 6.581 ± 0.796
1.222LeuMet: 1.222 ± 0.292
6.017LeuAsn: 6.017 ± 0.515
2.256LeuPro: 2.256 ± 0.429
3.667LeuGln: 3.667 ± 0.593
3.009LeuArg: 3.009 ± 0.535
5.077LeuSer: 5.077 ± 0.683
5.923LeuThr: 5.923 ± 0.794
3.667LeuVal: 3.667 ± 0.489
1.316LeuTrp: 1.316 ± 0.339
2.633LeuTyr: 2.633 ± 0.54
0.0LeuXaa: 0.0 ± 0.0
Met
2.068MetAla: 2.068 ± 0.44
0.094MetCys: 0.094 ± 0.091
1.598MetAsp: 1.598 ± 0.278
0.564MetGlu: 0.564 ± 0.197
0.94MetPhe: 0.94 ± 0.212
1.034MetGly: 1.034 ± 0.271
0.188MetHis: 0.188 ± 0.129
1.41MetIle: 1.41 ± 0.368
2.162MetLys: 2.162 ± 0.48
1.128MetLeu: 1.128 ± 0.289
0.188MetMet: 0.188 ± 0.101
1.128MetAsn: 1.128 ± 0.283
0.752MetPro: 0.752 ± 0.202
1.128MetGln: 1.128 ± 0.33
0.282MetArg: 0.282 ± 0.154
2.162MetSer: 2.162 ± 0.424
1.692MetThr: 1.692 ± 0.46
1.128MetVal: 1.128 ± 0.307
0.188MetTrp: 0.188 ± 0.116
0.564MetTyr: 0.564 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
4.231AsnAla: 4.231 ± 0.709
0.47AsnCys: 0.47 ± 0.247
2.821AsnAsp: 2.821 ± 0.469
2.821AsnGlu: 2.821 ± 0.478
1.88AsnPhe: 1.88 ± 0.333
6.299AsnGly: 6.299 ± 0.741
0.564AsnHis: 0.564 ± 0.195
3.855AsnIle: 3.855 ± 0.559
3.855AsnLys: 3.855 ± 0.607
6.017AsnLeu: 6.017 ± 0.77
1.222AsnMet: 1.222 ± 0.283
3.197AsnAsn: 3.197 ± 0.508
2.068AsnPro: 2.068 ± 0.556
3.385AsnGln: 3.385 ± 0.604
2.351AsnArg: 2.351 ± 0.417
3.855AsnSer: 3.855 ± 0.627
3.103AsnThr: 3.103 ± 0.595
4.795AsnVal: 4.795 ± 0.618
0.658AsnTrp: 0.658 ± 0.193
3.291AsnTyr: 3.291 ± 0.58
0.0AsnXaa: 0.0 ± 0.0
Pro
2.727ProAla: 2.727 ± 0.508
0.094ProCys: 0.094 ± 0.084
1.598ProAsp: 1.598 ± 0.409
1.88ProGlu: 1.88 ± 0.363
1.504ProPhe: 1.504 ± 0.426
0.846ProGly: 0.846 ± 0.234
0.47ProHis: 0.47 ± 0.245
1.974ProIle: 1.974 ± 0.457
2.539ProLys: 2.539 ± 0.542
1.692ProLeu: 1.692 ± 0.458
0.47ProMet: 0.47 ± 0.16
1.504ProAsn: 1.504 ± 0.532
0.564ProPro: 0.564 ± 0.234
1.222ProGln: 1.222 ± 0.258
0.846ProArg: 0.846 ± 0.345
2.256ProSer: 2.256 ± 0.339
1.41ProThr: 1.41 ± 0.446
3.009ProVal: 3.009 ± 0.563
0.282ProTrp: 0.282 ± 0.15
0.846ProTyr: 0.846 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
4.231GlnAla: 4.231 ± 0.644
0.094GlnCys: 0.094 ± 0.095
1.692GlnAsp: 1.692 ± 0.317
2.633GlnGlu: 2.633 ± 0.623
2.256GlnPhe: 2.256 ± 0.509
2.821GlnGly: 2.821 ± 0.631
0.188GlnHis: 0.188 ± 0.147
2.821GlnIle: 2.821 ± 0.502
3.009GlnLys: 3.009 ± 0.566
3.103GlnLeu: 3.103 ± 0.423
0.94GlnMet: 0.94 ± 0.304
3.009GlnAsn: 3.009 ± 0.466
1.316GlnPro: 1.316 ± 0.303
2.256GlnGln: 2.256 ± 0.546
2.162GlnArg: 2.162 ± 0.493
3.197GlnSer: 3.197 ± 0.615
3.761GlnThr: 3.761 ± 0.761
2.633GlnVal: 2.633 ± 0.681
0.47GlnTrp: 0.47 ± 0.206
1.598GlnTyr: 1.598 ± 0.32
0.0GlnXaa: 0.0 ± 0.0
Arg
2.633ArgAla: 2.633 ± 0.575
0.47ArgCys: 0.47 ± 0.229
1.88ArgAsp: 1.88 ± 0.377
2.068ArgGlu: 2.068 ± 0.415
1.974ArgPhe: 1.974 ± 0.395
1.222ArgGly: 1.222 ± 0.31
0.658ArgHis: 0.658 ± 0.265
3.291ArgIle: 3.291 ± 0.509
2.256ArgLys: 2.256 ± 0.409
3.855ArgLeu: 3.855 ± 0.552
0.188ArgMet: 0.188 ± 0.129
2.539ArgAsn: 2.539 ± 0.568
1.128ArgPro: 1.128 ± 0.261
1.41ArgGln: 1.41 ± 0.293
1.504ArgArg: 1.504 ± 0.442
1.504ArgSer: 1.504 ± 0.359
2.539ArgThr: 2.539 ± 0.429
3.009ArgVal: 3.009 ± 0.421
0.47ArgTrp: 0.47 ± 0.193
1.88ArgTyr: 1.88 ± 0.4
0.0ArgXaa: 0.0 ± 0.0
Ser
7.052SerAla: 7.052 ± 1.672
0.188SerCys: 0.188 ± 0.105
4.607SerAsp: 4.607 ± 0.486
4.231SerGlu: 4.231 ± 0.693
3.197SerPhe: 3.197 ± 0.653
6.205SerGly: 6.205 ± 1.025
0.846SerHis: 0.846 ± 0.284
3.385SerIle: 3.385 ± 0.51
5.077SerLys: 5.077 ± 0.931
6.863SerLeu: 6.863 ± 0.664
0.846SerMet: 0.846 ± 0.273
4.231SerAsn: 4.231 ± 0.681
1.786SerPro: 1.786 ± 0.397
2.727SerGln: 2.727 ± 0.873
1.598SerArg: 1.598 ± 0.332
6.958SerSer: 6.958 ± 1.238
4.795SerThr: 4.795 ± 0.788
4.231SerVal: 4.231 ± 0.661
0.376SerTrp: 0.376 ± 0.207
2.445SerTyr: 2.445 ± 0.336
0.0SerXaa: 0.0 ± 0.0
Thr
7.522ThrAla: 7.522 ± 1.713
0.282ThrCys: 0.282 ± 0.17
3.667ThrAsp: 3.667 ± 0.709
3.761ThrGlu: 3.761 ± 0.628
2.256ThrPhe: 2.256 ± 0.466
4.043ThrGly: 4.043 ± 0.626
0.846ThrHis: 0.846 ± 0.189
4.137ThrIle: 4.137 ± 0.694
4.701ThrLys: 4.701 ± 0.735
5.735ThrLeu: 5.735 ± 0.666
0.846ThrMet: 0.846 ± 0.244
3.667ThrAsn: 3.667 ± 0.718
2.162ThrPro: 2.162 ± 0.531
3.009ThrGln: 3.009 ± 0.41
1.598ThrArg: 1.598 ± 0.355
4.137ThrSer: 4.137 ± 0.585
4.795ThrThr: 4.795 ± 0.959
5.265ThrVal: 5.265 ± 0.888
0.47ThrTrp: 0.47 ± 0.238
2.821ThrTyr: 2.821 ± 0.561
0.0ThrXaa: 0.0 ± 0.0
Val
5.265ValAla: 5.265 ± 0.555
0.47ValCys: 0.47 ± 0.207
4.325ValAsp: 4.325 ± 0.566
5.359ValGlu: 5.359 ± 0.866
2.633ValPhe: 2.633 ± 0.503
4.043ValGly: 4.043 ± 0.612
0.376ValHis: 0.376 ± 0.174
4.043ValIle: 4.043 ± 0.77
5.359ValLys: 5.359 ± 0.626
4.701ValLeu: 4.701 ± 0.577
1.504ValMet: 1.504 ± 0.307
3.667ValAsn: 3.667 ± 0.623
2.162ValPro: 2.162 ± 0.447
2.915ValGln: 2.915 ± 0.654
2.068ValArg: 2.068 ± 0.368
4.983ValSer: 4.983 ± 0.766
5.359ValThr: 5.359 ± 0.927
4.701ValVal: 4.701 ± 0.607
0.658ValTrp: 0.658 ± 0.222
2.162ValTyr: 2.162 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
0.94TrpAla: 0.94 ± 0.267
0.188TrpCys: 0.188 ± 0.12
0.282TrpAsp: 0.282 ± 0.151
0.752TrpGlu: 0.752 ± 0.264
0.282TrpPhe: 0.282 ± 0.143
0.752TrpGly: 0.752 ± 0.252
0.47TrpHis: 0.47 ± 0.235
0.846TrpIle: 0.846 ± 0.197
0.94TrpLys: 0.94 ± 0.329
0.47TrpLeu: 0.47 ± 0.251
0.282TrpMet: 0.282 ± 0.142
0.752TrpAsn: 0.752 ± 0.231
0.188TrpPro: 0.188 ± 0.131
0.752TrpGln: 0.752 ± 0.237
0.47TrpArg: 0.47 ± 0.21
1.88TrpSer: 1.88 ± 0.43
0.846TrpThr: 0.846 ± 0.243
0.658TrpVal: 0.658 ± 0.263
0.094TrpTrp: 0.094 ± 0.105
0.564TrpTyr: 0.564 ± 0.209
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.915TyrAla: 2.915 ± 0.494
0.282TyrCys: 0.282 ± 0.155
2.821TyrAsp: 2.821 ± 0.469
2.539TyrGlu: 2.539 ± 0.577
1.504TyrPhe: 1.504 ± 0.378
2.915TyrGly: 2.915 ± 0.527
0.094TyrHis: 0.094 ± 0.087
2.633TyrIle: 2.633 ± 0.536
2.256TyrLys: 2.256 ± 0.534
2.445TyrLeu: 2.445 ± 0.429
1.316TyrMet: 1.316 ± 0.306
2.256TyrAsn: 2.256 ± 0.496
1.128TyrPro: 1.128 ± 0.347
2.256TyrGln: 2.256 ± 0.494
1.316TyrArg: 1.316 ± 0.383
2.445TyrSer: 2.445 ± 0.551
2.539TyrThr: 2.539 ± 0.512
2.162TyrVal: 2.162 ± 0.381
0.47TyrTrp: 0.47 ± 0.192
1.974TyrTyr: 1.974 ± 0.436
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (10637 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski