Amino acid dipepetide frequency for Streptococcus phage Javan135

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.15AlaAla: 5.15 ± 0.908
0.245AlaCys: 0.245 ± 0.108
5.15AlaAsp: 5.15 ± 0.625
6.376AlaGlu: 6.376 ± 0.674
2.698AlaPhe: 2.698 ± 0.7
4.169AlaGly: 4.169 ± 0.75
0.572AlaHis: 0.572 ± 0.23
7.194AlaIle: 7.194 ± 0.871
7.03AlaLys: 7.03 ± 0.879
7.03AlaLeu: 7.03 ± 0.872
2.289AlaMet: 2.289 ± 0.528
5.232AlaAsn: 5.232 ± 1.052
1.553AlaPro: 1.553 ± 0.393
4.169AlaGln: 4.169 ± 0.685
2.616AlaArg: 2.616 ± 0.544
4.905AlaSer: 4.905 ± 0.865
5.64AlaThr: 5.64 ± 1.092
5.804AlaVal: 5.804 ± 1.075
0.327AlaTrp: 0.327 ± 0.148
3.025AlaTyr: 3.025 ± 0.522
0.0AlaXaa: 0.0 ± 0.0
Cys
0.163CysAla: 0.163 ± 0.113
0.163CysCys: 0.163 ± 0.102
0.082CysAsp: 0.082 ± 0.081
0.327CysGlu: 0.327 ± 0.169
0.082CysPhe: 0.082 ± 0.08
0.245CysGly: 0.245 ± 0.139
0.082CysHis: 0.082 ± 0.077
0.327CysIle: 0.327 ± 0.157
0.654CysLys: 0.654 ± 0.206
0.49CysLeu: 0.49 ± 0.224
0.163CysMet: 0.163 ± 0.108
0.245CysAsn: 0.245 ± 0.138
0.245CysPro: 0.245 ± 0.184
0.409CysGln: 0.409 ± 0.154
0.245CysArg: 0.245 ± 0.198
0.245CysSer: 0.245 ± 0.169
0.327CysThr: 0.327 ± 0.227
0.327CysVal: 0.327 ± 0.165
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.76AspAla: 3.76 ± 0.493
0.163AspCys: 0.163 ± 0.106
3.27AspAsp: 3.27 ± 0.517
4.333AspGlu: 4.333 ± 0.791
3.27AspPhe: 3.27 ± 0.591
5.15AspGly: 5.15 ± 0.722
0.572AspHis: 0.572 ± 0.196
5.068AspIle: 5.068 ± 0.757
5.967AspLys: 5.967 ± 0.746
5.886AspLeu: 5.886 ± 0.852
1.144AspMet: 1.144 ± 0.35
3.188AspAsn: 3.188 ± 0.566
1.39AspPro: 1.39 ± 0.393
1.144AspGln: 1.144 ± 0.262
2.943AspArg: 2.943 ± 0.65
3.352AspSer: 3.352 ± 0.439
3.106AspThr: 3.106 ± 0.515
3.27AspVal: 3.27 ± 0.645
1.226AspTrp: 1.226 ± 0.353
3.025AspTyr: 3.025 ± 0.508
0.0AspXaa: 0.0 ± 0.0
Glu
5.804GluAla: 5.804 ± 0.787
0.245GluCys: 0.245 ± 0.121
3.106GluAsp: 3.106 ± 0.632
6.703GluGlu: 6.703 ± 1.129
2.534GluPhe: 2.534 ± 0.488
4.006GluGly: 4.006 ± 0.64
1.308GluHis: 1.308 ± 0.492
4.578GluIle: 4.578 ± 0.517
7.275GluLys: 7.275 ± 1.176
8.256GluLeu: 8.256 ± 1.171
2.044GluMet: 2.044 ± 0.424
3.679GluAsn: 3.679 ± 0.703
2.044GluPro: 2.044 ± 0.433
2.943GluGln: 2.943 ± 0.697
4.333GluArg: 4.333 ± 0.903
3.352GluSer: 3.352 ± 0.515
3.515GluThr: 3.515 ± 0.66
4.66GluVal: 4.66 ± 0.639
1.063GluTrp: 1.063 ± 0.245
2.861GluTyr: 2.861 ± 0.525
0.0GluXaa: 0.0 ± 0.0
Phe
3.188PheAla: 3.188 ± 0.707
0.245PheCys: 0.245 ± 0.134
3.025PheAsp: 3.025 ± 0.52
2.698PheGlu: 2.698 ± 0.485
1.063PhePhe: 1.063 ± 0.359
2.698PheGly: 2.698 ± 0.499
0.245PheHis: 0.245 ± 0.127
2.044PheIle: 2.044 ± 0.49
3.597PheLys: 3.597 ± 0.676
2.289PheLeu: 2.289 ± 0.51
0.572PheMet: 0.572 ± 0.239
2.779PheAsn: 2.779 ± 0.48
0.49PhePro: 0.49 ± 0.178
0.327PheGln: 0.327 ± 0.149
1.39PheArg: 1.39 ± 0.306
2.207PheSer: 2.207 ± 0.454
2.452PheThr: 2.452 ± 0.452
2.125PheVal: 2.125 ± 0.37
0.736PheTrp: 0.736 ± 0.245
1.635PheTyr: 1.635 ± 0.402
0.0PheXaa: 0.0 ± 0.0
Gly
5.64GlyAla: 5.64 ± 0.731
0.0GlyCys: 0.0 ± 0.0
4.087GlyAsp: 4.087 ± 0.683
4.333GlyGlu: 4.333 ± 0.616
2.534GlyPhe: 2.534 ± 0.601
4.087GlyGly: 4.087 ± 0.649
0.981GlyHis: 0.981 ± 0.296
4.333GlyIle: 4.333 ± 1.006
6.294GlyLys: 6.294 ± 0.82
6.785GlyLeu: 6.785 ± 0.812
2.289GlyMet: 2.289 ± 0.441
1.553GlyAsn: 1.553 ± 0.392
2.207GlyPro: 2.207 ± 1.318
2.207GlyGln: 2.207 ± 0.376
3.106GlyArg: 3.106 ± 0.53
3.679GlySer: 3.679 ± 0.428
4.087GlyThr: 4.087 ± 0.556
4.087GlyVal: 4.087 ± 0.603
0.899GlyTrp: 0.899 ± 0.254
2.207GlyTyr: 2.207 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
0.736HisAla: 0.736 ± 0.268
0.245HisCys: 0.245 ± 0.126
0.817HisAsp: 0.817 ± 0.264
1.063HisGlu: 1.063 ± 0.372
0.654HisPhe: 0.654 ± 0.245
0.899HisGly: 0.899 ± 0.343
0.409HisHis: 0.409 ± 0.186
0.899HisIle: 0.899 ± 0.301
0.736HisLys: 0.736 ± 0.257
0.817HisLeu: 0.817 ± 0.27
0.163HisMet: 0.163 ± 0.123
0.572HisAsn: 0.572 ± 0.217
0.654HisPro: 0.654 ± 0.256
0.409HisGln: 0.409 ± 0.231
0.736HisArg: 0.736 ± 0.24
0.899HisSer: 0.899 ± 0.345
0.817HisThr: 0.817 ± 0.287
0.654HisVal: 0.654 ± 0.255
0.0HisTrp: 0.0 ± 0.0
0.654HisTyr: 0.654 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
5.477IleAla: 5.477 ± 0.635
0.409IleCys: 0.409 ± 0.162
6.867IleAsp: 6.867 ± 0.651
4.987IleGlu: 4.987 ± 0.847
1.717IlePhe: 1.717 ± 0.312
3.679IleGly: 3.679 ± 0.639
0.736IleHis: 0.736 ± 0.228
4.578IleIle: 4.578 ± 0.809
7.684IleLys: 7.684 ± 1.043
4.578IleLeu: 4.578 ± 0.633
1.226IleMet: 1.226 ± 0.199
4.006IleAsn: 4.006 ± 0.548
1.88IlePro: 1.88 ± 0.365
2.452IleGln: 2.452 ± 0.65
1.962IleArg: 1.962 ± 0.425
4.741IleSer: 4.741 ± 0.66
4.823IleThr: 4.823 ± 0.642
4.087IleVal: 4.087 ± 0.523
0.49IleTrp: 0.49 ± 0.197
3.597IleTyr: 3.597 ± 0.603
0.0IleXaa: 0.0 ± 0.0
Lys
7.03LysAla: 7.03 ± 0.783
0.572LysCys: 0.572 ± 0.246
4.905LysAsp: 4.905 ± 0.754
6.458LysGlu: 6.458 ± 0.884
1.962LysPhe: 1.962 ± 0.409
5.559LysGly: 5.559 ± 0.682
1.226LysHis: 1.226 ± 0.407
6.703LysIle: 6.703 ± 0.935
6.54LysLys: 6.54 ± 1.129
6.867LysLeu: 6.867 ± 0.799
3.025LysMet: 3.025 ± 0.443
5.232LysAsn: 5.232 ± 0.855
2.779LysPro: 2.779 ± 0.492
4.006LysGln: 4.006 ± 0.593
3.924LysArg: 3.924 ± 0.761
3.842LysSer: 3.842 ± 0.466
5.313LysThr: 5.313 ± 0.667
5.15LysVal: 5.15 ± 0.869
1.471LysTrp: 1.471 ± 0.46
3.433LysTyr: 3.433 ± 0.512
0.0LysXaa: 0.0 ± 0.0
Leu
6.621LeuAla: 6.621 ± 0.765
0.327LeuCys: 0.327 ± 0.164
5.477LeuAsp: 5.477 ± 0.594
7.439LeuGlu: 7.439 ± 1.197
2.289LeuPhe: 2.289 ± 0.448
5.64LeuGly: 5.64 ± 0.764
0.736LeuHis: 0.736 ± 0.243
4.823LeuIle: 4.823 ± 0.607
8.91LeuLys: 8.91 ± 0.996
6.213LeuLeu: 6.213 ± 0.728
2.371LeuMet: 2.371 ± 0.49
5.15LeuAsn: 5.15 ± 0.506
3.76LeuPro: 3.76 ± 0.696
3.679LeuGln: 3.679 ± 0.53
4.251LeuArg: 4.251 ± 0.686
7.275LeuSer: 7.275 ± 0.85
5.068LeuThr: 5.068 ± 0.751
4.823LeuVal: 4.823 ± 0.653
0.736LeuTrp: 0.736 ± 0.217
1.962LeuTyr: 1.962 ± 0.456
0.0LeuXaa: 0.0 ± 0.0
Met
3.106MetAla: 3.106 ± 0.646
0.082MetCys: 0.082 ± 0.081
1.226MetAsp: 1.226 ± 0.313
2.207MetGlu: 2.207 ± 0.453
1.226MetPhe: 1.226 ± 0.308
1.39MetGly: 1.39 ± 0.322
0.409MetHis: 0.409 ± 0.214
1.553MetIle: 1.553 ± 0.289
0.981MetLys: 0.981 ± 0.271
2.044MetLeu: 2.044 ± 0.327
0.409MetMet: 0.409 ± 0.221
1.144MetAsn: 1.144 ± 0.32
0.654MetPro: 0.654 ± 0.213
0.736MetGln: 0.736 ± 0.203
1.226MetArg: 1.226 ± 0.372
2.861MetSer: 2.861 ± 0.628
1.471MetThr: 1.471 ± 0.498
0.736MetVal: 0.736 ± 0.245
0.082MetTrp: 0.082 ± 0.064
0.327MetTyr: 0.327 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
4.333AsnAla: 4.333 ± 0.711
0.082AsnCys: 0.082 ± 0.087
2.616AsnAsp: 2.616 ± 0.508
3.106AsnGlu: 3.106 ± 0.556
2.125AsnPhe: 2.125 ± 0.554
3.27AsnGly: 3.27 ± 0.607
1.144AsnHis: 1.144 ± 0.408
3.924AsnIle: 3.924 ± 0.465
3.679AsnLys: 3.679 ± 0.535
4.905AsnLeu: 4.905 ± 0.61
0.817AsnMet: 0.817 ± 0.216
2.452AsnAsn: 2.452 ± 0.495
1.553AsnPro: 1.553 ± 0.381
1.39AsnGln: 1.39 ± 0.311
1.88AsnArg: 1.88 ± 0.283
4.333AsnSer: 4.333 ± 0.864
2.943AsnThr: 2.943 ± 0.461
2.534AsnVal: 2.534 ± 0.457
0.899AsnTrp: 0.899 ± 0.205
2.207AsnTyr: 2.207 ± 0.499
0.0AsnXaa: 0.0 ± 0.0
Pro
1.798ProAla: 1.798 ± 0.473
0.327ProCys: 0.327 ± 0.177
1.39ProAsp: 1.39 ± 0.316
2.534ProGlu: 2.534 ± 0.532
0.981ProPhe: 0.981 ± 0.409
1.962ProGly: 1.962 ± 0.381
0.163ProHis: 0.163 ± 0.125
1.553ProIle: 1.553 ± 0.388
2.289ProLys: 2.289 ± 0.547
2.125ProLeu: 2.125 ± 0.448
0.49ProMet: 0.49 ± 0.227
1.226ProAsn: 1.226 ± 0.343
0.654ProPro: 0.654 ± 0.272
2.371ProGln: 2.371 ± 0.507
1.226ProArg: 1.226 ± 0.41
2.452ProSer: 2.452 ± 0.392
1.635ProThr: 1.635 ± 0.415
2.125ProVal: 2.125 ± 0.483
0.245ProTrp: 0.245 ± 0.166
0.49ProTyr: 0.49 ± 0.261
0.0ProXaa: 0.0 ± 0.0
Gln
3.924GlnAla: 3.924 ± 0.583
0.082GlnCys: 0.082 ± 0.081
2.125GlnAsp: 2.125 ± 0.327
3.025GlnGlu: 3.025 ± 0.538
1.635GlnPhe: 1.635 ± 0.366
3.515GlnGly: 3.515 ± 0.643
0.736GlnHis: 0.736 ± 0.293
3.025GlnIle: 3.025 ± 0.698
2.861GlnLys: 2.861 ± 0.457
3.76GlnLeu: 3.76 ± 0.573
0.981GlnMet: 0.981 ± 0.294
2.207GlnAsn: 2.207 ± 0.446
0.572GlnPro: 0.572 ± 0.197
1.144GlnGln: 1.144 ± 0.343
1.308GlnArg: 1.308 ± 0.289
4.006GlnSer: 4.006 ± 0.992
2.616GlnThr: 2.616 ± 0.382
1.308GlnVal: 1.308 ± 0.34
0.327GlnTrp: 0.327 ± 0.188
0.981GlnTyr: 0.981 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
3.27ArgAla: 3.27 ± 0.633
0.163ArgCys: 0.163 ± 0.115
2.044ArgAsp: 2.044 ± 0.539
2.616ArgGlu: 2.616 ± 0.547
1.962ArgPhe: 1.962 ± 0.475
2.452ArgGly: 2.452 ± 0.628
0.736ArgHis: 0.736 ± 0.233
3.679ArgIle: 3.679 ± 0.581
3.76ArgLys: 3.76 ± 0.712
5.559ArgLeu: 5.559 ± 0.886
1.308ArgMet: 1.308 ± 0.32
2.207ArgAsn: 2.207 ± 0.489
0.817ArgPro: 0.817 ± 0.235
1.717ArgGln: 1.717 ± 0.389
1.553ArgArg: 1.553 ± 0.389
2.207ArgSer: 2.207 ± 0.439
2.207ArgThr: 2.207 ± 0.483
2.616ArgVal: 2.616 ± 0.624
0.49ArgTrp: 0.49 ± 0.238
1.635ArgTyr: 1.635 ± 0.416
0.0ArgXaa: 0.0 ± 0.0
Ser
6.948SerAla: 6.948 ± 1.876
0.327SerCys: 0.327 ± 0.162
3.188SerAsp: 3.188 ± 0.558
4.414SerGlu: 4.414 ± 0.568
2.125SerPhe: 2.125 ± 0.592
5.477SerGly: 5.477 ± 1.088
0.981SerHis: 0.981 ± 0.36
4.578SerIle: 4.578 ± 0.709
4.987SerLys: 4.987 ± 0.488
5.395SerLeu: 5.395 ± 1.291
1.226SerMet: 1.226 ± 0.46
2.861SerAsn: 2.861 ± 0.535
1.635SerPro: 1.635 ± 0.372
2.779SerGln: 2.779 ± 0.554
3.025SerArg: 3.025 ± 0.373
5.068SerSer: 5.068 ± 1.51
3.025SerThr: 3.025 ± 0.632
5.804SerVal: 5.804 ± 0.864
0.49SerTrp: 0.49 ± 0.209
2.616SerTyr: 2.616 ± 0.614
0.0SerXaa: 0.0 ± 0.0
Thr
5.313ThrAla: 5.313 ± 0.772
0.327ThrCys: 0.327 ± 0.159
3.515ThrAsp: 3.515 ± 0.532
3.106ThrGlu: 3.106 ± 0.58
2.861ThrPhe: 2.861 ± 0.457
4.905ThrGly: 4.905 ± 0.752
0.49ThrHis: 0.49 ± 0.214
4.169ThrIle: 4.169 ± 0.532
4.169ThrLys: 4.169 ± 0.614
4.741ThrLeu: 4.741 ± 0.722
1.226ThrMet: 1.226 ± 0.33
1.717ThrAsn: 1.717 ± 0.427
1.553ThrPro: 1.553 ± 0.393
2.371ThrGln: 2.371 ± 0.543
1.962ThrArg: 1.962 ± 0.417
4.169ThrSer: 4.169 ± 1.725
4.414ThrThr: 4.414 ± 0.996
4.169ThrVal: 4.169 ± 0.536
0.899ThrTrp: 0.899 ± 0.299
2.779ThrTyr: 2.779 ± 0.535
0.0ThrXaa: 0.0 ± 0.0
Val
5.15ValAla: 5.15 ± 0.58
0.409ValCys: 0.409 ± 0.168
4.414ValAsp: 4.414 ± 0.742
4.741ValGlu: 4.741 ± 0.996
2.125ValPhe: 2.125 ± 0.387
3.924ValGly: 3.924 ± 0.676
0.736ValHis: 0.736 ± 0.285
4.414ValIle: 4.414 ± 0.663
4.66ValLys: 4.66 ± 0.419
5.477ValLeu: 5.477 ± 0.646
1.39ValMet: 1.39 ± 0.311
2.452ValAsn: 2.452 ± 0.51
1.798ValPro: 1.798 ± 0.448
2.698ValGln: 2.698 ± 0.781
2.289ValArg: 2.289 ± 0.486
4.578ValSer: 4.578 ± 0.842
3.188ValThr: 3.188 ± 0.333
4.169ValVal: 4.169 ± 0.512
0.327ValTrp: 0.327 ± 0.148
2.289ValTyr: 2.289 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
0.981TrpAla: 0.981 ± 0.24
0.0TrpCys: 0.0 ± 0.0
0.817TrpAsp: 0.817 ± 0.194
1.308TrpGlu: 1.308 ± 0.357
0.49TrpPhe: 0.49 ± 0.165
0.736TrpGly: 0.736 ± 0.282
0.163TrpHis: 0.163 ± 0.105
0.49TrpIle: 0.49 ± 0.165
0.736TrpLys: 0.736 ± 0.205
1.144TrpLeu: 1.144 ± 0.298
0.163TrpMet: 0.163 ± 0.159
0.572TrpAsn: 0.572 ± 0.234
0.736TrpPro: 0.736 ± 0.323
0.572TrpGln: 0.572 ± 0.239
0.572TrpArg: 0.572 ± 0.272
0.572TrpSer: 0.572 ± 0.24
0.245TrpThr: 0.245 ± 0.142
0.572TrpVal: 0.572 ± 0.242
0.0TrpTrp: 0.0 ± 0.0
0.163TrpTyr: 0.163 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.779TyrAla: 2.779 ± 0.523
0.409TyrCys: 0.409 ± 0.193
3.106TyrAsp: 3.106 ± 0.588
2.534TyrGlu: 2.534 ± 0.434
1.553TyrPhe: 1.553 ± 0.33
1.635TyrGly: 1.635 ± 0.407
0.409TyrHis: 0.409 ± 0.176
2.044TyrIle: 2.044 ± 0.459
3.188TyrLys: 3.188 ± 0.602
3.188TyrLeu: 3.188 ± 0.544
0.49TyrMet: 0.49 ± 0.196
1.717TyrAsn: 1.717 ± 0.414
1.063TyrPro: 1.063 ± 0.325
2.616TyrGln: 2.616 ± 0.598
2.452TyrArg: 2.452 ± 0.445
2.207TyrSer: 2.207 ± 0.408
1.962TyrThr: 1.962 ± 0.397
2.207TyrVal: 2.207 ± 0.499
0.327TyrTrp: 0.327 ± 0.132
1.717TyrTyr: 1.717 ± 0.439
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12234 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski