Amino acid dipepetide frequency for Streptococcus phage Javan407

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.791AlaAla: 0.791 ± 0.334
0.352AlaCys: 0.352 ± 0.183
4.396AlaAsp: 4.396 ± 0.571
5.187AlaGlu: 5.187 ± 0.659
3.077AlaPhe: 3.077 ± 0.442
4.572AlaGly: 4.572 ± 0.777
0.879AlaHis: 0.879 ± 0.282
5.539AlaIle: 5.539 ± 0.907
5.715AlaLys: 5.715 ± 0.698
4.748AlaLeu: 4.748 ± 0.622
1.495AlaMet: 1.495 ± 0.339
4.22AlaAsn: 4.22 ± 0.696
1.934AlaPro: 1.934 ± 0.451
1.231AlaGln: 1.231 ± 0.329
2.022AlaArg: 2.022 ± 0.548
3.165AlaSer: 3.165 ± 0.572
3.605AlaThr: 3.605 ± 0.697
4.836AlaVal: 4.836 ± 0.831
0.967AlaTrp: 0.967 ± 0.295
2.989AlaTyr: 2.989 ± 0.437
0.0AlaXaa: 0.0 ± 0.0
Cys
0.176CysAla: 0.176 ± 0.117
0.0CysCys: 0.0 ± 0.0
0.088CysAsp: 0.088 ± 0.082
0.176CysGlu: 0.176 ± 0.106
0.352CysPhe: 0.352 ± 0.19
0.352CysGly: 0.352 ± 0.19
0.0CysHis: 0.0 ± 0.0
0.44CysIle: 0.44 ± 0.204
0.264CysLys: 0.264 ± 0.157
0.352CysLeu: 0.352 ± 0.191
0.0CysMet: 0.0 ± 0.0
0.352CysAsn: 0.352 ± 0.141
0.088CysPro: 0.088 ± 0.104
0.088CysGln: 0.088 ± 0.09
0.088CysArg: 0.088 ± 0.08
0.264CysSer: 0.264 ± 0.144
0.352CysThr: 0.352 ± 0.187
0.352CysVal: 0.352 ± 0.18
0.0CysTrp: 0.0 ± 0.0
0.703CysTyr: 0.703 ± 0.248
0.0CysXaa: 0.0 ± 0.0
Asp
3.165AspAla: 3.165 ± 0.449
0.264AspCys: 0.264 ± 0.152
3.341AspAsp: 3.341 ± 0.561
4.484AspGlu: 4.484 ± 0.835
3.429AspPhe: 3.429 ± 0.572
5.011AspGly: 5.011 ± 0.748
0.44AspHis: 0.44 ± 0.191
5.451AspIle: 5.451 ± 0.798
6.154AspLys: 6.154 ± 0.685
4.572AspLeu: 4.572 ± 0.523
1.231AspMet: 1.231 ± 0.29
4.748AspAsn: 4.748 ± 0.698
1.407AspPro: 1.407 ± 0.335
1.143AspGln: 1.143 ± 0.253
2.813AspArg: 2.813 ± 0.569
3.253AspSer: 3.253 ± 0.506
3.781AspThr: 3.781 ± 0.868
3.429AspVal: 3.429 ± 0.557
1.143AspTrp: 1.143 ± 0.35
2.726AspTyr: 2.726 ± 0.411
0.0AspXaa: 0.0 ± 0.0
Glu
5.275GluAla: 5.275 ± 0.688
0.088GluCys: 0.088 ± 0.067
4.132GluAsp: 4.132 ± 0.607
6.066GluGlu: 6.066 ± 0.863
2.286GluPhe: 2.286 ± 0.337
2.462GluGly: 2.462 ± 0.395
1.055GluHis: 1.055 ± 0.334
7.649GluIle: 7.649 ± 0.906
6.242GluLys: 6.242 ± 1.029
7.913GluLeu: 7.913 ± 0.937
1.846GluMet: 1.846 ± 0.391
3.956GluAsn: 3.956 ± 0.652
1.934GluPro: 1.934 ± 0.415
4.044GluGln: 4.044 ± 0.428
3.165GluArg: 3.165 ± 0.459
3.605GluSer: 3.605 ± 0.532
4.396GluThr: 4.396 ± 0.578
4.572GluVal: 4.572 ± 0.713
1.143GluTrp: 1.143 ± 0.352
2.813GluTyr: 2.813 ± 0.379
0.0GluXaa: 0.0 ± 0.0
Phe
2.901PheAla: 2.901 ± 0.511
0.088PheCys: 0.088 ± 0.086
3.253PheAsp: 3.253 ± 0.462
3.781PheGlu: 3.781 ± 0.705
1.055PhePhe: 1.055 ± 0.281
3.165PheGly: 3.165 ± 0.603
0.264PheHis: 0.264 ± 0.139
3.165PheIle: 3.165 ± 0.484
3.781PheLys: 3.781 ± 0.652
2.726PheLeu: 2.726 ± 0.471
1.231PheMet: 1.231 ± 0.373
2.989PheAsn: 2.989 ± 0.642
1.055PhePro: 1.055 ± 0.308
0.791PheGln: 0.791 ± 0.265
1.67PheArg: 1.67 ± 0.387
2.55PheSer: 2.55 ± 0.444
2.989PheThr: 2.989 ± 0.425
2.55PheVal: 2.55 ± 0.426
0.352PheTrp: 0.352 ± 0.157
1.407PheTyr: 1.407 ± 0.285
0.0PheXaa: 0.0 ± 0.0
Gly
4.132GlyAla: 4.132 ± 0.707
0.352GlyCys: 0.352 ± 0.186
3.605GlyAsp: 3.605 ± 0.54
2.462GlyGlu: 2.462 ± 0.372
2.462GlyPhe: 2.462 ± 0.445
5.187GlyGly: 5.187 ± 1.206
1.231GlyHis: 1.231 ± 0.364
5.275GlyIle: 5.275 ± 0.828
5.627GlyLys: 5.627 ± 0.81
4.572GlyLeu: 4.572 ± 0.666
1.67GlyMet: 1.67 ± 0.396
3.341GlyAsn: 3.341 ± 0.48
0.967GlyPro: 0.967 ± 0.257
2.286GlyGln: 2.286 ± 0.468
1.846GlyArg: 1.846 ± 0.438
5.451GlySer: 5.451 ± 0.715
5.451GlyThr: 5.451 ± 1.066
4.924GlyVal: 4.924 ± 0.94
0.791GlyTrp: 0.791 ± 0.211
3.077GlyTyr: 3.077 ± 0.495
0.0GlyXaa: 0.0 ± 0.0
His
0.967HisAla: 0.967 ± 0.369
0.088HisCys: 0.088 ± 0.084
1.055HisAsp: 1.055 ± 0.29
0.703HisGlu: 0.703 ± 0.293
0.352HisPhe: 0.352 ± 0.16
1.407HisGly: 1.407 ± 0.346
0.352HisHis: 0.352 ± 0.191
1.407HisIle: 1.407 ± 0.318
1.143HisLys: 1.143 ± 0.326
0.703HisLeu: 0.703 ± 0.256
0.088HisMet: 0.088 ± 0.071
0.528HisAsn: 0.528 ± 0.181
0.352HisPro: 0.352 ± 0.147
0.352HisGln: 0.352 ± 0.173
0.44HisArg: 0.44 ± 0.196
0.615HisSer: 0.615 ± 0.202
0.703HisThr: 0.703 ± 0.233
1.143HisVal: 1.143 ± 0.349
0.176HisTrp: 0.176 ± 0.121
0.528HisTyr: 0.528 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.627IleAla: 5.627 ± 0.687
0.44IleCys: 0.44 ± 0.234
4.484IleAsp: 4.484 ± 0.555
5.803IleGlu: 5.803 ± 0.73
2.901IlePhe: 2.901 ± 0.488
4.66IleGly: 4.66 ± 0.549
1.319IleHis: 1.319 ± 0.297
5.011IleIle: 5.011 ± 0.814
6.946IleLys: 6.946 ± 0.716
5.363IleLeu: 5.363 ± 0.702
1.495IleMet: 1.495 ± 0.341
4.308IleAsn: 4.308 ± 0.595
2.813IlePro: 2.813 ± 0.45
2.462IleGln: 2.462 ± 0.426
2.813IleArg: 2.813 ± 0.455
7.034IleSer: 7.034 ± 0.664
5.451IleThr: 5.451 ± 0.898
4.22IleVal: 4.22 ± 0.678
1.143IleTrp: 1.143 ± 0.355
2.901IleTyr: 2.901 ± 0.548
0.0IleXaa: 0.0 ± 0.0
Lys
6.154LysAla: 6.154 ± 0.936
0.176LysCys: 0.176 ± 0.127
5.539LysAsp: 5.539 ± 0.704
8.352LysGlu: 8.352 ± 1.209
3.077LysPhe: 3.077 ± 0.55
4.924LysGly: 4.924 ± 0.631
0.791LysHis: 0.791 ± 0.216
6.506LysIle: 6.506 ± 0.891
6.594LysLys: 6.594 ± 0.922
7.122LysLeu: 7.122 ± 0.709
3.517LysMet: 3.517 ± 0.677
4.484LysAsn: 4.484 ± 0.818
2.55LysPro: 2.55 ± 0.386
4.22LysGln: 4.22 ± 0.773
3.693LysArg: 3.693 ± 0.629
4.396LysSer: 4.396 ± 0.515
6.682LysThr: 6.682 ± 0.805
6.242LysVal: 6.242 ± 0.816
1.583LysTrp: 1.583 ± 0.318
4.572LysTyr: 4.572 ± 0.67
0.0LysXaa: 0.0 ± 0.0
Leu
5.891LeuAla: 5.891 ± 0.757
0.615LeuCys: 0.615 ± 0.227
5.187LeuAsp: 5.187 ± 0.659
6.242LeuGlu: 6.242 ± 0.785
3.165LeuPhe: 3.165 ± 0.688
4.836LeuGly: 4.836 ± 0.71
1.055LeuHis: 1.055 ± 0.319
5.803LeuIle: 5.803 ± 0.88
7.473LeuLys: 7.473 ± 0.8
5.715LeuLeu: 5.715 ± 0.756
1.846LeuMet: 1.846 ± 0.369
4.396LeuAsn: 4.396 ± 0.58
1.758LeuPro: 1.758 ± 0.356
2.813LeuGln: 2.813 ± 0.482
2.638LeuArg: 2.638 ± 0.423
6.858LeuSer: 6.858 ± 0.598
5.451LeuThr: 5.451 ± 0.67
4.836LeuVal: 4.836 ± 0.581
0.791LeuTrp: 0.791 ± 0.269
2.638LeuTyr: 2.638 ± 0.458
0.0LeuXaa: 0.0 ± 0.0
Met
1.407MetAla: 1.407 ± 0.365
0.0MetCys: 0.0 ± 0.0
1.055MetAsp: 1.055 ± 0.26
1.67MetGlu: 1.67 ± 0.44
0.967MetPhe: 0.967 ± 0.336
0.615MetGly: 0.615 ± 0.168
0.615MetHis: 0.615 ± 0.209
1.758MetIle: 1.758 ± 0.381
2.286MetLys: 2.286 ± 0.441
2.022MetLeu: 2.022 ± 0.588
0.528MetMet: 0.528 ± 0.243
2.198MetAsn: 2.198 ± 0.572
0.967MetPro: 0.967 ± 0.268
1.407MetGln: 1.407 ± 0.457
1.143MetArg: 1.143 ± 0.302
1.495MetSer: 1.495 ± 0.355
1.583MetThr: 1.583 ± 0.367
0.967MetVal: 0.967 ± 0.225
0.176MetTrp: 0.176 ± 0.131
0.967MetTyr: 0.967 ± 0.244
0.0MetXaa: 0.0 ± 0.0
Asn
3.253AsnAla: 3.253 ± 0.546
0.264AsnCys: 0.264 ± 0.119
3.605AsnAsp: 3.605 ± 0.63
4.572AsnGlu: 4.572 ± 0.759
2.374AsnPhe: 2.374 ± 0.395
4.396AsnGly: 4.396 ± 0.5
0.791AsnHis: 0.791 ± 0.328
3.429AsnIle: 3.429 ± 0.437
5.363AsnLys: 5.363 ± 0.885
5.099AsnLeu: 5.099 ± 0.694
1.319AsnMet: 1.319 ± 0.3
3.341AsnAsn: 3.341 ± 0.418
2.813AsnPro: 2.813 ± 0.439
2.638AsnGln: 2.638 ± 0.413
1.934AsnArg: 1.934 ± 0.465
3.605AsnSer: 3.605 ± 0.44
2.198AsnThr: 2.198 ± 0.485
4.044AsnVal: 4.044 ± 0.548
0.967AsnTrp: 0.967 ± 0.301
2.638AsnTyr: 2.638 ± 0.469
0.0AsnXaa: 0.0 ± 0.0
Pro
0.791ProAla: 0.791 ± 0.238
0.0ProCys: 0.0 ± 0.0
1.934ProAsp: 1.934 ± 0.356
1.758ProGlu: 1.758 ± 0.465
1.319ProPhe: 1.319 ± 0.357
1.055ProGly: 1.055 ± 0.243
0.703ProHis: 0.703 ± 0.183
2.022ProIle: 2.022 ± 0.417
2.813ProLys: 2.813 ± 0.469
2.11ProLeu: 2.11 ± 0.334
0.44ProMet: 0.44 ± 0.217
1.758ProAsn: 1.758 ± 0.408
0.528ProPro: 0.528 ± 0.148
1.319ProGln: 1.319 ± 0.369
1.67ProArg: 1.67 ± 0.511
1.758ProSer: 1.758 ± 0.456
1.758ProThr: 1.758 ± 0.354
2.55ProVal: 2.55 ± 0.564
0.088ProTrp: 0.088 ± 0.081
1.231ProTyr: 1.231 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
2.813GlnAla: 2.813 ± 0.425
0.264GlnCys: 0.264 ± 0.123
1.846GlnAsp: 1.846 ± 0.4
2.989GlnGlu: 2.989 ± 0.645
1.407GlnPhe: 1.407 ± 0.422
2.638GlnGly: 2.638 ± 0.566
0.528GlnHis: 0.528 ± 0.202
2.11GlnIle: 2.11 ± 0.404
4.044GlnLys: 4.044 ± 0.622
3.165GlnLeu: 3.165 ± 0.623
0.791GlnMet: 0.791 ± 0.239
1.407GlnAsn: 1.407 ± 0.334
0.615GlnPro: 0.615 ± 0.199
1.67GlnGln: 1.67 ± 0.559
0.967GlnArg: 0.967 ± 0.213
1.846GlnSer: 1.846 ± 0.398
2.813GlnThr: 2.813 ± 0.589
2.726GlnVal: 2.726 ± 0.51
0.088GlnTrp: 0.088 ± 0.083
1.758GlnTyr: 1.758 ± 0.455
0.0GlnXaa: 0.0 ± 0.0
Arg
1.934ArgAla: 1.934 ± 0.385
0.176ArgCys: 0.176 ± 0.11
2.198ArgAsp: 2.198 ± 0.522
2.11ArgGlu: 2.11 ± 0.437
1.319ArgPhe: 1.319 ± 0.288
1.67ArgGly: 1.67 ± 0.366
0.352ArgHis: 0.352 ± 0.183
2.989ArgIle: 2.989 ± 0.481
4.396ArgLys: 4.396 ± 0.825
3.517ArgLeu: 3.517 ± 0.665
1.143ArgMet: 1.143 ± 0.307
1.758ArgAsn: 1.758 ± 0.404
1.143ArgPro: 1.143 ± 0.371
1.407ArgGln: 1.407 ± 0.385
1.055ArgArg: 1.055 ± 0.347
1.407ArgSer: 1.407 ± 0.338
2.198ArgThr: 2.198 ± 0.452
1.407ArgVal: 1.407 ± 0.32
0.615ArgTrp: 0.615 ± 0.207
1.67ArgTyr: 1.67 ± 0.481
0.0ArgXaa: 0.0 ± 0.0
Ser
4.132SerAla: 4.132 ± 0.7
0.176SerCys: 0.176 ± 0.131
5.099SerAsp: 5.099 ± 0.646
5.099SerGlu: 5.099 ± 0.676
2.989SerPhe: 2.989 ± 0.53
4.836SerGly: 4.836 ± 1.087
1.055SerHis: 1.055 ± 0.262
4.836SerIle: 4.836 ± 0.675
5.187SerLys: 5.187 ± 0.495
5.627SerLeu: 5.627 ± 0.596
1.055SerMet: 1.055 ± 0.287
3.693SerAsn: 3.693 ± 0.478
1.495SerPro: 1.495 ± 0.374
2.286SerGln: 2.286 ± 0.388
1.319SerArg: 1.319 ± 0.345
4.396SerSer: 4.396 ± 0.759
4.22SerThr: 4.22 ± 0.69
3.517SerVal: 3.517 ± 0.76
0.967SerTrp: 0.967 ± 0.299
1.846SerTyr: 1.846 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
4.924ThrAla: 4.924 ± 0.614
0.176ThrCys: 0.176 ± 0.155
3.781ThrAsp: 3.781 ± 0.551
3.781ThrGlu: 3.781 ± 0.645
3.693ThrPhe: 3.693 ± 0.623
4.924ThrGly: 4.924 ± 0.697
0.791ThrHis: 0.791 ± 0.36
5.451ThrIle: 5.451 ± 0.85
6.33ThrLys: 6.33 ± 0.777
5.627ThrLeu: 5.627 ± 0.589
2.022ThrMet: 2.022 ± 0.353
3.165ThrAsn: 3.165 ± 0.572
1.846ThrPro: 1.846 ± 0.371
1.934ThrGln: 1.934 ± 0.406
1.407ThrArg: 1.407 ± 0.337
4.308ThrSer: 4.308 ± 0.811
3.868ThrThr: 3.868 ± 0.568
4.924ThrVal: 4.924 ± 0.828
0.615ThrTrp: 0.615 ± 0.22
2.286ThrTyr: 2.286 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
4.308ValAla: 4.308 ± 0.735
0.264ValCys: 0.264 ± 0.146
3.956ValAsp: 3.956 ± 0.63
5.803ValGlu: 5.803 ± 0.639
2.374ValPhe: 2.374 ± 0.395
4.308ValGly: 4.308 ± 0.746
0.44ValHis: 0.44 ± 0.28
4.572ValIle: 4.572 ± 0.627
5.451ValLys: 5.451 ± 0.89
4.396ValLeu: 4.396 ± 0.64
0.967ValMet: 0.967 ± 0.276
5.011ValAsn: 5.011 ± 0.743
2.022ValPro: 2.022 ± 0.41
1.758ValGln: 1.758 ± 0.399
2.022ValArg: 2.022 ± 0.367
4.66ValSer: 4.66 ± 0.739
4.66ValThr: 4.66 ± 0.756
4.836ValVal: 4.836 ± 0.752
0.615ValTrp: 0.615 ± 0.258
2.198ValTyr: 2.198 ± 0.484
0.0ValXaa: 0.0 ± 0.0
Trp
0.791TrpAla: 0.791 ± 0.242
0.088TrpCys: 0.088 ± 0.086
0.615TrpAsp: 0.615 ± 0.167
0.615TrpGlu: 0.615 ± 0.274
0.703TrpPhe: 0.703 ± 0.281
0.967TrpGly: 0.967 ± 0.263
0.176TrpHis: 0.176 ± 0.142
0.879TrpIle: 0.879 ± 0.33
1.055TrpLys: 1.055 ± 0.303
1.055TrpLeu: 1.055 ± 0.379
0.088TrpMet: 0.088 ± 0.083
0.967TrpAsn: 0.967 ± 0.289
0.0TrpPro: 0.0 ± 0.0
0.703TrpGln: 0.703 ± 0.286
0.528TrpArg: 0.528 ± 0.229
0.967TrpSer: 0.967 ± 0.293
1.055TrpThr: 1.055 ± 0.366
0.791TrpVal: 0.791 ± 0.257
0.264TrpTrp: 0.264 ± 0.143
0.528TrpTyr: 0.528 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.286TyrAla: 2.286 ± 0.399
0.615TyrCys: 0.615 ± 0.241
2.813TyrAsp: 2.813 ± 0.639
2.989TyrGlu: 2.989 ± 0.555
2.462TyrPhe: 2.462 ± 0.502
2.726TyrGly: 2.726 ± 0.428
0.264TyrHis: 0.264 ± 0.165
2.901TyrIle: 2.901 ± 0.512
4.308TyrLys: 4.308 ± 0.548
3.605TyrLeu: 3.605 ± 0.401
0.967TyrMet: 0.967 ± 0.267
2.022TyrAsn: 2.022 ± 0.337
1.319TyrPro: 1.319 ± 0.328
1.934TyrGln: 1.934 ± 0.428
1.231TyrArg: 1.231 ± 0.351
2.198TyrSer: 2.198 ± 0.56
2.638TyrThr: 2.638 ± 0.52
1.846TyrVal: 1.846 ± 0.348
0.352TyrTrp: 0.352 ± 0.171
1.583TyrTyr: 1.583 ± 0.389
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (11375 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski