Amino acid dipepetide frequency for Streptococcus phage Javan425

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.574AlaAla: 1.574 ± 0.453
0.437AlaCys: 0.437 ± 0.187
3.148AlaAsp: 3.148 ± 0.496
5.247AlaGlu: 5.247 ± 0.73
3.323AlaPhe: 3.323 ± 0.499
3.935AlaGly: 3.935 ± 0.771
0.612AlaHis: 0.612 ± 0.186
6.034AlaIle: 6.034 ± 0.998
6.296AlaLys: 6.296 ± 0.703
5.072AlaLeu: 5.072 ± 0.687
2.186AlaMet: 2.186 ± 0.42
3.061AlaAsn: 3.061 ± 0.448
2.186AlaPro: 2.186 ± 0.48
1.836AlaGln: 1.836 ± 0.299
2.798AlaArg: 2.798 ± 0.423
4.722AlaSer: 4.722 ± 0.687
3.935AlaThr: 3.935 ± 0.641
3.498AlaVal: 3.498 ± 0.457
1.049AlaTrp: 1.049 ± 0.299
2.623AlaTyr: 2.623 ± 0.415
0.0AlaXaa: 0.0 ± 0.0
Cys
0.087CysAla: 0.087 ± 0.094
0.175CysCys: 0.175 ± 0.148
0.7CysAsp: 0.7 ± 0.286
0.7CysGlu: 0.7 ± 0.203
0.087CysPhe: 0.087 ± 0.081
0.612CysGly: 0.612 ± 0.259
0.175CysHis: 0.175 ± 0.126
0.437CysIle: 0.437 ± 0.215
0.35CysLys: 0.35 ± 0.173
0.612CysLeu: 0.612 ± 0.205
0.0CysMet: 0.0 ± 0.0
0.262CysAsn: 0.262 ± 0.157
0.175CysPro: 0.175 ± 0.119
0.087CysGln: 0.087 ± 0.09
0.262CysArg: 0.262 ± 0.158
0.612CysSer: 0.612 ± 0.186
0.087CysThr: 0.087 ± 0.087
0.437CysVal: 0.437 ± 0.218
0.087CysTrp: 0.087 ± 0.09
0.35CysTyr: 0.35 ± 0.224
0.0CysXaa: 0.0 ± 0.0
Asp
4.022AspAla: 4.022 ± 0.508
0.437AspCys: 0.437 ± 0.187
3.76AspAsp: 3.76 ± 0.616
3.935AspGlu: 3.935 ± 0.554
3.061AspPhe: 3.061 ± 0.432
5.684AspGly: 5.684 ± 0.768
0.874AspHis: 0.874 ± 0.263
4.722AspIle: 4.722 ± 0.615
5.072AspLys: 5.072 ± 0.621
5.159AspLeu: 5.159 ± 0.709
1.749AspMet: 1.749 ± 0.371
4.197AspAsn: 4.197 ± 0.591
1.049AspPro: 1.049 ± 0.295
1.137AspGln: 1.137 ± 0.273
2.623AspArg: 2.623 ± 0.423
4.022AspSer: 4.022 ± 0.608
3.585AspThr: 3.585 ± 0.51
2.798AspVal: 2.798 ± 0.562
1.312AspTrp: 1.312 ± 0.393
3.498AspTyr: 3.498 ± 0.649
0.0AspXaa: 0.0 ± 0.0
Glu
5.247GluAla: 5.247 ± 0.528
0.262GluCys: 0.262 ± 0.211
4.372GluAsp: 4.372 ± 0.677
6.208GluGlu: 6.208 ± 0.804
3.76GluPhe: 3.76 ± 0.6
3.061GluGly: 3.061 ± 0.601
1.137GluHis: 1.137 ± 0.32
5.072GluIle: 5.072 ± 0.769
5.247GluLys: 5.247 ± 0.726
7.345GluLeu: 7.345 ± 0.773
2.011GluMet: 2.011 ± 0.457
4.897GluAsn: 4.897 ± 0.719
1.049GluPro: 1.049 ± 0.319
3.235GluGln: 3.235 ± 0.546
2.711GluArg: 2.711 ± 0.517
3.061GluSer: 3.061 ± 0.565
3.235GluThr: 3.235 ± 0.556
4.547GluVal: 4.547 ± 0.585
1.224GluTrp: 1.224 ± 0.273
3.323GluTyr: 3.323 ± 0.423
0.0GluXaa: 0.0 ± 0.0
Phe
2.186PheAla: 2.186 ± 0.333
0.262PheCys: 0.262 ± 0.167
3.76PheAsp: 3.76 ± 0.528
3.76PheGlu: 3.76 ± 0.608
1.312PhePhe: 1.312 ± 0.383
3.061PheGly: 3.061 ± 0.565
0.35PheHis: 0.35 ± 0.167
3.061PheIle: 3.061 ± 0.463
4.722PheLys: 4.722 ± 0.774
2.886PheLeu: 2.886 ± 0.551
1.224PheMet: 1.224 ± 0.378
2.274PheAsn: 2.274 ± 0.367
0.874PhePro: 0.874 ± 0.335
1.224PheGln: 1.224 ± 0.328
1.836PheArg: 1.836 ± 0.438
2.099PheSer: 2.099 ± 0.498
3.498PheThr: 3.498 ± 0.402
2.361PheVal: 2.361 ± 0.38
0.087PheTrp: 0.087 ± 0.099
1.661PheTyr: 1.661 ± 0.326
0.0PheXaa: 0.0 ± 0.0
Gly
3.498GlyAla: 3.498 ± 0.886
0.35GlyCys: 0.35 ± 0.181
3.673GlyAsp: 3.673 ± 0.59
2.536GlyGlu: 2.536 ± 0.528
4.022GlyPhe: 4.022 ± 0.709
4.634GlyGly: 4.634 ± 0.599
1.312GlyHis: 1.312 ± 0.319
5.509GlyIle: 5.509 ± 0.675
5.684GlyLys: 5.684 ± 0.864
5.421GlyLeu: 5.421 ± 0.618
1.487GlyMet: 1.487 ± 0.427
3.41GlyAsn: 3.41 ± 0.44
0.7GlyPro: 0.7 ± 0.23
3.061GlyGln: 3.061 ± 0.453
2.711GlyArg: 2.711 ± 0.508
3.76GlySer: 3.76 ± 0.638
5.596GlyThr: 5.596 ± 1.062
4.197GlyVal: 4.197 ± 0.877
1.312GlyTrp: 1.312 ± 0.381
3.41GlyTyr: 3.41 ± 0.573
0.0GlyXaa: 0.0 ± 0.0
His
1.049HisAla: 1.049 ± 0.334
0.087HisCys: 0.087 ± 0.087
0.787HisAsp: 0.787 ± 0.251
1.137HisGlu: 1.137 ± 0.321
0.437HisPhe: 0.437 ± 0.189
0.612HisGly: 0.612 ± 0.246
0.175HisHis: 0.175 ± 0.13
0.787HisIle: 0.787 ± 0.259
1.312HisLys: 1.312 ± 0.293
1.399HisLeu: 1.399 ± 0.352
0.262HisMet: 0.262 ± 0.153
0.7HisAsn: 0.7 ± 0.239
0.612HisPro: 0.612 ± 0.253
0.612HisGln: 0.612 ± 0.274
0.525HisArg: 0.525 ± 0.143
0.437HisSer: 0.437 ± 0.158
0.874HisThr: 0.874 ± 0.215
0.874HisVal: 0.874 ± 0.255
0.175HisTrp: 0.175 ± 0.11
0.874HisTyr: 0.874 ± 0.254
0.0HisXaa: 0.0 ± 0.0
Ile
5.072IleAla: 5.072 ± 0.747
0.612IleCys: 0.612 ± 0.245
6.121IleAsp: 6.121 ± 0.825
6.558IleGlu: 6.558 ± 0.666
2.798IlePhe: 2.798 ± 0.548
5.072IleGly: 5.072 ± 0.594
1.137IleHis: 1.137 ± 0.211
4.547IleIle: 4.547 ± 0.553
7.258IleLys: 7.258 ± 0.947
5.159IleLeu: 5.159 ± 0.568
1.749IleMet: 1.749 ± 0.401
5.596IleAsn: 5.596 ± 0.825
2.448IlePro: 2.448 ± 0.584
2.274IleGln: 2.274 ± 0.323
1.399IleArg: 1.399 ± 0.389
4.722IleSer: 4.722 ± 0.779
4.46IleThr: 4.46 ± 0.717
5.247IleVal: 5.247 ± 0.727
0.525IleTrp: 0.525 ± 0.217
2.623IleTyr: 2.623 ± 0.654
0.0IleXaa: 0.0 ± 0.0
Lys
7.17LysAla: 7.17 ± 0.78
0.262LysCys: 0.262 ± 0.175
4.897LysAsp: 4.897 ± 0.721
6.296LysGlu: 6.296 ± 0.789
2.099LysPhe: 2.099 ± 0.482
5.334LysGly: 5.334 ± 0.528
0.787LysHis: 0.787 ± 0.285
6.733LysIle: 6.733 ± 0.797
7.258LysLys: 7.258 ± 1.075
6.733LysLeu: 6.733 ± 0.77
2.274LysMet: 2.274 ± 0.477
5.859LysAsn: 5.859 ± 0.707
2.274LysPro: 2.274 ± 0.307
3.061LysGln: 3.061 ± 0.444
4.11LysArg: 4.11 ± 0.609
4.372LysSer: 4.372 ± 0.5
6.995LysThr: 6.995 ± 0.713
4.46LysVal: 4.46 ± 0.607
1.487LysTrp: 1.487 ± 0.356
3.498LysTyr: 3.498 ± 0.449
0.0LysXaa: 0.0 ± 0.0
Leu
4.634LeuAla: 4.634 ± 0.844
0.612LeuCys: 0.612 ± 0.306
5.247LeuAsp: 5.247 ± 0.701
5.509LeuGlu: 5.509 ± 0.617
3.673LeuPhe: 3.673 ± 0.544
4.547LeuGly: 4.547 ± 0.952
0.7LeuHis: 0.7 ± 0.288
7.083LeuIle: 7.083 ± 0.837
7.782LeuLys: 7.782 ± 0.826
5.509LeuLeu: 5.509 ± 0.576
1.661LeuMet: 1.661 ± 0.325
4.809LeuAsn: 4.809 ± 0.741
2.361LeuPro: 2.361 ± 0.386
3.323LeuGln: 3.323 ± 0.638
2.798LeuArg: 2.798 ± 0.475
5.684LeuSer: 5.684 ± 0.648
6.733LeuThr: 6.733 ± 0.823
3.235LeuVal: 3.235 ± 0.53
1.224LeuTrp: 1.224 ± 0.275
2.011LeuTyr: 2.011 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
2.274MetAla: 2.274 ± 0.555
0.262MetCys: 0.262 ± 0.138
1.661MetAsp: 1.661 ± 0.374
1.312MetGlu: 1.312 ± 0.354
0.874MetPhe: 0.874 ± 0.247
1.137MetGly: 1.137 ± 0.296
0.35MetHis: 0.35 ± 0.192
2.011MetIle: 2.011 ± 0.359
2.623MetLys: 2.623 ± 0.492
1.661MetLeu: 1.661 ± 0.328
0.787MetMet: 0.787 ± 0.282
0.874MetAsn: 0.874 ± 0.298
0.787MetPro: 0.787 ± 0.241
1.487MetGln: 1.487 ± 0.346
1.137MetArg: 1.137 ± 0.279
1.836MetSer: 1.836 ± 0.335
2.099MetThr: 2.099 ± 0.401
0.874MetVal: 0.874 ± 0.25
0.262MetTrp: 0.262 ± 0.142
1.049MetTyr: 1.049 ± 0.323
0.0MetXaa: 0.0 ± 0.0
Asn
3.235AsnAla: 3.235 ± 0.573
0.35AsnCys: 0.35 ± 0.175
3.498AsnAsp: 3.498 ± 0.555
3.235AsnGlu: 3.235 ± 0.489
2.798AsnPhe: 2.798 ± 0.423
5.421AsnGly: 5.421 ± 0.855
0.437AsnHis: 0.437 ± 0.2
3.673AsnIle: 3.673 ± 0.598
4.984AsnLys: 4.984 ± 0.778
3.847AsnLeu: 3.847 ± 0.61
1.487AsnMet: 1.487 ± 0.454
3.235AsnAsn: 3.235 ± 0.611
1.749AsnPro: 1.749 ± 0.376
2.361AsnGln: 2.361 ± 0.331
2.186AsnArg: 2.186 ± 0.354
4.372AsnSer: 4.372 ± 0.574
3.148AsnThr: 3.148 ± 0.51
4.372AsnVal: 4.372 ± 0.545
1.049AsnTrp: 1.049 ± 0.375
2.623AsnTyr: 2.623 ± 0.447
0.0AsnXaa: 0.0 ± 0.0
Pro
1.574ProAla: 1.574 ± 0.404
0.0ProCys: 0.0 ± 0.0
1.487ProAsp: 1.487 ± 0.46
1.836ProGlu: 1.836 ± 0.411
0.962ProPhe: 0.962 ± 0.285
1.137ProGly: 1.137 ± 0.4
0.7ProHis: 0.7 ± 0.221
2.361ProIle: 2.361 ± 0.371
2.798ProLys: 2.798 ± 0.539
2.448ProLeu: 2.448 ± 0.418
0.7ProMet: 0.7 ± 0.234
1.224ProAsn: 1.224 ± 0.329
0.612ProPro: 0.612 ± 0.205
1.399ProGln: 1.399 ± 0.324
0.962ProArg: 0.962 ± 0.221
2.711ProSer: 2.711 ± 0.587
1.924ProThr: 1.924 ± 0.4
1.749ProVal: 1.749 ± 0.338
0.087ProTrp: 0.087 ± 0.094
0.874ProTyr: 0.874 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
2.623GlnAla: 2.623 ± 0.392
0.35GlnCys: 0.35 ± 0.159
1.574GlnAsp: 1.574 ± 0.347
1.924GlnGlu: 1.924 ± 0.447
1.574GlnPhe: 1.574 ± 0.421
1.924GlnGly: 1.924 ± 0.593
0.7GlnHis: 0.7 ± 0.285
3.76GlnIle: 3.76 ± 0.772
3.061GlnLys: 3.061 ± 0.571
2.886GlnLeu: 2.886 ± 0.438
0.787GlnMet: 0.787 ± 0.275
2.011GlnAsn: 2.011 ± 0.422
1.487GlnPro: 1.487 ± 0.341
1.049GlnGln: 1.049 ± 0.292
1.399GlnArg: 1.399 ± 0.309
2.798GlnSer: 2.798 ± 0.561
2.448GlnThr: 2.448 ± 0.443
1.836GlnVal: 1.836 ± 0.429
1.224GlnTrp: 1.224 ± 0.28
1.312GlnTyr: 1.312 ± 0.302
0.0GlnXaa: 0.0 ± 0.0
Arg
2.099ArgAla: 2.099 ± 0.372
0.0ArgCys: 0.0 ± 0.0
1.661ArgAsp: 1.661 ± 0.377
2.361ArgGlu: 2.361 ± 0.492
1.661ArgPhe: 1.661 ± 0.461
2.099ArgGly: 2.099 ± 0.38
0.7ArgHis: 0.7 ± 0.221
3.061ArgIle: 3.061 ± 0.486
2.886ArgLys: 2.886 ± 0.476
3.235ArgLeu: 3.235 ± 0.577
1.312ArgMet: 1.312 ± 0.314
2.099ArgAsn: 2.099 ± 0.395
1.137ArgPro: 1.137 ± 0.284
0.874ArgGln: 0.874 ± 0.282
0.525ArgArg: 0.525 ± 0.189
1.836ArgSer: 1.836 ± 0.465
2.186ArgThr: 2.186 ± 0.53
2.798ArgVal: 2.798 ± 0.484
0.087ArgTrp: 0.087 ± 0.097
2.536ArgTyr: 2.536 ± 0.412
0.0ArgXaa: 0.0 ± 0.0
Ser
4.547SerAla: 4.547 ± 0.812
0.437SerCys: 0.437 ± 0.203
3.498SerAsp: 3.498 ± 0.507
4.809SerGlu: 4.809 ± 0.608
2.536SerPhe: 2.536 ± 0.481
5.946SerGly: 5.946 ± 0.777
0.874SerHis: 0.874 ± 0.315
3.41SerIle: 3.41 ± 0.609
5.247SerLys: 5.247 ± 0.613
5.159SerLeu: 5.159 ± 0.709
2.011SerMet: 2.011 ± 0.461
4.372SerAsn: 4.372 ± 0.671
1.487SerPro: 1.487 ± 0.301
2.361SerGln: 2.361 ± 0.511
1.137SerArg: 1.137 ± 0.363
4.372SerSer: 4.372 ± 0.673
3.498SerThr: 3.498 ± 0.669
4.984SerVal: 4.984 ± 0.623
1.049SerTrp: 1.049 ± 0.28
2.623SerTyr: 2.623 ± 0.407
0.0SerXaa: 0.0 ± 0.0
Thr
4.809ThrAla: 4.809 ± 0.752
0.175ThrCys: 0.175 ± 0.109
4.809ThrAsp: 4.809 ± 0.902
4.197ThrGlu: 4.197 ± 0.691
2.886ThrPhe: 2.886 ± 0.498
5.946ThrGly: 5.946 ± 0.588
0.874ThrHis: 0.874 ± 0.291
5.859ThrIle: 5.859 ± 1.001
5.072ThrLys: 5.072 ± 0.498
4.809ThrLeu: 4.809 ± 0.66
0.962ThrMet: 0.962 ± 0.338
2.536ThrAsn: 2.536 ± 0.459
2.798ThrPro: 2.798 ± 0.339
2.711ThrGln: 2.711 ± 0.718
1.749ThrArg: 1.749 ± 0.44
3.935ThrSer: 3.935 ± 0.583
4.722ThrThr: 4.722 ± 0.881
5.334ThrVal: 5.334 ± 0.588
0.787ThrTrp: 0.787 ± 0.249
3.323ThrTyr: 3.323 ± 0.476
0.0ThrXaa: 0.0 ± 0.0
Val
4.11ValAla: 4.11 ± 0.484
0.612ValCys: 0.612 ± 0.191
4.197ValAsp: 4.197 ± 0.619
5.072ValGlu: 5.072 ± 0.657
1.836ValPhe: 1.836 ± 0.327
3.061ValGly: 3.061 ± 0.543
1.049ValHis: 1.049 ± 0.351
4.547ValIle: 4.547 ± 0.84
4.022ValLys: 4.022 ± 0.52
5.421ValLeu: 5.421 ± 0.691
1.312ValMet: 1.312 ± 0.26
3.76ValAsn: 3.76 ± 0.466
2.099ValPro: 2.099 ± 0.459
1.924ValGln: 1.924 ± 0.368
2.099ValArg: 2.099 ± 0.493
4.722ValSer: 4.722 ± 0.62
5.859ValThr: 5.859 ± 1.145
3.935ValVal: 3.935 ± 0.655
0.262ValTrp: 0.262 ± 0.152
1.836ValTyr: 1.836 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
0.962TrpAla: 0.962 ± 0.287
0.35TrpCys: 0.35 ± 0.185
0.525TrpAsp: 0.525 ± 0.213
1.049TrpGlu: 1.049 ± 0.277
0.525TrpPhe: 0.525 ± 0.234
1.049TrpGly: 1.049 ± 0.309
0.087TrpHis: 0.087 ± 0.096
1.049TrpIle: 1.049 ± 0.261
1.049TrpLys: 1.049 ± 0.238
0.874TrpLeu: 0.874 ± 0.276
0.175TrpMet: 0.175 ± 0.115
0.612TrpAsn: 0.612 ± 0.248
0.262TrpPro: 0.262 ± 0.139
0.35TrpGln: 0.35 ± 0.174
0.612TrpArg: 0.612 ± 0.214
1.312TrpSer: 1.312 ± 0.304
1.137TrpThr: 1.137 ± 0.431
1.312TrpVal: 1.312 ± 0.291
0.262TrpTrp: 0.262 ± 0.167
0.437TrpTyr: 0.437 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.798TyrAla: 2.798 ± 0.577
0.262TyrCys: 0.262 ± 0.159
3.323TyrAsp: 3.323 ± 0.357
3.323TyrGlu: 3.323 ± 0.512
2.274TyrPhe: 2.274 ± 0.488
2.011TyrGly: 2.011 ± 0.357
0.787TyrHis: 0.787 ± 0.279
1.661TyrIle: 1.661 ± 0.292
3.235TyrLys: 3.235 ± 0.538
3.498TyrLeu: 3.498 ± 0.496
1.137TyrMet: 1.137 ± 0.263
2.274TyrAsn: 2.274 ± 0.506
1.399TyrPro: 1.399 ± 0.355
2.361TyrGln: 2.361 ± 0.404
1.574TyrArg: 1.574 ± 0.427
2.973TyrSer: 2.973 ± 0.506
2.274TyrThr: 2.274 ± 0.389
2.886TyrVal: 2.886 ± 0.459
0.437TyrTrp: 0.437 ± 0.157
1.749TyrTyr: 1.749 ± 0.372
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 58 proteins (11437 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski