Amino acid dipepetide frequency for Streptococcus phage Javan150

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.545AlaAla: 4.545 ± 0.952
0.387AlaCys: 0.387 ± 0.227
5.029AlaAsp: 5.029 ± 0.597
5.705AlaGlu: 5.705 ± 0.742
2.031AlaPhe: 2.031 ± 0.427
5.029AlaGly: 5.029 ± 0.991
1.064AlaHis: 1.064 ± 0.316
6.286AlaIle: 6.286 ± 1.028
7.349AlaLys: 7.349 ± 0.761
5.125AlaLeu: 5.125 ± 0.733
1.354AlaMet: 1.354 ± 0.3
4.932AlaAsn: 4.932 ± 0.69
1.547AlaPro: 1.547 ± 0.314
2.224AlaGln: 2.224 ± 0.446
2.321AlaArg: 2.321 ± 0.537
3.965AlaSer: 3.965 ± 0.55
4.062AlaThr: 4.062 ± 0.698
5.125AlaVal: 5.125 ± 0.858
0.677AlaTrp: 0.677 ± 0.267
2.031AlaTyr: 2.031 ± 0.385
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.29CysCys: 0.29 ± 0.153
0.58CysAsp: 0.58 ± 0.237
0.58CysGlu: 0.58 ± 0.211
0.58CysPhe: 0.58 ± 0.228
0.29CysGly: 0.29 ± 0.162
0.097CysHis: 0.097 ± 0.088
0.193CysIle: 0.193 ± 0.186
0.484CysLys: 0.484 ± 0.204
0.677CysLeu: 0.677 ± 0.256
0.29CysMet: 0.29 ± 0.167
0.29CysAsn: 0.29 ± 0.164
0.193CysPro: 0.193 ± 0.2
0.387CysGln: 0.387 ± 0.2
0.097CysArg: 0.097 ± 0.093
0.484CysSer: 0.484 ± 0.246
0.097CysThr: 0.097 ± 0.1
0.29CysVal: 0.29 ± 0.23
0.0CysTrp: 0.0 ± 0.0
0.387CysTyr: 0.387 ± 0.223
0.0CysXaa: 0.0 ± 0.0
Asp
3.481AspAla: 3.481 ± 0.64
0.677AspCys: 0.677 ± 0.258
4.352AspAsp: 4.352 ± 0.681
5.125AspGlu: 5.125 ± 0.825
2.611AspPhe: 2.611 ± 0.31
5.512AspGly: 5.512 ± 0.727
0.387AspHis: 0.387 ± 0.192
5.512AspIle: 5.512 ± 0.637
5.996AspLys: 5.996 ± 0.7
6.382AspLeu: 6.382 ± 0.748
0.87AspMet: 0.87 ± 0.28
4.545AspAsn: 4.545 ± 0.677
2.321AspPro: 2.321 ± 0.484
1.451AspGln: 1.451 ± 0.391
2.804AspArg: 2.804 ± 0.492
2.418AspSer: 2.418 ± 0.551
4.545AspThr: 4.545 ± 0.844
4.352AspVal: 4.352 ± 0.706
1.451AspTrp: 1.451 ± 0.413
2.514AspTyr: 2.514 ± 0.476
0.0AspXaa: 0.0 ± 0.0
Glu
6.189GluAla: 6.189 ± 0.813
0.484GluCys: 0.484 ± 0.223
3.288GluAsp: 3.288 ± 0.666
5.319GluGlu: 5.319 ± 1.086
2.804GluPhe: 2.804 ± 0.512
3.481GluGly: 3.481 ± 0.614
1.451GluHis: 1.451 ± 0.306
5.222GluIle: 5.222 ± 0.706
4.932GluLys: 4.932 ± 0.549
7.833GluLeu: 7.833 ± 0.982
2.127GluMet: 2.127 ± 0.401
2.998GluAsn: 2.998 ± 0.517
2.031GluPro: 2.031 ± 0.486
3.481GluGln: 3.481 ± 0.577
2.611GluArg: 2.611 ± 0.486
4.062GluSer: 4.062 ± 0.624
4.932GluThr: 4.932 ± 0.847
4.158GluVal: 4.158 ± 0.812
0.967GluTrp: 0.967 ± 0.27
2.418GluTyr: 2.418 ± 0.466
0.0GluXaa: 0.0 ± 0.0
Phe
3.578PheAla: 3.578 ± 0.639
0.0PheCys: 0.0 ± 0.0
3.578PheAsp: 3.578 ± 0.587
4.448PheGlu: 4.448 ± 0.772
2.031PhePhe: 2.031 ± 0.39
2.804PheGly: 2.804 ± 0.385
0.58PheHis: 0.58 ± 0.207
1.741PheIle: 1.741 ± 0.386
3.385PheLys: 3.385 ± 0.52
1.934PheLeu: 1.934 ± 0.421
1.354PheMet: 1.354 ± 0.351
1.547PheAsn: 1.547 ± 0.401
0.774PhePro: 0.774 ± 0.295
0.967PheGln: 0.967 ± 0.275
2.031PheArg: 2.031 ± 0.494
2.708PheSer: 2.708 ± 0.603
2.127PheThr: 2.127 ± 0.518
3.094PheVal: 3.094 ± 0.614
0.484PheTrp: 0.484 ± 0.226
1.644PheTyr: 1.644 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
4.545GlyAla: 4.545 ± 0.952
0.0GlyCys: 0.0 ± 0.0
4.642GlyAsp: 4.642 ± 0.807
3.868GlyGlu: 3.868 ± 0.854
4.062GlyPhe: 4.062 ± 0.643
5.125GlyGly: 5.125 ± 0.838
1.257GlyHis: 1.257 ± 0.405
4.932GlyIle: 4.932 ± 0.757
6.576GlyLys: 6.576 ± 0.875
5.705GlyLeu: 5.705 ± 1.014
2.127GlyMet: 2.127 ± 0.482
3.578GlyAsn: 3.578 ± 0.736
2.127GlyPro: 2.127 ± 1.454
3.481GlyGln: 3.481 ± 0.682
2.031GlyArg: 2.031 ± 0.424
3.191GlySer: 3.191 ± 0.522
4.062GlyThr: 4.062 ± 0.705
4.062GlyVal: 4.062 ± 0.852
1.064GlyTrp: 1.064 ± 0.328
3.385GlyTyr: 3.385 ± 0.492
0.0GlyXaa: 0.0 ± 0.0
His
0.58HisAla: 0.58 ± 0.239
0.387HisCys: 0.387 ± 0.174
1.064HisAsp: 1.064 ± 0.334
1.547HisGlu: 1.547 ± 0.383
0.387HisPhe: 0.387 ± 0.183
1.354HisGly: 1.354 ± 0.367
0.387HisHis: 0.387 ± 0.189
0.58HisIle: 0.58 ± 0.206
0.58HisLys: 0.58 ± 0.209
1.547HisLeu: 1.547 ± 0.432
0.29HisMet: 0.29 ± 0.162
0.484HisAsn: 0.484 ± 0.201
0.677HisPro: 0.677 ± 0.247
0.774HisGln: 0.774 ± 0.268
0.58HisArg: 0.58 ± 0.196
0.774HisSer: 0.774 ± 0.276
0.967HisThr: 0.967 ± 0.3
0.677HisVal: 0.677 ± 0.294
0.29HisTrp: 0.29 ± 0.167
0.774HisTyr: 0.774 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
5.415IleAla: 5.415 ± 0.734
0.58IleCys: 0.58 ± 0.241
7.253IleAsp: 7.253 ± 0.963
5.415IleGlu: 5.415 ± 0.918
1.934IlePhe: 1.934 ± 0.487
3.481IleGly: 3.481 ± 0.599
1.354IleHis: 1.354 ± 0.348
4.545IleIle: 4.545 ± 0.756
6.866IleLys: 6.866 ± 0.92
4.158IleLeu: 4.158 ± 0.577
1.644IleMet: 1.644 ± 0.515
3.578IleAsn: 3.578 ± 0.527
2.031IlePro: 2.031 ± 0.483
2.031IleGln: 2.031 ± 0.334
2.901IleArg: 2.901 ± 0.56
5.125IleSer: 5.125 ± 1.003
4.255IleThr: 4.255 ± 0.677
5.029IleVal: 5.029 ± 0.762
0.484IleTrp: 0.484 ± 0.184
1.741IleTyr: 1.741 ± 0.443
0.0IleXaa: 0.0 ± 0.0
Lys
6.189LysAla: 6.189 ± 0.91
0.58LysCys: 0.58 ± 0.226
4.642LysAsp: 4.642 ± 0.725
7.833LysGlu: 7.833 ± 0.993
2.611LysPhe: 2.611 ± 0.416
6.286LysGly: 6.286 ± 1.021
1.16LysHis: 1.16 ± 0.447
5.802LysIle: 5.802 ± 0.903
7.93LysLys: 7.93 ± 1.408
5.609LysLeu: 5.609 ± 0.697
2.708LysMet: 2.708 ± 0.464
4.255LysAsn: 4.255 ± 0.845
3.094LysPro: 3.094 ± 0.52
3.481LysGln: 3.481 ± 0.605
3.868LysArg: 3.868 ± 0.661
6.963LysSer: 6.963 ± 0.751
5.609LysThr: 5.609 ± 0.571
5.899LysVal: 5.899 ± 0.759
1.547LysTrp: 1.547 ± 0.337
2.901LysTyr: 2.901 ± 0.407
0.0LysXaa: 0.0 ± 0.0
Leu
7.059LeuAla: 7.059 ± 0.762
0.387LeuCys: 0.387 ± 0.198
5.802LeuAsp: 5.802 ± 0.775
4.642LeuGlu: 4.642 ± 0.718
3.578LeuPhe: 3.578 ± 0.56
5.899LeuGly: 5.899 ± 0.921
0.677LeuHis: 0.677 ± 0.331
4.545LeuIle: 4.545 ± 0.652
9.574LeuLys: 9.574 ± 0.872
6.672LeuLeu: 6.672 ± 0.748
1.354LeuMet: 1.354 ± 0.327
4.255LeuAsn: 4.255 ± 0.578
2.998LeuPro: 2.998 ± 0.523
3.191LeuGln: 3.191 ± 0.628
2.224LeuArg: 2.224 ± 0.484
5.899LeuSer: 5.899 ± 1.087
4.352LeuThr: 4.352 ± 0.679
5.029LeuVal: 5.029 ± 0.705
0.677LeuTrp: 0.677 ± 0.251
2.127LeuTyr: 2.127 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
2.127MetAla: 2.127 ± 0.511
0.193MetCys: 0.193 ± 0.126
1.934MetAsp: 1.934 ± 0.513
1.354MetGlu: 1.354 ± 0.378
1.064MetPhe: 1.064 ± 0.307
1.451MetGly: 1.451 ± 0.572
0.29MetHis: 0.29 ± 0.182
2.031MetIle: 2.031 ± 0.511
2.031MetLys: 2.031 ± 0.441
1.547MetLeu: 1.547 ± 0.39
0.484MetMet: 0.484 ± 0.186
1.064MetAsn: 1.064 ± 0.323
0.87MetPro: 0.87 ± 0.344
0.87MetGln: 0.87 ± 0.299
0.967MetArg: 0.967 ± 0.363
2.127MetSer: 2.127 ± 0.437
1.547MetThr: 1.547 ± 0.379
1.741MetVal: 1.741 ± 0.433
0.29MetTrp: 0.29 ± 0.151
0.967MetTyr: 0.967 ± 0.272
0.0MetXaa: 0.0 ± 0.0
Asn
3.288AsnAla: 3.288 ± 0.649
0.484AsnCys: 0.484 ± 0.224
2.031AsnAsp: 2.031 ± 0.421
2.901AsnGlu: 2.901 ± 0.51
2.127AsnPhe: 2.127 ± 0.512
3.965AsnGly: 3.965 ± 0.623
0.58AsnHis: 0.58 ± 0.28
2.611AsnIle: 2.611 ± 0.42
3.771AsnLys: 3.771 ± 0.665
4.158AsnLeu: 4.158 ± 0.617
1.547AsnMet: 1.547 ± 0.345
2.418AsnAsn: 2.418 ± 0.413
2.514AsnPro: 2.514 ± 0.49
2.611AsnGln: 2.611 ± 0.484
2.708AsnArg: 2.708 ± 0.674
3.191AsnSer: 3.191 ± 0.728
3.675AsnThr: 3.675 ± 0.494
3.675AsnVal: 3.675 ± 0.567
0.774AsnTrp: 0.774 ± 0.186
1.934AsnTyr: 1.934 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
2.031ProAla: 2.031 ± 0.576
0.0ProCys: 0.0 ± 0.0
2.127ProAsp: 2.127 ± 0.446
1.644ProGlu: 1.644 ± 0.42
1.354ProPhe: 1.354 ± 0.347
1.16ProGly: 1.16 ± 0.318
0.774ProHis: 0.774 ± 0.301
2.224ProIle: 2.224 ± 0.407
3.578ProLys: 3.578 ± 0.648
2.127ProLeu: 2.127 ± 0.39
0.967ProMet: 0.967 ± 0.245
1.451ProAsn: 1.451 ± 0.426
0.87ProPro: 0.87 ± 0.331
1.547ProGln: 1.547 ± 0.506
0.677ProArg: 0.677 ± 0.273
2.708ProSer: 2.708 ± 0.614
1.741ProThr: 1.741 ± 0.364
2.127ProVal: 2.127 ± 0.461
0.097ProTrp: 0.097 ± 0.088
0.87ProTyr: 0.87 ± 0.301
0.0ProXaa: 0.0 ± 0.0
Gln
2.708GlnAla: 2.708 ± 0.665
0.193GlnCys: 0.193 ± 0.131
2.031GlnAsp: 2.031 ± 0.443
2.514GlnGlu: 2.514 ± 0.701
1.644GlnPhe: 1.644 ± 0.371
2.514GlnGly: 2.514 ± 0.651
0.774GlnHis: 0.774 ± 0.255
2.418GlnIle: 2.418 ± 0.769
3.481GlnLys: 3.481 ± 0.567
3.288GlnLeu: 3.288 ± 0.634
0.87GlnMet: 0.87 ± 0.263
2.127GlnAsn: 2.127 ± 0.463
1.257GlnPro: 1.257 ± 0.296
1.354GlnGln: 1.354 ± 0.335
2.031GlnArg: 2.031 ± 0.568
2.321GlnSer: 2.321 ± 0.453
2.611GlnThr: 2.611 ± 0.684
3.094GlnVal: 3.094 ± 0.648
0.387GlnTrp: 0.387 ± 0.185
1.354GlnTyr: 1.354 ± 0.509
0.0GlnXaa: 0.0 ± 0.0
Arg
2.031ArgAla: 2.031 ± 0.431
0.097ArgCys: 0.097 ± 0.101
2.611ArgAsp: 2.611 ± 0.55
2.708ArgGlu: 2.708 ± 0.477
1.257ArgPhe: 1.257 ± 0.315
2.127ArgGly: 2.127 ± 0.608
0.774ArgHis: 0.774 ± 0.273
2.901ArgIle: 2.901 ± 0.759
3.578ArgLys: 3.578 ± 0.599
3.481ArgLeu: 3.481 ± 0.602
1.064ArgMet: 1.064 ± 0.393
2.224ArgAsn: 2.224 ± 0.448
0.967ArgPro: 0.967 ± 0.264
1.837ArgGln: 1.837 ± 0.347
1.451ArgArg: 1.451 ± 0.399
1.837ArgSer: 1.837 ± 0.486
2.514ArgThr: 2.514 ± 0.493
1.934ArgVal: 1.934 ± 0.402
0.484ArgTrp: 0.484 ± 0.246
2.031ArgTyr: 2.031 ± 0.447
0.0ArgXaa: 0.0 ± 0.0
Ser
4.642SerAla: 4.642 ± 1.246
0.58SerCys: 0.58 ± 0.249
3.578SerAsp: 3.578 ± 0.534
2.998SerGlu: 2.998 ± 0.596
3.578SerPhe: 3.578 ± 0.711
5.802SerGly: 5.802 ± 0.977
0.677SerHis: 0.677 ± 0.262
4.545SerIle: 4.545 ± 0.637
4.352SerLys: 4.352 ± 0.667
5.899SerLeu: 5.899 ± 0.838
1.741SerMet: 1.741 ± 0.374
3.481SerAsn: 3.481 ± 0.699
1.064SerPro: 1.064 ± 0.294
2.321SerGln: 2.321 ± 0.48
2.321SerArg: 2.321 ± 0.698
3.771SerSer: 3.771 ± 0.756
3.771SerThr: 3.771 ± 0.539
4.448SerVal: 4.448 ± 0.583
0.387SerTrp: 0.387 ± 0.168
2.514SerTyr: 2.514 ± 0.541
0.0SerXaa: 0.0 ± 0.0
Thr
4.545ThrAla: 4.545 ± 0.724
0.29ThrCys: 0.29 ± 0.176
4.158ThrAsp: 4.158 ± 0.665
3.481ThrGlu: 3.481 ± 0.525
2.998ThrPhe: 2.998 ± 0.504
5.705ThrGly: 5.705 ± 1.036
0.774ThrHis: 0.774 ± 0.282
5.222ThrIle: 5.222 ± 0.812
4.835ThrLys: 4.835 ± 0.622
5.512ThrLeu: 5.512 ± 0.707
1.547ThrMet: 1.547 ± 0.327
2.611ThrAsn: 2.611 ± 0.424
1.741ThrPro: 1.741 ± 0.5
2.418ThrGln: 2.418 ± 0.562
1.837ThrArg: 1.837 ± 0.438
3.094ThrSer: 3.094 ± 0.569
3.675ThrThr: 3.675 ± 0.757
4.738ThrVal: 4.738 ± 0.748
0.677ThrTrp: 0.677 ± 0.231
1.934ThrTyr: 1.934 ± 0.494
0.0ThrXaa: 0.0 ± 0.0
Val
3.771ValAla: 3.771 ± 0.559
0.484ValCys: 0.484 ± 0.188
5.415ValAsp: 5.415 ± 0.78
5.222ValGlu: 5.222 ± 0.843
2.224ValPhe: 2.224 ± 0.437
4.352ValGly: 4.352 ± 0.606
0.58ValHis: 0.58 ± 0.201
5.609ValIle: 5.609 ± 0.681
6.092ValLys: 6.092 ± 0.737
5.899ValLeu: 5.899 ± 0.691
1.451ValMet: 1.451 ± 0.354
2.804ValAsn: 2.804 ± 0.51
1.451ValPro: 1.451 ± 0.336
2.127ValGln: 2.127 ± 0.417
2.031ValArg: 2.031 ± 0.532
4.738ValSer: 4.738 ± 0.634
4.932ValThr: 4.932 ± 0.598
4.352ValVal: 4.352 ± 0.656
0.193ValTrp: 0.193 ± 0.124
2.224ValTyr: 2.224 ± 0.568
0.0ValXaa: 0.0 ± 0.0
Trp
0.967TrpAla: 0.967 ± 0.398
0.097TrpCys: 0.097 ± 0.093
0.58TrpAsp: 0.58 ± 0.276
0.967TrpGlu: 0.967 ± 0.303
0.484TrpPhe: 0.484 ± 0.234
1.451TrpGly: 1.451 ± 0.395
0.29TrpHis: 0.29 ± 0.218
0.58TrpIle: 0.58 ± 0.303
0.677TrpLys: 0.677 ± 0.269
0.774TrpLeu: 0.774 ± 0.276
0.29TrpMet: 0.29 ± 0.196
0.58TrpAsn: 0.58 ± 0.214
0.097TrpPro: 0.097 ± 0.088
0.484TrpGln: 0.484 ± 0.248
0.677TrpArg: 0.677 ± 0.266
0.87TrpSer: 0.87 ± 0.268
0.677TrpThr: 0.677 ± 0.219
0.193TrpVal: 0.193 ± 0.139
0.097TrpTrp: 0.097 ± 0.106
0.484TrpTyr: 0.484 ± 0.184
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.094TyrAla: 3.094 ± 0.548
0.097TyrCys: 0.097 ± 0.093
2.611TyrAsp: 2.611 ± 0.569
2.514TyrGlu: 2.514 ± 0.464
1.451TyrPhe: 1.451 ± 0.298
2.031TyrGly: 2.031 ± 0.333
0.774TyrHis: 0.774 ± 0.345
2.321TyrIle: 2.321 ± 0.374
2.901TyrLys: 2.901 ± 0.468
2.611TyrLeu: 2.611 ± 0.467
0.774TyrMet: 0.774 ± 0.317
1.741TyrAsn: 1.741 ± 0.474
1.451TyrPro: 1.451 ± 0.322
1.934TyrGln: 1.934 ± 0.511
1.741TyrArg: 1.741 ± 0.441
2.224TyrSer: 2.224 ± 0.516
1.644TyrThr: 1.644 ± 0.414
2.031TyrVal: 2.031 ± 0.372
0.29TyrTrp: 0.29 ± 0.129
1.16TyrTyr: 1.16 ± 0.315
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10342 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski