Amino acid dipepetide frequency for Streptococcus phage Javan268

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.336AlaAla: 4.336 ± 1.615
0.197AlaCys: 0.197 ± 0.137
4.238AlaAsp: 4.238 ± 1.07
5.716AlaGlu: 5.716 ± 0.903
2.957AlaPhe: 2.957 ± 0.909
6.8AlaGly: 6.8 ± 1.671
1.084AlaHis: 1.084 ± 0.371
5.716AlaIle: 5.716 ± 0.871
5.815AlaLys: 5.815 ± 0.711
6.997AlaLeu: 6.997 ± 1.169
2.661AlaMet: 2.661 ± 1.025
3.646AlaAsn: 3.646 ± 0.631
2.365AlaPro: 2.365 ± 0.47
3.548AlaGln: 3.548 ± 0.806
2.07AlaArg: 2.07 ± 0.502
5.42AlaSer: 5.42 ± 1.291
4.435AlaThr: 4.435 ± 0.68
4.632AlaVal: 4.632 ± 1.211
0.887AlaTrp: 0.887 ± 0.318
3.055AlaTyr: 3.055 ± 0.582
0.0AlaXaa: 0.0 ± 0.0
Cys
0.394CysAla: 0.394 ± 0.22
0.0CysCys: 0.0 ± 0.0
0.197CysAsp: 0.197 ± 0.151
0.296CysGlu: 0.296 ± 0.256
0.099CysPhe: 0.099 ± 0.1
0.69CysGly: 0.69 ± 0.306
0.099CysHis: 0.099 ± 0.113
0.197CysIle: 0.197 ± 0.168
0.099CysLys: 0.099 ± 0.106
0.099CysLeu: 0.099 ± 0.113
0.197CysMet: 0.197 ± 0.156
0.099CysAsn: 0.099 ± 0.089
0.197CysPro: 0.197 ± 0.168
0.197CysGln: 0.197 ± 0.224
0.099CysArg: 0.099 ± 0.101
0.296CysSer: 0.296 ± 0.178
0.197CysThr: 0.197 ± 0.14
0.099CysVal: 0.099 ± 0.1
0.197CysTrp: 0.197 ± 0.168
0.296CysTyr: 0.296 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
2.365AspAla: 2.365 ± 0.519
0.197AspCys: 0.197 ± 0.158
4.336AspAsp: 4.336 ± 0.85
3.646AspGlu: 3.646 ± 0.994
3.548AspPhe: 3.548 ± 0.509
6.504AspGly: 6.504 ± 0.999
0.887AspHis: 0.887 ± 0.279
3.548AspIle: 3.548 ± 0.677
5.125AspLys: 5.125 ± 0.807
5.815AspLeu: 5.815 ± 0.627
1.971AspMet: 1.971 ± 0.365
3.351AspAsn: 3.351 ± 0.555
1.971AspPro: 1.971 ± 0.547
1.38AspGln: 1.38 ± 0.375
3.449AspArg: 3.449 ± 0.639
3.942AspSer: 3.942 ± 0.637
4.336AspThr: 4.336 ± 0.726
3.745AspVal: 3.745 ± 0.75
0.887AspTrp: 0.887 ± 0.325
2.957AspTyr: 2.957 ± 0.559
0.0AspXaa: 0.0 ± 0.0
Glu
4.533GluAla: 4.533 ± 0.698
0.394GluCys: 0.394 ± 0.226
3.942GluAsp: 3.942 ± 0.799
3.942GluGlu: 3.942 ± 0.652
3.055GluPhe: 3.055 ± 0.573
2.562GluGly: 2.562 ± 0.46
0.788GluHis: 0.788 ± 0.287
3.942GluIle: 3.942 ± 0.918
5.026GluLys: 5.026 ± 1.08
7.588GluLeu: 7.588 ± 1.038
2.464GluMet: 2.464 ± 0.575
4.435GluAsn: 4.435 ± 0.751
2.365GluPro: 2.365 ± 0.561
4.435GluGln: 4.435 ± 0.785
3.548GluArg: 3.548 ± 0.449
2.661GluSer: 2.661 ± 0.495
2.661GluThr: 2.661 ± 0.424
5.026GluVal: 5.026 ± 0.88
0.788GluTrp: 0.788 ± 0.241
2.661GluTyr: 2.661 ± 0.56
0.0GluXaa: 0.0 ± 0.0
Phe
3.252PheAla: 3.252 ± 0.592
0.296PheCys: 0.296 ± 0.167
4.139PheAsp: 4.139 ± 0.531
4.336PheGlu: 4.336 ± 0.847
1.084PhePhe: 1.084 ± 0.313
2.464PheGly: 2.464 ± 0.654
0.296PheHis: 0.296 ± 0.156
2.562PheIle: 2.562 ± 0.46
3.745PheLys: 3.745 ± 0.676
2.759PheLeu: 2.759 ± 0.591
0.887PheMet: 0.887 ± 0.276
2.07PheAsn: 2.07 ± 0.387
0.493PhePro: 0.493 ± 0.231
1.38PheGln: 1.38 ± 0.373
0.986PheArg: 0.986 ± 0.259
1.971PheSer: 1.971 ± 0.447
2.562PheThr: 2.562 ± 0.588
2.168PheVal: 2.168 ± 0.618
0.0PheTrp: 0.0 ± 0.0
1.774PheTyr: 1.774 ± 0.439
0.0PheXaa: 0.0 ± 0.0
Gly
5.519GlyAla: 5.519 ± 1.156
0.099GlyCys: 0.099 ± 0.112
3.351GlyAsp: 3.351 ± 0.738
4.336GlyGlu: 4.336 ± 0.543
3.154GlyPhe: 3.154 ± 0.546
2.562GlyGly: 2.562 ± 0.512
0.591GlyHis: 0.591 ± 0.266
5.716GlyIle: 5.716 ± 1.726
5.125GlyLys: 5.125 ± 0.595
5.42GlyLeu: 5.42 ± 0.892
1.971GlyMet: 1.971 ± 0.368
4.632GlyAsn: 4.632 ± 0.53
1.478GlyPro: 1.478 ± 0.58
3.351GlyGln: 3.351 ± 0.588
3.548GlyArg: 3.548 ± 0.673
4.829GlySer: 4.829 ± 1.144
5.026GlyThr: 5.026 ± 0.944
5.42GlyVal: 5.42 ± 0.628
0.788GlyTrp: 0.788 ± 0.236
2.759GlyTyr: 2.759 ± 0.715
0.0GlyXaa: 0.0 ± 0.0
His
0.296HisAla: 0.296 ± 0.154
0.0HisCys: 0.0 ± 0.0
1.183HisAsp: 1.183 ± 0.316
0.986HisGlu: 0.986 ± 0.342
0.394HisPhe: 0.394 ± 0.227
1.084HisGly: 1.084 ± 0.348
0.197HisHis: 0.197 ± 0.129
1.675HisIle: 1.675 ± 0.387
1.084HisLys: 1.084 ± 0.354
0.69HisLeu: 0.69 ± 0.286
0.099HisMet: 0.099 ± 0.116
0.493HisAsn: 0.493 ± 0.241
0.591HisPro: 0.591 ± 0.25
0.788HisGln: 0.788 ± 0.289
0.591HisArg: 0.591 ± 0.293
1.084HisSer: 1.084 ± 0.365
0.887HisThr: 0.887 ± 0.265
0.591HisVal: 0.591 ± 0.271
0.296HisTrp: 0.296 ± 0.175
0.493HisTyr: 0.493 ± 0.227
0.0HisXaa: 0.0 ± 0.0
Ile
4.632IleAla: 4.632 ± 1.075
0.099IleCys: 0.099 ± 0.093
5.125IleAsp: 5.125 ± 0.707
5.026IleGlu: 5.026 ± 0.845
2.07IlePhe: 2.07 ± 0.4
3.942IleGly: 3.942 ± 0.936
1.084IleHis: 1.084 ± 0.269
3.745IleIle: 3.745 ± 0.747
5.913IleLys: 5.913 ± 0.658
3.055IleLeu: 3.055 ± 0.543
1.675IleMet: 1.675 ± 0.422
4.73IleAsn: 4.73 ± 0.542
2.661IlePro: 2.661 ± 0.526
2.858IleGln: 2.858 ± 0.628
3.844IleArg: 3.844 ± 0.463
5.913IleSer: 5.913 ± 0.853
4.336IleThr: 4.336 ± 0.756
3.252IleVal: 3.252 ± 0.495
0.394IleTrp: 0.394 ± 0.183
2.07IleTyr: 2.07 ± 0.438
0.0IleXaa: 0.0 ± 0.0
Lys
6.209LysAla: 6.209 ± 0.94
0.197LysCys: 0.197 ± 0.158
4.041LysAsp: 4.041 ± 0.694
7.096LysGlu: 7.096 ± 0.898
2.661LysPhe: 2.661 ± 0.451
5.42LysGly: 5.42 ± 0.773
0.986LysHis: 0.986 ± 0.274
5.322LysIle: 5.322 ± 0.74
4.336LysLys: 4.336 ± 0.863
6.012LysLeu: 6.012 ± 0.902
1.872LysMet: 1.872 ± 0.522
4.139LysAsn: 4.139 ± 0.625
1.971LysPro: 1.971 ± 0.432
3.844LysGln: 3.844 ± 0.779
2.464LysArg: 2.464 ± 0.508
5.026LysSer: 5.026 ± 0.666
4.928LysThr: 4.928 ± 0.583
5.125LysVal: 5.125 ± 0.799
0.69LysTrp: 0.69 ± 0.258
3.548LysTyr: 3.548 ± 0.725
0.0LysXaa: 0.0 ± 0.0
Leu
6.899LeuAla: 6.899 ± 0.818
0.493LeuCys: 0.493 ± 0.23
6.11LeuAsp: 6.11 ± 0.754
5.42LeuGlu: 5.42 ± 0.946
3.055LeuPhe: 3.055 ± 0.575
5.223LeuGly: 5.223 ± 0.708
1.183LeuHis: 1.183 ± 0.362
4.238LeuIle: 4.238 ± 0.585
6.701LeuLys: 6.701 ± 1.033
4.435LeuLeu: 4.435 ± 0.681
1.577LeuMet: 1.577 ± 0.545
4.435LeuAsn: 4.435 ± 0.492
2.464LeuPro: 2.464 ± 0.475
4.041LeuGln: 4.041 ± 0.523
3.351LeuArg: 3.351 ± 0.733
6.701LeuSer: 6.701 ± 0.869
5.322LeuThr: 5.322 ± 0.59
4.829LeuVal: 4.829 ± 0.718
0.788LeuTrp: 0.788 ± 0.32
1.872LeuTyr: 1.872 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
3.154MetAla: 3.154 ± 1.091
0.197MetCys: 0.197 ± 0.152
1.281MetAsp: 1.281 ± 0.341
1.183MetGlu: 1.183 ± 0.41
0.493MetPhe: 0.493 ± 0.241
1.478MetGly: 1.478 ± 0.42
0.296MetHis: 0.296 ± 0.141
2.464MetIle: 2.464 ± 0.569
1.971MetLys: 1.971 ± 0.47
1.872MetLeu: 1.872 ± 0.38
1.084MetMet: 1.084 ± 0.737
1.281MetAsn: 1.281 ± 0.32
0.788MetPro: 0.788 ± 0.308
1.577MetGln: 1.577 ± 0.297
1.38MetArg: 1.38 ± 0.369
2.267MetSer: 2.267 ± 0.787
2.168MetThr: 2.168 ± 0.359
0.986MetVal: 0.986 ± 0.27
0.197MetTrp: 0.197 ± 0.138
0.986MetTyr: 0.986 ± 0.321
0.0MetXaa: 0.0 ± 0.0
Asn
4.73AsnAla: 4.73 ± 0.68
0.197AsnCys: 0.197 ± 0.142
3.252AsnAsp: 3.252 ± 0.614
3.154AsnGlu: 3.154 ± 0.641
1.872AsnPhe: 1.872 ± 0.418
5.026AsnGly: 5.026 ± 0.67
0.69AsnHis: 0.69 ± 0.262
2.661AsnIle: 2.661 ± 0.453
4.73AsnLys: 4.73 ± 0.671
4.238AsnLeu: 4.238 ± 0.662
1.183AsnMet: 1.183 ± 0.334
2.168AsnAsn: 2.168 ± 0.464
3.548AsnPro: 3.548 ± 0.727
1.971AsnGln: 1.971 ± 0.532
1.971AsnArg: 1.971 ± 0.406
3.942AsnSer: 3.942 ± 0.46
2.759AsnThr: 2.759 ± 0.47
3.745AsnVal: 3.745 ± 0.614
0.788AsnTrp: 0.788 ± 0.305
1.774AsnTyr: 1.774 ± 0.551
0.0AsnXaa: 0.0 ± 0.0
Pro
2.464ProAla: 2.464 ± 0.497
0.197ProCys: 0.197 ± 0.165
1.971ProAsp: 1.971 ± 0.51
1.675ProGlu: 1.675 ± 0.34
1.183ProPhe: 1.183 ± 0.427
1.872ProGly: 1.872 ± 0.368
0.394ProHis: 0.394 ± 0.191
1.478ProIle: 1.478 ± 0.421
3.646ProLys: 3.646 ± 0.651
2.858ProLeu: 2.858 ± 0.508
0.69ProMet: 0.69 ± 0.199
1.872ProAsn: 1.872 ± 0.565
0.986ProPro: 0.986 ± 0.323
1.38ProGln: 1.38 ± 0.347
1.577ProArg: 1.577 ± 0.51
1.478ProSer: 1.478 ± 0.32
2.464ProThr: 2.464 ± 0.488
1.872ProVal: 1.872 ± 0.528
0.394ProTrp: 0.394 ± 0.176
1.478ProTyr: 1.478 ± 0.435
0.0ProXaa: 0.0 ± 0.0
Gln
5.42GlnAla: 5.42 ± 1.351
0.493GlnCys: 0.493 ± 0.232
2.168GlnAsp: 2.168 ± 0.444
2.759GlnGlu: 2.759 ± 0.727
1.675GlnPhe: 1.675 ± 0.411
3.154GlnGly: 3.154 ± 1.018
0.296GlnHis: 0.296 ± 0.173
2.464GlnIle: 2.464 ± 0.451
3.154GlnLys: 3.154 ± 0.443
3.548GlnLeu: 3.548 ± 0.593
0.986GlnMet: 0.986 ± 0.322
2.562GlnAsn: 2.562 ± 0.503
0.887GlnPro: 0.887 ± 0.271
2.267GlnGln: 2.267 ± 0.529
1.577GlnArg: 1.577 ± 0.38
3.351GlnSer: 3.351 ± 0.553
1.774GlnThr: 1.774 ± 0.315
2.759GlnVal: 2.759 ± 0.567
0.394GlnTrp: 0.394 ± 0.214
1.774GlnTyr: 1.774 ± 0.478
0.0GlnXaa: 0.0 ± 0.0
Arg
3.351ArgAla: 3.351 ± 0.511
0.0ArgCys: 0.0 ± 0.0
2.858ArgAsp: 2.858 ± 0.536
3.252ArgGlu: 3.252 ± 0.58
1.281ArgPhe: 1.281 ± 0.344
2.661ArgGly: 2.661 ± 0.514
0.493ArgHis: 0.493 ± 0.256
2.957ArgIle: 2.957 ± 0.696
3.351ArgLys: 3.351 ± 0.784
3.252ArgLeu: 3.252 ± 0.485
0.986ArgMet: 0.986 ± 0.294
2.464ArgAsn: 2.464 ± 0.465
1.183ArgPro: 1.183 ± 0.356
1.38ArgGln: 1.38 ± 0.395
0.394ArgArg: 0.394 ± 0.181
2.267ArgSer: 2.267 ± 0.47
2.365ArgThr: 2.365 ± 0.396
1.675ArgVal: 1.675 ± 0.377
1.281ArgTrp: 1.281 ± 0.457
2.365ArgTyr: 2.365 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
6.209SerAla: 6.209 ± 1.443
0.296SerCys: 0.296 ± 0.187
4.928SerAsp: 4.928 ± 0.469
3.646SerGlu: 3.646 ± 0.672
2.562SerPhe: 2.562 ± 0.491
5.125SerGly: 5.125 ± 0.876
1.281SerHis: 1.281 ± 0.268
3.055SerIle: 3.055 ± 0.549
3.646SerLys: 3.646 ± 0.627
6.406SerLeu: 6.406 ± 0.892
1.971SerMet: 1.971 ± 0.474
4.139SerAsn: 4.139 ± 0.899
2.267SerPro: 2.267 ± 0.476
2.957SerGln: 2.957 ± 0.563
2.07SerArg: 2.07 ± 0.444
5.125SerSer: 5.125 ± 1.08
4.928SerThr: 4.928 ± 0.836
4.829SerVal: 4.829 ± 1.162
1.084SerTrp: 1.084 ± 0.413
2.562SerTyr: 2.562 ± 0.596
0.0SerXaa: 0.0 ± 0.0
Thr
4.829ThrAla: 4.829 ± 1.049
0.099ThrCys: 0.099 ± 0.101
2.759ThrAsp: 2.759 ± 0.669
3.942ThrGlu: 3.942 ± 0.669
3.055ThrPhe: 3.055 ± 0.421
4.829ThrGly: 4.829 ± 0.779
0.788ThrHis: 0.788 ± 0.28
5.913ThrIle: 5.913 ± 0.877
3.844ThrLys: 3.844 ± 0.706
4.829ThrLeu: 4.829 ± 0.691
1.577ThrMet: 1.577 ± 0.364
2.267ThrAsn: 2.267 ± 0.364
2.07ThrPro: 2.07 ± 0.488
2.661ThrGln: 2.661 ± 0.417
1.478ThrArg: 1.478 ± 0.446
4.041ThrSer: 4.041 ± 0.739
3.548ThrThr: 3.548 ± 0.628
5.913ThrVal: 5.913 ± 0.945
0.788ThrTrp: 0.788 ± 0.305
2.957ThrTyr: 2.957 ± 0.615
0.0ThrXaa: 0.0 ± 0.0
Val
5.322ValAla: 5.322 ± 0.898
0.099ValCys: 0.099 ± 0.099
4.336ValAsp: 4.336 ± 0.615
2.858ValGlu: 2.858 ± 0.557
2.759ValPhe: 2.759 ± 0.671
4.73ValGly: 4.73 ± 0.984
0.887ValHis: 0.887 ± 0.374
4.73ValIle: 4.73 ± 0.847
5.026ValLys: 5.026 ± 0.725
5.125ValLeu: 5.125 ± 0.971
1.675ValMet: 1.675 ± 0.344
2.759ValAsn: 2.759 ± 0.385
1.774ValPro: 1.774 ± 0.444
1.675ValGln: 1.675 ± 0.464
2.464ValArg: 2.464 ± 0.533
5.617ValSer: 5.617 ± 0.815
4.829ValThr: 4.829 ± 0.592
5.223ValVal: 5.223 ± 0.861
0.493ValTrp: 0.493 ± 0.236
2.661ValTyr: 2.661 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.69TrpAla: 0.69 ± 0.225
0.0TrpCys: 0.0 ± 0.0
0.493TrpAsp: 0.493 ± 0.23
1.183TrpGlu: 1.183 ± 0.355
0.493TrpPhe: 0.493 ± 0.224
0.788TrpGly: 0.788 ± 0.255
0.296TrpHis: 0.296 ± 0.189
0.986TrpIle: 0.986 ± 0.366
1.084TrpLys: 1.084 ± 0.31
0.591TrpLeu: 0.591 ± 0.409
0.394TrpMet: 0.394 ± 0.169
1.084TrpAsn: 1.084 ± 0.404
0.099TrpPro: 0.099 ± 0.114
0.296TrpGln: 0.296 ± 0.207
0.887TrpArg: 0.887 ± 0.341
0.788TrpSer: 0.788 ± 0.329
0.69TrpThr: 0.69 ± 0.29
0.887TrpVal: 0.887 ± 0.275
0.296TrpTrp: 0.296 ± 0.16
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.971TyrAla: 1.971 ± 0.503
0.493TyrCys: 0.493 ± 0.292
3.252TyrAsp: 3.252 ± 0.564
2.365TyrGlu: 2.365 ± 0.512
1.774TyrPhe: 1.774 ± 0.673
2.858TyrGly: 2.858 ± 0.649
0.887TyrHis: 0.887 ± 0.262
3.055TyrIle: 3.055 ± 0.521
2.464TyrLys: 2.464 ± 0.614
3.548TyrLeu: 3.548 ± 0.717
1.084TyrMet: 1.084 ± 0.308
1.774TyrAsn: 1.774 ± 0.502
1.774TyrPro: 1.774 ± 0.401
1.478TyrGln: 1.478 ± 0.328
2.07TyrArg: 2.07 ± 0.476
2.365TyrSer: 2.365 ± 0.47
1.971TyrThr: 1.971 ± 0.395
2.267TyrVal: 2.267 ± 0.299
0.591TyrTrp: 0.591 ± 0.299
1.675TyrTyr: 1.675 ± 0.448
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10148 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski