Amino acid dipepetide frequency for Streptococcus phage Javan14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.862AlaAla: 3.862 ± 1.198
0.158AlaCys: 0.158 ± 0.112
3.231AlaAsp: 3.231 ± 0.548
3.704AlaGlu: 3.704 ± 0.536
3.074AlaPhe: 3.074 ± 0.451
3.31AlaGly: 3.31 ± 0.665
1.103AlaHis: 1.103 ± 0.305
4.886AlaIle: 4.886 ± 0.6
5.832AlaLys: 5.832 ± 1.005
5.911AlaLeu: 5.911 ± 0.752
1.813AlaMet: 1.813 ± 0.371
4.019AlaAsn: 4.019 ± 0.508
1.813AlaPro: 1.813 ± 0.487
1.813AlaGln: 1.813 ± 0.313
2.443AlaArg: 2.443 ± 0.398
4.334AlaSer: 4.334 ± 0.608
4.256AlaThr: 4.256 ± 0.508
3.783AlaVal: 3.783 ± 0.543
0.709AlaTrp: 0.709 ± 0.258
3.31AlaTyr: 3.31 ± 0.536
0.0AlaXaa: 0.0 ± 0.0
Cys
0.158CysAla: 0.158 ± 0.104
0.0CysCys: 0.0 ± 0.0
0.236CysAsp: 0.236 ± 0.157
0.552CysGlu: 0.552 ± 0.249
0.158CysPhe: 0.158 ± 0.104
0.236CysGly: 0.236 ± 0.133
0.079CysHis: 0.079 ± 0.078
0.315CysIle: 0.315 ± 0.15
0.552CysLys: 0.552 ± 0.226
0.394CysLeu: 0.394 ± 0.145
0.236CysMet: 0.236 ± 0.142
0.236CysAsn: 0.236 ± 0.124
0.0CysPro: 0.0 ± 0.0
0.236CysGln: 0.236 ± 0.129
0.236CysArg: 0.236 ± 0.144
0.394CysSer: 0.394 ± 0.203
0.315CysThr: 0.315 ± 0.148
0.394CysVal: 0.394 ± 0.152
0.158CysTrp: 0.158 ± 0.105
0.315CysTyr: 0.315 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
2.601AspAla: 2.601 ± 0.429
0.552AspCys: 0.552 ± 0.222
4.729AspAsp: 4.729 ± 0.796
5.123AspGlu: 5.123 ± 0.668
3.389AspPhe: 3.389 ± 0.539
4.729AspGly: 4.729 ± 1.168
0.709AspHis: 0.709 ± 0.244
5.28AspIle: 5.28 ± 0.777
5.595AspLys: 5.595 ± 0.527
4.965AspLeu: 4.965 ± 0.623
2.364AspMet: 2.364 ± 0.312
3.546AspAsn: 3.546 ± 0.449
1.813AspPro: 1.813 ± 0.357
1.261AspGln: 1.261 ± 0.308
1.97AspArg: 1.97 ± 0.322
4.965AspSer: 4.965 ± 0.558
3.625AspThr: 3.625 ± 0.493
3.389AspVal: 3.389 ± 0.526
1.025AspTrp: 1.025 ± 0.247
4.256AspTyr: 4.256 ± 0.741
0.0AspXaa: 0.0 ± 0.0
Glu
4.413GluAla: 4.413 ± 0.892
0.473GluCys: 0.473 ± 0.21
4.65GluAsp: 4.65 ± 0.575
5.911GluGlu: 5.911 ± 0.699
3.389GluPhe: 3.389 ± 0.557
2.522GluGly: 2.522 ± 0.417
1.182GluHis: 1.182 ± 0.337
6.226GluIle: 6.226 ± 0.784
5.753GluLys: 5.753 ± 0.824
8.905GluLeu: 8.905 ± 1.279
2.049GluMet: 2.049 ± 0.484
4.413GluAsn: 4.413 ± 0.751
2.364GluPro: 2.364 ± 0.568
2.758GluGln: 2.758 ± 0.743
2.522GluArg: 2.522 ± 0.438
4.334GluSer: 4.334 ± 0.649
4.413GluThr: 4.413 ± 0.588
5.28GluVal: 5.28 ± 0.682
1.103GluTrp: 1.103 ± 0.254
2.837GluTyr: 2.837 ± 0.476
0.0GluXaa: 0.0 ± 0.0
Phe
2.837PheAla: 2.837 ± 0.487
0.236PheCys: 0.236 ± 0.144
2.916PheAsp: 2.916 ± 0.526
3.862PheGlu: 3.862 ± 0.543
1.734PhePhe: 1.734 ± 0.417
1.97PheGly: 1.97 ± 0.423
0.315PheHis: 0.315 ± 0.159
3.389PheIle: 3.389 ± 0.576
4.807PheLys: 4.807 ± 0.571
2.679PheLeu: 2.679 ± 0.486
1.261PheMet: 1.261 ± 0.367
2.128PheAsn: 2.128 ± 0.418
0.946PhePro: 0.946 ± 0.265
0.867PheGln: 0.867 ± 0.214
2.128PheArg: 2.128 ± 0.412
3.074PheSer: 3.074 ± 0.56
2.285PheThr: 2.285 ± 0.476
2.995PheVal: 2.995 ± 0.497
0.315PheTrp: 0.315 ± 0.167
1.97PheTyr: 1.97 ± 0.407
0.0PheXaa: 0.0 ± 0.0
Gly
4.334GlyAla: 4.334 ± 0.965
0.079GlyCys: 0.079 ± 0.08
4.413GlyAsp: 4.413 ± 0.584
3.468GlyGlu: 3.468 ± 0.525
3.783GlyPhe: 3.783 ± 0.675
3.94GlyGly: 3.94 ± 0.696
0.867GlyHis: 0.867 ± 0.302
4.571GlyIle: 4.571 ± 0.748
5.438GlyLys: 5.438 ± 0.687
4.177GlyLeu: 4.177 ± 0.662
1.34GlyMet: 1.34 ± 0.422
3.31GlyAsn: 3.31 ± 0.434
1.182GlyPro: 1.182 ± 0.391
1.734GlyGln: 1.734 ± 0.341
2.207GlyArg: 2.207 ± 0.376
3.152GlySer: 3.152 ± 0.612
4.413GlyThr: 4.413 ± 0.787
3.704GlyVal: 3.704 ± 0.614
1.103GlyTrp: 1.103 ± 0.257
2.522GlyTyr: 2.522 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
0.867HisAla: 0.867 ± 0.292
0.079HisCys: 0.079 ± 0.079
0.788HisAsp: 0.788 ± 0.25
1.261HisGlu: 1.261 ± 0.362
0.158HisPhe: 0.158 ± 0.121
0.552HisGly: 0.552 ± 0.233
0.236HisHis: 0.236 ± 0.176
0.788HisIle: 0.788 ± 0.231
1.182HisLys: 1.182 ± 0.352
1.182HisLeu: 1.182 ± 0.321
0.236HisMet: 0.236 ± 0.145
0.867HisAsn: 0.867 ± 0.32
0.473HisPro: 0.473 ± 0.152
0.552HisGln: 0.552 ± 0.187
0.552HisArg: 0.552 ± 0.168
0.946HisSer: 0.946 ± 0.317
0.315HisThr: 0.315 ± 0.195
1.025HisVal: 1.025 ± 0.264
0.0HisTrp: 0.0 ± 0.0
0.552HisTyr: 0.552 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
5.438IleAla: 5.438 ± 0.582
0.236IleCys: 0.236 ± 0.119
6.462IleAsp: 6.462 ± 0.668
6.226IleGlu: 6.226 ± 0.854
2.522IlePhe: 2.522 ± 0.622
4.177IleGly: 4.177 ± 0.747
0.552IleHis: 0.552 ± 0.211
3.546IleIle: 3.546 ± 0.517
6.62IleLys: 6.62 ± 0.778
5.123IleLeu: 5.123 ± 0.574
1.497IleMet: 1.497 ± 0.371
4.729IleAsn: 4.729 ± 0.739
2.995IlePro: 2.995 ± 0.43
2.443IleGln: 2.443 ± 0.46
2.601IleArg: 2.601 ± 0.458
4.729IleSer: 4.729 ± 0.867
3.704IleThr: 3.704 ± 0.505
5.595IleVal: 5.595 ± 0.592
0.63IleTrp: 0.63 ± 0.179
2.837IleTyr: 2.837 ± 0.46
0.0IleXaa: 0.0 ± 0.0
Lys
6.541LysAla: 6.541 ± 1.002
0.315LysCys: 0.315 ± 0.187
5.911LysAsp: 5.911 ± 0.667
7.566LysGlu: 7.566 ± 1.12
3.389LysPhe: 3.389 ± 0.62
5.438LysGly: 5.438 ± 0.618
0.788LysHis: 0.788 ± 0.258
6.147LysIle: 6.147 ± 0.958
9.221LysLys: 9.221 ± 1.014
8.196LysLeu: 8.196 ± 0.974
2.285LysMet: 2.285 ± 0.454
5.989LysAsn: 5.989 ± 0.565
2.364LysPro: 2.364 ± 0.533
4.098LysGln: 4.098 ± 0.578
4.413LysArg: 4.413 ± 0.493
4.492LysSer: 4.492 ± 0.69
5.989LysThr: 5.989 ± 0.664
4.334LysVal: 4.334 ± 0.693
0.946LysTrp: 0.946 ± 0.252
3.704LysTyr: 3.704 ± 0.488
0.0LysXaa: 0.0 ± 0.0
Leu
5.044LeuAla: 5.044 ± 0.621
0.473LeuCys: 0.473 ± 0.166
5.911LeuAsp: 5.911 ± 0.505
6.935LeuGlu: 6.935 ± 0.792
2.995LeuPhe: 2.995 ± 0.554
5.517LeuGly: 5.517 ± 0.781
0.946LeuHis: 0.946 ± 0.297
5.044LeuIle: 5.044 ± 0.67
7.881LeuLys: 7.881 ± 0.922
6.935LeuLeu: 6.935 ± 0.747
2.049LeuMet: 2.049 ± 0.376
5.438LeuAsn: 5.438 ± 0.595
2.443LeuPro: 2.443 ± 0.359
2.758LeuGln: 2.758 ± 0.515
2.916LeuArg: 2.916 ± 0.43
5.832LeuSer: 5.832 ± 0.716
4.807LeuThr: 4.807 ± 0.667
4.492LeuVal: 4.492 ± 0.662
1.025LeuTrp: 1.025 ± 0.273
3.152LeuTyr: 3.152 ± 0.386
0.0LeuXaa: 0.0 ± 0.0
Met
1.655MetAla: 1.655 ± 0.299
0.0MetCys: 0.0 ± 0.0
1.103MetAsp: 1.103 ± 0.353
1.734MetGlu: 1.734 ± 0.523
1.182MetPhe: 1.182 ± 0.281
0.867MetGly: 0.867 ± 0.351
0.236MetHis: 0.236 ± 0.122
1.576MetIle: 1.576 ± 0.517
2.601MetLys: 2.601 ± 0.472
2.285MetLeu: 2.285 ± 0.401
0.552MetMet: 0.552 ± 0.198
0.946MetAsn: 0.946 ± 0.213
0.867MetPro: 0.867 ± 0.285
1.103MetGln: 1.103 ± 0.251
0.867MetArg: 0.867 ± 0.262
1.97MetSer: 1.97 ± 0.37
2.049MetThr: 2.049 ± 0.465
1.497MetVal: 1.497 ± 0.286
0.315MetTrp: 0.315 ± 0.123
0.394MetTyr: 0.394 ± 0.185
0.0MetXaa: 0.0 ± 0.0
Asn
4.729AsnAla: 4.729 ± 0.725
0.236AsnCys: 0.236 ± 0.123
3.783AsnAsp: 3.783 ± 0.62
3.862AsnGlu: 3.862 ± 0.582
2.679AsnPhe: 2.679 ± 0.4
4.807AsnGly: 4.807 ± 0.631
0.788AsnHis: 0.788 ± 0.271
4.256AsnIle: 4.256 ± 0.5
4.729AsnLys: 4.729 ± 0.591
5.044AsnLeu: 5.044 ± 0.587
1.025AsnMet: 1.025 ± 0.261
3.31AsnAsn: 3.31 ± 0.52
2.207AsnPro: 2.207 ± 0.33
2.207AsnGln: 2.207 ± 0.5
2.285AsnArg: 2.285 ± 0.435
3.231AsnSer: 3.231 ± 0.451
2.995AsnThr: 2.995 ± 0.632
3.231AsnVal: 3.231 ± 0.539
0.867AsnTrp: 0.867 ± 0.293
3.231AsnTyr: 3.231 ± 0.602
0.0AsnXaa: 0.0 ± 0.0
Pro
1.497ProAla: 1.497 ± 0.339
0.079ProCys: 0.079 ± 0.079
1.97ProAsp: 1.97 ± 0.55
2.285ProGlu: 2.285 ± 0.441
1.182ProPhe: 1.182 ± 0.285
1.576ProGly: 1.576 ± 0.442
0.315ProHis: 0.315 ± 0.167
2.601ProIle: 2.601 ± 0.37
2.758ProLys: 2.758 ± 0.563
2.049ProLeu: 2.049 ± 0.491
0.63ProMet: 0.63 ± 0.215
1.103ProAsn: 1.103 ± 0.265
0.867ProPro: 0.867 ± 0.276
1.103ProGln: 1.103 ± 0.243
1.34ProArg: 1.34 ± 0.371
2.207ProSer: 2.207 ± 0.371
1.813ProThr: 1.813 ± 0.339
1.34ProVal: 1.34 ± 0.299
0.079ProTrp: 0.079 ± 0.081
1.182ProTyr: 1.182 ± 0.38
0.0ProXaa: 0.0 ± 0.0
Gln
2.522GlnAla: 2.522 ± 0.391
0.473GlnCys: 0.473 ± 0.188
1.34GlnAsp: 1.34 ± 0.323
2.758GlnGlu: 2.758 ± 0.508
0.946GlnPhe: 0.946 ± 0.209
1.97GlnGly: 1.97 ± 0.667
0.63GlnHis: 0.63 ± 0.186
2.995GlnIle: 2.995 ± 0.454
4.019GlnLys: 4.019 ± 0.456
4.098GlnLeu: 4.098 ± 0.484
0.867GlnMet: 0.867 ± 0.243
2.128GlnAsn: 2.128 ± 0.461
0.552GlnPro: 0.552 ± 0.218
1.576GlnGln: 1.576 ± 0.384
1.734GlnArg: 1.734 ± 0.398
2.916GlnSer: 2.916 ± 0.606
1.103GlnThr: 1.103 ± 0.441
1.419GlnVal: 1.419 ± 0.314
0.315GlnTrp: 0.315 ± 0.126
1.419GlnTyr: 1.419 ± 0.321
0.0GlnXaa: 0.0 ± 0.0
Arg
1.734ArgAla: 1.734 ± 0.326
0.079ArgCys: 0.079 ± 0.072
2.995ArgAsp: 2.995 ± 0.583
3.152ArgGlu: 3.152 ± 0.545
1.655ArgPhe: 1.655 ± 0.328
2.758ArgGly: 2.758 ± 0.49
0.63ArgHis: 0.63 ± 0.242
3.783ArgIle: 3.783 ± 0.469
2.995ArgLys: 2.995 ± 0.522
3.546ArgLeu: 3.546 ± 0.511
1.813ArgMet: 1.813 ± 0.415
1.891ArgAsn: 1.891 ± 0.531
1.182ArgPro: 1.182 ± 0.324
1.419ArgGln: 1.419 ± 0.276
2.128ArgArg: 2.128 ± 0.501
1.261ArgSer: 1.261 ± 0.267
2.758ArgThr: 2.758 ± 0.337
2.285ArgVal: 2.285 ± 0.454
0.709ArgTrp: 0.709 ± 0.223
1.497ArgTyr: 1.497 ± 0.281
0.0ArgXaa: 0.0 ± 0.0
Ser
3.94SerAla: 3.94 ± 0.569
0.236SerCys: 0.236 ± 0.127
4.965SerAsp: 4.965 ± 0.669
5.044SerGlu: 5.044 ± 0.62
2.758SerPhe: 2.758 ± 0.502
4.65SerGly: 4.65 ± 0.734
0.946SerHis: 0.946 ± 0.253
3.625SerIle: 3.625 ± 0.507
6.541SerLys: 6.541 ± 0.825
3.862SerLeu: 3.862 ± 0.479
1.182SerMet: 1.182 ± 0.317
4.256SerAsn: 4.256 ± 0.672
1.025SerPro: 1.025 ± 0.339
2.285SerGln: 2.285 ± 0.428
2.364SerArg: 2.364 ± 0.355
5.123SerSer: 5.123 ± 0.735
3.704SerThr: 3.704 ± 0.509
4.965SerVal: 4.965 ± 0.633
0.394SerTrp: 0.394 ± 0.164
2.837SerTyr: 2.837 ± 0.499
0.0SerXaa: 0.0 ± 0.0
Thr
3.783ThrAla: 3.783 ± 0.661
0.315ThrCys: 0.315 ± 0.149
3.468ThrAsp: 3.468 ± 0.568
3.625ThrGlu: 3.625 ± 0.384
3.468ThrPhe: 3.468 ± 0.521
4.256ThrGly: 4.256 ± 0.773
0.552ThrHis: 0.552 ± 0.176
5.123ThrIle: 5.123 ± 0.651
5.201ThrLys: 5.201 ± 0.423
4.571ThrLeu: 4.571 ± 0.606
1.025ThrMet: 1.025 ± 0.224
3.389ThrAsn: 3.389 ± 0.555
1.891ThrPro: 1.891 ± 0.347
2.679ThrGln: 2.679 ± 0.448
2.285ThrArg: 2.285 ± 0.468
3.468ThrSer: 3.468 ± 0.541
4.019ThrThr: 4.019 ± 0.507
5.044ThrVal: 5.044 ± 0.749
0.315ThrTrp: 0.315 ± 0.154
2.522ThrTyr: 2.522 ± 0.505
0.0ThrXaa: 0.0 ± 0.0
Val
3.625ValAla: 3.625 ± 0.531
0.315ValCys: 0.315 ± 0.161
3.546ValAsp: 3.546 ± 0.463
4.807ValGlu: 4.807 ± 0.55
1.97ValPhe: 1.97 ± 0.379
3.546ValGly: 3.546 ± 0.61
0.709ValHis: 0.709 ± 0.237
4.807ValIle: 4.807 ± 0.618
5.832ValLys: 5.832 ± 0.611
4.177ValLeu: 4.177 ± 0.537
0.709ValMet: 0.709 ± 0.279
4.413ValAsn: 4.413 ± 0.55
1.576ValPro: 1.576 ± 0.391
2.049ValGln: 2.049 ± 0.538
3.074ValArg: 3.074 ± 0.563
4.492ValSer: 4.492 ± 0.569
4.807ValThr: 4.807 ± 0.498
3.152ValVal: 3.152 ± 0.535
0.552ValTrp: 0.552 ± 0.16
2.443ValTyr: 2.443 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
0.709TrpAla: 0.709 ± 0.182
0.158TrpCys: 0.158 ± 0.136
0.394TrpAsp: 0.394 ± 0.145
0.63TrpGlu: 0.63 ± 0.224
0.63TrpPhe: 0.63 ± 0.207
0.63TrpGly: 0.63 ± 0.299
0.236TrpHis: 0.236 ± 0.146
1.025TrpIle: 1.025 ± 0.259
1.103TrpLys: 1.103 ± 0.238
0.946TrpLeu: 0.946 ± 0.393
0.0TrpMet: 0.0 ± 0.0
0.709TrpAsn: 0.709 ± 0.264
0.0TrpPro: 0.0 ± 0.0
0.63TrpGln: 0.63 ± 0.178
0.552TrpArg: 0.552 ± 0.199
0.709TrpSer: 0.709 ± 0.233
1.182TrpThr: 1.182 ± 0.282
0.315TrpVal: 0.315 ± 0.127
0.158TrpTrp: 0.158 ± 0.106
0.236TrpTyr: 0.236 ± 0.12
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.074TyrAla: 3.074 ± 0.588
0.709TyrCys: 0.709 ± 0.207
3.074TyrAsp: 3.074 ± 0.524
2.995TyrGlu: 2.995 ± 0.506
1.655TyrPhe: 1.655 ± 0.357
2.364TyrGly: 2.364 ± 0.438
0.788TyrHis: 0.788 ± 0.218
2.679TyrIle: 2.679 ± 0.485
3.468TyrLys: 3.468 ± 0.481
3.546TyrLeu: 3.546 ± 0.551
0.788TyrMet: 0.788 ± 0.237
2.679TyrAsn: 2.679 ± 0.454
1.419TyrPro: 1.419 ± 0.41
2.207TyrGln: 2.207 ± 0.445
1.655TyrArg: 1.655 ± 0.393
3.074TyrSer: 3.074 ± 0.396
2.443TyrThr: 2.443 ± 0.57
2.364TyrVal: 2.364 ± 0.386
0.236TyrTrp: 0.236 ± 0.153
1.813TyrTyr: 1.813 ± 0.487
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12690 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski