Amino acid dipepetide frequency for Streptococcus satellite phage Javan581

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.962AlaAla: 0.962 ± 0.744
0.321AlaCys: 0.321 ± 0.316
4.489AlaAsp: 4.489 ± 1.028
6.412AlaGlu: 6.412 ± 1.468
1.924AlaPhe: 1.924 ± 0.716
3.206AlaGly: 3.206 ± 0.884
0.962AlaHis: 0.962 ± 0.636
4.489AlaIle: 4.489 ± 0.856
6.412AlaLys: 6.412 ± 1.179
5.771AlaLeu: 5.771 ± 0.921
1.924AlaMet: 1.924 ± 0.937
3.206AlaAsn: 3.206 ± 0.693
1.603AlaPro: 1.603 ± 0.757
3.206AlaGln: 3.206 ± 0.873
1.603AlaArg: 1.603 ± 0.524
3.847AlaSer: 3.847 ± 1.063
5.45AlaThr: 5.45 ± 2.107
1.924AlaVal: 1.924 ± 0.503
0.641AlaTrp: 0.641 ± 0.369
4.168AlaTyr: 4.168 ± 0.864
0.0AlaXaa: 0.0 ± 0.0
Cys
0.641CysAla: 0.641 ± 0.4
0.0CysCys: 0.0 ± 0.0
0.321CysAsp: 0.321 ± 0.342
0.0CysGlu: 0.0 ± 0.0
0.641CysPhe: 0.641 ± 0.408
0.641CysGly: 0.641 ± 0.47
0.321CysHis: 0.321 ± 0.284
0.962CysIle: 0.962 ± 0.405
0.641CysLys: 0.641 ± 0.387
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.321CysPro: 0.321 ± 0.317
0.0CysGln: 0.0 ± 0.0
0.641CysArg: 0.641 ± 0.426
0.321CysSer: 0.321 ± 0.326
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.321CysTyr: 0.321 ± 0.346
0.0CysXaa: 0.0 ± 0.0
Asp
2.565AspAla: 2.565 ± 1.069
0.321AspCys: 0.321 ± 0.317
1.282AspAsp: 1.282 ± 0.795
3.847AspGlu: 3.847 ± 0.909
2.565AspPhe: 2.565 ± 0.76
1.924AspGly: 1.924 ± 0.887
0.641AspHis: 0.641 ± 0.503
8.015AspIle: 8.015 ± 1.424
5.13AspLys: 5.13 ± 1.317
6.092AspLeu: 6.092 ± 1.297
2.565AspMet: 2.565 ± 0.869
3.847AspAsn: 3.847 ± 1.419
1.282AspPro: 1.282 ± 0.699
1.603AspGln: 1.603 ± 0.575
0.962AspArg: 0.962 ± 0.475
4.809AspSer: 4.809 ± 0.979
2.886AspThr: 2.886 ± 1.83
2.565AspVal: 2.565 ± 0.604
1.282AspTrp: 1.282 ± 0.526
4.489AspTyr: 4.489 ± 1.007
0.0AspXaa: 0.0 ± 0.0
Glu
5.45GluAla: 5.45 ± 1.153
0.641GluCys: 0.641 ± 0.454
6.092GluAsp: 6.092 ± 1.296
5.45GluGlu: 5.45 ± 1.586
4.168GluPhe: 4.168 ± 1.851
0.962GluGly: 0.962 ± 0.481
2.565GluHis: 2.565 ± 1.082
3.527GluIle: 3.527 ± 0.764
6.412GluLys: 6.412 ± 1.437
12.504GluLeu: 12.504 ± 2.003
3.847GluMet: 3.847 ± 1.258
4.168GluAsn: 4.168 ± 0.742
1.282GluPro: 1.282 ± 0.589
5.13GluGln: 5.13 ± 1.232
4.489GluArg: 4.489 ± 1.242
1.924GluSer: 1.924 ± 0.745
3.847GluThr: 3.847 ± 1.309
5.45GluVal: 5.45 ± 1.741
0.0GluTrp: 0.0 ± 0.0
2.244GluTyr: 2.244 ± 0.846
0.0GluXaa: 0.0 ± 0.0
Phe
1.603PheAla: 1.603 ± 0.996
0.0PheCys: 0.0 ± 0.0
0.962PheAsp: 0.962 ± 0.455
4.809PheGlu: 4.809 ± 1.086
0.962PhePhe: 0.962 ± 0.523
2.565PheGly: 2.565 ± 0.673
0.641PheHis: 0.641 ± 0.443
1.603PheIle: 1.603 ± 0.48
3.847PheLys: 3.847 ± 0.948
3.527PheLeu: 3.527 ± 0.752
0.321PheMet: 0.321 ± 0.296
0.962PheAsn: 0.962 ± 0.474
0.641PhePro: 0.641 ± 0.569
3.206PheGln: 3.206 ± 0.966
2.565PheArg: 2.565 ± 0.745
2.244PheSer: 2.244 ± 0.653
0.962PheThr: 0.962 ± 0.564
1.924PheVal: 1.924 ± 0.941
0.641PheTrp: 0.641 ± 0.414
2.565PheTyr: 2.565 ± 0.74
0.0PheXaa: 0.0 ± 0.0
Gly
2.886GlyAla: 2.886 ± 1.213
0.321GlyCys: 0.321 ± 0.309
1.603GlyAsp: 1.603 ± 0.712
1.603GlyGlu: 1.603 ± 0.85
1.282GlyPhe: 1.282 ± 0.564
2.244GlyGly: 2.244 ± 1.017
0.962GlyHis: 0.962 ± 0.51
4.168GlyIle: 4.168 ± 0.837
3.527GlyLys: 3.527 ± 1.015
5.771GlyLeu: 5.771 ± 1.223
2.244GlyMet: 2.244 ± 0.919
3.847GlyAsn: 3.847 ± 0.934
0.0GlyPro: 0.0 ± 0.0
1.282GlyGln: 1.282 ± 0.623
2.244GlyArg: 2.244 ± 0.61
0.962GlySer: 0.962 ± 0.519
1.603GlyThr: 1.603 ± 0.642
2.244GlyVal: 2.244 ± 0.653
0.321GlyTrp: 0.321 ± 0.326
4.168GlyTyr: 4.168 ± 1.292
0.0GlyXaa: 0.0 ± 0.0
His
1.924HisAla: 1.924 ± 0.931
0.0HisCys: 0.0 ± 0.0
1.282HisAsp: 1.282 ± 0.653
0.962HisGlu: 0.962 ± 0.73
1.603HisPhe: 1.603 ± 0.778
0.962HisGly: 0.962 ± 0.417
0.321HisHis: 0.321 ± 0.303
1.603HisIle: 1.603 ± 0.693
0.962HisLys: 0.962 ± 0.589
0.641HisLeu: 0.641 ± 0.401
0.0HisMet: 0.0 ± 0.0
1.924HisAsn: 1.924 ± 0.886
0.321HisPro: 0.321 ± 0.296
0.641HisGln: 0.641 ± 0.379
1.282HisArg: 1.282 ± 0.512
1.282HisSer: 1.282 ± 0.449
1.603HisThr: 1.603 ± 0.677
0.641HisVal: 0.641 ± 0.415
0.321HisTrp: 0.321 ± 0.309
0.962HisTyr: 0.962 ± 0.421
0.0HisXaa: 0.0 ± 0.0
Ile
7.695IleAla: 7.695 ± 1.802
0.321IleCys: 0.321 ± 0.342
5.13IleAsp: 5.13 ± 1.838
6.733IleGlu: 6.733 ± 1.995
2.244IlePhe: 2.244 ± 0.707
3.847IleGly: 3.847 ± 0.939
2.565IleHis: 2.565 ± 0.65
2.244IleIle: 2.244 ± 0.785
5.13IleLys: 5.13 ± 1.211
5.771IleLeu: 5.771 ± 1.147
1.924IleMet: 1.924 ± 0.534
4.809IleAsn: 4.809 ± 1.295
3.206IlePro: 3.206 ± 0.781
1.924IleGln: 1.924 ± 0.653
4.809IleArg: 4.809 ± 1.0
3.206IleSer: 3.206 ± 0.835
4.809IleThr: 4.809 ± 1.375
2.244IleVal: 2.244 ± 0.682
0.0IleTrp: 0.0 ± 0.0
2.565IleTyr: 2.565 ± 0.916
0.0IleXaa: 0.0 ± 0.0
Lys
4.168LysAla: 4.168 ± 0.97
1.282LysCys: 1.282 ± 0.534
6.092LysAsp: 6.092 ± 1.303
8.977LysGlu: 8.977 ± 1.741
0.641LysPhe: 0.641 ± 0.436
1.603LysGly: 1.603 ± 0.944
1.924LysHis: 1.924 ± 0.76
6.092LysIle: 6.092 ± 1.098
8.015LysLys: 8.015 ± 2.088
8.336LysLeu: 8.336 ± 1.07
0.962LysMet: 0.962 ± 0.437
8.015LysAsn: 8.015 ± 1.675
2.565LysPro: 2.565 ± 0.831
3.206LysGln: 3.206 ± 0.564
6.092LysArg: 6.092 ± 1.48
2.565LysSer: 2.565 ± 0.766
7.695LysThr: 7.695 ± 1.838
4.809LysVal: 4.809 ± 0.865
0.962LysTrp: 0.962 ± 0.46
2.244LysTyr: 2.244 ± 0.735
0.0LysXaa: 0.0 ± 0.0
Leu
8.015LeuAla: 8.015 ± 0.889
0.0LeuCys: 0.0 ± 0.0
8.657LeuAsp: 8.657 ± 1.28
9.618LeuGlu: 9.618 ± 2.091
3.206LeuPhe: 3.206 ± 0.951
5.771LeuGly: 5.771 ± 1.781
1.924LeuHis: 1.924 ± 0.747
4.809LeuIle: 4.809 ± 0.984
8.657LeuLys: 8.657 ± 2.06
8.977LeuLeu: 8.977 ± 1.594
2.244LeuMet: 2.244 ± 0.671
8.336LeuAsn: 8.336 ± 1.737
3.206LeuPro: 3.206 ± 1.216
4.168LeuGln: 4.168 ± 0.812
2.565LeuArg: 2.565 ± 0.83
6.412LeuSer: 6.412 ± 2.186
6.733LeuThr: 6.733 ± 1.071
3.847LeuVal: 3.847 ± 0.957
0.962LeuTrp: 0.962 ± 0.534
3.847LeuTyr: 3.847 ± 0.775
0.0LeuXaa: 0.0 ± 0.0
Met
2.565MetAla: 2.565 ± 0.907
0.0MetCys: 0.0 ± 0.0
1.924MetAsp: 1.924 ± 0.707
1.282MetGlu: 1.282 ± 0.614
0.321MetPhe: 0.321 ± 0.296
0.321MetGly: 0.321 ± 0.342
0.321MetHis: 0.321 ± 0.284
1.603MetIle: 1.603 ± 0.491
2.886MetLys: 2.886 ± 1.198
2.244MetLeu: 2.244 ± 0.693
0.321MetMet: 0.321 ± 0.309
0.641MetAsn: 0.641 ± 0.383
0.321MetPro: 0.321 ± 0.328
1.603MetGln: 1.603 ± 0.644
2.244MetArg: 2.244 ± 0.798
0.962MetSer: 0.962 ± 0.456
5.13MetThr: 5.13 ± 1.539
1.924MetVal: 1.924 ± 0.606
0.321MetTrp: 0.321 ± 0.326
0.641MetTyr: 0.641 ± 0.389
0.0MetXaa: 0.0 ± 0.0
Asn
5.771AsnAla: 5.771 ± 1.762
0.0AsnCys: 0.0 ± 0.0
3.206AsnAsp: 3.206 ± 1.194
2.565AsnGlu: 2.565 ± 0.695
2.886AsnPhe: 2.886 ± 0.883
2.886AsnGly: 2.886 ± 0.683
0.962AsnHis: 0.962 ± 0.421
4.489AsnIle: 4.489 ± 1.257
4.489AsnLys: 4.489 ± 1.271
5.771AsnLeu: 5.771 ± 0.975
2.244AsnMet: 2.244 ± 1.023
2.886AsnAsn: 2.886 ± 0.967
3.527AsnPro: 3.527 ± 0.874
2.244AsnGln: 2.244 ± 0.975
5.13AsnArg: 5.13 ± 1.045
3.206AsnSer: 3.206 ± 1.311
2.565AsnThr: 2.565 ± 0.878
1.924AsnVal: 1.924 ± 0.689
0.962AsnTrp: 0.962 ± 0.632
2.565AsnTyr: 2.565 ± 0.388
0.0AsnXaa: 0.0 ± 0.0
Pro
0.641ProAla: 0.641 ± 0.369
0.641ProCys: 0.641 ± 0.569
1.282ProAsp: 1.282 ± 0.646
3.527ProGlu: 3.527 ± 0.807
0.962ProPhe: 0.962 ± 0.489
0.962ProGly: 0.962 ± 0.409
0.962ProHis: 0.962 ± 0.487
1.924ProIle: 1.924 ± 0.825
2.244ProLys: 2.244 ± 0.816
3.527ProLeu: 3.527 ± 0.937
0.641ProMet: 0.641 ± 0.387
1.603ProAsn: 1.603 ± 0.694
1.282ProPro: 1.282 ± 0.875
1.603ProGln: 1.603 ± 0.616
0.962ProArg: 0.962 ± 0.424
0.641ProSer: 0.641 ± 0.4
1.924ProThr: 1.924 ± 0.7
0.962ProVal: 0.962 ± 0.426
0.0ProTrp: 0.0 ± 0.0
1.924ProTyr: 1.924 ± 0.636
0.0ProXaa: 0.0 ± 0.0
Gln
2.565GlnAla: 2.565 ± 1.099
0.321GlnCys: 0.321 ± 0.346
2.244GlnAsp: 2.244 ± 0.624
3.527GlnGlu: 3.527 ± 1.102
2.244GlnPhe: 2.244 ± 0.475
1.603GlnGly: 1.603 ± 0.669
0.641GlnHis: 0.641 ± 0.415
2.244GlnIle: 2.244 ± 0.437
2.886GlnLys: 2.886 ± 1.118
3.527GlnLeu: 3.527 ± 0.823
0.962GlnMet: 0.962 ± 0.542
2.886GlnAsn: 2.886 ± 0.91
1.924GlnPro: 1.924 ± 0.708
2.244GlnGln: 2.244 ± 0.944
3.527GlnArg: 3.527 ± 1.003
5.13GlnSer: 5.13 ± 1.964
3.527GlnThr: 3.527 ± 0.954
2.244GlnVal: 2.244 ± 0.584
0.321GlnTrp: 0.321 ± 0.316
2.565GlnTyr: 2.565 ± 0.807
0.0GlnXaa: 0.0 ± 0.0
Arg
2.565ArgAla: 2.565 ± 0.757
0.0ArgCys: 0.0 ± 0.0
2.886ArgAsp: 2.886 ± 0.72
3.847ArgGlu: 3.847 ± 0.956
1.282ArgPhe: 1.282 ± 0.626
1.924ArgGly: 1.924 ± 0.745
0.641ArgHis: 0.641 ± 0.338
4.489ArgIle: 4.489 ± 0.831
4.809ArgLys: 4.809 ± 1.183
8.657ArgLeu: 8.657 ± 1.308
0.962ArgMet: 0.962 ± 0.612
2.565ArgAsn: 2.565 ± 0.826
0.641ArgPro: 0.641 ± 0.607
4.168ArgGln: 4.168 ± 1.341
2.565ArgArg: 2.565 ± 0.63
1.924ArgSer: 1.924 ± 0.948
2.565ArgThr: 2.565 ± 0.689
3.847ArgVal: 3.847 ± 1.156
0.321ArgTrp: 0.321 ± 0.291
3.206ArgTyr: 3.206 ± 0.961
0.0ArgXaa: 0.0 ± 0.0
Ser
1.924SerAla: 1.924 ± 0.81
0.962SerCys: 0.962 ± 0.596
3.847SerAsp: 3.847 ± 0.675
4.168SerGlu: 4.168 ± 0.699
1.603SerPhe: 1.603 ± 0.541
2.565SerGly: 2.565 ± 0.552
0.962SerHis: 0.962 ± 0.448
4.489SerIle: 4.489 ± 1.517
5.45SerLys: 5.45 ± 1.148
5.45SerLeu: 5.45 ± 1.124
0.962SerMet: 0.962 ± 0.658
2.565SerAsn: 2.565 ± 0.639
1.924SerPro: 1.924 ± 0.625
2.244SerGln: 2.244 ± 0.744
3.527SerArg: 3.527 ± 1.306
1.924SerSer: 1.924 ± 0.798
3.527SerThr: 3.527 ± 1.363
1.282SerVal: 1.282 ± 0.757
0.321SerTrp: 0.321 ± 0.293
1.282SerTyr: 1.282 ± 0.619
0.0SerXaa: 0.0 ± 0.0
Thr
3.206ThrAla: 3.206 ± 1.114
0.0ThrCys: 0.0 ± 0.0
3.847ThrAsp: 3.847 ± 1.122
5.13ThrGlu: 5.13 ± 1.335
4.168ThrPhe: 4.168 ± 2.407
4.168ThrGly: 4.168 ± 1.169
0.641ThrHis: 0.641 ± 0.409
6.412ThrIle: 6.412 ± 1.243
3.847ThrLys: 3.847 ± 1.272
5.45ThrLeu: 5.45 ± 1.391
2.565ThrMet: 2.565 ± 0.854
2.886ThrAsn: 2.886 ± 1.019
2.565ThrPro: 2.565 ± 0.801
1.924ThrGln: 1.924 ± 0.956
2.886ThrArg: 2.886 ± 1.164
3.847ThrSer: 3.847 ± 1.487
4.489ThrThr: 4.489 ± 1.475
3.847ThrVal: 3.847 ± 1.464
0.641ThrTrp: 0.641 ± 0.383
1.603ThrTyr: 1.603 ± 0.691
0.0ThrXaa: 0.0 ± 0.0
Val
4.809ValAla: 4.809 ± 0.947
0.321ValCys: 0.321 ± 0.306
1.282ValAsp: 1.282 ± 0.721
3.206ValGlu: 3.206 ± 1.168
2.244ValPhe: 2.244 ± 0.705
2.244ValGly: 2.244 ± 1.042
0.962ValHis: 0.962 ± 0.63
3.527ValIle: 3.527 ± 0.926
6.733ValLys: 6.733 ± 1.666
3.847ValLeu: 3.847 ± 1.182
0.641ValMet: 0.641 ± 0.436
2.565ValAsn: 2.565 ± 0.93
0.641ValPro: 0.641 ± 0.377
1.924ValGln: 1.924 ± 0.656
1.924ValArg: 1.924 ± 0.838
3.527ValSer: 3.527 ± 0.686
0.962ValThr: 0.962 ± 0.448
1.603ValVal: 1.603 ± 0.579
0.0ValTrp: 0.0 ± 0.0
2.565ValTyr: 2.565 ± 0.686
0.0ValXaa: 0.0 ± 0.0
Trp
0.962TrpAla: 0.962 ± 0.664
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.641TrpGlu: 0.641 ± 0.397
0.321TrpPhe: 0.321 ± 0.36
0.321TrpGly: 0.321 ± 0.315
0.0TrpHis: 0.0 ± 0.0
0.641TrpIle: 0.641 ± 0.457
0.0TrpLys: 0.0 ± 0.0
0.962TrpLeu: 0.962 ± 0.467
0.321TrpMet: 0.321 ± 0.293
0.641TrpAsn: 0.641 ± 0.338
0.0TrpPro: 0.0 ± 0.0
0.641TrpGln: 0.641 ± 0.391
0.321TrpArg: 0.321 ± 0.306
0.641TrpSer: 0.641 ± 0.376
0.962TrpThr: 0.962 ± 0.512
0.321TrpVal: 0.321 ± 0.317
0.321TrpTrp: 0.321 ± 0.309
0.641TrpTyr: 0.641 ± 0.503
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.924TyrAla: 1.924 ± 1.112
0.321TyrCys: 0.321 ± 0.284
2.244TyrAsp: 2.244 ± 0.558
3.847TyrGlu: 3.847 ± 0.679
1.282TyrPhe: 1.282 ± 0.959
2.565TyrGly: 2.565 ± 0.735
0.321TyrHis: 0.321 ± 0.296
3.847TyrIle: 3.847 ± 1.2
4.168TyrLys: 4.168 ± 1.538
5.13TyrLeu: 5.13 ± 1.443
1.603TyrMet: 1.603 ± 0.73
1.924TyrAsn: 1.924 ± 1.114
0.962TyrPro: 0.962 ± 0.432
4.168TyrGln: 4.168 ± 1.07
3.527TyrArg: 3.527 ± 1.013
1.603TyrSer: 1.603 ± 0.445
2.886TyrThr: 2.886 ± 0.702
1.924TyrVal: 1.924 ± 0.723
0.321TyrTrp: 0.321 ± 0.284
3.527TyrTyr: 3.527 ± 1.092
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 17 proteins (3120 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski