Amino acid dipepetide frequency for Streptococcus satellite phage Javan397

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.319AlaCys: 0.319 ± 0.35
2.553AlaAsp: 2.553 ± 1.043
4.148AlaGlu: 4.148 ± 1.072
2.234AlaPhe: 2.234 ± 0.822
1.914AlaGly: 1.914 ± 0.687
0.0AlaHis: 0.0 ± 0.0
3.829AlaIle: 3.829 ± 1.165
4.148AlaLys: 4.148 ± 0.98
5.105AlaLeu: 5.105 ± 1.186
0.638AlaMet: 0.638 ± 0.396
4.467AlaAsn: 4.467 ± 1.152
1.276AlaPro: 1.276 ± 0.735
1.595AlaGln: 1.595 ± 0.62
1.276AlaArg: 1.276 ± 0.503
1.914AlaSer: 1.914 ± 0.651
4.148AlaThr: 4.148 ± 0.704
2.553AlaVal: 2.553 ± 0.779
0.638AlaTrp: 0.638 ± 0.329
1.276AlaTyr: 1.276 ± 0.663
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.319CysCys: 0.319 ± 0.342
0.638CysAsp: 0.638 ± 0.374
0.638CysGlu: 0.638 ± 0.389
0.638CysPhe: 0.638 ± 0.479
0.638CysGly: 0.638 ± 0.465
0.319CysHis: 0.319 ± 0.268
1.595CysIle: 1.595 ± 0.768
0.0CysLys: 0.0 ± 0.0
0.957CysLeu: 0.957 ± 0.514
0.0CysMet: 0.0 ± 0.0
0.319CysAsn: 0.319 ± 0.285
0.319CysPro: 0.319 ± 0.31
0.0CysGln: 0.0 ± 0.0
0.319CysArg: 0.319 ± 0.31
0.319CysSer: 0.319 ± 0.268
0.319CysThr: 0.319 ± 0.283
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.957AspAla: 0.957 ± 0.511
0.638AspCys: 0.638 ± 0.475
3.829AspAsp: 3.829 ± 1.048
2.872AspGlu: 2.872 ± 0.929
5.743AspPhe: 5.743 ± 1.589
0.957AspGly: 0.957 ± 0.578
0.319AspHis: 0.319 ± 0.31
10.211AspIle: 10.211 ± 1.599
4.786AspLys: 4.786 ± 1.216
4.786AspLeu: 4.786 ± 1.245
0.638AspMet: 0.638 ± 0.505
3.829AspAsn: 3.829 ± 1.115
1.276AspPro: 1.276 ± 0.405
4.148AspGln: 4.148 ± 1.175
2.553AspArg: 2.553 ± 0.852
3.829AspSer: 3.829 ± 1.047
2.553AspThr: 2.553 ± 0.781
4.467AspVal: 4.467 ± 1.159
0.638AspTrp: 0.638 ± 0.436
4.148AspTyr: 4.148 ± 1.189
0.0AspXaa: 0.0 ± 0.0
Glu
3.191GluAla: 3.191 ± 1.058
0.0GluCys: 0.0 ± 0.0
2.872GluAsp: 2.872 ± 1.17
6.063GluGlu: 6.063 ± 1.441
3.51GluPhe: 3.51 ± 1.309
2.872GluGly: 2.872 ± 0.989
2.553GluHis: 2.553 ± 0.935
7.658GluIle: 7.658 ± 1.578
6.063GluLys: 6.063 ± 1.37
10.211GluLeu: 10.211 ± 1.912
0.957GluMet: 0.957 ± 0.568
3.829GluAsn: 3.829 ± 0.982
2.234GluPro: 2.234 ± 0.879
0.638GluGln: 0.638 ± 0.45
2.872GluArg: 2.872 ± 1.259
3.191GluSer: 3.191 ± 1.336
2.553GluThr: 2.553 ± 0.786
4.786GluVal: 4.786 ± 1.196
0.638GluTrp: 0.638 ± 0.537
3.829GluTyr: 3.829 ± 0.895
0.0GluXaa: 0.0 ± 0.0
Phe
0.957PheAla: 0.957 ± 0.533
0.0PheCys: 0.0 ± 0.0
6.063PheAsp: 6.063 ± 2.152
2.872PheGlu: 2.872 ± 1.081
2.872PhePhe: 2.872 ± 1.47
3.829PheGly: 3.829 ± 0.976
0.638PheHis: 0.638 ± 0.374
2.872PheIle: 2.872 ± 0.975
4.467PheLys: 4.467 ± 1.228
6.382PheLeu: 6.382 ± 2.05
0.319PheMet: 0.319 ± 0.31
3.51PheAsn: 3.51 ± 0.852
0.638PhePro: 0.638 ± 0.442
0.638PheGln: 0.638 ± 0.411
1.276PheArg: 1.276 ± 0.603
3.829PheSer: 3.829 ± 0.976
1.595PheThr: 1.595 ± 0.692
2.872PheVal: 2.872 ± 0.741
0.957PheTrp: 0.957 ± 0.616
1.914PheTyr: 1.914 ± 0.678
0.0PheXaa: 0.0 ± 0.0
Gly
1.276GlyAla: 1.276 ± 0.754
0.957GlyCys: 0.957 ± 0.618
3.191GlyAsp: 3.191 ± 0.958
1.595GlyGlu: 1.595 ± 0.614
3.191GlyPhe: 3.191 ± 0.737
1.595GlyGly: 1.595 ± 0.509
1.276GlyHis: 1.276 ± 0.607
3.829GlyIle: 3.829 ± 0.822
3.191GlyLys: 3.191 ± 1.316
5.743GlyLeu: 5.743 ± 1.726
1.595GlyMet: 1.595 ± 0.729
1.914GlyAsn: 1.914 ± 0.852
0.638GlyPro: 0.638 ± 0.407
2.553GlyGln: 2.553 ± 0.793
1.276GlyArg: 1.276 ± 0.474
2.234GlySer: 2.234 ± 0.606
2.234GlyThr: 2.234 ± 0.757
1.276GlyVal: 1.276 ± 0.425
0.638GlyTrp: 0.638 ± 0.537
2.872GlyTyr: 2.872 ± 0.909
0.0GlyXaa: 0.0 ± 0.0
His
1.276HisAla: 1.276 ± 0.947
0.0HisCys: 0.0 ± 0.0
0.638HisAsp: 0.638 ± 0.479
0.957HisGlu: 0.957 ± 0.499
0.957HisPhe: 0.957 ± 0.669
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.638HisIle: 0.638 ± 0.628
1.914HisLys: 1.914 ± 0.858
1.914HisLeu: 1.914 ± 0.968
0.0HisMet: 0.0 ± 0.0
0.638HisAsn: 0.638 ± 0.421
0.957HisPro: 0.957 ± 0.585
0.957HisGln: 0.957 ± 0.678
0.957HisArg: 0.957 ± 0.486
0.957HisSer: 0.957 ± 0.713
0.957HisThr: 0.957 ± 0.433
1.595HisVal: 1.595 ± 0.86
0.0HisTrp: 0.0 ± 0.0
0.319HisTyr: 0.319 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
5.424IleAla: 5.424 ± 1.209
0.957IleCys: 0.957 ± 0.554
5.105IleAsp: 5.105 ± 1.467
3.51IleGlu: 3.51 ± 0.983
3.191IlePhe: 3.191 ± 1.0
4.467IleGly: 4.467 ± 0.866
1.595IleHis: 1.595 ± 0.727
6.701IleIle: 6.701 ± 2.332
7.339IleLys: 7.339 ± 1.873
8.296IleLeu: 8.296 ± 1.511
1.276IleMet: 1.276 ± 0.67
8.296IleAsn: 8.296 ± 1.547
3.51IlePro: 3.51 ± 1.497
2.234IleGln: 2.234 ± 0.721
4.786IleArg: 4.786 ± 0.809
6.701IleSer: 6.701 ± 1.621
5.743IleThr: 5.743 ± 1.501
5.105IleVal: 5.105 ± 1.508
0.638IleTrp: 0.638 ± 0.378
3.191IleTyr: 3.191 ± 0.792
0.0IleXaa: 0.0 ± 0.0
Lys
4.148LysAla: 4.148 ± 1.434
0.319LysCys: 0.319 ± 0.31
6.063LysAsp: 6.063 ± 1.486
8.934LysGlu: 8.934 ± 1.825
1.595LysPhe: 1.595 ± 0.633
2.234LysGly: 2.234 ± 0.721
1.914LysHis: 1.914 ± 0.856
7.658LysIle: 7.658 ± 1.088
8.615LysLys: 8.615 ± 1.692
5.424LysLeu: 5.424 ± 0.903
1.914LysMet: 1.914 ± 0.799
7.339LysAsn: 7.339 ± 1.725
4.148LysPro: 4.148 ± 1.247
2.872LysGln: 2.872 ± 0.941
4.467LysArg: 4.467 ± 1.268
5.424LysSer: 5.424 ± 1.27
4.786LysThr: 4.786 ± 1.38
4.467LysVal: 4.467 ± 0.861
0.638LysTrp: 0.638 ± 0.487
5.105LysTyr: 5.105 ± 1.039
0.0LysXaa: 0.0 ± 0.0
Leu
7.339LeuAla: 7.339 ± 1.548
0.957LeuCys: 0.957 ± 0.805
6.382LeuAsp: 6.382 ± 1.236
7.977LeuGlu: 7.977 ± 1.628
4.786LeuPhe: 4.786 ± 1.363
5.743LeuGly: 5.743 ± 1.4
0.319LeuHis: 0.319 ± 0.297
9.253LeuIle: 9.253 ± 2.696
10.53LeuLys: 10.53 ± 1.799
14.678LeuLeu: 14.678 ± 2.34
1.914LeuMet: 1.914 ± 0.622
6.382LeuAsn: 6.382 ± 1.268
2.553LeuPro: 2.553 ± 1.072
4.148LeuGln: 4.148 ± 1.235
1.276LeuArg: 1.276 ± 0.472
5.424LeuSer: 5.424 ± 0.99
6.063LeuThr: 6.063 ± 1.73
4.148LeuVal: 4.148 ± 1.392
0.957LeuTrp: 0.957 ± 0.573
5.743LeuTyr: 5.743 ± 0.873
0.0LeuXaa: 0.0 ± 0.0
Met
1.276MetAla: 1.276 ± 0.668
0.319MetCys: 0.319 ± 0.342
0.957MetAsp: 0.957 ± 0.632
1.914MetGlu: 1.914 ± 0.746
0.957MetPhe: 0.957 ± 0.706
0.638MetGly: 0.638 ± 0.479
0.0MetHis: 0.0 ± 0.0
1.595MetIle: 1.595 ± 0.96
2.553MetLys: 2.553 ± 0.946
0.957MetLeu: 0.957 ± 0.489
0.0MetMet: 0.0 ± 0.0
1.276MetAsn: 1.276 ± 0.627
1.276MetPro: 1.276 ± 0.486
0.638MetGln: 0.638 ± 0.45
0.319MetArg: 0.319 ± 0.297
1.914MetSer: 1.914 ± 0.725
0.957MetThr: 0.957 ± 0.51
1.276MetVal: 1.276 ± 0.623
0.0MetTrp: 0.0 ± 0.0
0.319MetTyr: 0.319 ± 0.314
0.0MetXaa: 0.0 ± 0.0
Asn
3.51AsnAla: 3.51 ± 0.831
0.319AsnCys: 0.319 ± 0.297
6.063AsnAsp: 6.063 ± 1.768
3.191AsnGlu: 3.191 ± 0.848
2.553AsnPhe: 2.553 ± 1.157
5.424AsnGly: 5.424 ± 2.284
1.595AsnHis: 1.595 ± 0.64
4.148AsnIle: 4.148 ± 0.882
5.105AsnLys: 5.105 ± 1.417
6.063AsnLeu: 6.063 ± 1.135
1.914AsnMet: 1.914 ± 0.721
4.786AsnAsn: 4.786 ± 1.044
2.234AsnPro: 2.234 ± 0.844
2.553AsnGln: 2.553 ± 1.123
3.191AsnArg: 3.191 ± 0.913
2.872AsnSer: 2.872 ± 0.917
2.553AsnThr: 2.553 ± 1.17
2.872AsnVal: 2.872 ± 0.749
0.319AsnTrp: 0.319 ± 0.33
2.872AsnTyr: 2.872 ± 1.012
0.0AsnXaa: 0.0 ± 0.0
Pro
0.638ProAla: 0.638 ± 0.502
0.319ProCys: 0.319 ± 0.283
0.957ProAsp: 0.957 ± 0.573
3.51ProGlu: 3.51 ± 1.212
1.595ProPhe: 1.595 ± 0.599
0.319ProGly: 0.319 ± 0.33
0.0ProHis: 0.0 ± 0.0
2.872ProIle: 2.872 ± 0.709
4.148ProLys: 4.148 ± 1.322
3.191ProLeu: 3.191 ± 1.144
1.276ProMet: 1.276 ± 0.491
0.638ProAsn: 0.638 ± 0.411
0.319ProPro: 0.319 ± 0.297
1.914ProGln: 1.914 ± 0.63
2.553ProArg: 2.553 ± 1.04
0.957ProSer: 0.957 ± 0.493
2.234ProThr: 2.234 ± 1.008
1.914ProVal: 1.914 ± 0.584
0.0ProTrp: 0.0 ± 0.0
2.234ProTyr: 2.234 ± 0.955
0.0ProXaa: 0.0 ± 0.0
Gln
2.553GlnAla: 2.553 ± 1.061
0.319GlnCys: 0.319 ± 0.394
0.319GlnAsp: 0.319 ± 0.31
3.829GlnGlu: 3.829 ± 1.068
2.553GlnPhe: 2.553 ± 0.874
0.638GlnGly: 0.638 ± 0.393
0.319GlnHis: 0.319 ± 0.342
3.191GlnIle: 3.191 ± 1.058
1.914GlnLys: 1.914 ± 0.961
5.743GlnLeu: 5.743 ± 1.453
0.638GlnMet: 0.638 ± 0.491
1.276GlnAsn: 1.276 ± 0.525
1.276GlnPro: 1.276 ± 0.725
2.234GlnGln: 2.234 ± 0.683
2.234GlnArg: 2.234 ± 0.958
2.234GlnSer: 2.234 ± 0.942
3.51GlnThr: 3.51 ± 0.73
1.914GlnVal: 1.914 ± 0.518
0.319GlnTrp: 0.319 ± 0.268
2.234GlnTyr: 2.234 ± 0.856
0.0GlnXaa: 0.0 ± 0.0
Arg
1.595ArgAla: 1.595 ± 0.653
0.0ArgCys: 0.0 ± 0.0
3.51ArgAsp: 3.51 ± 0.76
2.234ArgGlu: 2.234 ± 0.638
1.914ArgPhe: 1.914 ± 1.023
2.234ArgGly: 2.234 ± 0.929
0.957ArgHis: 0.957 ± 0.513
2.872ArgIle: 2.872 ± 1.09
2.872ArgLys: 2.872 ± 0.758
4.786ArgLeu: 4.786 ± 1.293
0.319ArgMet: 0.319 ± 0.35
2.553ArgAsn: 2.553 ± 0.738
2.234ArgPro: 2.234 ± 0.744
2.872ArgGln: 2.872 ± 1.285
1.914ArgArg: 1.914 ± 0.934
1.914ArgSer: 1.914 ± 0.692
2.872ArgThr: 2.872 ± 1.009
2.234ArgVal: 2.234 ± 0.889
0.319ArgTrp: 0.319 ± 0.332
1.276ArgTyr: 1.276 ± 0.531
0.0ArgXaa: 0.0 ± 0.0
Ser
2.234SerAla: 2.234 ± 1.127
0.638SerCys: 0.638 ± 0.418
3.191SerAsp: 3.191 ± 0.922
6.382SerGlu: 6.382 ± 1.054
3.191SerPhe: 3.191 ± 0.835
1.595SerGly: 1.595 ± 0.749
0.957SerHis: 0.957 ± 0.55
6.063SerIle: 6.063 ± 1.521
6.701SerLys: 6.701 ± 1.264
6.063SerLeu: 6.063 ± 0.898
1.276SerMet: 1.276 ± 0.646
3.191SerAsn: 3.191 ± 0.618
0.957SerPro: 0.957 ± 0.528
2.553SerGln: 2.553 ± 1.281
2.553SerArg: 2.553 ± 0.79
3.191SerSer: 3.191 ± 0.908
4.148SerThr: 4.148 ± 1.321
1.276SerVal: 1.276 ± 0.638
0.638SerTrp: 0.638 ± 0.518
4.467SerTyr: 4.467 ± 1.127
0.0SerXaa: 0.0 ± 0.0
Thr
3.51ThrAla: 3.51 ± 1.453
0.319ThrCys: 0.319 ± 0.283
4.786ThrAsp: 4.786 ± 1.086
3.191ThrGlu: 3.191 ± 1.004
1.276ThrPhe: 1.276 ± 0.405
3.829ThrGly: 3.829 ± 1.064
1.595ThrHis: 1.595 ± 0.879
5.424ThrIle: 5.424 ± 1.296
4.148ThrLys: 4.148 ± 1.151
4.786ThrLeu: 4.786 ± 1.611
1.276ThrMet: 1.276 ± 0.582
2.872ThrAsn: 2.872 ± 1.31
2.553ThrPro: 2.553 ± 0.812
2.234ThrGln: 2.234 ± 0.55
3.191ThrArg: 3.191 ± 1.029
2.872ThrSer: 2.872 ± 0.885
2.872ThrThr: 2.872 ± 1.004
3.829ThrVal: 3.829 ± 1.016
0.638ThrTrp: 0.638 ± 0.375
3.191ThrTyr: 3.191 ± 1.023
0.0ThrXaa: 0.0 ± 0.0
Val
2.872ValAla: 2.872 ± 0.706
0.319ValCys: 0.319 ± 0.268
1.595ValAsp: 1.595 ± 0.593
1.914ValGlu: 1.914 ± 0.761
3.829ValPhe: 3.829 ± 0.876
1.595ValGly: 1.595 ± 0.625
0.638ValHis: 0.638 ± 0.444
2.553ValIle: 2.553 ± 0.947
4.148ValLys: 4.148 ± 0.816
5.105ValLeu: 5.105 ± 1.103
1.276ValMet: 1.276 ± 0.574
4.148ValAsn: 4.148 ± 1.128
1.595ValPro: 1.595 ± 0.596
1.276ValGln: 1.276 ± 0.5
0.957ValArg: 0.957 ± 0.623
5.424ValSer: 5.424 ± 1.637
7.02ValThr: 7.02 ± 1.792
1.914ValVal: 1.914 ± 1.005
0.638ValTrp: 0.638 ± 0.474
1.276ValTyr: 1.276 ± 0.549
0.0ValXaa: 0.0 ± 0.0
Trp
0.319TrpAla: 0.319 ± 0.314
0.0TrpCys: 0.0 ± 0.0
0.638TrpAsp: 0.638 ± 0.458
0.319TrpGlu: 0.319 ± 0.332
0.638TrpPhe: 0.638 ± 0.381
0.319TrpGly: 0.319 ± 0.268
0.0TrpHis: 0.0 ± 0.0
0.638TrpIle: 0.638 ± 0.475
0.638TrpLys: 0.638 ± 0.363
1.914TrpLeu: 1.914 ± 0.575
0.319TrpMet: 0.319 ± 0.342
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.319TrpGln: 0.319 ± 0.268
0.638TrpArg: 0.638 ± 0.411
1.914TrpSer: 1.914 ± 0.804
0.0TrpThr: 0.0 ± 0.0
0.638TrpVal: 0.638 ± 0.392
0.0TrpTrp: 0.0 ± 0.0
0.319TrpTyr: 0.319 ± 0.268
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.957TyrAla: 0.957 ± 0.51
0.319TyrCys: 0.319 ± 0.297
3.829TyrAsp: 3.829 ± 0.981
4.467TyrGlu: 4.467 ± 0.774
1.276TyrPhe: 1.276 ± 0.53
2.234TyrGly: 2.234 ± 0.845
0.638TyrHis: 0.638 ± 0.363
4.467TyrIle: 4.467 ± 1.035
4.786TyrLys: 4.786 ± 1.098
4.467TyrLeu: 4.467 ± 0.919
1.276TyrMet: 1.276 ± 0.566
2.872TyrAsn: 2.872 ± 0.88
1.595TyrPro: 1.595 ± 0.535
2.872TyrGln: 2.872 ± 0.944
2.872TyrArg: 2.872 ± 1.059
4.148TyrSer: 4.148 ± 1.218
1.595TyrThr: 1.595 ± 0.473
0.957TyrVal: 0.957 ± 0.486
0.957TyrTrp: 0.957 ± 0.793
3.51TyrTyr: 3.51 ± 0.779
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 18 proteins (3135 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski