Amino acid dipepetide frequency for Streptococcus satellite phage Javan378

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.0AlaCys: 0.0 ± 0.0
2.554AlaAsp: 2.554 ± 0.703
3.064AlaGlu: 3.064 ± 1.862
2.554AlaPhe: 2.554 ± 0.96
4.086AlaGly: 4.086 ± 0.948
0.0AlaHis: 0.0 ± 0.0
3.575AlaIle: 3.575 ± 1.645
10.215AlaLys: 10.215 ± 1.381
8.682AlaLeu: 8.682 ± 1.107
1.021AlaMet: 1.021 ± 0.62
1.532AlaAsn: 1.532 ± 0.69
1.532AlaPro: 1.532 ± 1.153
1.021AlaGln: 1.021 ± 0.67
1.532AlaArg: 1.532 ± 1.394
3.575AlaSer: 3.575 ± 1.103
5.107AlaThr: 5.107 ± 1.423
3.064AlaVal: 3.064 ± 1.242
1.021AlaTrp: 1.021 ± 0.462
1.021AlaTyr: 1.021 ± 0.818
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.511CysAsp: 0.511 ± 0.519
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.511CysGly: 0.511 ± 0.519
0.0CysHis: 0.0 ± 0.0
1.021CysIle: 1.021 ± 0.767
0.511CysLys: 0.511 ± 0.519
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.511CysArg: 0.511 ± 0.465
0.511CysSer: 0.511 ± 0.384
0.0CysThr: 0.0 ± 0.0
0.511CysVal: 0.511 ± 0.465
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
2.043AspAsp: 2.043 ± 1.149
4.597AspGlu: 4.597 ± 1.804
3.064AspPhe: 3.064 ± 1.121
2.043AspGly: 2.043 ± 1.164
0.0AspHis: 0.0 ± 0.0
5.618AspIle: 5.618 ± 1.663
7.15AspLys: 7.15 ± 1.172
5.107AspLeu: 5.107 ± 0.921
1.021AspMet: 1.021 ± 0.866
2.554AspAsn: 2.554 ± 0.723
2.554AspPro: 2.554 ± 0.915
3.064AspGln: 3.064 ± 0.541
2.043AspArg: 2.043 ± 1.269
5.618AspSer: 5.618 ± 1.279
5.107AspThr: 5.107 ± 2.085
3.064AspVal: 3.064 ± 1.216
1.021AspTrp: 1.021 ± 1.17
4.597AspTyr: 4.597 ± 1.939
0.0AspXaa: 0.0 ± 0.0
Glu
3.575GluAla: 3.575 ± 1.549
0.0GluCys: 0.0 ± 0.0
4.086GluAsp: 4.086 ± 1.461
4.086GluGlu: 4.086 ± 1.824
3.575GluPhe: 3.575 ± 1.316
1.532GluGly: 1.532 ± 0.704
0.511GluHis: 0.511 ± 0.384
6.129GluIle: 6.129 ± 2.105
10.725GluLys: 10.725 ± 3.385
8.682GluLeu: 8.682 ± 1.393
1.021GluMet: 1.021 ± 0.812
5.107GluAsn: 5.107 ± 1.93
2.043GluPro: 2.043 ± 1.107
5.107GluGln: 5.107 ± 1.889
3.575GluArg: 3.575 ± 2.602
6.129GluSer: 6.129 ± 1.621
3.064GluThr: 3.064 ± 1.162
5.618GluVal: 5.618 ± 2.208
0.511GluTrp: 0.511 ± 0.465
2.043GluTyr: 2.043 ± 1.543
0.0GluXaa: 0.0 ± 0.0
Phe
1.532PheAla: 1.532 ± 1.133
0.511PheCys: 0.511 ± 0.465
2.554PheAsp: 2.554 ± 0.973
6.129PheGlu: 6.129 ± 1.417
2.554PhePhe: 2.554 ± 1.215
1.532PheGly: 1.532 ± 0.858
1.021PheHis: 1.021 ± 0.462
3.575PheIle: 3.575 ± 1.222
3.575PheLys: 3.575 ± 1.299
6.639PheLeu: 6.639 ± 2.146
0.511PheMet: 0.511 ± 0.697
2.043PheAsn: 2.043 ± 1.269
0.511PhePro: 0.511 ± 0.384
0.511PheGln: 0.511 ± 0.384
1.021PheArg: 1.021 ± 0.768
3.575PheSer: 3.575 ± 1.053
1.532PheThr: 1.532 ± 0.843
1.021PheVal: 1.021 ± 0.681
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.064GlyAla: 3.064 ± 1.688
1.021GlyCys: 1.021 ± 0.462
2.043GlyAsp: 2.043 ± 0.942
4.086GlyGlu: 4.086 ± 2.34
3.575GlyPhe: 3.575 ± 1.299
4.086GlyGly: 4.086 ± 1.983
0.511GlyHis: 0.511 ± 0.465
2.554GlyIle: 2.554 ± 0.909
3.064GlyLys: 3.064 ± 1.079
4.597GlyLeu: 4.597 ± 1.565
1.021GlyMet: 1.021 ± 0.739
2.043GlyAsn: 2.043 ± 0.968
0.511GlyPro: 0.511 ± 0.585
1.021GlyGln: 1.021 ± 0.462
2.043GlyArg: 2.043 ± 1.045
3.575GlySer: 3.575 ± 0.926
1.532GlyThr: 1.532 ± 1.153
3.064GlyVal: 3.064 ± 0.752
0.511GlyTrp: 0.511 ± 0.697
2.043GlyTyr: 2.043 ± 1.109
0.0GlyXaa: 0.0 ± 0.0
His
1.532HisAla: 1.532 ± 0.873
0.511HisCys: 0.511 ± 0.519
0.0HisAsp: 0.0 ± 0.0
1.021HisGlu: 1.021 ± 0.462
2.043HisPhe: 2.043 ± 0.828
0.511HisGly: 0.511 ± 0.465
0.0HisHis: 0.0 ± 0.0
1.021HisIle: 1.021 ± 0.462
0.0HisLys: 0.0 ± 0.0
1.532HisLeu: 1.532 ± 0.529
0.0HisMet: 0.0 ± 0.0
1.021HisAsn: 1.021 ± 0.767
0.0HisPro: 0.0 ± 0.0
0.511HisGln: 0.511 ± 0.57
1.021HisArg: 1.021 ± 0.929
1.532HisSer: 1.532 ± 1.127
1.021HisThr: 1.021 ± 0.462
0.511HisVal: 0.511 ± 0.384
0.511HisTrp: 0.511 ± 0.519
1.532HisTyr: 1.532 ± 1.068
0.0HisXaa: 0.0 ± 0.0
Ile
5.107IleAla: 5.107 ± 1.821
0.0IleCys: 0.0 ± 0.0
10.215IleAsp: 10.215 ± 2.804
5.618IleGlu: 5.618 ± 2.651
1.532IlePhe: 1.532 ± 0.778
3.064IleGly: 3.064 ± 1.017
1.532IleHis: 1.532 ± 0.909
4.597IleIle: 4.597 ± 1.379
8.172IleLys: 8.172 ± 2.639
5.618IleLeu: 5.618 ± 1.454
0.511IleMet: 0.511 ± 0.578
4.597IleAsn: 4.597 ± 1.206
1.532IlePro: 1.532 ± 0.711
3.064IleGln: 3.064 ± 0.928
2.043IleArg: 2.043 ± 1.458
6.639IleSer: 6.639 ± 2.376
4.086IleThr: 4.086 ± 1.625
1.532IleVal: 1.532 ± 0.846
0.511IleTrp: 0.511 ± 0.384
6.129IleTyr: 6.129 ± 2.136
0.0IleXaa: 0.0 ± 0.0
Lys
7.15LysAla: 7.15 ± 1.938
0.511LysCys: 0.511 ± 0.519
5.107LysAsp: 5.107 ± 1.112
11.747LysGlu: 11.747 ± 2.578
2.554LysPhe: 2.554 ± 1.105
3.575LysGly: 3.575 ± 1.21
2.043LysHis: 2.043 ± 1.001
12.768LysIle: 12.768 ± 1.76
9.193LysLys: 9.193 ± 2.079
8.682LysLeu: 8.682 ± 1.866
3.064LysMet: 3.064 ± 1.877
7.661LysAsn: 7.661 ± 1.375
1.021LysPro: 1.021 ± 0.553
5.618LysGln: 5.618 ± 2.042
5.618LysArg: 5.618 ± 1.221
9.193LysSer: 9.193 ± 1.824
8.172LysThr: 8.172 ± 1.964
4.597LysVal: 4.597 ± 1.568
0.511LysTrp: 0.511 ± 0.657
1.532LysTyr: 1.532 ± 0.837
0.0LysXaa: 0.0 ± 0.0
Leu
6.639LeuAla: 6.639 ± 1.424
0.511LeuCys: 0.511 ± 0.384
9.193LeuAsp: 9.193 ± 1.78
9.704LeuGlu: 9.704 ± 4.19
3.064LeuPhe: 3.064 ± 1.117
4.086LeuGly: 4.086 ± 1.565
1.021LeuHis: 1.021 ± 0.462
6.129LeuIle: 6.129 ± 2.12
12.257LeuLys: 12.257 ± 2.374
8.682LeuLeu: 8.682 ± 2.913
2.043LeuMet: 2.043 ± 0.914
7.15LeuAsn: 7.15 ± 1.483
2.554LeuPro: 2.554 ± 1.586
4.597LeuGln: 4.597 ± 1.982
3.064LeuArg: 3.064 ± 1.251
8.172LeuSer: 8.172 ± 1.948
5.618LeuThr: 5.618 ± 1.412
4.086LeuVal: 4.086 ± 1.211
0.0LeuTrp: 0.0 ± 0.0
3.064LeuTyr: 3.064 ± 0.85
0.0LeuXaa: 0.0 ± 0.0
Met
1.532MetAla: 1.532 ± 1.394
0.0MetCys: 0.0 ± 0.0
1.532MetAsp: 1.532 ± 0.739
2.554MetGlu: 2.554 ± 0.955
0.511MetPhe: 0.511 ± 0.561
0.511MetGly: 0.511 ± 0.384
0.0MetHis: 0.0 ± 0.0
1.532MetIle: 1.532 ± 0.843
1.532MetLys: 1.532 ± 0.834
1.021MetLeu: 1.021 ± 1.17
0.511MetMet: 0.511 ± 0.465
3.575MetAsn: 3.575 ± 1.604
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
0.511MetSer: 0.511 ± 0.384
1.021MetThr: 1.021 ± 0.707
1.021MetVal: 1.021 ± 0.611
0.0MetTrp: 0.0 ± 0.0
1.532MetTyr: 1.532 ± 0.926
0.0MetXaa: 0.0 ± 0.0
Asn
3.064AsnAla: 3.064 ± 0.963
0.0AsnCys: 0.0 ± 0.0
3.064AsnAsp: 3.064 ± 1.004
3.575AsnGlu: 3.575 ± 0.527
2.554AsnPhe: 2.554 ± 1.307
5.618AsnGly: 5.618 ± 1.301
1.021AsnHis: 1.021 ± 1.037
5.618AsnIle: 5.618 ± 0.924
7.661AsnLys: 7.661 ± 1.714
3.575AsnLeu: 3.575 ± 1.239
0.0AsnMet: 0.0 ± 0.0
5.107AsnAsn: 5.107 ± 1.091
2.043AsnPro: 2.043 ± 0.683
3.064AsnGln: 3.064 ± 1.242
4.086AsnArg: 4.086 ± 1.25
3.575AsnSer: 3.575 ± 1.009
2.554AsnThr: 2.554 ± 0.822
2.554AsnVal: 2.554 ± 0.902
2.043AsnTrp: 2.043 ± 0.838
1.532AsnTyr: 1.532 ± 0.843
0.0AsnXaa: 0.0 ± 0.0
Pro
2.043ProAla: 2.043 ± 1.281
0.511ProCys: 0.511 ± 0.519
2.043ProAsp: 2.043 ± 1.19
1.532ProGlu: 1.532 ± 0.766
0.511ProPhe: 0.511 ± 0.519
0.511ProGly: 0.511 ± 0.57
1.021ProHis: 1.021 ± 0.553
1.532ProIle: 1.532 ± 1.048
4.086ProLys: 4.086 ± 1.113
1.532ProLeu: 1.532 ± 1.011
0.0ProMet: 0.0 ± 0.0
2.554ProAsn: 2.554 ± 0.629
0.511ProPro: 0.511 ± 0.561
0.0ProGln: 0.0 ± 0.0
0.511ProArg: 0.511 ± 0.384
0.0ProSer: 0.0 ± 0.0
1.021ProThr: 1.021 ± 0.708
2.043ProVal: 2.043 ± 1.107
0.0ProTrp: 0.0 ± 0.0
1.021ProTyr: 1.021 ± 0.708
0.0ProXaa: 0.0 ± 0.0
Gln
3.064GlnAla: 3.064 ± 1.671
0.0GlnCys: 0.0 ± 0.0
1.021GlnAsp: 1.021 ± 0.611
1.532GlnGlu: 1.532 ± 0.93
1.532GlnPhe: 1.532 ± 0.843
2.043GlnGly: 2.043 ± 1.074
1.021GlnHis: 1.021 ± 0.67
3.064GlnIle: 3.064 ± 1.317
4.597GlnLys: 4.597 ± 1.273
5.107GlnLeu: 5.107 ± 1.256
1.021GlnMet: 1.021 ± 0.682
1.532GlnAsn: 1.532 ± 0.876
1.021GlnPro: 1.021 ± 1.122
3.064GlnGln: 3.064 ± 1.38
1.532GlnArg: 1.532 ± 0.78
1.532GlnSer: 1.532 ± 0.922
3.064GlnThr: 3.064 ± 0.747
3.064GlnVal: 3.064 ± 0.802
0.0GlnTrp: 0.0 ± 0.0
2.043GlnTyr: 2.043 ± 0.812
0.0GlnXaa: 0.0 ± 0.0
Arg
1.021ArgAla: 1.021 ± 0.86
0.0ArgCys: 0.0 ± 0.0
1.021ArgAsp: 1.021 ± 0.929
0.511ArgGlu: 0.511 ± 0.384
2.043ArgPhe: 2.043 ± 0.977
1.532ArgGly: 1.532 ± 1.177
1.532ArgHis: 1.532 ± 0.562
3.575ArgIle: 3.575 ± 1.074
3.575ArgLys: 3.575 ± 1.246
5.618ArgLeu: 5.618 ± 2.607
0.511ArgMet: 0.511 ± 0.657
2.554ArgAsn: 2.554 ± 1.118
1.021ArgPro: 1.021 ± 0.462
0.511ArgGln: 0.511 ± 0.465
1.021ArgArg: 1.021 ± 0.889
2.554ArgSer: 2.554 ± 1.303
5.107ArgThr: 5.107 ± 1.81
0.511ArgVal: 0.511 ± 0.384
1.021ArgTrp: 1.021 ± 0.707
1.532ArgTyr: 1.532 ± 0.631
0.0ArgXaa: 0.0 ± 0.0
Ser
4.086SerAla: 4.086 ± 1.38
0.511SerCys: 0.511 ± 0.384
4.086SerAsp: 4.086 ± 1.272
5.618SerGlu: 5.618 ± 1.683
3.575SerPhe: 3.575 ± 1.329
2.554SerGly: 2.554 ± 1.569
2.554SerHis: 2.554 ± 1.114
4.597SerIle: 4.597 ± 1.277
6.639SerLys: 6.639 ± 1.984
8.172SerLeu: 8.172 ± 2.071
1.532SerMet: 1.532 ± 1.352
6.129SerAsn: 6.129 ± 2.315
1.532SerPro: 1.532 ± 0.758
2.554SerGln: 2.554 ± 1.053
0.511SerArg: 0.511 ± 0.519
3.575SerSer: 3.575 ± 1.056
1.532SerThr: 1.532 ± 0.894
3.575SerVal: 3.575 ± 1.566
0.511SerTrp: 0.511 ± 0.465
5.618SerTyr: 5.618 ± 2.011
0.0SerXaa: 0.0 ± 0.0
Thr
5.107ThrAla: 5.107 ± 3.466
0.0ThrCys: 0.0 ± 0.0
3.064ThrAsp: 3.064 ± 1.162
6.129ThrGlu: 6.129 ± 2.896
1.532ThrPhe: 1.532 ± 0.963
2.554ThrGly: 2.554 ± 0.932
1.021ThrHis: 1.021 ± 0.462
4.597ThrIle: 4.597 ± 0.994
5.107ThrLys: 5.107 ± 1.383
7.661ThrLeu: 7.661 ± 2.042
2.043ThrMet: 2.043 ± 1.334
2.043ThrAsn: 2.043 ± 0.638
1.021ThrPro: 1.021 ± 0.576
2.043ThrGln: 2.043 ± 1.262
1.532ThrArg: 1.532 ± 0.631
2.043ThrSer: 2.043 ± 0.683
2.554ThrThr: 2.554 ± 0.68
3.064ThrVal: 3.064 ± 1.067
0.0ThrTrp: 0.0 ± 0.0
3.575ThrTyr: 3.575 ± 1.213
0.0ThrXaa: 0.0 ± 0.0
Val
1.532ValAla: 1.532 ± 0.926
0.0ValCys: 0.0 ± 0.0
5.107ValAsp: 5.107 ± 2.023
3.575ValGlu: 3.575 ± 0.883
1.532ValPhe: 1.532 ± 0.711
1.532ValGly: 1.532 ± 0.911
0.511ValHis: 0.511 ± 0.465
3.064ValIle: 3.064 ± 1.169
3.575ValLys: 3.575 ± 1.057
4.597ValLeu: 4.597 ± 1.14
2.043ValMet: 2.043 ± 0.937
2.554ValAsn: 2.554 ± 1.629
2.043ValPro: 2.043 ± 1.154
2.043ValGln: 2.043 ± 1.498
2.554ValArg: 2.554 ± 1.601
3.575ValSer: 3.575 ± 1.193
2.554ValThr: 2.554 ± 0.834
4.597ValVal: 4.597 ± 1.071
0.0ValTrp: 0.0 ± 0.0
3.575ValTyr: 3.575 ± 0.911
0.0ValXaa: 0.0 ± 0.0
Trp
2.554TrpAla: 2.554 ± 0.89
0.0TrpCys: 0.0 ± 0.0
0.511TrpAsp: 0.511 ± 0.657
2.043TrpGlu: 2.043 ± 1.388
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.532TrpLys: 1.532 ± 0.963
1.021TrpLeu: 1.021 ± 0.812
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.021TrpThr: 1.021 ± 0.462
0.0TrpVal: 0.0 ± 0.0
0.511TrpTrp: 0.511 ± 0.465
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.554TyrAla: 2.554 ± 1.068
0.0TyrCys: 0.0 ± 0.0
0.511TyrAsp: 0.511 ± 0.384
0.511TyrGlu: 0.511 ± 0.384
2.043TyrPhe: 2.043 ± 1.053
4.086TyrGly: 4.086 ± 1.027
0.511TyrHis: 0.511 ± 0.57
1.532TyrIle: 1.532 ± 0.685
7.15TyrLys: 7.15 ± 1.751
5.107TyrLeu: 5.107 ± 0.772
0.511TyrMet: 0.511 ± 0.384
2.554TyrAsn: 2.554 ± 0.71
1.532TyrPro: 1.532 ± 0.687
3.064TyrGln: 3.064 ± 1.337
2.554TyrArg: 2.554 ± 1.245
3.575TyrSer: 3.575 ± 1.398
1.021TyrThr: 1.021 ± 0.781
3.064TyrVal: 3.064 ± 1.451
0.511TyrTrp: 0.511 ± 0.384
3.064TyrTyr: 3.064 ± 0.989
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1959 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski