Amino acid dipepetide frequency for Streptococcus satellite phage Javan376

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.583AlaCys: 0.583 ± 0.482
1.166AlaAsp: 1.166 ± 0.529
3.499AlaGlu: 3.499 ± 1.65
0.583AlaPhe: 0.583 ± 0.566
3.499AlaGly: 3.499 ± 1.791
0.0AlaHis: 0.0 ± 0.0
2.332AlaIle: 2.332 ± 0.774
7.58AlaLys: 7.58 ± 1.686
6.997AlaLeu: 6.997 ± 2.208
1.749AlaMet: 1.749 ± 0.941
2.915AlaAsn: 2.915 ± 1.83
0.0AlaPro: 0.0 ± 0.0
1.166AlaGln: 1.166 ± 0.692
3.499AlaArg: 3.499 ± 0.957
6.414AlaSer: 6.414 ± 1.679
4.665AlaThr: 4.665 ± 1.261
1.749AlaVal: 1.749 ± 0.91
1.166AlaTrp: 1.166 ± 0.53
3.499AlaTyr: 3.499 ± 1.791
0.0AlaXaa: 0.0 ± 0.0
Cys
1.166CysAla: 1.166 ± 0.579
0.0CysCys: 0.0 ± 0.0
0.583CysAsp: 0.583 ± 0.51
0.583CysGlu: 0.583 ± 0.444
0.0CysPhe: 0.0 ± 0.0
0.583CysGly: 0.583 ± 0.444
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.583CysLeu: 0.583 ± 0.482
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.583CysGln: 0.583 ± 0.444
0.583CysArg: 0.583 ± 0.482
0.583CysSer: 0.583 ± 0.566
0.583CysThr: 0.583 ± 0.566
0.583CysVal: 0.583 ± 0.444
0.583CysTrp: 0.583 ± 0.482
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.749AspAla: 1.749 ± 1.076
0.583AspCys: 0.583 ± 0.444
1.749AspAsp: 1.749 ± 1.332
4.082AspGlu: 4.082 ± 1.52
2.332AspPhe: 2.332 ± 1.297
0.0AspGly: 0.0 ± 0.0
0.0AspHis: 0.0 ± 0.0
4.082AspIle: 4.082 ± 1.26
4.082AspLys: 4.082 ± 1.21
5.831AspLeu: 5.831 ± 1.897
1.166AspMet: 1.166 ± 0.752
2.915AspAsn: 2.915 ± 1.494
1.749AspPro: 1.749 ± 0.849
2.332AspGln: 2.332 ± 0.327
4.665AspArg: 4.665 ± 1.344
2.915AspSer: 2.915 ± 1.67
1.749AspThr: 1.749 ± 0.765
2.915AspVal: 2.915 ± 1.582
0.583AspTrp: 0.583 ± 0.444
2.332AspTyr: 2.332 ± 1.398
0.0AspXaa: 0.0 ± 0.0
Glu
6.997GluAla: 6.997 ± 2.56
1.749GluCys: 1.749 ± 0.529
1.749GluAsp: 1.749 ± 0.913
8.163GluGlu: 8.163 ± 2.449
2.915GluPhe: 2.915 ± 1.684
3.499GluGly: 3.499 ± 0.711
1.166GluHis: 1.166 ± 1.019
8.163GluIle: 8.163 ± 2.747
7.58GluLys: 7.58 ± 1.879
9.913GluLeu: 9.913 ± 3.79
4.082GluMet: 4.082 ± 1.663
6.414GluAsn: 6.414 ± 1.126
0.583GluPro: 0.583 ± 0.51
2.332GluGln: 2.332 ± 0.809
5.248GluArg: 5.248 ± 2.739
1.166GluSer: 1.166 ± 0.529
6.414GluThr: 6.414 ± 2.204
4.665GluVal: 4.665 ± 0.927
2.332GluTrp: 2.332 ± 1.368
3.499GluTyr: 3.499 ± 1.348
0.0GluXaa: 0.0 ± 0.0
Phe
0.583PheAla: 0.583 ± 0.482
0.0PheCys: 0.0 ± 0.0
2.915PheAsp: 2.915 ± 1.67
2.915PheGlu: 2.915 ± 0.804
3.499PhePhe: 3.499 ± 1.542
2.332PheGly: 2.332 ± 0.959
0.583PheHis: 0.583 ± 0.482
1.749PheIle: 1.749 ± 0.913
3.499PheLys: 3.499 ± 1.28
4.665PheLeu: 4.665 ± 1.516
1.749PheMet: 1.749 ± 1.529
2.332PheAsn: 2.332 ± 1.359
0.583PhePro: 0.583 ± 0.444
1.749PheGln: 1.749 ± 1.034
1.166PheArg: 1.166 ± 0.529
2.915PheSer: 2.915 ± 0.804
5.831PheThr: 5.831 ± 1.403
2.915PheVal: 2.915 ± 1.108
0.583PheTrp: 0.583 ± 0.444
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.499GlyAla: 3.499 ± 1.048
0.583GlyCys: 0.583 ± 0.482
4.082GlyAsp: 4.082 ± 1.048
4.082GlyGlu: 4.082 ± 1.074
2.915GlyPhe: 2.915 ± 0.804
2.332GlyGly: 2.332 ± 0.931
1.166GlyHis: 1.166 ± 0.964
2.332GlyIle: 2.332 ± 0.932
6.414GlyLys: 6.414 ± 1.095
6.997GlyLeu: 6.997 ± 1.376
0.583GlyMet: 0.583 ± 0.444
1.166GlyAsn: 1.166 ± 0.53
0.0GlyPro: 0.0 ± 0.0
2.332GlyGln: 2.332 ± 1.449
2.332GlyArg: 2.332 ± 0.689
2.332GlySer: 2.332 ± 1.061
0.583GlyThr: 0.583 ± 0.482
5.831GlyVal: 5.831 ± 0.933
0.583GlyTrp: 0.583 ± 0.444
2.332GlyTyr: 2.332 ± 0.689
0.0GlyXaa: 0.0 ± 0.0
His
1.749HisAla: 1.749 ± 0.94
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.332HisGlu: 2.332 ± 0.802
0.583HisPhe: 0.583 ± 0.482
0.583HisGly: 0.583 ± 0.482
0.0HisHis: 0.0 ± 0.0
0.583HisIle: 0.583 ± 0.482
0.583HisLys: 0.583 ± 0.51
1.166HisLeu: 1.166 ± 0.692
0.0HisMet: 0.0 ± 0.0
1.166HisAsn: 1.166 ± 0.579
0.583HisPro: 0.583 ± 0.566
0.583HisGln: 0.583 ± 0.444
0.0HisArg: 0.0 ± 0.0
0.583HisSer: 0.583 ± 0.482
0.583HisThr: 0.583 ± 0.482
0.583HisVal: 0.583 ± 0.51
0.0HisTrp: 0.0 ± 0.0
0.583HisTyr: 0.583 ± 0.566
0.0HisXaa: 0.0 ± 0.0
Ile
2.915IleAla: 2.915 ± 1.374
0.583IleCys: 0.583 ± 0.566
4.082IleAsp: 4.082 ± 0.559
6.997IleGlu: 6.997 ± 2.609
1.166IlePhe: 1.166 ± 0.529
1.749IleGly: 1.749 ± 0.61
1.166IleHis: 1.166 ± 0.53
4.665IleIle: 4.665 ± 0.849
5.831IleLys: 5.831 ± 1.976
5.248IleLeu: 5.248 ± 1.79
0.583IleMet: 0.583 ± 0.482
1.749IleAsn: 1.749 ± 0.936
1.749IlePro: 1.749 ± 0.61
2.915IleGln: 2.915 ± 0.958
4.082IleArg: 4.082 ± 0.869
5.248IleSer: 5.248 ± 2.005
2.915IleThr: 2.915 ± 0.941
4.665IleVal: 4.665 ± 0.849
1.166IleTrp: 1.166 ± 1.016
1.749IleTyr: 1.749 ± 0.946
0.0IleXaa: 0.0 ± 0.0
Lys
6.414LysAla: 6.414 ± 2.243
0.0LysCys: 0.0 ± 0.0
6.414LysAsp: 6.414 ± 1.461
11.662LysGlu: 11.662 ± 3.446
2.332LysPhe: 2.332 ± 0.765
4.665LysGly: 4.665 ± 1.427
0.0LysHis: 0.0 ± 0.0
9.329LysIle: 9.329 ± 1.902
14.577LysLys: 14.577 ± 2.66
6.414LysLeu: 6.414 ± 0.791
2.332LysMet: 2.332 ± 1.223
5.248LysAsn: 5.248 ± 1.226
1.749LysPro: 1.749 ± 1.476
5.248LysGln: 5.248 ± 0.737
3.499LysArg: 3.499 ± 1.048
5.248LysSer: 5.248 ± 1.378
7.58LysThr: 7.58 ± 2.045
6.414LysVal: 6.414 ± 2.645
0.583LysTrp: 0.583 ± 0.482
3.499LysTyr: 3.499 ± 1.542
0.0LysXaa: 0.0 ± 0.0
Leu
5.248LeuAla: 5.248 ± 1.448
0.0LeuCys: 0.0 ± 0.0
5.248LeuAsp: 5.248 ± 1.386
8.746LeuGlu: 8.746 ± 2.648
5.248LeuPhe: 5.248 ± 0.966
6.414LeuGly: 6.414 ± 1.863
1.166LeuHis: 1.166 ± 0.529
1.166LeuIle: 1.166 ± 0.865
11.079LeuLys: 11.079 ± 1.684
6.414LeuLeu: 6.414 ± 1.581
1.749LeuMet: 1.749 ± 1.436
10.496LeuAsn: 10.496 ± 1.257
3.499LeuPro: 3.499 ± 1.442
3.499LeuGln: 3.499 ± 1.929
3.499LeuArg: 3.499 ± 0.738
4.665LeuSer: 4.665 ± 1.868
8.163LeuThr: 8.163 ± 2.261
5.831LeuVal: 5.831 ± 1.289
0.583LeuTrp: 0.583 ± 0.482
4.082LeuTyr: 4.082 ± 1.301
0.0LeuXaa: 0.0 ± 0.0
Met
1.166MetAla: 1.166 ± 1.019
0.0MetCys: 0.0 ± 0.0
0.583MetAsp: 0.583 ± 0.566
4.665MetGlu: 4.665 ± 2.472
0.583MetPhe: 0.583 ± 0.51
0.583MetGly: 0.583 ± 0.719
0.0MetHis: 0.0 ± 0.0
0.583MetIle: 0.583 ± 0.51
1.749MetLys: 1.749 ± 0.946
2.332MetLeu: 2.332 ± 0.945
0.583MetMet: 0.583 ± 0.719
1.749MetAsn: 1.749 ± 1.144
0.0MetPro: 0.0 ± 0.0
1.749MetGln: 1.749 ± 0.529
0.583MetArg: 0.583 ± 0.563
1.166MetSer: 1.166 ± 0.53
2.332MetThr: 2.332 ± 1.159
2.332MetVal: 2.332 ± 1.228
0.0MetTrp: 0.0 ± 0.0
0.583MetTyr: 0.583 ± 0.444
0.0MetXaa: 0.0 ± 0.0
Asn
1.166AsnAla: 1.166 ± 0.53
0.0AsnCys: 0.0 ± 0.0
3.499AsnAsp: 3.499 ± 1.543
2.915AsnGlu: 2.915 ± 1.249
2.332AsnPhe: 2.332 ± 1.297
4.665AsnGly: 4.665 ± 1.199
0.583AsnHis: 0.583 ± 0.566
3.499AsnIle: 3.499 ± 1.666
5.831AsnLys: 5.831 ± 1.347
7.58AsnLeu: 7.58 ± 0.805
1.166AsnMet: 1.166 ± 1.039
5.248AsnAsn: 5.248 ± 1.704
1.749AsnPro: 1.749 ± 1.105
1.166AsnGln: 1.166 ± 0.529
3.499AsnArg: 3.499 ± 1.592
5.248AsnSer: 5.248 ± 1.477
4.665AsnThr: 4.665 ± 1.93
1.749AsnVal: 1.749 ± 0.941
1.749AsnTrp: 1.749 ± 1.034
3.499AsnTyr: 3.499 ± 2.706
0.0AsnXaa: 0.0 ± 0.0
Pro
1.166ProAla: 1.166 ± 0.964
0.0ProCys: 0.0 ± 0.0
1.749ProAsp: 1.749 ± 1.105
2.915ProGlu: 2.915 ± 1.108
0.583ProPhe: 0.583 ± 0.482
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
1.749ProIle: 1.749 ± 1.19
1.166ProLys: 1.166 ± 0.529
1.166ProLeu: 1.166 ± 0.617
1.166ProMet: 1.166 ± 0.875
3.499ProAsn: 3.499 ± 1.109
2.332ProPro: 2.332 ± 1.297
0.0ProGln: 0.0 ± 0.0
0.583ProArg: 0.583 ± 0.482
2.915ProSer: 2.915 ± 1.136
3.499ProThr: 3.499 ± 1.043
1.749ProVal: 1.749 ± 1.144
0.0ProTrp: 0.0 ± 0.0
2.332ProTyr: 2.332 ± 0.802
0.0ProXaa: 0.0 ± 0.0
Gln
1.749GlnAla: 1.749 ± 0.61
0.0GlnCys: 0.0 ± 0.0
0.583GlnAsp: 0.583 ± 0.51
4.665GlnGlu: 4.665 ± 1.529
0.583GlnPhe: 0.583 ± 0.444
3.499GlnGly: 3.499 ± 0.638
0.583GlnHis: 0.583 ± 0.482
1.166GlnIle: 1.166 ± 1.019
4.082GlnLys: 4.082 ± 1.248
0.583GlnLeu: 0.583 ± 0.719
1.749GlnMet: 1.749 ± 0.979
1.749GlnAsn: 1.749 ± 0.603
1.166GlnPro: 1.166 ± 0.53
4.082GlnGln: 4.082 ± 2.142
1.166GlnArg: 1.166 ± 0.958
1.749GlnSer: 1.749 ± 0.979
1.749GlnThr: 1.749 ± 1.495
2.332GlnVal: 2.332 ± 1.371
0.0GlnTrp: 0.0 ± 0.0
3.499GlnTyr: 3.499 ± 1.587
0.0GlnXaa: 0.0 ± 0.0
Arg
4.665ArgAla: 4.665 ± 1.139
0.0ArgCys: 0.0 ± 0.0
2.915ArgAsp: 2.915 ± 0.894
4.082ArgGlu: 4.082 ± 1.01
4.082ArgPhe: 4.082 ± 1.296
1.749ArgGly: 1.749 ± 0.849
2.915ArgHis: 2.915 ± 1.212
2.332ArgIle: 2.332 ± 0.803
4.665ArgLys: 4.665 ± 1.468
5.831ArgLeu: 5.831 ± 1.809
1.749ArgMet: 1.749 ± 0.768
1.749ArgAsn: 1.749 ± 1.034
1.749ArgPro: 1.749 ± 1.221
0.583ArgGln: 0.583 ± 0.482
4.082ArgArg: 4.082 ± 1.907
1.166ArgSer: 1.166 ± 0.579
1.749ArgThr: 1.749 ± 0.529
0.583ArgVal: 0.583 ± 0.444
0.0ArgTrp: 0.0 ± 0.0
2.332ArgTyr: 2.332 ± 0.765
0.0ArgXaa: 0.0 ± 0.0
Ser
0.583SerAla: 0.583 ± 0.482
1.166SerCys: 1.166 ± 0.53
4.665SerAsp: 4.665 ± 1.458
2.332SerGlu: 2.332 ± 0.945
2.915SerPhe: 2.915 ± 0.822
3.499SerGly: 3.499 ± 0.618
0.0SerHis: 0.0 ± 0.0
5.248SerIle: 5.248 ± 2.323
6.414SerLys: 6.414 ± 2.025
5.248SerLeu: 5.248 ± 1.289
0.583SerMet: 0.583 ± 0.482
1.749SerAsn: 1.749 ± 1.446
4.082SerPro: 4.082 ± 1.274
1.166SerGln: 1.166 ± 0.579
2.332SerArg: 2.332 ± 0.809
4.082SerSer: 4.082 ± 1.313
2.332SerThr: 2.332 ± 1.175
4.665SerVal: 4.665 ± 1.26
0.0SerTrp: 0.0 ± 0.0
2.915SerTyr: 2.915 ± 1.274
0.0SerXaa: 0.0 ± 0.0
Thr
2.915ThrAla: 2.915 ± 1.44
0.0ThrCys: 0.0 ± 0.0
2.915ThrAsp: 2.915 ± 1.84
3.499ThrGlu: 3.499 ± 1.827
1.749ThrPhe: 1.749 ± 0.849
5.248ThrGly: 5.248 ± 1.322
1.166ThrHis: 1.166 ± 0.529
6.414ThrIle: 6.414 ± 2.408
4.082ThrLys: 4.082 ± 1.078
9.329ThrLeu: 9.329 ± 2.584
1.166ThrMet: 1.166 ± 0.964
4.082ThrAsn: 4.082 ± 2.004
4.665ThrPro: 4.665 ± 0.946
2.332ThrGln: 2.332 ± 0.983
1.749ThrArg: 1.749 ± 0.831
2.332ThrSer: 2.332 ± 0.948
3.499ThrThr: 3.499 ± 1.666
4.665ThrVal: 4.665 ± 1.364
0.583ThrTrp: 0.583 ± 0.51
2.332ThrTyr: 2.332 ± 0.854
0.0ThrXaa: 0.0 ± 0.0
Val
5.831ValAla: 5.831 ± 1.898
0.583ValCys: 0.583 ± 0.51
2.915ValAsp: 2.915 ± 0.673
7.58ValGlu: 7.58 ± 2.192
3.499ValPhe: 3.499 ± 0.543
1.749ValGly: 1.749 ± 0.835
1.166ValHis: 1.166 ± 0.608
4.665ValIle: 4.665 ± 1.843
6.414ValLys: 6.414 ± 0.904
3.499ValLeu: 3.499 ± 0.543
0.583ValMet: 0.583 ± 0.51
3.499ValAsn: 3.499 ± 0.618
1.166ValPro: 1.166 ± 0.579
0.583ValGln: 0.583 ± 0.566
2.332ValArg: 2.332 ± 0.984
2.915ValSer: 2.915 ± 1.507
2.915ValThr: 2.915 ± 0.864
2.332ValVal: 2.332 ± 0.689
1.166ValTrp: 1.166 ± 2.036
2.915ValTyr: 2.915 ± 1.155
0.0ValXaa: 0.0 ± 0.0
Trp
1.166TrpAla: 1.166 ± 0.529
0.583TrpCys: 0.583 ± 0.444
0.0TrpAsp: 0.0 ± 0.0
1.749TrpGlu: 1.749 ± 0.874
0.583TrpPhe: 0.583 ± 0.51
1.166TrpGly: 1.166 ± 0.888
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.749TrpLys: 1.749 ± 1.19
2.332TrpLeu: 2.332 ± 1.22
0.0TrpMet: 0.0 ± 0.0
1.166TrpAsn: 1.166 ± 2.036
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.583TrpArg: 0.583 ± 0.444
0.583TrpSer: 0.583 ± 0.482
1.166TrpThr: 1.166 ± 0.53
0.0TrpVal: 0.0 ± 0.0
0.583TrpTrp: 0.583 ± 0.482
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.332TyrAla: 2.332 ± 1.928
0.583TyrCys: 0.583 ± 0.482
0.0TyrAsp: 0.0 ± 0.0
1.166TyrGlu: 1.166 ± 0.687
4.082TyrPhe: 4.082 ± 2.292
4.665TyrGly: 4.665 ± 1.705
0.583TyrHis: 0.583 ± 0.566
1.166TyrIle: 1.166 ± 0.617
5.831TyrLys: 5.831 ± 1.692
5.248TyrLeu: 5.248 ± 0.809
0.0TyrMet: 0.0 ± 0.0
2.915TyrAsn: 2.915 ± 1.71
1.166TyrPro: 1.166 ± 0.617
2.332TyrGln: 2.332 ± 1.449
3.499TyrArg: 3.499 ± 1.47
1.749TyrSer: 1.749 ± 0.979
1.749TyrThr: 1.749 ± 1.157
1.749TyrVal: 1.749 ± 1.476
1.166TyrTrp: 1.166 ± 1.132
1.166TyrTyr: 1.166 ± 0.958
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1716 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski