Amino acid dipepetide frequency for Streptococcus satellite phage Javan373

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.0AlaAla: 0.0 ± 0.0
0.476AlaCys: 0.476 ± 0.443
2.378AlaAsp: 2.378 ± 0.789
0.951AlaGlu: 0.951 ± 0.476
2.378AlaPhe: 2.378 ± 1.624
2.853AlaGly: 2.853 ± 1.334
0.476AlaHis: 0.476 ± 0.414
3.329AlaIle: 3.329 ± 0.651
5.706AlaLys: 5.706 ± 2.052
6.182AlaLeu: 6.182 ± 1.004
0.476AlaMet: 0.476 ± 0.413
1.902AlaAsn: 1.902 ± 1.771
0.0AlaPro: 0.0 ± 0.0
0.951AlaGln: 0.951 ± 0.829
3.804AlaArg: 3.804 ± 0.784
2.853AlaSer: 2.853 ± 0.958
4.28AlaThr: 4.28 ± 1.57
3.329AlaVal: 3.329 ± 1.165
0.476AlaTrp: 0.476 ± 0.413
3.804AlaTyr: 3.804 ± 1.483
0.0AlaXaa: 0.0 ± 0.0
Cys
0.476CysAla: 0.476 ± 0.443
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.476CysGlu: 0.476 ± 0.413
0.0CysPhe: 0.0 ± 0.0
0.476CysGly: 0.476 ± 0.413
0.0CysHis: 0.0 ± 0.0
0.951CysIle: 0.951 ± 0.52
0.0CysLys: 0.0 ± 0.0
0.476CysLeu: 0.476 ± 0.443
0.0CysMet: 0.0 ± 0.0
0.476CysAsn: 0.476 ± 0.473
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.476CysArg: 0.476 ± 0.443
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.476CysTrp: 0.476 ± 0.443
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
0.951AspAla: 0.951 ± 0.654
0.0AspCys: 0.0 ± 0.0
1.902AspAsp: 1.902 ± 1.251
2.378AspGlu: 2.378 ± 1.56
5.231AspPhe: 5.231 ± 1.551
2.378AspGly: 2.378 ± 1.464
0.476AspHis: 0.476 ± 0.414
7.133AspIle: 7.133 ± 0.96
6.182AspLys: 6.182 ± 1.425
6.657AspLeu: 6.657 ± 1.352
0.951AspMet: 0.951 ± 0.607
3.804AspAsn: 3.804 ± 1.296
2.853AspPro: 2.853 ± 0.838
1.902AspGln: 1.902 ± 0.857
3.804AspArg: 3.804 ± 1.473
3.329AspSer: 3.329 ± 1.879
0.476AspThr: 0.476 ± 0.413
0.951AspVal: 0.951 ± 0.51
0.951AspTrp: 0.951 ± 0.58
4.28AspTyr: 4.28 ± 1.863
0.0AspXaa: 0.0 ± 0.0
Glu
5.706GluAla: 5.706 ± 1.34
0.951GluCys: 0.951 ± 0.53
4.755GluAsp: 4.755 ± 1.388
7.133GluGlu: 7.133 ± 2.232
2.853GluPhe: 2.853 ± 1.867
1.902GluGly: 1.902 ± 0.91
0.476GluHis: 0.476 ± 0.414
5.231GluIle: 5.231 ± 1.669
11.888GluLys: 11.888 ± 3.944
8.559GluLeu: 8.559 ± 2.117
2.853GluMet: 2.853 ± 1.212
5.231GluAsn: 5.231 ± 0.685
0.476GluPro: 0.476 ± 0.56
4.28GluGln: 4.28 ± 1.087
5.706GluArg: 5.706 ± 2.727
4.28GluSer: 4.28 ± 2.049
6.182GluThr: 6.182 ± 1.938
3.804GluVal: 3.804 ± 2.126
1.902GluTrp: 1.902 ± 0.899
2.378GluTyr: 2.378 ± 0.98
0.0GluXaa: 0.0 ± 0.0
Phe
1.902PheAla: 1.902 ± 1.191
0.0PheCys: 0.0 ± 0.0
3.329PheAsp: 3.329 ± 1.392
1.902PheGlu: 1.902 ± 0.722
3.804PhePhe: 3.804 ± 1.245
2.378PheGly: 2.378 ± 0.844
0.951PheHis: 0.951 ± 0.51
1.427PheIle: 1.427 ± 0.788
5.706PheLys: 5.706 ± 1.681
7.133PheLeu: 7.133 ± 1.547
0.951PheMet: 0.951 ± 0.616
2.853PheAsn: 2.853 ± 1.193
0.951PhePro: 0.951 ± 0.58
1.902PheGln: 1.902 ± 0.904
0.951PheArg: 0.951 ± 0.53
2.378PheSer: 2.378 ± 0.542
3.804PheThr: 3.804 ± 1.43
4.755PheVal: 4.755 ± 1.485
0.476PheTrp: 0.476 ± 0.413
0.476PheTyr: 0.476 ± 0.498
0.0PheXaa: 0.0 ± 0.0
Gly
3.329GlyAla: 3.329 ± 0.853
0.476GlyCys: 0.476 ± 0.443
3.329GlyAsp: 3.329 ± 0.676
6.657GlyGlu: 6.657 ± 1.232
1.427GlyPhe: 1.427 ± 0.479
2.378GlyGly: 2.378 ± 1.177
1.427GlyHis: 1.427 ± 0.901
5.706GlyIle: 5.706 ± 1.866
7.608GlyLys: 7.608 ± 1.72
6.182GlyLeu: 6.182 ± 1.457
0.951GlyMet: 0.951 ± 0.481
0.951GlyAsn: 0.951 ± 0.58
0.0GlyPro: 0.0 ± 0.0
2.853GlyGln: 2.853 ± 1.209
1.902GlyArg: 1.902 ± 0.735
2.378GlySer: 2.378 ± 0.844
0.476GlyThr: 0.476 ± 0.443
4.28GlyVal: 4.28 ± 1.208
1.427GlyTrp: 1.427 ± 0.874
2.378GlyTyr: 2.378 ± 0.806
0.0GlyXaa: 0.0 ± 0.0
His
0.951HisAla: 0.951 ± 0.885
0.0HisCys: 0.0 ± 0.0
0.951HisAsp: 0.951 ± 0.654
1.427HisGlu: 1.427 ± 0.81
0.476HisPhe: 0.476 ± 0.443
0.476HisGly: 0.476 ± 0.443
0.0HisHis: 0.0 ± 0.0
0.476HisIle: 0.476 ± 0.443
0.476HisLys: 0.476 ± 0.498
1.427HisLeu: 1.427 ± 0.79
0.476HisMet: 0.476 ± 0.414
0.476HisAsn: 0.476 ± 0.443
0.476HisPro: 0.476 ± 0.414
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.427HisSer: 1.427 ± 0.599
0.476HisThr: 0.476 ± 0.443
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
1.427IleAla: 1.427 ± 1.009
0.476IleCys: 0.476 ± 0.473
2.853IleAsp: 2.853 ± 0.734
6.182IleGlu: 6.182 ± 1.875
5.231IlePhe: 5.231 ± 1.481
2.853IleGly: 2.853 ± 1.398
0.951IleHis: 0.951 ± 0.52
5.706IleIle: 5.706 ± 1.761
5.706IleLys: 5.706 ± 2.286
6.182IleLeu: 6.182 ± 1.774
0.951IleMet: 0.951 ± 0.672
3.804IleAsn: 3.804 ± 1.339
2.378IlePro: 2.378 ± 0.923
3.804IleGln: 3.804 ± 1.177
2.853IleArg: 2.853 ± 1.068
7.608IleSer: 7.608 ± 1.67
5.231IleThr: 5.231 ± 1.158
3.804IleVal: 3.804 ± 1.304
1.902IleTrp: 1.902 ± 0.987
0.951IleTyr: 0.951 ± 0.668
0.0IleXaa: 0.0 ± 0.0
Lys
7.133LysAla: 7.133 ± 2.412
0.0LysCys: 0.0 ± 0.0
4.28LysAsp: 4.28 ± 1.214
10.461LysGlu: 10.461 ± 3.383
4.755LysPhe: 4.755 ± 1.455
5.706LysGly: 5.706 ± 1.175
0.0LysHis: 0.0 ± 0.0
10.937LysIle: 10.937 ± 3.032
10.937LysLys: 10.937 ± 1.142
8.559LysLeu: 8.559 ± 1.374
1.902LysMet: 1.902 ± 1.173
6.657LysAsn: 6.657 ± 1.507
2.378LysPro: 2.378 ± 1.007
4.28LysGln: 4.28 ± 0.796
2.853LysArg: 2.853 ± 1.028
2.378LysSer: 2.378 ± 1.26
6.182LysThr: 6.182 ± 1.898
6.657LysVal: 6.657 ± 2.074
0.951LysTrp: 0.951 ± 0.568
4.28LysTyr: 4.28 ± 1.708
0.0LysXaa: 0.0 ± 0.0
Leu
3.804LeuAla: 3.804 ± 1.137
0.0LeuCys: 0.0 ± 0.0
6.182LeuAsp: 6.182 ± 0.528
12.363LeuGlu: 12.363 ± 3.979
3.329LeuPhe: 3.329 ± 1.195
5.231LeuGly: 5.231 ± 1.755
1.427LeuHis: 1.427 ± 0.607
3.329LeuIle: 3.329 ± 1.989
8.559LeuLys: 8.559 ± 1.337
8.559LeuLeu: 8.559 ± 1.635
0.476LeuMet: 0.476 ± 0.405
9.986LeuAsn: 9.986 ± 1.595
3.804LeuPro: 3.804 ± 1.382
5.231LeuGln: 5.231 ± 1.794
4.28LeuArg: 4.28 ± 1.045
7.133LeuSer: 7.133 ± 2.815
7.608LeuThr: 7.608 ± 2.324
5.706LeuVal: 5.706 ± 1.894
0.951LeuTrp: 0.951 ± 0.568
4.755LeuTyr: 4.755 ± 1.532
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
0.951MetAsp: 0.951 ± 0.765
1.902MetGlu: 1.902 ± 1.635
0.0MetPhe: 0.0 ± 0.0
0.476MetGly: 0.476 ± 0.527
0.0MetHis: 0.0 ± 0.0
1.902MetIle: 1.902 ± 1.147
1.427MetLys: 1.427 ± 1.006
0.951MetLeu: 0.951 ± 0.51
1.427MetMet: 1.427 ± 1.3
1.902MetAsn: 1.902 ± 0.874
0.0MetPro: 0.0 ± 0.0
0.951MetGln: 0.951 ± 0.53
0.476MetArg: 0.476 ± 0.56
0.476MetSer: 0.476 ± 0.413
1.427MetThr: 1.427 ± 0.861
1.902MetVal: 1.902 ± 1.185
0.0MetTrp: 0.0 ± 0.0
0.951MetTyr: 0.951 ± 0.58
0.0MetXaa: 0.0 ± 0.0
Asn
3.329AsnAla: 3.329 ± 1.697
0.476AsnCys: 0.476 ± 0.473
5.706AsnAsp: 5.706 ± 1.642
1.902AsnGlu: 1.902 ± 0.91
1.427AsnPhe: 1.427 ± 0.81
7.133AsnGly: 7.133 ± 1.149
0.0AsnHis: 0.0 ± 0.0
4.28AsnIle: 4.28 ± 1.52
5.706AsnLys: 5.706 ± 1.101
8.084AsnLeu: 8.084 ± 1.173
0.951AsnMet: 0.951 ± 0.841
4.28AsnAsn: 4.28 ± 1.702
2.378AsnPro: 2.378 ± 0.921
1.427AsnGln: 1.427 ± 0.479
3.804AsnArg: 3.804 ± 1.432
4.28AsnSer: 4.28 ± 1.548
4.755AsnThr: 4.755 ± 1.869
0.951AsnVal: 0.951 ± 0.589
1.902AsnTrp: 1.902 ± 0.904
1.902AsnTyr: 1.902 ± 1.043
0.0AsnXaa: 0.0 ± 0.0
Pro
0.951ProAla: 0.951 ± 0.885
0.0ProCys: 0.0 ± 0.0
1.902ProAsp: 1.902 ± 0.873
2.378ProGlu: 2.378 ± 0.754
1.427ProPhe: 1.427 ± 0.607
0.951ProGly: 0.951 ± 0.947
0.476ProHis: 0.476 ± 0.414
2.378ProIle: 2.378 ± 0.984
2.378ProLys: 2.378 ± 0.844
2.853ProLeu: 2.853 ± 0.839
0.476ProMet: 0.476 ± 0.527
2.378ProAsn: 2.378 ± 1.19
1.427ProPro: 1.427 ± 1.238
0.0ProGln: 0.0 ± 0.0
0.951ProArg: 0.951 ± 0.51
2.378ProSer: 2.378 ± 0.94
1.902ProThr: 1.902 ± 0.873
0.951ProVal: 0.951 ± 0.51
0.0ProTrp: 0.0 ± 0.0
1.427ProTyr: 1.427 ± 0.837
0.0ProXaa: 0.0 ± 0.0
Gln
0.951GlnAla: 0.951 ± 0.51
0.0GlnCys: 0.0 ± 0.0
1.902GlnAsp: 1.902 ± 1.307
6.657GlnGlu: 6.657 ± 1.396
0.951GlnPhe: 0.951 ± 0.476
3.804GlnGly: 3.804 ± 1.366
0.476GlnHis: 0.476 ± 0.443
1.427GlnIle: 1.427 ± 1.102
3.329GlnLys: 3.329 ± 1.433
2.853GlnLeu: 2.853 ± 1.792
0.476GlnMet: 0.476 ± 0.443
2.378GlnAsn: 2.378 ± 0.866
0.476GlnPro: 0.476 ± 0.413
2.378GlnGln: 2.378 ± 2.213
1.427GlnArg: 1.427 ± 0.77
0.951GlnSer: 0.951 ± 0.568
2.853GlnThr: 2.853 ± 1.407
2.853GlnVal: 2.853 ± 1.548
0.0GlnTrp: 0.0 ± 0.0
2.378GlnTyr: 2.378 ± 1.391
0.0GlnXaa: 0.0 ± 0.0
Arg
3.329ArgAla: 3.329 ± 1.496
0.0ArgCys: 0.0 ± 0.0
3.329ArgAsp: 3.329 ± 1.675
3.329ArgGlu: 3.329 ± 1.277
2.853ArgPhe: 2.853 ± 1.356
1.427ArgGly: 1.427 ± 0.636
0.476ArgHis: 0.476 ± 0.443
2.853ArgIle: 2.853 ± 1.063
4.28ArgLys: 4.28 ± 1.144
6.182ArgLeu: 6.182 ± 1.349
1.427ArgMet: 1.427 ± 0.929
1.902ArgAsn: 1.902 ± 0.997
1.427ArgPro: 1.427 ± 0.949
2.378ArgGln: 2.378 ± 0.658
2.853ArgArg: 2.853 ± 2.007
1.427ArgSer: 1.427 ± 0.76
1.902ArgThr: 1.902 ± 0.569
0.476ArgVal: 0.476 ± 0.413
0.476ArgTrp: 0.476 ± 0.498
3.329ArgTyr: 3.329 ± 0.592
0.0ArgXaa: 0.0 ± 0.0
Ser
1.902SerAla: 1.902 ± 1.191
0.476SerCys: 0.476 ± 0.413
5.706SerAsp: 5.706 ± 1.309
5.231SerGlu: 5.231 ± 2.0
4.28SerPhe: 4.28 ± 0.961
3.804SerGly: 3.804 ± 1.037
0.0SerHis: 0.0 ± 0.0
1.902SerIle: 1.902 ± 0.877
5.706SerLys: 5.706 ± 1.283
4.755SerLeu: 4.755 ± 1.258
0.476SerMet: 0.476 ± 0.443
3.329SerAsn: 3.329 ± 1.669
2.378SerPro: 2.378 ± 1.264
1.902SerGln: 1.902 ± 1.043
1.902SerArg: 1.902 ± 0.497
5.231SerSer: 5.231 ± 1.585
0.951SerThr: 0.951 ± 0.589
3.804SerVal: 3.804 ± 1.337
0.0SerTrp: 0.0 ± 0.0
2.853SerTyr: 2.853 ± 1.378
0.0SerXaa: 0.0 ± 0.0
Thr
2.378ThrAla: 2.378 ± 1.265
0.0ThrCys: 0.0 ± 0.0
4.28ThrAsp: 4.28 ± 1.714
3.804ThrGlu: 3.804 ± 1.465
2.378ThrPhe: 2.378 ± 0.846
4.28ThrGly: 4.28 ± 1.241
0.951ThrHis: 0.951 ± 0.53
4.755ThrIle: 4.755 ± 2.305
4.28ThrLys: 4.28 ± 1.167
7.133ThrLeu: 7.133 ± 1.717
1.427ThrMet: 1.427 ± 0.938
3.329ThrAsn: 3.329 ± 0.679
3.329ThrPro: 3.329 ± 1.327
1.902ThrGln: 1.902 ± 0.865
0.951ThrArg: 0.951 ± 0.6
1.902ThrSer: 1.902 ± 0.731
3.804ThrThr: 3.804 ± 1.486
2.853ThrVal: 2.853 ± 1.007
0.476ThrTrp: 0.476 ± 0.414
2.378ThrTyr: 2.378 ± 0.944
0.0ThrXaa: 0.0 ± 0.0
Val
4.755ValAla: 4.755 ± 1.652
0.0ValCys: 0.0 ± 0.0
2.378ValAsp: 2.378 ± 1.278
4.755ValGlu: 4.755 ± 2.143
1.902ValPhe: 1.902 ± 1.06
2.378ValGly: 2.378 ± 0.844
0.476ValHis: 0.476 ± 0.443
3.329ValIle: 3.329 ± 0.87
3.804ValLys: 3.804 ± 1.036
3.329ValLeu: 3.329 ± 0.841
0.0ValMet: 0.0 ± 0.0
3.804ValAsn: 3.804 ± 0.958
1.902ValPro: 1.902 ± 1.043
0.951ValGln: 0.951 ± 0.616
3.329ValArg: 3.329 ± 1.577
3.804ValSer: 3.804 ± 1.498
3.804ValThr: 3.804 ± 1.179
4.28ValVal: 4.28 ± 1.221
0.951ValTrp: 0.951 ± 1.408
2.853ValTyr: 2.853 ± 1.037
0.0ValXaa: 0.0 ± 0.0
Trp
0.951TrpAla: 0.951 ± 0.53
0.476TrpCys: 0.476 ± 0.413
0.0TrpAsp: 0.0 ± 0.0
2.378TrpGlu: 2.378 ± 0.955
0.0TrpPhe: 0.0 ± 0.0
1.427TrpGly: 1.427 ± 0.788
0.476TrpHis: 0.476 ± 0.56
0.476TrpIle: 0.476 ± 0.414
0.951TrpLys: 0.951 ± 0.734
1.902TrpLeu: 1.902 ± 1.226
0.0TrpMet: 0.0 ± 0.0
1.427TrpAsn: 1.427 ± 1.509
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.476TrpArg: 0.476 ± 0.413
0.951TrpSer: 0.951 ± 0.568
0.476TrpThr: 0.476 ± 0.413
0.0TrpVal: 0.0 ± 0.0
0.476TrpTrp: 0.476 ± 0.443
0.951TrpTyr: 0.951 ± 0.765
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.378TyrAla: 2.378 ± 1.81
0.476TyrCys: 0.476 ± 0.443
0.476TyrAsp: 0.476 ± 0.414
4.28TyrGlu: 4.28 ± 2.28
3.329TyrPhe: 3.329 ± 1.325
3.804TyrGly: 3.804 ± 1.033
0.476TyrHis: 0.476 ± 0.498
3.329TyrIle: 3.329 ± 1.715
6.657TyrLys: 6.657 ± 1.358
4.755TyrLeu: 4.755 ± 1.461
0.0TyrMet: 0.0 ± 0.0
3.804TyrAsn: 3.804 ± 1.611
0.951TyrPro: 0.951 ± 0.476
1.427TyrGln: 1.427 ± 0.599
2.853TyrArg: 2.853 ± 1.422
1.427TyrSer: 1.427 ± 0.817
0.476TyrThr: 0.476 ± 0.473
1.427TyrVal: 1.427 ± 0.987
0.0TyrTrp: 0.0 ± 0.0
1.427TyrTyr: 1.427 ± 0.942
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (2104 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski