Amino acid dipepetide frequency for Streptococcus satellite phage Javan736

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.169AlaAla: 1.169 ± 0.635
0.292AlaCys: 0.292 ± 0.248
2.63AlaAsp: 2.63 ± 0.802
6.137AlaGlu: 6.137 ± 1.128
3.799AlaPhe: 3.799 ± 1.232
2.338AlaGly: 2.338 ± 0.671
0.584AlaHis: 0.584 ± 0.434
3.214AlaIle: 3.214 ± 0.8
4.968AlaLys: 4.968 ± 0.691
5.845AlaLeu: 5.845 ± 1.234
2.63AlaMet: 2.63 ± 1.14
2.63AlaAsn: 2.63 ± 0.882
1.169AlaPro: 1.169 ± 0.522
2.63AlaGln: 2.63 ± 0.852
4.968AlaArg: 4.968 ± 1.2
1.753AlaSer: 1.753 ± 0.607
3.507AlaThr: 3.507 ± 0.924
3.799AlaVal: 3.799 ± 0.726
0.584AlaTrp: 0.584 ± 0.421
2.046AlaTyr: 2.046 ± 0.812
0.0AlaXaa: 0.0 ± 0.0
Cys
0.584CysAla: 0.584 ± 0.384
0.0CysCys: 0.0 ± 0.0
0.584CysAsp: 0.584 ± 0.41
0.0CysGlu: 0.0 ± 0.0
0.292CysPhe: 0.292 ± 0.276
0.877CysGly: 0.877 ± 0.631
0.292CysHis: 0.292 ± 0.224
0.0CysIle: 0.0 ± 0.0
0.877CysLys: 0.877 ± 0.531
1.461CysLeu: 1.461 ± 0.638
0.0CysMet: 0.0 ± 0.0
0.584CysAsn: 0.584 ± 0.496
0.584CysPro: 0.584 ± 0.38
0.292CysGln: 0.292 ± 0.285
0.584CysArg: 0.584 ± 0.416
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.292CysVal: 0.292 ± 0.314
0.0CysTrp: 0.0 ± 0.0
0.584CysTyr: 0.584 ± 0.467
0.0CysXaa: 0.0 ± 0.0
Asp
0.877AspAla: 0.877 ± 0.449
0.877AspCys: 0.877 ± 0.62
3.507AspAsp: 3.507 ± 1.074
4.383AspGlu: 4.383 ± 1.121
3.507AspPhe: 3.507 ± 1.212
3.214AspGly: 3.214 ± 0.904
0.292AspHis: 0.292 ± 0.273
5.552AspIle: 5.552 ± 1.25
5.552AspLys: 5.552 ± 1.274
6.137AspLeu: 6.137 ± 1.879
1.461AspMet: 1.461 ± 0.559
2.046AspAsn: 2.046 ± 0.898
0.584AspPro: 0.584 ± 0.345
0.877AspGln: 0.877 ± 0.454
3.799AspArg: 3.799 ± 0.908
2.046AspSer: 2.046 ± 0.697
2.63AspThr: 2.63 ± 0.669
2.922AspVal: 2.922 ± 0.908
0.584AspTrp: 0.584 ± 0.334
4.383AspTyr: 4.383 ± 1.241
0.0AspXaa: 0.0 ± 0.0
Glu
5.26GluAla: 5.26 ± 1.268
1.169GluCys: 1.169 ± 0.542
3.799GluAsp: 3.799 ± 1.17
7.013GluGlu: 7.013 ± 1.18
2.63GluPhe: 2.63 ± 0.948
3.799GluGly: 3.799 ± 1.114
2.046GluHis: 2.046 ± 0.584
6.429GluIle: 6.429 ± 1.388
7.306GluLys: 7.306 ± 1.264
8.475GluLeu: 8.475 ± 1.477
2.338GluMet: 2.338 ± 0.796
6.721GluAsn: 6.721 ± 1.581
2.922GluPro: 2.922 ± 0.872
4.091GluGln: 4.091 ± 1.148
5.552GluArg: 5.552 ± 1.301
5.845GluSer: 5.845 ± 1.376
4.968GluThr: 4.968 ± 1.186
3.799GluVal: 3.799 ± 1.097
0.877GluTrp: 0.877 ± 0.51
4.383GluTyr: 4.383 ± 1.006
0.0GluXaa: 0.0 ± 0.0
Phe
1.461PheAla: 1.461 ± 0.739
0.0PheCys: 0.0 ± 0.0
3.214PheAsp: 3.214 ± 0.854
3.799PheGlu: 3.799 ± 1.331
2.046PhePhe: 2.046 ± 1.011
2.046PheGly: 2.046 ± 0.697
0.292PheHis: 0.292 ± 0.224
2.338PheIle: 2.338 ± 0.747
4.091PheLys: 4.091 ± 1.014
4.968PheLeu: 4.968 ± 1.145
0.292PheMet: 0.292 ± 0.287
3.507PheAsn: 3.507 ± 0.964
0.584PhePro: 0.584 ± 0.429
2.63PheGln: 2.63 ± 0.802
1.753PheArg: 1.753 ± 0.734
2.922PheSer: 2.922 ± 0.68
2.63PheThr: 2.63 ± 0.871
0.584PheVal: 0.584 ± 0.452
0.584PheTrp: 0.584 ± 0.355
2.046PheTyr: 2.046 ± 0.562
0.0PheXaa: 0.0 ± 0.0
Gly
2.046GlyAla: 2.046 ± 0.791
0.877GlyCys: 0.877 ± 0.391
3.214GlyAsp: 3.214 ± 1.045
4.383GlyGlu: 4.383 ± 1.201
1.461GlyPhe: 1.461 ± 0.612
2.922GlyGly: 2.922 ± 1.133
0.877GlyHis: 0.877 ± 0.424
4.968GlyIle: 4.968 ± 1.208
4.383GlyLys: 4.383 ± 1.091
6.429GlyLeu: 6.429 ± 1.021
1.169GlyMet: 1.169 ± 0.561
2.046GlyAsn: 2.046 ± 1.016
0.292GlyPro: 0.292 ± 0.224
1.753GlyGln: 1.753 ± 0.546
3.214GlyArg: 3.214 ± 1.001
1.461GlySer: 1.461 ± 0.603
3.799GlyThr: 3.799 ± 1.05
3.507GlyVal: 3.507 ± 1.3
0.877GlyTrp: 0.877 ± 0.542
2.63GlyTyr: 2.63 ± 0.858
0.0GlyXaa: 0.0 ± 0.0
His
1.753HisAla: 1.753 ± 0.743
0.0HisCys: 0.0 ± 0.0
0.877HisAsp: 0.877 ± 0.474
0.877HisGlu: 0.877 ± 0.519
0.877HisPhe: 0.877 ± 0.426
1.461HisGly: 1.461 ± 0.72
0.0HisHis: 0.0 ± 0.0
0.292HisIle: 0.292 ± 0.224
0.877HisLys: 0.877 ± 0.71
1.461HisLeu: 1.461 ± 0.558
0.0HisMet: 0.0 ± 0.0
0.584HisAsn: 0.584 ± 0.357
0.584HisPro: 0.584 ± 0.345
0.584HisGln: 0.584 ± 0.452
1.461HisArg: 1.461 ± 0.858
0.877HisSer: 0.877 ± 0.391
1.461HisThr: 1.461 ± 0.694
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.461HisTyr: 1.461 ± 0.501
0.0HisXaa: 0.0 ± 0.0
Ile
4.676IleAla: 4.676 ± 1.078
0.292IleCys: 0.292 ± 0.308
6.429IleAsp: 6.429 ± 2.332
7.598IleGlu: 7.598 ± 1.619
4.091IlePhe: 4.091 ± 0.974
3.214IleGly: 3.214 ± 0.789
0.877IleHis: 0.877 ± 0.662
3.799IleIle: 3.799 ± 0.683
6.137IleLys: 6.137 ± 1.197
4.383IleLeu: 4.383 ± 0.892
0.877IleMet: 0.877 ± 0.422
2.046IleAsn: 2.046 ± 0.818
2.63IlePro: 2.63 ± 0.809
2.046IleGln: 2.046 ± 0.822
2.63IleArg: 2.63 ± 0.887
4.968IleSer: 4.968 ± 1.118
3.799IleThr: 3.799 ± 0.7
2.338IleVal: 2.338 ± 0.594
0.0IleTrp: 0.0 ± 0.0
2.338IleTyr: 2.338 ± 0.789
0.0IleXaa: 0.0 ± 0.0
Lys
8.182LysAla: 8.182 ± 1.74
0.292LysCys: 0.292 ± 0.276
2.338LysAsp: 2.338 ± 0.961
7.598LysGlu: 7.598 ± 1.557
3.214LysPhe: 3.214 ± 1.129
5.26LysGly: 5.26 ± 1.267
4.383LysHis: 4.383 ± 1.017
4.676LysIle: 4.676 ± 0.854
6.137LysLys: 6.137 ± 1.114
9.351LysLeu: 9.351 ± 1.356
2.63LysMet: 2.63 ± 1.206
5.26LysAsn: 5.26 ± 1.227
3.799LysPro: 3.799 ± 1.297
4.091LysGln: 4.091 ± 1.049
6.721LysArg: 6.721 ± 1.344
3.507LysSer: 3.507 ± 0.905
6.721LysThr: 6.721 ± 1.354
5.845LysVal: 5.845 ± 0.689
1.169LysTrp: 1.169 ± 0.572
2.63LysTyr: 2.63 ± 0.922
0.0LysXaa: 0.0 ± 0.0
Leu
5.552LeuAla: 5.552 ± 1.148
0.877LeuCys: 0.877 ± 0.442
9.351LeuAsp: 9.351 ± 1.758
12.566LeuGlu: 12.566 ± 1.819
3.507LeuPhe: 3.507 ± 0.999
3.799LeuGly: 3.799 ± 1.035
1.169LeuHis: 1.169 ± 0.524
4.383LeuIle: 4.383 ± 1.171
9.059LeuLys: 9.059 ± 1.37
9.059LeuLeu: 9.059 ± 1.238
2.63LeuMet: 2.63 ± 0.76
7.013LeuAsn: 7.013 ± 1.381
3.507LeuPro: 3.507 ± 0.995
2.338LeuGln: 2.338 ± 0.556
4.383LeuArg: 4.383 ± 1.03
6.137LeuSer: 6.137 ± 0.97
7.013LeuThr: 7.013 ± 1.344
4.383LeuVal: 4.383 ± 0.947
0.292LeuTrp: 0.292 ± 0.289
3.799LeuTyr: 3.799 ± 0.911
0.0LeuXaa: 0.0 ± 0.0
Met
1.753MetAla: 1.753 ± 0.816
0.0MetCys: 0.0 ± 0.0
1.461MetAsp: 1.461 ± 0.637
3.799MetGlu: 3.799 ± 0.749
0.584MetPhe: 0.584 ± 0.4
2.046MetGly: 2.046 ± 0.589
0.0MetHis: 0.0 ± 0.0
1.753MetIle: 1.753 ± 0.581
2.338MetLys: 2.338 ± 1.036
1.169MetLeu: 1.169 ± 0.639
0.584MetMet: 0.584 ± 0.452
2.63MetAsn: 2.63 ± 1.046
0.584MetPro: 0.584 ± 0.345
0.584MetGln: 0.584 ± 0.407
2.046MetArg: 2.046 ± 0.838
1.753MetSer: 1.753 ± 0.586
3.214MetThr: 3.214 ± 1.105
0.584MetVal: 0.584 ± 0.367
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.338AsnAla: 2.338 ± 0.624
0.0AsnCys: 0.0 ± 0.0
2.63AsnAsp: 2.63 ± 1.082
2.922AsnGlu: 2.922 ± 0.828
1.169AsnPhe: 1.169 ± 0.521
5.26AsnGly: 5.26 ± 0.922
0.877AsnHis: 0.877 ± 0.585
3.507AsnIle: 3.507 ± 1.285
6.137AsnLys: 6.137 ± 0.86
4.968AsnLeu: 4.968 ± 0.868
2.046AsnMet: 2.046 ± 0.909
2.63AsnAsn: 2.63 ± 0.823
2.63AsnPro: 2.63 ± 0.938
2.338AsnGln: 2.338 ± 0.801
1.461AsnArg: 1.461 ± 0.618
4.091AsnSer: 4.091 ± 0.983
3.214AsnThr: 3.214 ± 0.982
2.046AsnVal: 2.046 ± 0.642
0.584AsnTrp: 0.584 ± 0.369
1.461AsnTyr: 1.461 ± 0.66
0.0AsnXaa: 0.0 ± 0.0
Pro
2.046ProAla: 2.046 ± 0.695
0.0ProCys: 0.0 ± 0.0
2.046ProAsp: 2.046 ± 0.691
3.214ProGlu: 3.214 ± 0.849
2.338ProPhe: 2.338 ± 1.003
0.877ProGly: 0.877 ± 0.71
0.0ProHis: 0.0 ± 0.0
1.461ProIle: 1.461 ± 0.63
3.799ProLys: 3.799 ± 1.158
1.461ProLeu: 1.461 ± 0.581
0.292ProMet: 0.292 ± 0.285
2.046ProAsn: 2.046 ± 0.98
1.461ProPro: 1.461 ± 0.634
0.877ProGln: 0.877 ± 0.52
3.799ProArg: 3.799 ± 1.089
1.461ProSer: 1.461 ± 0.603
0.877ProThr: 0.877 ± 0.383
2.046ProVal: 2.046 ± 0.55
0.292ProTrp: 0.292 ± 0.248
1.461ProTyr: 1.461 ± 0.477
0.0ProXaa: 0.0 ± 0.0
Gln
3.507GlnAla: 3.507 ± 1.228
0.0GlnCys: 0.0 ± 0.0
1.461GlnAsp: 1.461 ± 0.689
3.214GlnGlu: 3.214 ± 0.743
0.877GlnPhe: 0.877 ± 0.39
2.338GlnGly: 2.338 ± 1.253
0.0GlnHis: 0.0 ± 0.0
2.338GlnIle: 2.338 ± 0.659
3.799GlnLys: 3.799 ± 0.835
4.676GlnLeu: 4.676 ± 1.136
0.584GlnMet: 0.584 ± 0.364
0.877GlnAsn: 0.877 ± 0.58
0.584GlnPro: 0.584 ± 0.415
1.169GlnGln: 1.169 ± 0.543
2.046GlnArg: 2.046 ± 0.824
1.169GlnSer: 1.169 ± 0.495
1.753GlnThr: 1.753 ± 0.551
3.799GlnVal: 3.799 ± 1.083
0.584GlnTrp: 0.584 ± 0.359
1.169GlnTyr: 1.169 ± 0.559
0.0GlnXaa: 0.0 ± 0.0
Arg
4.091ArgAla: 4.091 ± 1.091
0.584ArgCys: 0.584 ± 0.403
1.753ArgAsp: 1.753 ± 0.608
5.552ArgGlu: 5.552 ± 1.262
2.63ArgPhe: 2.63 ± 0.915
3.507ArgGly: 3.507 ± 1.244
0.877ArgHis: 0.877 ± 0.377
6.137ArgIle: 6.137 ± 0.977
3.799ArgLys: 3.799 ± 1.095
6.429ArgLeu: 6.429 ± 1.628
2.046ArgMet: 2.046 ± 0.927
2.338ArgAsn: 2.338 ± 1.011
1.169ArgPro: 1.169 ± 0.604
2.338ArgGln: 2.338 ± 0.831
2.046ArgArg: 2.046 ± 0.901
2.338ArgSer: 2.338 ± 0.998
2.63ArgThr: 2.63 ± 0.907
3.214ArgVal: 3.214 ± 1.013
0.292ArgTrp: 0.292 ± 0.314
3.507ArgTyr: 3.507 ± 1.267
0.0ArgXaa: 0.0 ± 0.0
Ser
2.922SerAla: 2.922 ± 0.731
0.584SerCys: 0.584 ± 0.416
3.214SerAsp: 3.214 ± 0.947
4.383SerGlu: 4.383 ± 1.233
2.338SerPhe: 2.338 ± 0.737
3.214SerGly: 3.214 ± 0.702
0.584SerHis: 0.584 ± 0.368
4.676SerIle: 4.676 ± 1.093
4.968SerLys: 4.968 ± 1.114
4.383SerLeu: 4.383 ± 1.034
2.046SerMet: 2.046 ± 0.702
3.799SerAsn: 3.799 ± 1.089
1.753SerPro: 1.753 ± 0.552
1.461SerGln: 1.461 ± 0.59
1.753SerArg: 1.753 ± 0.787
2.338SerSer: 2.338 ± 0.989
2.63SerThr: 2.63 ± 0.763
1.753SerVal: 1.753 ± 0.76
0.292SerTrp: 0.292 ± 0.248
2.046SerTyr: 2.046 ± 0.863
0.0SerXaa: 0.0 ± 0.0
Thr
2.63ThrAla: 2.63 ± 0.779
0.584ThrCys: 0.584 ± 0.439
2.338ThrAsp: 2.338 ± 0.785
3.214ThrGlu: 3.214 ± 0.878
3.507ThrPhe: 3.507 ± 1.664
2.338ThrGly: 2.338 ± 0.877
0.877ThrHis: 0.877 ± 0.352
3.799ThrIle: 3.799 ± 0.96
7.013ThrLys: 7.013 ± 1.63
8.475ThrLeu: 8.475 ± 1.267
2.046ThrMet: 2.046 ± 0.731
0.584ThrAsn: 0.584 ± 0.475
2.046ThrPro: 2.046 ± 0.912
1.753ThrGln: 1.753 ± 0.707
2.338ThrArg: 2.338 ± 0.544
2.922ThrSer: 2.922 ± 0.796
4.383ThrThr: 4.383 ± 1.139
4.968ThrVal: 4.968 ± 1.378
0.584ThrTrp: 0.584 ± 0.422
4.968ThrTyr: 4.968 ± 1.349
0.0ThrXaa: 0.0 ± 0.0
Val
3.799ValAla: 3.799 ± 1.526
0.584ValCys: 0.584 ± 0.398
1.753ValAsp: 1.753 ± 0.789
4.968ValGlu: 4.968 ± 1.392
1.169ValPhe: 1.169 ± 0.422
2.046ValGly: 2.046 ± 0.96
0.0ValHis: 0.0 ± 0.0
2.046ValIle: 2.046 ± 0.669
5.845ValLys: 5.845 ± 1.128
5.26ValLeu: 5.26 ± 1.348
1.169ValMet: 1.169 ± 0.654
3.214ValAsn: 3.214 ± 0.952
2.63ValPro: 2.63 ± 1.11
0.877ValGln: 0.877 ± 0.557
3.507ValArg: 3.507 ± 1.195
2.63ValSer: 2.63 ± 1.03
3.799ValThr: 3.799 ± 0.812
3.507ValVal: 3.507 ± 1.252
0.0ValTrp: 0.0 ± 0.0
3.214ValTyr: 3.214 ± 0.782
0.0ValXaa: 0.0 ± 0.0
Trp
0.292TrpAla: 0.292 ± 0.224
0.0TrpCys: 0.0 ± 0.0
0.292TrpAsp: 0.292 ± 0.314
1.169TrpGlu: 1.169 ± 0.592
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.292TrpHis: 0.292 ± 0.248
0.877TrpIle: 0.877 ± 0.439
0.292TrpLys: 0.292 ± 0.314
0.877TrpLeu: 0.877 ± 0.413
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.292TrpPro: 0.292 ± 0.248
0.877TrpGln: 0.877 ± 0.432
0.292TrpArg: 0.292 ± 0.248
1.169TrpSer: 1.169 ± 0.498
0.0TrpThr: 0.0 ± 0.0
0.877TrpVal: 0.877 ± 0.383
0.0TrpTrp: 0.0 ± 0.0
0.292TrpTyr: 0.292 ± 0.248
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.753TyrAla: 1.753 ± 0.673
0.877TyrCys: 0.877 ± 0.435
2.338TyrAsp: 2.338 ± 0.795
2.922TyrGlu: 2.922 ± 1.075
1.753TyrPhe: 1.753 ± 0.884
1.753TyrGly: 1.753 ± 1.022
1.169TyrHis: 1.169 ± 0.444
2.922TyrIle: 2.922 ± 0.74
6.429TyrLys: 6.429 ± 1.889
5.552TyrLeu: 5.552 ± 1.291
1.753TyrMet: 1.753 ± 0.715
1.753TyrAsn: 1.753 ± 0.508
2.046TyrPro: 2.046 ± 0.917
2.338TyrGln: 2.338 ± 0.791
2.922TyrArg: 2.922 ± 0.687
1.753TyrSer: 1.753 ± 0.598
2.338TyrThr: 2.338 ± 0.755
1.753TyrVal: 1.753 ± 0.747
0.292TyrTrp: 0.292 ± 0.285
0.292TyrTyr: 0.292 ± 0.285
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3423 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski