Amino acid dipepetide frequency for Streptococcus satellite phage Javan462

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.935AlaAla: 0.935 ± 0.449
0.312AlaCys: 0.312 ± 0.295
1.87AlaAsp: 1.87 ± 0.929
4.052AlaGlu: 4.052 ± 1.16
3.741AlaPhe: 3.741 ± 0.728
2.805AlaGly: 2.805 ± 0.906
0.312AlaHis: 0.312 ± 0.309
4.364AlaIle: 4.364 ± 1.29
4.676AlaLys: 4.676 ± 1.462
5.923AlaLeu: 5.923 ± 1.245
1.559AlaMet: 1.559 ± 0.589
3.741AlaAsn: 3.741 ± 1.069
0.935AlaPro: 0.935 ± 0.536
1.87AlaGln: 1.87 ± 0.905
2.182AlaArg: 2.182 ± 0.964
1.559AlaSer: 1.559 ± 0.519
5.923AlaThr: 5.923 ± 1.349
2.805AlaVal: 2.805 ± 0.908
0.312AlaTrp: 0.312 ± 0.312
3.117AlaTyr: 3.117 ± 0.966
0.0AlaXaa: 0.0 ± 0.0
Cys
0.312CysAla: 0.312 ± 0.338
0.0CysCys: 0.0 ± 0.0
0.623CysAsp: 0.623 ± 0.429
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.312CysGly: 0.312 ± 0.331
0.312CysHis: 0.312 ± 0.331
0.0CysIle: 0.0 ± 0.0
0.935CysLys: 0.935 ± 0.559
0.623CysLeu: 0.623 ± 0.409
0.0CysMet: 0.0 ± 0.0
0.623CysAsn: 0.623 ± 0.612
1.247CysPro: 1.247 ± 0.758
0.0CysGln: 0.0 ± 0.0
0.623CysArg: 0.623 ± 0.364
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.312CysVal: 0.312 ± 0.309
0.0CysTrp: 0.0 ± 0.0
1.247CysTyr: 1.247 ± 1.01
0.0CysXaa: 0.0 ± 0.0
Asp
2.494AspAla: 2.494 ± 0.894
0.312AspCys: 0.312 ± 0.295
4.052AspAsp: 4.052 ± 1.187
5.611AspGlu: 5.611 ± 1.789
4.676AspPhe: 4.676 ± 1.554
2.805AspGly: 2.805 ± 0.965
0.623AspHis: 0.623 ± 0.454
5.299AspIle: 5.299 ± 1.15
6.234AspLys: 6.234 ± 1.279
6.858AspLeu: 6.858 ± 1.245
0.312AspMet: 0.312 ± 0.342
2.494AspAsn: 2.494 ± 1.268
1.247AspPro: 1.247 ± 0.674
1.559AspGln: 1.559 ± 0.626
2.494AspArg: 2.494 ± 0.954
2.182AspSer: 2.182 ± 0.616
3.117AspThr: 3.117 ± 1.103
1.87AspVal: 1.87 ± 0.556
1.247AspTrp: 1.247 ± 0.623
2.805AspTyr: 2.805 ± 0.889
0.0AspXaa: 0.0 ± 0.0
Glu
6.234GluAla: 6.234 ± 1.328
0.935GluCys: 0.935 ± 0.72
4.364GluAsp: 4.364 ± 1.508
5.299GluGlu: 5.299 ± 1.374
4.052GluPhe: 4.052 ± 1.399
4.052GluGly: 4.052 ± 0.945
1.559GluHis: 1.559 ± 0.736
5.611GluIle: 5.611 ± 1.837
9.975GluLys: 9.975 ± 1.614
9.04GluLeu: 9.04 ± 1.548
1.87GluMet: 1.87 ± 0.882
4.988GluAsn: 4.988 ± 1.245
1.247GluPro: 1.247 ± 0.497
4.676GluGln: 4.676 ± 1.343
4.676GluArg: 4.676 ± 1.619
3.741GluSer: 3.741 ± 0.915
4.052GluThr: 4.052 ± 0.779
5.299GluVal: 5.299 ± 1.473
1.247GluTrp: 1.247 ± 0.535
3.741GluTyr: 3.741 ± 0.812
0.0GluXaa: 0.0 ± 0.0
Phe
2.494PheAla: 2.494 ± 0.69
0.312PheCys: 0.312 ± 0.309
2.805PheAsp: 2.805 ± 0.907
3.429PheGlu: 3.429 ± 0.866
2.182PhePhe: 2.182 ± 0.825
0.623PheGly: 0.623 ± 0.364
1.247PheHis: 1.247 ± 0.638
1.87PheIle: 1.87 ± 0.666
4.988PheLys: 4.988 ± 1.241
4.676PheLeu: 4.676 ± 1.084
1.87PheMet: 1.87 ± 0.694
3.429PheAsn: 3.429 ± 1.05
0.312PhePro: 0.312 ± 0.283
1.247PheGln: 1.247 ± 0.56
1.247PheArg: 1.247 ± 0.51
3.117PheSer: 3.117 ± 1.067
2.494PheThr: 2.494 ± 0.859
2.182PheVal: 2.182 ± 0.716
0.0PheTrp: 0.0 ± 0.0
1.87PheTyr: 1.87 ± 0.805
0.0PheXaa: 0.0 ± 0.0
Gly
4.676GlyAla: 4.676 ± 0.847
0.935GlyCys: 0.935 ± 0.513
2.182GlyAsp: 2.182 ± 1.119
3.429GlyGlu: 3.429 ± 0.627
1.559GlyPhe: 1.559 ± 0.6
2.182GlyGly: 2.182 ± 1.042
0.935GlyHis: 0.935 ± 0.478
3.741GlyIle: 3.741 ± 1.236
4.364GlyLys: 4.364 ± 1.243
4.676GlyLeu: 4.676 ± 1.107
0.312GlyMet: 0.312 ± 0.309
1.87GlyAsn: 1.87 ± 0.762
1.247GlyPro: 1.247 ± 0.678
1.559GlyGln: 1.559 ± 1.013
1.559GlyArg: 1.559 ± 0.585
0.935GlySer: 0.935 ± 0.466
2.494GlyThr: 2.494 ± 0.656
1.87GlyVal: 1.87 ± 1.193
1.247GlyTrp: 1.247 ± 0.571
2.494GlyTyr: 2.494 ± 1.255
0.0GlyXaa: 0.0 ± 0.0
His
1.247HisAla: 1.247 ± 0.665
0.312HisCys: 0.312 ± 0.437
0.623HisAsp: 0.623 ± 0.7
0.935HisGlu: 0.935 ± 0.64
0.312HisPhe: 0.312 ± 0.295
0.935HisGly: 0.935 ± 0.5
0.935HisHis: 0.935 ± 0.5
1.87HisIle: 1.87 ± 0.619
1.559HisLys: 1.559 ± 0.608
2.182HisLeu: 2.182 ± 1.319
0.0HisMet: 0.0 ± 0.0
0.312HisAsn: 0.312 ± 0.295
0.312HisPro: 0.312 ± 0.283
1.559HisGln: 1.559 ± 0.724
1.247HisArg: 1.247 ± 0.857
0.623HisSer: 0.623 ± 0.409
1.87HisThr: 1.87 ± 0.718
0.312HisVal: 0.312 ± 0.309
0.312HisTrp: 0.312 ± 0.283
1.247HisTyr: 1.247 ± 0.685
0.0HisXaa: 0.0 ± 0.0
Ile
4.052IleAla: 4.052 ± 1.222
1.247IleCys: 1.247 ± 0.64
4.052IleAsp: 4.052 ± 0.975
9.663IleGlu: 9.663 ± 1.778
1.87IlePhe: 1.87 ± 0.678
1.559IleGly: 1.559 ± 0.588
1.559IleHis: 1.559 ± 0.62
4.988IleIle: 4.988 ± 1.347
11.222IleLys: 11.222 ± 1.889
5.299IleLeu: 5.299 ± 1.14
1.247IleMet: 1.247 ± 0.667
4.676IleAsn: 4.676 ± 0.956
1.559IlePro: 1.559 ± 0.581
1.87IleGln: 1.87 ± 0.618
1.559IleArg: 1.559 ± 0.792
4.052IleSer: 4.052 ± 1.093
4.364IleThr: 4.364 ± 1.175
4.052IleVal: 4.052 ± 1.176
0.0IleTrp: 0.0 ± 0.0
1.559IleTyr: 1.559 ± 0.665
0.0IleXaa: 0.0 ± 0.0
Lys
4.988LysAla: 4.988 ± 1.557
0.312LysCys: 0.312 ± 0.283
7.17LysAsp: 7.17 ± 1.363
9.975LysGlu: 9.975 ± 1.884
2.182LysPhe: 2.182 ± 0.74
4.676LysGly: 4.676 ± 1.45
2.494LysHis: 2.494 ± 1.059
6.546LysIle: 6.546 ± 1.644
10.599LysLys: 10.599 ± 2.186
9.663LysLeu: 9.663 ± 1.862
1.559LysMet: 1.559 ± 0.677
5.923LysAsn: 5.923 ± 1.063
2.182LysPro: 2.182 ± 0.901
5.299LysGln: 5.299 ± 1.288
5.923LysArg: 5.923 ± 1.373
7.17LysSer: 7.17 ± 1.54
5.923LysThr: 5.923 ± 1.035
6.858LysVal: 6.858 ± 1.338
0.312LysTrp: 0.312 ± 0.295
2.805LysTyr: 2.805 ± 0.891
0.0LysXaa: 0.0 ± 0.0
Leu
4.052LeuAla: 4.052 ± 1.162
0.0LeuCys: 0.0 ± 0.0
6.234LeuAsp: 6.234 ± 1.56
10.287LeuGlu: 10.287 ± 2.356
3.741LeuPhe: 3.741 ± 1.226
6.546LeuGly: 6.546 ± 1.256
2.182LeuHis: 2.182 ± 0.639
4.676LeuIle: 4.676 ± 1.344
8.416LeuLys: 8.416 ± 1.617
10.599LeuLeu: 10.599 ± 1.436
1.559LeuMet: 1.559 ± 0.521
7.481LeuAsn: 7.481 ± 1.454
2.494LeuPro: 2.494 ± 0.686
3.741LeuGln: 3.741 ± 1.073
3.741LeuArg: 3.741 ± 1.128
6.858LeuSer: 6.858 ± 1.041
4.988LeuThr: 4.988 ± 1.106
4.676LeuVal: 4.676 ± 1.403
0.312LeuTrp: 0.312 ± 0.295
4.676LeuTyr: 4.676 ± 1.181
0.0LeuXaa: 0.0 ± 0.0
Met
0.623MetAla: 0.623 ± 0.403
0.0MetCys: 0.0 ± 0.0
1.247MetAsp: 1.247 ± 0.769
1.559MetGlu: 1.559 ± 0.642
0.312MetPhe: 0.312 ± 0.342
0.312MetGly: 0.312 ± 0.283
0.0MetHis: 0.0 ± 0.0
1.559MetIle: 1.559 ± 0.731
1.559MetLys: 1.559 ± 0.675
2.494MetLeu: 2.494 ± 0.655
0.312MetMet: 0.312 ± 0.283
1.87MetAsn: 1.87 ± 0.711
0.0MetPro: 0.0 ± 0.0
1.247MetGln: 1.247 ± 0.629
0.935MetArg: 0.935 ± 0.701
2.182MetSer: 2.182 ± 0.844
1.247MetThr: 1.247 ± 0.512
0.935MetVal: 0.935 ± 0.661
0.0MetTrp: 0.0 ± 0.0
0.312MetTyr: 0.312 ± 0.323
0.0MetXaa: 0.0 ± 0.0
Asn
4.052AsnAla: 4.052 ± 1.092
0.623AsnCys: 0.623 ± 0.618
2.805AsnAsp: 2.805 ± 0.829
4.676AsnGlu: 4.676 ± 1.688
3.117AsnPhe: 3.117 ± 0.855
3.429AsnGly: 3.429 ± 0.888
1.247AsnHis: 1.247 ± 0.583
3.741AsnIle: 3.741 ± 1.241
5.299AsnLys: 5.299 ± 1.258
4.988AsnLeu: 4.988 ± 1.148
0.935AsnMet: 0.935 ± 0.573
4.364AsnAsn: 4.364 ± 1.344
2.182AsnPro: 2.182 ± 1.005
4.052AsnGln: 4.052 ± 1.202
3.117AsnArg: 3.117 ± 1.003
3.429AsnSer: 3.429 ± 0.771
4.676AsnThr: 4.676 ± 1.235
3.117AsnVal: 3.117 ± 1.28
0.935AsnTrp: 0.935 ± 0.586
2.182AsnTyr: 2.182 ± 0.879
0.0AsnXaa: 0.0 ± 0.0
Pro
0.623ProAla: 0.623 ± 0.589
0.0ProCys: 0.0 ± 0.0
0.623ProAsp: 0.623 ± 0.428
3.429ProGlu: 3.429 ± 1.019
1.87ProPhe: 1.87 ± 0.53
0.623ProGly: 0.623 ± 0.566
0.0ProHis: 0.0 ± 0.0
1.87ProIle: 1.87 ± 0.802
2.805ProLys: 2.805 ± 0.986
1.87ProLeu: 1.87 ± 0.811
1.247ProMet: 1.247 ± 0.54
1.559ProAsn: 1.559 ± 0.527
0.0ProPro: 0.0 ± 0.0
0.935ProGln: 0.935 ± 0.446
1.247ProArg: 1.247 ± 0.535
2.182ProSer: 2.182 ± 0.761
1.247ProThr: 1.247 ± 0.573
0.935ProVal: 0.935 ± 0.446
0.0ProTrp: 0.0 ± 0.0
1.559ProTyr: 1.559 ± 0.733
0.0ProXaa: 0.0 ± 0.0
Gln
2.805GlnAla: 2.805 ± 1.265
0.623GlnCys: 0.623 ± 0.545
2.494GlnAsp: 2.494 ± 1.105
4.676GlnGlu: 4.676 ± 1.14
1.559GlnPhe: 1.559 ± 0.53
1.247GlnGly: 1.247 ± 0.811
0.623GlnHis: 0.623 ± 0.431
3.429GlnIle: 3.429 ± 1.067
4.052GlnLys: 4.052 ± 1.579
4.364GlnLeu: 4.364 ± 1.174
1.247GlnMet: 1.247 ± 0.665
2.805GlnAsn: 2.805 ± 0.985
1.247GlnPro: 1.247 ± 0.575
3.117GlnGln: 3.117 ± 1.342
2.494GlnArg: 2.494 ± 0.874
1.87GlnSer: 1.87 ± 0.557
1.247GlnThr: 1.247 ± 0.724
3.741GlnVal: 3.741 ± 1.314
0.0GlnTrp: 0.0 ± 0.0
3.429GlnTyr: 3.429 ± 1.22
0.0GlnXaa: 0.0 ± 0.0
Arg
3.117ArgAla: 3.117 ± 0.881
0.0ArgCys: 0.0 ± 0.0
2.494ArgAsp: 2.494 ± 0.827
2.805ArgGlu: 2.805 ± 1.006
0.623ArgPhe: 0.623 ± 0.456
1.87ArgGly: 1.87 ± 0.797
0.935ArgHis: 0.935 ± 0.509
3.117ArgIle: 3.117 ± 0.702
5.299ArgLys: 5.299 ± 1.051
4.364ArgLeu: 4.364 ± 1.235
0.935ArgMet: 0.935 ± 0.635
2.182ArgAsn: 2.182 ± 1.005
1.247ArgPro: 1.247 ± 0.597
3.429ArgGln: 3.429 ± 1.121
2.182ArgArg: 2.182 ± 0.736
1.559ArgSer: 1.559 ± 0.547
3.741ArgThr: 3.741 ± 0.839
2.182ArgVal: 2.182 ± 0.845
0.312ArgTrp: 0.312 ± 0.338
2.805ArgTyr: 2.805 ± 1.053
0.0ArgXaa: 0.0 ± 0.0
Ser
3.117SerAla: 3.117 ± 1.204
0.312SerCys: 0.312 ± 0.331
4.364SerAsp: 4.364 ± 1.118
4.052SerGlu: 4.052 ± 1.482
3.117SerPhe: 3.117 ± 0.922
1.247SerGly: 1.247 ± 0.879
0.312SerHis: 0.312 ± 0.309
3.741SerIle: 3.741 ± 0.7
5.923SerLys: 5.923 ± 1.235
3.429SerLeu: 3.429 ± 1.084
0.935SerMet: 0.935 ± 0.672
3.741SerAsn: 3.741 ± 1.098
1.559SerPro: 1.559 ± 0.936
1.87SerGln: 1.87 ± 0.996
4.052SerArg: 4.052 ± 0.898
2.494SerSer: 2.494 ± 0.913
3.117SerThr: 3.117 ± 1.055
3.741SerVal: 3.741 ± 1.055
0.935SerTrp: 0.935 ± 0.529
1.87SerTyr: 1.87 ± 0.704
0.0SerXaa: 0.0 ± 0.0
Thr
2.494ThrAla: 2.494 ± 0.963
0.312ThrCys: 0.312 ± 0.32
3.429ThrAsp: 3.429 ± 1.596
4.988ThrGlu: 4.988 ± 1.642
3.429ThrPhe: 3.429 ± 1.377
2.494ThrGly: 2.494 ± 0.897
1.247ThrHis: 1.247 ± 0.521
5.923ThrIle: 5.923 ± 1.58
4.364ThrLys: 4.364 ± 0.996
4.052ThrLeu: 4.052 ± 0.944
1.247ThrMet: 1.247 ± 0.515
3.429ThrAsn: 3.429 ± 0.648
3.429ThrPro: 3.429 ± 1.019
4.052ThrGln: 4.052 ± 1.436
0.935ThrArg: 0.935 ± 0.596
3.429ThrSer: 3.429 ± 1.224
2.494ThrThr: 2.494 ± 1.098
4.052ThrVal: 4.052 ± 1.014
0.0ThrTrp: 0.0 ± 0.0
3.429ThrTyr: 3.429 ± 1.194
0.0ThrXaa: 0.0 ± 0.0
Val
1.87ValAla: 1.87 ± 0.684
0.312ValCys: 0.312 ± 0.283
3.741ValAsp: 3.741 ± 0.865
4.052ValGlu: 4.052 ± 0.944
2.182ValPhe: 2.182 ± 0.709
2.494ValGly: 2.494 ± 0.812
1.247ValHis: 1.247 ± 0.472
3.741ValIle: 3.741 ± 0.899
4.988ValLys: 4.988 ± 0.991
6.234ValLeu: 6.234 ± 1.521
0.935ValMet: 0.935 ± 0.576
4.052ValAsn: 4.052 ± 0.9
1.247ValPro: 1.247 ± 0.463
1.247ValGln: 1.247 ± 0.782
1.87ValArg: 1.87 ± 0.509
2.182ValSer: 2.182 ± 0.73
4.988ValThr: 4.988 ± 1.057
1.87ValVal: 1.87 ± 0.629
0.935ValTrp: 0.935 ± 0.512
2.494ValTyr: 2.494 ± 0.678
0.0ValXaa: 0.0 ± 0.0
Trp
0.623TrpAla: 0.623 ± 0.381
0.0TrpCys: 0.0 ± 0.0
1.247TrpAsp: 1.247 ± 0.67
1.559TrpGlu: 1.559 ± 0.563
0.312TrpPhe: 0.312 ± 0.283
0.312TrpGly: 0.312 ± 0.35
0.0TrpHis: 0.0 ± 0.0
0.312TrpIle: 0.312 ± 0.306
0.312TrpLys: 0.312 ± 0.323
0.935TrpLeu: 0.935 ± 0.497
0.0TrpMet: 0.0 ± 0.0
0.623TrpAsn: 0.623 ± 0.424
0.0TrpPro: 0.0 ± 0.0
0.623TrpGln: 0.623 ± 0.434
0.312TrpArg: 0.312 ± 0.312
0.312TrpSer: 0.312 ± 0.295
0.0TrpThr: 0.0 ± 0.0
0.312TrpVal: 0.312 ± 0.32
0.312TrpTrp: 0.312 ± 0.295
0.623TrpTyr: 0.623 ± 0.454
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.182TyrAla: 2.182 ± 1.164
0.312TyrCys: 0.312 ± 0.342
1.87TyrAsp: 1.87 ± 0.829
2.182TyrGlu: 2.182 ± 0.664
1.559TyrPhe: 1.559 ± 0.754
3.741TyrGly: 3.741 ± 1.092
0.935TyrHis: 0.935 ± 0.535
4.052TyrIle: 4.052 ± 1.214
4.988TyrLys: 4.988 ± 1.335
4.988TyrLeu: 4.988 ± 1.046
0.312TyrMet: 0.312 ± 0.412
2.805TyrAsn: 2.805 ± 0.798
0.935TyrPro: 0.935 ± 0.502
3.117TyrGln: 3.117 ± 0.884
2.805TyrArg: 2.805 ± 0.899
4.052TyrSer: 4.052 ± 1.275
1.247TyrThr: 1.247 ± 0.564
1.559TyrVal: 1.559 ± 0.807
0.312TyrTrp: 0.312 ± 0.283
2.182TyrTyr: 2.182 ± 0.937
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 23 proteins (3209 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski