Amino acid dipepetide frequency for Streptococcus phage phiARI0131-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.275AlaAla: 4.275 ± 1.331
0.107AlaCys: 0.107 ± 0.136
4.702AlaAsp: 4.702 ± 0.781
5.985AlaGlu: 5.985 ± 0.809
2.672AlaPhe: 2.672 ± 0.469
4.275AlaGly: 4.275 ± 0.945
0.214AlaHis: 0.214 ± 0.155
5.344AlaIle: 5.344 ± 1.303
4.702AlaLys: 4.702 ± 0.606
8.015AlaLeu: 8.015 ± 1.062
2.565AlaMet: 2.565 ± 0.494
4.168AlaAsn: 4.168 ± 1.067
1.496AlaPro: 1.496 ± 0.361
2.992AlaGln: 2.992 ± 0.873
2.992AlaArg: 2.992 ± 0.543
4.168AlaSer: 4.168 ± 0.704
4.489AlaThr: 4.489 ± 0.731
4.061AlaVal: 4.061 ± 0.599
0.962AlaTrp: 0.962 ± 0.308
2.565AlaTyr: 2.565 ± 0.704
0.0AlaXaa: 0.0 ± 0.0
Cys
0.107CysAla: 0.107 ± 0.076
0.0CysCys: 0.0 ± 0.0
0.214CysAsp: 0.214 ± 0.144
0.534CysGlu: 0.534 ± 0.236
0.107CysPhe: 0.107 ± 0.076
0.107CysGly: 0.107 ± 0.076
0.107CysHis: 0.107 ± 0.101
0.427CysIle: 0.427 ± 0.257
0.427CysLys: 0.427 ± 0.215
0.641CysLeu: 0.641 ± 0.3
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.214CysGln: 0.214 ± 0.189
0.534CysArg: 0.534 ± 0.25
0.321CysSer: 0.321 ± 0.231
0.214CysThr: 0.214 ± 0.176
0.321CysVal: 0.321 ± 0.225
0.107CysTrp: 0.107 ± 0.076
0.641CysTyr: 0.641 ± 0.258
0.0CysXaa: 0.0 ± 0.0
Asp
5.237AspAla: 5.237 ± 0.73
0.214AspCys: 0.214 ± 0.186
3.741AspAsp: 3.741 ± 0.835
4.702AspGlu: 4.702 ± 1.335
2.565AspPhe: 2.565 ± 0.577
5.344AspGly: 5.344 ± 0.96
0.855AspHis: 0.855 ± 0.307
6.199AspIle: 6.199 ± 0.691
5.237AspLys: 5.237 ± 0.944
5.557AspLeu: 5.557 ± 0.827
1.496AspMet: 1.496 ± 0.404
2.672AspAsn: 2.672 ± 0.626
1.496AspPro: 1.496 ± 0.378
1.389AspGln: 1.389 ± 0.323
1.71AspArg: 1.71 ± 0.551
3.42AspSer: 3.42 ± 0.669
2.351AspThr: 2.351 ± 0.409
3.206AspVal: 3.206 ± 0.61
0.855AspTrp: 0.855 ± 0.213
2.458AspTyr: 2.458 ± 0.562
0.0AspXaa: 0.0 ± 0.0
Glu
5.771GluAla: 5.771 ± 1.241
0.321GluCys: 0.321 ± 0.179
3.313GluAsp: 3.313 ± 0.782
5.878GluGlu: 5.878 ± 0.933
3.313GluPhe: 3.313 ± 0.645
3.42GluGly: 3.42 ± 0.535
0.427GluHis: 0.427 ± 0.274
6.305GluIle: 6.305 ± 0.973
7.374GluLys: 7.374 ± 1.141
8.977GluLeu: 8.977 ± 0.979
2.779GluMet: 2.779 ± 0.639
5.237GluAsn: 5.237 ± 0.905
0.962GluPro: 0.962 ± 0.469
3.527GluGln: 3.527 ± 0.817
3.313GluArg: 3.313 ± 0.62
3.954GluSer: 3.954 ± 0.64
4.382GluThr: 4.382 ± 0.879
4.916GluVal: 4.916 ± 0.887
1.069GluTrp: 1.069 ± 0.327
2.137GluTyr: 2.137 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
2.244PheAla: 2.244 ± 0.662
0.107PheCys: 0.107 ± 0.11
3.847PheAsp: 3.847 ± 0.699
2.351PheGlu: 2.351 ± 0.479
2.458PhePhe: 2.458 ± 0.904
3.42PheGly: 3.42 ± 0.731
0.641PheHis: 0.641 ± 0.301
2.992PheIle: 2.992 ± 0.703
4.809PheLys: 4.809 ± 0.665
2.886PheLeu: 2.886 ± 0.579
1.389PheMet: 1.389 ± 0.516
3.206PheAsn: 3.206 ± 0.583
1.069PhePro: 1.069 ± 0.417
1.282PheGln: 1.282 ± 0.433
1.817PheArg: 1.817 ± 0.349
2.458PheSer: 2.458 ± 0.63
2.992PheThr: 2.992 ± 0.383
2.565PheVal: 2.565 ± 0.612
0.748PheTrp: 0.748 ± 0.247
1.389PheTyr: 1.389 ± 0.378
0.0PheXaa: 0.0 ± 0.0
Gly
4.061GlyAla: 4.061 ± 0.766
0.214GlyCys: 0.214 ± 0.143
3.954GlyAsp: 3.954 ± 0.723
3.313GlyGlu: 3.313 ± 0.447
3.099GlyPhe: 3.099 ± 0.615
3.741GlyGly: 3.741 ± 0.557
0.427GlyHis: 0.427 ± 0.176
4.702GlyIle: 4.702 ± 0.87
4.061GlyLys: 4.061 ± 0.477
4.595GlyLeu: 4.595 ± 0.835
2.137GlyMet: 2.137 ± 0.531
4.382GlyAsn: 4.382 ± 0.679
0.855GlyPro: 0.855 ± 0.282
3.206GlyGln: 3.206 ± 0.565
2.672GlyArg: 2.672 ± 0.59
3.313GlySer: 3.313 ± 0.684
3.313GlyThr: 3.313 ± 0.577
4.275GlyVal: 4.275 ± 0.485
1.71GlyTrp: 1.71 ± 0.673
2.351GlyTyr: 2.351 ± 0.582
0.0GlyXaa: 0.0 ± 0.0
His
0.321HisAla: 0.321 ± 0.151
0.214HisCys: 0.214 ± 0.149
0.748HisAsp: 0.748 ± 0.448
0.641HisGlu: 0.641 ± 0.248
1.069HisPhe: 1.069 ± 0.336
0.641HisGly: 0.641 ± 0.308
0.214HisHis: 0.214 ± 0.197
1.389HisIle: 1.389 ± 0.316
0.748HisLys: 0.748 ± 0.28
1.176HisLeu: 1.176 ± 0.386
0.321HisMet: 0.321 ± 0.165
0.962HisAsn: 0.962 ± 0.296
0.427HisPro: 0.427 ± 0.183
0.748HisGln: 0.748 ± 0.429
0.534HisArg: 0.534 ± 0.203
1.603HisSer: 1.603 ± 0.607
0.748HisThr: 0.748 ± 0.339
1.176HisVal: 1.176 ± 0.421
0.0HisTrp: 0.0 ± 0.0
0.427HisTyr: 0.427 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
5.664IleAla: 5.664 ± 0.692
0.641IleCys: 0.641 ± 0.263
3.954IleAsp: 3.954 ± 0.834
6.412IleGlu: 6.412 ± 1.006
3.099IlePhe: 3.099 ± 0.745
3.847IleGly: 3.847 ± 0.614
0.962IleHis: 0.962 ± 0.35
3.206IleIle: 3.206 ± 0.623
7.054IleLys: 7.054 ± 0.841
4.382IleLeu: 4.382 ± 0.756
0.855IleMet: 0.855 ± 0.235
3.527IleAsn: 3.527 ± 0.522
1.817IlePro: 1.817 ± 0.332
2.672IleGln: 2.672 ± 0.48
3.634IleArg: 3.634 ± 0.637
5.664IleSer: 5.664 ± 1.375
4.702IleThr: 4.702 ± 0.722
3.847IleVal: 3.847 ± 0.733
0.427IleTrp: 0.427 ± 0.191
2.031IleTyr: 2.031 ± 0.69
0.0IleXaa: 0.0 ± 0.0
Lys
5.878LysAla: 5.878 ± 0.911
0.321LysCys: 0.321 ± 0.218
5.45LysAsp: 5.45 ± 0.675
6.626LysGlu: 6.626 ± 1.264
2.351LysPhe: 2.351 ± 0.438
4.595LysGly: 4.595 ± 0.805
2.244LysHis: 2.244 ± 0.566
4.809LysIle: 4.809 ± 0.89
6.84LysLys: 6.84 ± 1.209
5.237LysLeu: 5.237 ± 0.701
1.924LysMet: 1.924 ± 0.512
4.916LysAsn: 4.916 ± 0.648
2.992LysPro: 2.992 ± 0.652
3.847LysGln: 3.847 ± 0.588
4.275LysArg: 4.275 ± 0.894
5.45LysSer: 5.45 ± 0.619
5.664LysThr: 5.664 ± 0.811
5.344LysVal: 5.344 ± 0.832
0.962LysTrp: 0.962 ± 0.34
2.992LysTyr: 2.992 ± 0.475
0.0LysXaa: 0.0 ± 0.0
Leu
5.985LeuAla: 5.985 ± 0.94
0.321LeuCys: 0.321 ± 0.214
6.733LeuAsp: 6.733 ± 0.85
6.412LeuGlu: 6.412 ± 0.931
3.741LeuPhe: 3.741 ± 0.542
5.664LeuGly: 5.664 ± 0.714
1.176LeuHis: 1.176 ± 0.374
3.847LeuIle: 3.847 ± 0.614
6.305LeuLys: 6.305 ± 0.744
5.13LeuLeu: 5.13 ± 0.937
1.71LeuMet: 1.71 ± 0.49
5.237LeuAsn: 5.237 ± 0.743
2.565LeuPro: 2.565 ± 0.72
3.099LeuGln: 3.099 ± 0.945
4.489LeuArg: 4.489 ± 0.731
5.45LeuSer: 5.45 ± 1.031
5.557LeuThr: 5.557 ± 0.673
3.741LeuVal: 3.741 ± 0.666
0.748LeuTrp: 0.748 ± 0.27
3.313LeuTyr: 3.313 ± 0.58
0.0LeuXaa: 0.0 ± 0.0
Met
1.71MetAla: 1.71 ± 0.471
0.107MetCys: 0.107 ± 0.106
0.962MetAsp: 0.962 ± 0.283
2.031MetGlu: 2.031 ± 0.5
0.748MetPhe: 0.748 ± 0.422
1.069MetGly: 1.069 ± 0.232
0.107MetHis: 0.107 ± 0.105
1.603MetIle: 1.603 ± 0.377
1.71MetLys: 1.71 ± 0.468
2.031MetLeu: 2.031 ± 0.571
0.534MetMet: 0.534 ± 0.25
2.244MetAsn: 2.244 ± 0.626
0.534MetPro: 0.534 ± 0.199
1.069MetGln: 1.069 ± 0.349
1.389MetArg: 1.389 ± 0.476
1.71MetSer: 1.71 ± 0.488
2.244MetThr: 2.244 ± 0.482
1.496MetVal: 1.496 ± 0.345
0.0MetTrp: 0.0 ± 0.0
0.427MetTyr: 0.427 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
4.168AsnAla: 4.168 ± 0.987
0.321AsnCys: 0.321 ± 0.191
2.779AsnAsp: 2.779 ± 0.631
4.916AsnGlu: 4.916 ± 0.575
2.992AsnPhe: 2.992 ± 0.461
4.275AsnGly: 4.275 ± 0.712
0.855AsnHis: 0.855 ± 0.311
3.206AsnIle: 3.206 ± 0.59
5.557AsnLys: 5.557 ± 0.7
4.168AsnLeu: 4.168 ± 0.939
0.855AsnMet: 0.855 ± 0.401
2.565AsnAsn: 2.565 ± 0.625
2.458AsnPro: 2.458 ± 0.509
4.595AsnGln: 4.595 ± 0.723
2.244AsnArg: 2.244 ± 0.45
4.168AsnSer: 4.168 ± 0.9
3.099AsnThr: 3.099 ± 0.571
4.702AsnVal: 4.702 ± 0.754
0.962AsnTrp: 0.962 ± 0.334
1.069AsnTyr: 1.069 ± 0.319
0.0AsnXaa: 0.0 ± 0.0
Pro
1.817ProAla: 1.817 ± 0.521
0.214ProCys: 0.214 ± 0.17
2.244ProAsp: 2.244 ± 0.631
2.031ProGlu: 2.031 ± 0.499
0.748ProPhe: 0.748 ± 0.351
1.176ProGly: 1.176 ± 0.546
0.748ProHis: 0.748 ± 0.275
1.71ProIle: 1.71 ± 0.593
2.351ProLys: 2.351 ± 0.575
2.137ProLeu: 2.137 ± 0.541
0.214ProMet: 0.214 ± 0.165
0.962ProAsn: 0.962 ± 0.363
0.214ProPro: 0.214 ± 0.124
1.603ProGln: 1.603 ± 0.618
1.389ProArg: 1.389 ± 0.437
1.389ProSer: 1.389 ± 0.412
1.71ProThr: 1.71 ± 0.445
2.137ProVal: 2.137 ± 0.47
0.321ProTrp: 0.321 ± 0.184
0.962ProTyr: 0.962 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
5.023GlnAla: 5.023 ± 1.018
0.107GlnCys: 0.107 ± 0.11
1.924GlnAsp: 1.924 ± 0.537
4.061GlnGlu: 4.061 ± 0.706
1.817GlnPhe: 1.817 ± 0.374
2.031GlnGly: 2.031 ± 0.378
0.321GlnHis: 0.321 ± 0.23
3.099GlnIle: 3.099 ± 0.667
3.741GlnLys: 3.741 ± 0.65
3.847GlnLeu: 3.847 ± 0.652
1.282GlnMet: 1.282 ± 0.375
2.458GlnAsn: 2.458 ± 0.477
1.496GlnPro: 1.496 ± 0.4
1.603GlnGln: 1.603 ± 0.484
2.137GlnArg: 2.137 ± 0.61
3.206GlnSer: 3.206 ± 0.493
3.741GlnThr: 3.741 ± 0.509
2.244GlnVal: 2.244 ± 0.505
0.107GlnTrp: 0.107 ± 0.127
0.748GlnTyr: 0.748 ± 0.266
0.0GlnXaa: 0.0 ± 0.0
Arg
3.42ArgAla: 3.42 ± 0.728
0.214ArgCys: 0.214 ± 0.146
2.244ArgAsp: 2.244 ± 0.532
3.634ArgGlu: 3.634 ± 0.696
1.817ArgPhe: 1.817 ± 0.586
1.496ArgGly: 1.496 ± 0.359
0.427ArgHis: 0.427 ± 0.243
3.527ArgIle: 3.527 ± 0.807
4.061ArgLys: 4.061 ± 0.861
3.954ArgLeu: 3.954 ± 0.756
1.176ArgMet: 1.176 ± 0.294
3.847ArgAsn: 3.847 ± 0.715
1.389ArgPro: 1.389 ± 0.479
2.672ArgGln: 2.672 ± 0.658
1.496ArgArg: 1.496 ± 0.468
1.71ArgSer: 1.71 ± 0.368
2.458ArgThr: 2.458 ± 0.808
2.244ArgVal: 2.244 ± 0.553
0.641ArgTrp: 0.641 ± 0.253
2.137ArgTyr: 2.137 ± 0.59
0.0ArgXaa: 0.0 ± 0.0
Ser
5.023SerAla: 5.023 ± 0.844
0.427SerCys: 0.427 ± 0.241
3.099SerAsp: 3.099 ± 0.738
4.595SerGlu: 4.595 ± 0.815
3.847SerPhe: 3.847 ± 0.599
4.489SerGly: 4.489 ± 1.048
1.176SerHis: 1.176 ± 0.348
3.954SerIle: 3.954 ± 0.519
3.741SerLys: 3.741 ± 0.787
4.809SerLeu: 4.809 ± 0.645
1.176SerMet: 1.176 ± 0.426
3.954SerAsn: 3.954 ± 0.659
2.031SerPro: 2.031 ± 0.416
2.458SerGln: 2.458 ± 0.639
2.779SerArg: 2.779 ± 0.582
3.527SerSer: 3.527 ± 0.875
3.741SerThr: 3.741 ± 0.845
3.847SerVal: 3.847 ± 0.901
0.855SerTrp: 0.855 ± 0.292
2.351SerTyr: 2.351 ± 0.476
0.0SerXaa: 0.0 ± 0.0
Thr
4.809ThrAla: 4.809 ± 1.118
0.107ThrCys: 0.107 ± 0.101
5.13ThrAsp: 5.13 ± 0.836
4.061ThrGlu: 4.061 ± 0.7
3.099ThrPhe: 3.099 ± 0.816
4.061ThrGly: 4.061 ± 0.84
1.069ThrHis: 1.069 ± 0.33
4.382ThrIle: 4.382 ± 0.786
5.237ThrLys: 5.237 ± 0.549
5.13ThrLeu: 5.13 ± 0.645
0.962ThrMet: 0.962 ± 0.415
2.886ThrAsn: 2.886 ± 0.526
1.176ThrPro: 1.176 ± 0.398
3.42ThrGln: 3.42 ± 0.799
2.351ThrArg: 2.351 ± 0.391
3.527ThrSer: 3.527 ± 0.581
4.168ThrThr: 4.168 ± 0.991
5.023ThrVal: 5.023 ± 0.988
0.641ThrTrp: 0.641 ± 0.275
2.244ThrTyr: 2.244 ± 0.621
0.0ThrXaa: 0.0 ± 0.0
Val
2.886ValAla: 2.886 ± 0.537
0.321ValCys: 0.321 ± 0.207
3.741ValAsp: 3.741 ± 0.664
5.664ValGlu: 5.664 ± 0.923
2.992ValPhe: 2.992 ± 0.747
4.168ValGly: 4.168 ± 0.662
1.176ValHis: 1.176 ± 0.35
3.954ValIle: 3.954 ± 0.625
5.771ValLys: 5.771 ± 0.77
4.275ValLeu: 4.275 ± 0.692
1.282ValMet: 1.282 ± 0.309
3.634ValAsn: 3.634 ± 0.555
1.603ValPro: 1.603 ± 0.474
2.351ValGln: 2.351 ± 0.473
2.244ValArg: 2.244 ± 0.48
3.954ValSer: 3.954 ± 0.711
5.237ValThr: 5.237 ± 0.693
4.916ValVal: 4.916 ± 0.661
1.069ValTrp: 1.069 ± 0.457
1.389ValTyr: 1.389 ± 0.527
0.0ValXaa: 0.0 ± 0.0
Trp
0.641TrpAla: 0.641 ± 0.292
0.107TrpCys: 0.107 ± 0.101
0.321TrpAsp: 0.321 ± 0.199
0.962TrpGlu: 0.962 ± 0.396
0.641TrpPhe: 0.641 ± 0.321
0.748TrpGly: 0.748 ± 0.257
0.107TrpHis: 0.107 ± 0.114
1.069TrpIle: 1.069 ± 0.401
0.748TrpLys: 0.748 ± 0.307
0.855TrpLeu: 0.855 ± 0.239
0.427TrpMet: 0.427 ± 0.228
1.603TrpAsn: 1.603 ± 0.373
0.107TrpPro: 0.107 ± 0.093
0.855TrpGln: 0.855 ± 0.265
0.427TrpArg: 0.427 ± 0.249
0.641TrpSer: 0.641 ± 0.239
0.855TrpThr: 0.855 ± 0.286
0.855TrpVal: 0.855 ± 0.35
0.214TrpTrp: 0.214 ± 0.108
0.855TrpTyr: 0.855 ± 0.757
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.71TyrAla: 1.71 ± 0.384
0.641TyrCys: 0.641 ± 0.281
1.603TyrAsp: 1.603 ± 0.362
2.565TyrGlu: 2.565 ± 0.467
1.496TyrPhe: 1.496 ± 0.556
1.817TyrGly: 1.817 ± 0.467
0.855TyrHis: 0.855 ± 0.246
2.672TyrIle: 2.672 ± 0.485
2.244TyrLys: 2.244 ± 0.401
3.42TyrLeu: 3.42 ± 0.657
0.534TyrMet: 0.534 ± 0.249
1.496TyrAsn: 1.496 ± 0.462
1.389TyrPro: 1.389 ± 0.404
1.389TyrGln: 1.389 ± 0.404
2.244TyrArg: 2.244 ± 0.635
2.244TyrSer: 2.244 ± 0.523
1.817TyrThr: 1.817 ± 0.394
1.71TyrVal: 1.71 ± 0.408
0.641TyrTrp: 0.641 ± 0.313
1.817TyrTyr: 1.817 ± 0.831
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37 proteins (9358 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski