Amino acid dipepetide frequency for Escherichia phage phiv205-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.963AlaAla: 9.963 ± 1.079
0.608AlaCys: 0.608 ± 0.31
6.084AlaAsp: 6.084 ± 0.65
7.985AlaGlu: 7.985 ± 1.513
3.422AlaPhe: 3.422 ± 0.534
6.693AlaGly: 6.693 ± 0.787
1.369AlaHis: 1.369 ± 0.341
6.388AlaIle: 6.388 ± 0.647
7.149AlaLys: 7.149 ± 1.082
7.681AlaLeu: 7.681 ± 0.949
3.194AlaMet: 3.194 ± 0.625
3.879AlaAsn: 3.879 ± 0.654
1.673AlaPro: 1.673 ± 0.39
4.563AlaGln: 4.563 ± 0.742
5.704AlaArg: 5.704 ± 0.586
4.335AlaSer: 4.335 ± 0.581
5.704AlaThr: 5.704 ± 0.608
4.107AlaVal: 4.107 ± 0.521
2.129AlaTrp: 2.129 ± 0.302
2.662AlaTyr: 2.662 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
1.141CysAla: 1.141 ± 0.27
0.304CysCys: 0.304 ± 0.184
0.684CysAsp: 0.684 ± 0.269
0.532CysGlu: 0.532 ± 0.237
0.456CysPhe: 0.456 ± 0.161
1.141CysGly: 1.141 ± 0.305
0.38CysHis: 0.38 ± 0.231
0.532CysIle: 0.532 ± 0.193
0.532CysLys: 0.532 ± 0.216
0.837CysLeu: 0.837 ± 0.263
0.38CysMet: 0.38 ± 0.16
0.761CysAsn: 0.761 ± 0.235
0.456CysPro: 0.456 ± 0.18
0.456CysGln: 0.456 ± 0.192
0.761CysArg: 0.761 ± 0.248
0.989CysSer: 0.989 ± 0.27
0.152CysThr: 0.152 ± 0.107
0.837CysVal: 0.837 ± 0.253
0.304CysTrp: 0.304 ± 0.16
0.304CysTyr: 0.304 ± 0.124
0.0CysXaa: 0.0 ± 0.0
Asp
6.16AspAla: 6.16 ± 0.613
0.913AspCys: 0.913 ± 0.291
5.019AspAsp: 5.019 ± 0.614
3.27AspGlu: 3.27 ± 0.467
2.205AspPhe: 2.205 ± 0.412
4.791AspGly: 4.791 ± 0.533
1.217AspHis: 1.217 ± 0.267
4.183AspIle: 4.183 ± 0.558
3.498AspLys: 3.498 ± 0.491
4.107AspLeu: 4.107 ± 0.55
1.369AspMet: 1.369 ± 0.334
3.27AspAsn: 3.27 ± 0.669
1.825AspPro: 1.825 ± 0.342
1.749AspGln: 1.749 ± 0.339
3.346AspArg: 3.346 ± 0.36
3.879AspSer: 3.879 ± 0.683
2.282AspThr: 2.282 ± 0.484
4.639AspVal: 4.639 ± 0.641
1.217AspTrp: 1.217 ± 0.269
2.662AspTyr: 2.662 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
7.453GluAla: 7.453 ± 1.282
0.684GluCys: 0.684 ± 0.193
2.814GluAsp: 2.814 ± 0.462
4.867GluGlu: 4.867 ± 0.916
1.521GluPhe: 1.521 ± 0.377
3.574GluGly: 3.574 ± 0.464
1.521GluHis: 1.521 ± 0.27
4.259GluIle: 4.259 ± 0.678
4.563GluLys: 4.563 ± 0.555
6.769GluLeu: 6.769 ± 0.601
2.129GluMet: 2.129 ± 0.373
3.27GluAsn: 3.27 ± 0.494
1.749GluPro: 1.749 ± 0.365
3.422GluGln: 3.422 ± 0.582
4.411GluArg: 4.411 ± 0.789
4.943GluSer: 4.943 ± 0.825
2.89GluThr: 2.89 ± 0.541
3.346GluVal: 3.346 ± 0.477
1.217GluTrp: 1.217 ± 0.369
2.282GluTyr: 2.282 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
3.118PheAla: 3.118 ± 0.453
0.38PheCys: 0.38 ± 0.17
2.282PheAsp: 2.282 ± 0.413
1.901PheGlu: 1.901 ± 0.384
0.532PhePhe: 0.532 ± 0.191
2.053PheGly: 2.053 ± 0.362
0.304PheHis: 0.304 ± 0.158
1.901PheIle: 1.901 ± 0.491
2.053PheLys: 2.053 ± 0.381
2.358PheLeu: 2.358 ± 0.402
0.684PheMet: 0.684 ± 0.241
2.129PheAsn: 2.129 ± 0.352
0.837PhePro: 0.837 ± 0.222
0.761PheGln: 0.761 ± 0.355
2.129PheArg: 2.129 ± 0.378
2.434PheSer: 2.434 ± 0.426
1.749PheThr: 1.749 ± 0.296
1.901PheVal: 1.901 ± 0.421
0.608PheTrp: 0.608 ± 0.21
1.293PheTyr: 1.293 ± 0.383
0.0PheXaa: 0.0 ± 0.0
Gly
5.704GlyAla: 5.704 ± 0.709
0.684GlyCys: 0.684 ± 0.211
4.487GlyAsp: 4.487 ± 0.73
5.171GlyGlu: 5.171 ± 0.561
2.586GlyPhe: 2.586 ± 0.399
4.487GlyGly: 4.487 ± 0.67
1.141GlyHis: 1.141 ± 0.284
4.411GlyIle: 4.411 ± 0.626
5.095GlyLys: 5.095 ± 0.688
4.411GlyLeu: 4.411 ± 0.561
2.282GlyMet: 2.282 ± 0.462
3.27GlyAsn: 3.27 ± 0.425
1.217GlyPro: 1.217 ± 0.233
3.498GlyGln: 3.498 ± 0.669
4.487GlyArg: 4.487 ± 0.617
5.248GlySer: 5.248 ± 0.633
3.65GlyThr: 3.65 ± 0.514
4.791GlyVal: 4.791 ± 0.661
1.217GlyTrp: 1.217 ± 0.24
2.51GlyTyr: 2.51 ± 0.414
0.0GlyXaa: 0.0 ± 0.0
His
1.825HisAla: 1.825 ± 0.443
0.456HisCys: 0.456 ± 0.169
0.913HisAsp: 0.913 ± 0.199
1.369HisGlu: 1.369 ± 0.279
0.837HisPhe: 0.837 ± 0.238
2.129HisGly: 2.129 ± 0.377
0.228HisHis: 0.228 ± 0.145
0.837HisIle: 0.837 ± 0.256
1.217HisLys: 1.217 ± 0.401
1.141HisLeu: 1.141 ± 0.28
0.456HisMet: 0.456 ± 0.157
0.38HisAsn: 0.38 ± 0.152
0.684HisPro: 0.684 ± 0.17
0.684HisGln: 0.684 ± 0.246
0.989HisArg: 0.989 ± 0.221
0.989HisSer: 0.989 ± 0.322
0.456HisThr: 0.456 ± 0.178
0.837HisVal: 0.837 ± 0.239
0.304HisTrp: 0.304 ± 0.146
0.456HisTyr: 0.456 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
5.248IleAla: 5.248 ± 0.546
0.761IleCys: 0.761 ± 0.25
4.031IleAsp: 4.031 ± 0.477
4.639IleGlu: 4.639 ± 0.587
1.749IlePhe: 1.749 ± 0.372
4.335IleGly: 4.335 ± 0.608
1.597IleHis: 1.597 ± 0.369
3.574IleIle: 3.574 ± 0.699
2.966IleLys: 2.966 ± 0.474
3.955IleLeu: 3.955 ± 0.454
1.065IleMet: 1.065 ± 0.253
2.738IleAsn: 2.738 ± 0.492
3.194IlePro: 3.194 ± 0.532
2.205IleGln: 2.205 ± 0.469
3.574IleArg: 3.574 ± 0.596
4.031IleSer: 4.031 ± 0.51
4.563IleThr: 4.563 ± 0.583
3.194IleVal: 3.194 ± 0.453
0.913IleTrp: 0.913 ± 0.269
1.597IleTyr: 1.597 ± 0.343
0.0IleXaa: 0.0 ± 0.0
Lys
6.312LysAla: 6.312 ± 1.059
0.761LysCys: 0.761 ± 0.269
3.498LysAsp: 3.498 ± 0.507
4.791LysGlu: 4.791 ± 0.738
1.673LysPhe: 1.673 ± 0.315
4.867LysGly: 4.867 ± 0.747
0.913LysHis: 0.913 ± 0.228
3.346LysIle: 3.346 ± 0.449
4.107LysLys: 4.107 ± 0.717
6.16LysLeu: 6.16 ± 0.778
2.129LysMet: 2.129 ± 0.398
2.738LysAsn: 2.738 ± 0.417
3.422LysPro: 3.422 ± 0.578
2.966LysGln: 2.966 ± 0.558
3.803LysArg: 3.803 ± 0.534
2.662LysSer: 2.662 ± 0.511
3.498LysThr: 3.498 ± 0.736
3.27LysVal: 3.27 ± 0.433
0.608LysTrp: 0.608 ± 0.232
2.434LysTyr: 2.434 ± 0.519
0.0LysXaa: 0.0 ± 0.0
Leu
7.909LeuAla: 7.909 ± 0.813
0.837LeuCys: 0.837 ± 0.228
4.639LeuAsp: 4.639 ± 0.574
5.628LeuGlu: 5.628 ± 0.667
2.434LeuPhe: 2.434 ± 0.417
4.487LeuGly: 4.487 ± 0.639
1.293LeuHis: 1.293 ± 0.359
4.487LeuIle: 4.487 ± 0.384
5.856LeuLys: 5.856 ± 0.845
7.377LeuLeu: 7.377 ± 0.99
2.662LeuMet: 2.662 ± 0.382
2.814LeuAsn: 2.814 ± 0.44
3.422LeuPro: 3.422 ± 0.502
2.89LeuGln: 2.89 ± 0.448
4.867LeuArg: 4.867 ± 0.602
5.324LeuSer: 5.324 ± 0.715
4.487LeuThr: 4.487 ± 0.654
3.955LeuVal: 3.955 ± 0.561
1.217LeuTrp: 1.217 ± 0.24
2.966LeuTyr: 2.966 ± 0.472
0.0LeuXaa: 0.0 ± 0.0
Met
4.107MetAla: 4.107 ± 0.569
0.38MetCys: 0.38 ± 0.15
1.445MetAsp: 1.445 ± 0.57
1.141MetGlu: 1.141 ± 0.253
0.837MetPhe: 0.837 ± 0.273
0.989MetGly: 0.989 ± 0.271
0.38MetHis: 0.38 ± 0.181
1.749MetIle: 1.749 ± 0.328
1.597MetLys: 1.597 ± 0.339
2.205MetLeu: 2.205 ± 0.342
0.532MetMet: 0.532 ± 0.191
1.217MetAsn: 1.217 ± 0.381
1.749MetPro: 1.749 ± 0.366
1.445MetGln: 1.445 ± 0.314
1.597MetArg: 1.597 ± 0.367
2.434MetSer: 2.434 ± 0.377
2.51MetThr: 2.51 ± 0.469
1.901MetVal: 1.901 ± 0.363
0.38MetTrp: 0.38 ± 0.184
0.684MetTyr: 0.684 ± 0.248
0.0MetXaa: 0.0 ± 0.0
Asn
4.487AsnAla: 4.487 ± 0.497
0.761AsnCys: 0.761 ± 0.218
2.205AsnAsp: 2.205 ± 0.457
2.586AsnGlu: 2.586 ± 0.451
0.913AsnPhe: 0.913 ± 0.213
4.259AsnGly: 4.259 ± 0.45
1.141AsnHis: 1.141 ± 0.312
2.89AsnIle: 2.89 ± 0.536
2.434AsnLys: 2.434 ± 0.394
2.662AsnLeu: 2.662 ± 0.373
0.989AsnMet: 0.989 ± 0.317
2.282AsnAsn: 2.282 ± 0.468
2.129AsnPro: 2.129 ± 0.536
2.129AsnGln: 2.129 ± 0.415
2.434AsnArg: 2.434 ± 0.538
2.358AsnSer: 2.358 ± 0.446
2.129AsnThr: 2.129 ± 0.356
2.434AsnVal: 2.434 ± 0.475
0.608AsnTrp: 0.608 ± 0.21
1.977AsnTyr: 1.977 ± 0.375
0.0AsnXaa: 0.0 ± 0.0
Pro
3.879ProAla: 3.879 ± 0.605
0.304ProCys: 0.304 ± 0.143
2.662ProAsp: 2.662 ± 0.406
3.574ProGlu: 3.574 ± 0.603
1.369ProPhe: 1.369 ± 0.328
2.282ProGly: 2.282 ± 0.411
0.38ProHis: 0.38 ± 0.184
1.369ProIle: 1.369 ± 0.316
2.662ProLys: 2.662 ± 0.425
3.042ProLeu: 3.042 ± 0.466
1.521ProMet: 1.521 ± 0.315
0.989ProAsn: 0.989 ± 0.247
1.445ProPro: 1.445 ± 0.33
1.445ProGln: 1.445 ± 0.339
1.369ProArg: 1.369 ± 0.291
3.042ProSer: 3.042 ± 0.414
1.901ProThr: 1.901 ± 0.337
3.498ProVal: 3.498 ± 0.496
0.532ProTrp: 0.532 ± 0.231
0.913ProTyr: 0.913 ± 0.233
0.0ProXaa: 0.0 ± 0.0
Gln
4.487GlnAla: 4.487 ± 0.707
0.38GlnCys: 0.38 ± 0.183
1.825GlnAsp: 1.825 ± 0.48
2.586GlnGlu: 2.586 ± 0.545
1.369GlnPhe: 1.369 ± 0.314
3.346GlnGly: 3.346 ± 0.587
0.456GlnHis: 0.456 ± 0.178
3.042GlnIle: 3.042 ± 0.431
3.042GlnLys: 3.042 ± 0.535
3.346GlnLeu: 3.346 ± 0.453
1.369GlnMet: 1.369 ± 0.294
1.977GlnAsn: 1.977 ± 0.377
2.053GlnPro: 2.053 ± 0.362
4.867GlnGln: 4.867 ± 0.968
3.194GlnArg: 3.194 ± 0.567
2.662GlnSer: 2.662 ± 0.531
1.901GlnThr: 1.901 ± 0.386
2.434GlnVal: 2.434 ± 0.448
0.913GlnTrp: 0.913 ± 0.286
1.977GlnTyr: 1.977 ± 0.365
0.0GlnXaa: 0.0 ± 0.0
Arg
4.943ArgAla: 4.943 ± 0.655
0.761ArgCys: 0.761 ± 0.254
3.422ArgAsp: 3.422 ± 0.52
4.715ArgGlu: 4.715 ± 0.686
2.129ArgPhe: 2.129 ± 0.384
3.727ArgGly: 3.727 ± 0.483
1.369ArgHis: 1.369 ± 0.297
3.879ArgIle: 3.879 ± 0.607
4.715ArgLys: 4.715 ± 0.739
5.4ArgLeu: 5.4 ± 0.653
2.053ArgMet: 2.053 ± 0.392
2.586ArgAsn: 2.586 ± 0.329
1.673ArgPro: 1.673 ± 0.383
2.738ArgGln: 2.738 ± 0.512
4.487ArgArg: 4.487 ± 0.72
3.042ArgSer: 3.042 ± 0.465
2.738ArgThr: 2.738 ± 0.384
3.118ArgVal: 3.118 ± 0.5
0.684ArgTrp: 0.684 ± 0.229
1.977ArgTyr: 1.977 ± 0.358
0.0ArgXaa: 0.0 ± 0.0
Ser
4.107SerAla: 4.107 ± 0.558
0.532SerCys: 0.532 ± 0.231
4.639SerAsp: 4.639 ± 0.49
3.955SerGlu: 3.955 ± 0.591
2.814SerPhe: 2.814 ± 0.481
5.78SerGly: 5.78 ± 0.678
0.761SerHis: 0.761 ± 0.231
3.042SerIle: 3.042 ± 0.489
3.194SerLys: 3.194 ± 0.564
6.008SerLeu: 6.008 ± 0.768
2.053SerMet: 2.053 ± 0.494
3.042SerAsn: 3.042 ± 0.415
2.966SerPro: 2.966 ± 0.549
3.194SerGln: 3.194 ± 0.628
4.107SerArg: 4.107 ± 0.702
3.498SerSer: 3.498 ± 0.509
2.814SerThr: 2.814 ± 0.5
3.27SerVal: 3.27 ± 0.604
0.837SerTrp: 0.837 ± 0.226
1.597SerTyr: 1.597 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
4.867ThrAla: 4.867 ± 0.644
0.684ThrCys: 0.684 ± 0.186
4.259ThrAsp: 4.259 ± 0.516
2.814ThrGlu: 2.814 ± 0.44
1.597ThrPhe: 1.597 ± 0.421
4.487ThrGly: 4.487 ± 0.551
0.684ThrHis: 0.684 ± 0.256
3.194ThrIle: 3.194 ± 0.562
3.498ThrLys: 3.498 ± 0.574
2.966ThrLeu: 2.966 ± 0.564
1.369ThrMet: 1.369 ± 0.323
1.293ThrAsn: 1.293 ± 0.278
3.498ThrPro: 3.498 ± 0.535
3.498ThrGln: 3.498 ± 0.672
1.749ThrArg: 1.749 ± 0.297
2.662ThrSer: 2.662 ± 0.526
2.738ThrThr: 2.738 ± 0.562
4.563ThrVal: 4.563 ± 0.561
0.684ThrTrp: 0.684 ± 0.199
1.065ThrTyr: 1.065 ± 0.277
0.0ThrXaa: 0.0 ± 0.0
Val
4.335ValAla: 4.335 ± 0.622
0.761ValCys: 0.761 ± 0.206
3.498ValAsp: 3.498 ± 0.63
4.183ValGlu: 4.183 ± 0.607
1.597ValPhe: 1.597 ± 0.326
4.259ValGly: 4.259 ± 0.52
1.141ValHis: 1.141 ± 0.286
4.411ValIle: 4.411 ± 0.519
3.118ValLys: 3.118 ± 0.483
4.639ValLeu: 4.639 ± 0.552
1.521ValMet: 1.521 ± 0.299
3.042ValAsn: 3.042 ± 0.616
1.977ValPro: 1.977 ± 0.401
1.901ValGln: 1.901 ± 0.298
3.194ValArg: 3.194 ± 0.471
4.411ValSer: 4.411 ± 0.461
4.031ValThr: 4.031 ± 0.629
3.803ValVal: 3.803 ± 0.591
0.684ValTrp: 0.684 ± 0.218
2.282ValTyr: 2.282 ± 0.511
0.0ValXaa: 0.0 ± 0.0
Trp
1.749TrpAla: 1.749 ± 0.321
0.304TrpCys: 0.304 ± 0.136
0.532TrpAsp: 0.532 ± 0.217
0.456TrpGlu: 0.456 ± 0.21
0.38TrpPhe: 0.38 ± 0.169
0.761TrpGly: 0.761 ± 0.225
0.228TrpHis: 0.228 ± 0.114
0.38TrpIle: 0.38 ± 0.259
0.761TrpLys: 0.761 ± 0.209
2.129TrpLeu: 2.129 ± 0.315
0.913TrpMet: 0.913 ± 0.281
0.608TrpAsn: 0.608 ± 0.232
0.608TrpPro: 0.608 ± 0.223
0.608TrpGln: 0.608 ± 0.218
1.445TrpArg: 1.445 ± 0.299
1.445TrpSer: 1.445 ± 0.318
0.761TrpThr: 0.761 ± 0.22
1.065TrpVal: 1.065 ± 0.272
0.304TrpTrp: 0.304 ± 0.163
0.456TrpTyr: 0.456 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.498TyrAla: 3.498 ± 0.684
0.684TyrCys: 0.684 ± 0.185
2.662TyrAsp: 2.662 ± 0.486
1.521TyrGlu: 1.521 ± 0.304
0.989TyrPhe: 0.989 ± 0.299
1.977TyrGly: 1.977 ± 0.397
0.684TyrHis: 0.684 ± 0.223
2.053TyrIle: 2.053 ± 0.426
1.901TyrLys: 1.901 ± 0.36
2.358TyrLeu: 2.358 ± 0.381
0.532TyrMet: 0.532 ± 0.179
1.521TyrAsn: 1.521 ± 0.337
1.597TyrPro: 1.597 ± 0.445
2.205TyrGln: 2.205 ± 0.572
2.51TyrArg: 2.51 ± 0.499
1.825TyrSer: 1.825 ± 0.386
1.217TyrThr: 1.217 ± 0.281
1.825TyrVal: 1.825 ± 0.395
0.532TyrTrp: 0.532 ± 0.169
1.293TyrTyr: 1.293 ± 0.275
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13150 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski