Amino acid dipepetide frequency for Helicobacter phage FrANT170U

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.303AlaAla: 1.303 ± 0.506
0.869AlaCys: 0.869 ± 0.433
1.846AlaAsp: 1.846 ± 0.409
3.583AlaGlu: 3.583 ± 0.673
4.017AlaPhe: 4.017 ± 0.895
3.257AlaGly: 3.257 ± 0.899
0.651AlaHis: 0.651 ± 0.275
5.537AlaIle: 5.537 ± 1.174
8.577AlaLys: 8.577 ± 1.053
12.159AlaLeu: 12.159 ± 1.281
1.628AlaMet: 1.628 ± 0.492
7.274AlaAsn: 7.274 ± 1.123
1.086AlaPro: 1.086 ± 0.294
3.148AlaGln: 3.148 ± 0.519
2.823AlaArg: 2.823 ± 0.637
3.691AlaSer: 3.691 ± 0.667
2.714AlaThr: 2.714 ± 0.538
2.063AlaVal: 2.063 ± 0.515
0.217AlaTrp: 0.217 ± 0.147
2.388AlaTyr: 2.388 ± 0.566
0.0AlaXaa: 0.0 ± 0.0
Cys
0.434CysAla: 0.434 ± 0.219
0.109CysCys: 0.109 ± 0.108
0.76CysAsp: 0.76 ± 0.409
0.869CysGlu: 0.869 ± 0.381
0.869CysPhe: 0.869 ± 0.36
0.434CysGly: 0.434 ± 0.29
0.0CysHis: 0.0 ± 0.0
0.543CysIle: 0.543 ± 0.275
0.109CysLys: 0.109 ± 0.136
1.194CysLeu: 1.194 ± 0.506
0.109CysMet: 0.109 ± 0.125
0.217CysAsn: 0.217 ± 0.158
0.434CysPro: 0.434 ± 0.268
0.326CysGln: 0.326 ± 0.222
0.109CysArg: 0.109 ± 0.09
0.109CysSer: 0.109 ± 0.136
0.434CysThr: 0.434 ± 0.201
0.543CysVal: 0.543 ± 0.292
0.0CysTrp: 0.0 ± 0.0
0.109CysTyr: 0.109 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
2.063AspAla: 2.063 ± 0.478
0.326AspCys: 0.326 ± 0.251
1.411AspAsp: 1.411 ± 0.324
3.474AspGlu: 3.474 ± 0.494
3.366AspPhe: 3.366 ± 0.747
1.303AspGly: 1.303 ± 0.369
0.76AspHis: 0.76 ± 0.243
3.148AspIle: 3.148 ± 0.671
5.754AspLys: 5.754 ± 1.116
7.817AspLeu: 7.817 ± 1.029
1.086AspMet: 1.086 ± 0.493
4.234AspAsn: 4.234 ± 0.726
1.52AspPro: 1.52 ± 0.427
1.194AspGln: 1.194 ± 0.398
2.063AspArg: 2.063 ± 0.553
2.714AspSer: 2.714 ± 0.705
1.628AspThr: 1.628 ± 0.449
1.086AspVal: 1.086 ± 0.409
0.217AspTrp: 0.217 ± 0.157
3.474AspTyr: 3.474 ± 0.833
0.0AspXaa: 0.0 ± 0.0
Glu
6.623GluAla: 6.623 ± 1.268
0.434GluCys: 0.434 ± 0.248
2.388GluAsp: 2.388 ± 0.39
4.234GluGlu: 4.234 ± 0.878
3.8GluPhe: 3.8 ± 0.634
1.411GluGly: 1.411 ± 0.335
1.194GluHis: 1.194 ± 0.375
7.274GluIle: 7.274 ± 1.128
8.794GluLys: 8.794 ± 1.282
8.034GluLeu: 8.034 ± 0.842
2.063GluMet: 2.063 ± 0.5
6.297GluAsn: 6.297 ± 0.68
1.52GluPro: 1.52 ± 0.361
4.451GluGln: 4.451 ± 1.116
4.668GluArg: 4.668 ± 0.924
6.514GluSer: 6.514 ± 0.901
5.211GluThr: 5.211 ± 0.487
3.583GluVal: 3.583 ± 0.862
0.651GluTrp: 0.651 ± 0.254
2.606GluTyr: 2.606 ± 0.456
0.0GluXaa: 0.0 ± 0.0
Phe
1.52PheAla: 1.52 ± 0.42
0.543PheCys: 0.543 ± 0.249
2.497PheAsp: 2.497 ± 0.645
4.017PheGlu: 4.017 ± 0.662
3.691PhePhe: 3.691 ± 0.568
1.52PheGly: 1.52 ± 0.254
1.194PheHis: 1.194 ± 0.346
2.931PheIle: 2.931 ± 0.53
6.514PheLys: 6.514 ± 1.023
6.731PheLeu: 6.731 ± 1.113
0.434PheMet: 0.434 ± 0.259
3.366PheAsn: 3.366 ± 0.77
0.434PhePro: 0.434 ± 0.179
0.977PheGln: 0.977 ± 0.339
2.171PheArg: 2.171 ± 0.536
4.885PheSer: 4.885 ± 0.559
2.28PheThr: 2.28 ± 0.586
2.063PheVal: 2.063 ± 0.424
0.109PheTrp: 0.109 ± 0.114
2.063PheTyr: 2.063 ± 0.572
0.0PheXaa: 0.0 ± 0.0
Gly
3.04GlyAla: 3.04 ± 0.853
0.434GlyCys: 0.434 ± 0.229
1.628GlyAsp: 1.628 ± 0.461
2.28GlyGlu: 2.28 ± 0.43
3.148GlyPhe: 3.148 ± 0.662
3.148GlyGly: 3.148 ± 0.759
0.326GlyHis: 0.326 ± 0.254
2.063GlyIle: 2.063 ± 0.507
1.846GlyLys: 1.846 ± 0.4
6.08GlyLeu: 6.08 ± 0.76
1.194GlyMet: 1.194 ± 0.361
3.04GlyAsn: 3.04 ± 0.628
0.109GlyPro: 0.109 ± 0.119
0.977GlyGln: 0.977 ± 0.346
1.52GlyArg: 1.52 ± 0.408
3.474GlySer: 3.474 ± 0.642
1.194GlyThr: 1.194 ± 0.37
4.343GlyVal: 4.343 ± 1.206
0.0GlyTrp: 0.0 ± 0.0
2.063GlyTyr: 2.063 ± 0.472
0.0GlyXaa: 0.0 ± 0.0
His
1.086HisAla: 1.086 ± 0.324
0.326HisCys: 0.326 ± 0.309
0.76HisAsp: 0.76 ± 0.403
1.628HisGlu: 1.628 ± 0.381
0.869HisPhe: 0.869 ± 0.34
0.217HisGly: 0.217 ± 0.165
0.0HisHis: 0.0 ± 0.0
1.194HisIle: 1.194 ± 0.428
1.846HisLys: 1.846 ± 0.484
1.303HisLeu: 1.303 ± 0.338
0.217HisMet: 0.217 ± 0.141
1.086HisAsn: 1.086 ± 0.377
0.217HisPro: 0.217 ± 0.141
0.651HisGln: 0.651 ± 0.338
0.651HisArg: 0.651 ± 0.294
1.086HisSer: 1.086 ± 0.328
0.869HisThr: 0.869 ± 0.283
0.217HisVal: 0.217 ± 0.178
0.0HisTrp: 0.0 ± 0.0
0.869HisTyr: 0.869 ± 0.263
0.0HisXaa: 0.0 ± 0.0
Ile
5.537IleAla: 5.537 ± 1.097
0.434IleCys: 0.434 ± 0.282
4.451IleAsp: 4.451 ± 0.914
5.103IleGlu: 5.103 ± 0.845
1.954IlePhe: 1.954 ± 0.513
1.737IleGly: 1.737 ± 0.625
0.869IleHis: 0.869 ± 0.346
4.451IleIle: 4.451 ± 0.729
8.034IleLys: 8.034 ± 1.149
6.948IleLeu: 6.948 ± 0.693
0.869IleMet: 0.869 ± 0.266
4.451IleAsn: 4.451 ± 0.991
1.303IlePro: 1.303 ± 0.361
3.691IleGln: 3.691 ± 0.726
2.931IleArg: 2.931 ± 0.624
4.126IleSer: 4.126 ± 0.698
3.366IleThr: 3.366 ± 0.495
3.148IleVal: 3.148 ± 0.79
0.217IleTrp: 0.217 ± 0.152
2.606IleTyr: 2.606 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
9.445LysAla: 9.445 ± 1.15
0.326LysCys: 0.326 ± 0.191
6.948LysAsp: 6.948 ± 1.027
12.159LysGlu: 12.159 ± 1.583
3.366LysPhe: 3.366 ± 0.602
3.8LysGly: 3.8 ± 0.729
2.606LysHis: 2.606 ± 0.754
7.925LysIle: 7.925 ± 1.09
8.794LysLys: 8.794 ± 1.663
9.011LysLeu: 9.011 ± 1.197
1.086LysMet: 1.086 ± 0.381
8.685LysAsn: 8.685 ± 1.142
3.691LysPro: 3.691 ± 0.673
5.103LysGln: 5.103 ± 0.778
5.32LysArg: 5.32 ± 1.03
5.754LysSer: 5.754 ± 0.859
6.514LysThr: 6.514 ± 1.114
4.017LysVal: 4.017 ± 0.697
0.76LysTrp: 0.76 ± 0.208
2.606LysTyr: 2.606 ± 0.558
0.0LysXaa: 0.0 ± 0.0
Leu
7.274LeuAla: 7.274 ± 1.154
1.628LeuCys: 1.628 ± 0.725
4.017LeuAsp: 4.017 ± 0.551
12.594LeuGlu: 12.594 ± 1.366
3.908LeuPhe: 3.908 ± 0.842
6.08LeuGly: 6.08 ± 0.713
0.543LeuHis: 0.543 ± 0.305
5.32LeuIle: 5.32 ± 0.743
18.239LeuLys: 18.239 ± 1.579
6.405LeuLeu: 6.405 ± 0.959
2.388LeuMet: 2.388 ± 0.528
11.617LeuAsn: 11.617 ± 1.323
1.737LeuPro: 1.737 ± 0.411
4.451LeuGln: 4.451 ± 0.944
3.691LeuArg: 3.691 ± 0.65
5.428LeuSer: 5.428 ± 0.991
5.103LeuThr: 5.103 ± 0.772
3.8LeuVal: 3.8 ± 0.734
0.76LeuTrp: 0.76 ± 0.322
2.063LeuTyr: 2.063 ± 0.436
0.0LeuXaa: 0.0 ± 0.0
Met
0.651MetAla: 0.651 ± 0.221
0.109MetCys: 0.109 ± 0.111
1.411MetAsp: 1.411 ± 0.403
0.977MetGlu: 0.977 ± 0.313
1.52MetPhe: 1.52 ± 0.509
1.086MetGly: 1.086 ± 0.458
0.434MetHis: 0.434 ± 0.3
0.869MetIle: 0.869 ± 0.267
1.737MetLys: 1.737 ± 0.451
1.954MetLeu: 1.954 ± 0.406
0.109MetMet: 0.109 ± 0.118
1.954MetAsn: 1.954 ± 0.486
1.411MetPro: 1.411 ± 0.342
2.063MetGln: 2.063 ± 0.422
0.977MetArg: 0.977 ± 0.342
1.303MetSer: 1.303 ± 0.367
0.326MetThr: 0.326 ± 0.198
0.651MetVal: 0.651 ± 0.239
0.326MetTrp: 0.326 ± 0.197
0.326MetTyr: 0.326 ± 0.184
0.0MetXaa: 0.0 ± 0.0
Asn
8.902AsnAla: 8.902 ± 1.527
0.217AsnCys: 0.217 ± 0.189
3.366AsnAsp: 3.366 ± 0.531
7.382AsnGlu: 7.382 ± 1.333
3.583AsnPhe: 3.583 ± 0.719
2.388AsnGly: 2.388 ± 0.376
1.628AsnHis: 1.628 ± 0.427
3.474AsnIle: 3.474 ± 0.704
7.491AsnLys: 7.491 ± 0.785
7.382AsnLeu: 7.382 ± 0.954
1.737AsnMet: 1.737 ± 0.366
6.405AsnAsn: 6.405 ± 1.131
2.497AsnPro: 2.497 ± 0.604
5.754AsnGln: 5.754 ± 1.17
2.823AsnArg: 2.823 ± 0.586
4.126AsnSer: 4.126 ± 0.588
3.583AsnThr: 3.583 ± 0.656
2.28AsnVal: 2.28 ± 0.652
0.217AsnTrp: 0.217 ± 0.14
3.908AsnTyr: 3.908 ± 0.664
0.0AsnXaa: 0.0 ± 0.0
Pro
0.434ProAla: 0.434 ± 0.205
0.0ProCys: 0.0 ± 0.0
0.76ProAsp: 0.76 ± 0.269
1.086ProGlu: 1.086 ± 0.335
1.628ProPhe: 1.628 ± 0.431
0.326ProGly: 0.326 ± 0.137
0.0ProHis: 0.0 ± 0.0
2.388ProIle: 2.388 ± 0.495
3.257ProLys: 3.257 ± 0.688
2.714ProLeu: 2.714 ± 0.579
0.977ProMet: 0.977 ± 0.421
1.954ProAsn: 1.954 ± 0.409
0.217ProPro: 0.217 ± 0.132
0.977ProGln: 0.977 ± 0.309
0.651ProArg: 0.651 ± 0.206
3.04ProSer: 3.04 ± 0.406
2.063ProThr: 2.063 ± 0.466
0.326ProVal: 0.326 ± 0.176
0.109ProTrp: 0.109 ± 0.107
1.086ProTyr: 1.086 ± 0.406
0.0ProXaa: 0.0 ± 0.0
Gln
5.103GlnAla: 5.103 ± 0.952
0.326GlnCys: 0.326 ± 0.219
2.063GlnAsp: 2.063 ± 0.402
4.885GlnGlu: 4.885 ± 0.846
1.411GlnPhe: 1.411 ± 0.316
2.063GlnGly: 2.063 ± 0.471
0.651GlnHis: 0.651 ± 0.276
2.714GlnIle: 2.714 ± 0.519
5.754GlnLys: 5.754 ± 0.934
3.04GlnLeu: 3.04 ± 0.584
0.869GlnMet: 0.869 ± 0.354
4.451GlnAsn: 4.451 ± 0.873
0.76GlnPro: 0.76 ± 0.296
2.714GlnGln: 2.714 ± 0.584
1.411GlnArg: 1.411 ± 0.449
3.691GlnSer: 3.691 ± 0.71
2.388GlnThr: 2.388 ± 0.47
2.171GlnVal: 2.171 ± 0.43
0.543GlnTrp: 0.543 ± 0.177
0.869GlnTyr: 0.869 ± 0.277
0.0GlnXaa: 0.0 ± 0.0
Arg
3.366ArgAla: 3.366 ± 0.641
0.109ArgCys: 0.109 ± 0.09
2.171ArgAsp: 2.171 ± 0.423
3.366ArgGlu: 3.366 ± 0.591
2.28ArgPhe: 2.28 ± 0.547
1.303ArgGly: 1.303 ± 0.289
0.543ArgHis: 0.543 ± 0.191
3.04ArgIle: 3.04 ± 0.695
3.148ArgLys: 3.148 ± 0.726
5.211ArgLeu: 5.211 ± 0.728
0.76ArgMet: 0.76 ± 0.339
1.411ArgAsn: 1.411 ± 0.433
1.303ArgPro: 1.303 ± 0.433
1.52ArgGln: 1.52 ± 0.432
1.086ArgArg: 1.086 ± 0.318
3.04ArgSer: 3.04 ± 0.85
1.628ArgThr: 1.628 ± 0.424
1.954ArgVal: 1.954 ± 0.427
0.109ArgTrp: 0.109 ± 0.085
1.954ArgTyr: 1.954 ± 0.538
0.0ArgXaa: 0.0 ± 0.0
Ser
4.451SerAla: 4.451 ± 0.666
0.434SerCys: 0.434 ± 0.258
4.994SerAsp: 4.994 ± 0.478
5.645SerGlu: 5.645 ± 0.889
3.691SerPhe: 3.691 ± 0.641
4.017SerGly: 4.017 ± 0.816
1.086SerHis: 1.086 ± 0.29
3.148SerIle: 3.148 ± 0.464
5.754SerLys: 5.754 ± 0.584
8.034SerLeu: 8.034 ± 1.19
1.628SerMet: 1.628 ± 0.414
3.908SerAsn: 3.908 ± 0.734
1.303SerPro: 1.303 ± 0.277
2.823SerGln: 2.823 ± 0.519
1.846SerArg: 1.846 ± 0.404
2.931SerSer: 2.931 ± 0.507
2.063SerThr: 2.063 ± 0.489
4.777SerVal: 4.777 ± 0.968
0.326SerTrp: 0.326 ± 0.191
2.823SerTyr: 2.823 ± 0.501
0.0SerXaa: 0.0 ± 0.0
Thr
2.388ThrAla: 2.388 ± 0.726
0.326ThrCys: 0.326 ± 0.189
2.497ThrAsp: 2.497 ± 0.531
2.931ThrGlu: 2.931 ± 0.593
1.628ThrPhe: 1.628 ± 0.603
2.714ThrGly: 2.714 ± 0.754
1.086ThrHis: 1.086 ± 0.287
3.04ThrIle: 3.04 ± 0.607
4.126ThrLys: 4.126 ± 0.865
5.103ThrLeu: 5.103 ± 0.87
0.76ThrMet: 0.76 ± 0.272
3.583ThrAsn: 3.583 ± 0.429
2.497ThrPro: 2.497 ± 0.524
3.366ThrGln: 3.366 ± 0.651
1.52ThrArg: 1.52 ± 0.462
2.823ThrSer: 2.823 ± 0.467
2.714ThrThr: 2.714 ± 0.405
1.086ThrVal: 1.086 ± 0.474
0.434ThrTrp: 0.434 ± 0.26
1.52ThrTyr: 1.52 ± 0.453
0.0ThrXaa: 0.0 ± 0.0
Val
3.04ValAla: 3.04 ± 0.709
0.326ValCys: 0.326 ± 0.204
2.714ValAsp: 2.714 ± 0.615
2.28ValGlu: 2.28 ± 0.491
3.148ValPhe: 3.148 ± 0.682
3.691ValGly: 3.691 ± 0.784
0.217ValHis: 0.217 ± 0.2
3.474ValIle: 3.474 ± 0.734
4.126ValLys: 4.126 ± 0.676
4.126ValLeu: 4.126 ± 0.768
1.303ValMet: 1.303 ± 0.433
2.171ValAsn: 2.171 ± 0.712
0.651ValPro: 0.651 ± 0.275
0.869ValGln: 0.869 ± 0.316
1.411ValArg: 1.411 ± 0.312
4.126ValSer: 4.126 ± 1.218
0.651ValThr: 0.651 ± 0.252
2.171ValVal: 2.171 ± 0.505
0.326ValTrp: 0.326 ± 0.193
0.76ValTyr: 0.76 ± 0.342
0.0ValXaa: 0.0 ± 0.0
Trp
0.109TrpAla: 0.109 ± 0.103
0.109TrpCys: 0.109 ± 0.126
0.109TrpAsp: 0.109 ± 0.085
0.543TrpGlu: 0.543 ± 0.271
0.0TrpPhe: 0.0 ± 0.0
0.543TrpGly: 0.543 ± 0.274
0.109TrpHis: 0.109 ± 0.09
0.326TrpIle: 0.326 ± 0.206
0.326TrpLys: 0.326 ± 0.176
0.217TrpLeu: 0.217 ± 0.17
0.217TrpMet: 0.217 ± 0.181
0.434TrpAsn: 0.434 ± 0.298
0.0TrpPro: 0.0 ± 0.0
0.326TrpGln: 0.326 ± 0.159
0.217TrpArg: 0.217 ± 0.14
0.434TrpSer: 0.434 ± 0.204
0.326TrpThr: 0.326 ± 0.154
0.76TrpVal: 0.76 ± 0.306
0.0TrpTrp: 0.0 ± 0.0
0.217TrpTyr: 0.217 ± 0.237
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.063TyrAla: 2.063 ± 0.564
0.326TyrCys: 0.326 ± 0.274
1.846TyrAsp: 1.846 ± 0.349
2.714TyrGlu: 2.714 ± 0.483
2.063TyrPhe: 2.063 ± 0.688
0.869TyrGly: 0.869 ± 0.22
1.194TyrHis: 1.194 ± 0.291
3.366TyrIle: 3.366 ± 0.868
3.691TyrLys: 3.691 ± 0.734
3.583TyrLeu: 3.583 ± 0.697
0.651TyrMet: 0.651 ± 0.235
2.606TyrAsn: 2.606 ± 0.406
1.303TyrPro: 1.303 ± 0.33
2.497TyrGln: 2.497 ± 0.462
1.411TyrArg: 1.411 ± 0.349
2.497TyrSer: 2.497 ± 0.506
1.086TyrThr: 1.086 ± 0.398
0.543TyrVal: 0.543 ± 0.32
0.0TyrTrp: 0.0 ± 0.0
1.411TyrTyr: 1.411 ± 0.36
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 35 proteins (9212 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski