Amino acid dipepetide frequency for Helicobacter phage PtB92G

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.207AlaAla: 1.207 ± 0.501
0.804AlaCys: 0.804 ± 0.293
1.508AlaAsp: 1.508 ± 0.333
3.318AlaGlu: 3.318 ± 0.699
4.123AlaPhe: 4.123 ± 0.72
2.514AlaGly: 2.514 ± 0.658
1.207AlaHis: 1.207 ± 0.287
4.625AlaIle: 4.625 ± 0.721
9.05AlaLys: 9.05 ± 0.901
10.96AlaLeu: 10.96 ± 1.224
1.307AlaMet: 1.307 ± 0.397
6.234AlaAsn: 6.234 ± 0.805
1.307AlaPro: 1.307 ± 0.329
3.117AlaGln: 3.117 ± 0.542
3.017AlaArg: 3.017 ± 0.504
3.318AlaSer: 3.318 ± 0.754
3.318AlaThr: 3.318 ± 0.595
2.011AlaVal: 2.011 ± 0.588
0.302AlaTrp: 0.302 ± 0.175
1.911AlaTyr: 1.911 ± 0.386
0.0AlaXaa: 0.0 ± 0.0
Cys
0.503CysAla: 0.503 ± 0.265
0.201CysCys: 0.201 ± 0.136
0.603CysAsp: 0.603 ± 0.333
0.905CysGlu: 0.905 ± 0.293
0.804CysPhe: 0.804 ± 0.348
0.402CysGly: 0.402 ± 0.232
0.0CysHis: 0.0 ± 0.0
0.603CysIle: 0.603 ± 0.3
0.503CysLys: 0.503 ± 0.209
1.207CysLeu: 1.207 ± 0.375
0.0CysMet: 0.0 ± 0.0
0.402CysAsn: 0.402 ± 0.253
0.402CysPro: 0.402 ± 0.166
0.201CysGln: 0.201 ± 0.138
0.101CysArg: 0.101 ± 0.091
0.704CysSer: 0.704 ± 0.464
0.603CysThr: 0.603 ± 0.281
0.201CysVal: 0.201 ± 0.15
0.0CysTrp: 0.0 ± 0.0
0.302CysTyr: 0.302 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
2.715AspAla: 2.715 ± 0.812
0.302AspCys: 0.302 ± 0.206
1.911AspAsp: 1.911 ± 0.475
3.519AspGlu: 3.519 ± 0.75
3.72AspPhe: 3.72 ± 0.515
1.508AspGly: 1.508 ± 0.382
0.804AspHis: 0.804 ± 0.351
2.715AspIle: 2.715 ± 0.529
5.229AspLys: 5.229 ± 0.823
7.24AspLeu: 7.24 ± 0.875
0.503AspMet: 0.503 ± 0.213
3.318AspAsn: 3.318 ± 0.585
1.207AspPro: 1.207 ± 0.344
0.704AspGln: 0.704 ± 0.325
1.911AspArg: 1.911 ± 0.487
2.715AspSer: 2.715 ± 0.606
1.508AspThr: 1.508 ± 0.387
1.408AspVal: 1.408 ± 0.397
0.101AspTrp: 0.101 ± 0.111
3.419AspTyr: 3.419 ± 0.625
0.0AspXaa: 0.0 ± 0.0
Glu
6.938GluAla: 6.938 ± 1.014
0.704GluCys: 0.704 ± 0.228
1.81GluAsp: 1.81 ± 0.439
4.123GluGlu: 4.123 ± 0.883
3.318GluPhe: 3.318 ± 0.524
1.709GluGly: 1.709 ± 0.34
1.006GluHis: 1.006 ± 0.458
7.642GluIle: 7.642 ± 1.017
9.653GluLys: 9.653 ± 1.343
9.955GluLeu: 9.955 ± 1.208
1.106GluMet: 1.106 ± 0.297
7.039GluAsn: 7.039 ± 0.95
1.709GluPro: 1.709 ± 0.383
4.827GluGln: 4.827 ± 0.709
4.927GluArg: 4.927 ± 0.641
6.536GluSer: 6.536 ± 0.827
4.827GluThr: 4.827 ± 0.513
4.525GluVal: 4.525 ± 0.789
0.402GluTrp: 0.402 ± 0.181
2.514GluTyr: 2.514 ± 0.44
0.0GluXaa: 0.0 ± 0.0
Phe
1.81PheAla: 1.81 ± 0.436
0.704PheCys: 0.704 ± 0.327
2.815PheAsp: 2.815 ± 0.687
3.62PheGlu: 3.62 ± 0.737
3.218PhePhe: 3.218 ± 0.63
1.508PheGly: 1.508 ± 0.334
0.603PheHis: 0.603 ± 0.178
3.218PheIle: 3.218 ± 0.58
7.139PheLys: 7.139 ± 0.732
6.536PheLeu: 6.536 ± 0.701
0.503PheMet: 0.503 ± 0.268
3.318PheAsn: 3.318 ± 0.585
0.201PhePro: 0.201 ± 0.15
0.905PheGln: 0.905 ± 0.396
2.212PheArg: 2.212 ± 0.406
5.229PheSer: 5.229 ± 0.877
2.011PheThr: 2.011 ± 0.36
2.313PheVal: 2.313 ± 0.474
0.302PheTrp: 0.302 ± 0.216
2.112PheTyr: 2.112 ± 0.433
0.0PheXaa: 0.0 ± 0.0
Gly
2.715GlyAla: 2.715 ± 1.12
0.503GlyCys: 0.503 ± 0.215
1.307GlyAsp: 1.307 ± 0.451
2.413GlyGlu: 2.413 ± 0.514
3.017GlyPhe: 3.017 ± 0.557
2.916GlyGly: 2.916 ± 0.771
0.302GlyHis: 0.302 ± 0.197
3.218GlyIle: 3.218 ± 0.43
3.017GlyLys: 3.017 ± 0.83
5.329GlyLeu: 5.329 ± 0.599
1.207GlyMet: 1.207 ± 0.346
3.419GlyAsn: 3.419 ± 0.521
0.0GlyPro: 0.0 ± 0.0
0.905GlyGln: 0.905 ± 0.235
1.207GlyArg: 1.207 ± 0.299
2.815GlySer: 2.815 ± 0.66
0.704GlyThr: 0.704 ± 0.277
3.922GlyVal: 3.922 ± 0.814
0.0GlyTrp: 0.0 ± 0.0
1.709GlyTyr: 1.709 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
0.905HisAla: 0.905 ± 0.394
0.0HisCys: 0.0 ± 0.0
1.006HisAsp: 1.006 ± 0.408
1.106HisGlu: 1.106 ± 0.326
0.804HisPhe: 0.804 ± 0.261
0.201HisGly: 0.201 ± 0.149
0.201HisHis: 0.201 ± 0.218
1.106HisIle: 1.106 ± 0.376
2.011HisLys: 2.011 ± 0.514
1.81HisLeu: 1.81 ± 0.452
0.201HisMet: 0.201 ± 0.137
1.307HisAsn: 1.307 ± 0.345
0.302HisPro: 0.302 ± 0.135
0.201HisGln: 0.201 ± 0.156
0.603HisArg: 0.603 ± 0.242
0.905HisSer: 0.905 ± 0.334
1.307HisThr: 1.307 ± 0.296
0.302HisVal: 0.302 ± 0.181
0.0HisTrp: 0.0 ± 0.0
1.106HisTyr: 1.106 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
4.625IleAla: 4.625 ± 0.621
0.905IleCys: 0.905 ± 0.357
4.123IleAsp: 4.123 ± 0.528
6.033IleGlu: 6.033 ± 0.936
2.514IlePhe: 2.514 ± 0.479
1.911IleGly: 1.911 ± 0.568
0.905IleHis: 0.905 ± 0.274
3.821IleIle: 3.821 ± 0.708
7.743IleLys: 7.743 ± 1.037
7.039IleLeu: 7.039 ± 0.842
0.704IleMet: 0.704 ± 0.313
5.329IleAsn: 5.329 ± 0.838
2.413IlePro: 2.413 ± 0.545
3.922IleGln: 3.922 ± 0.543
2.514IleArg: 2.514 ± 0.445
4.525IleSer: 4.525 ± 0.569
4.424IleThr: 4.424 ± 0.764
3.419IleVal: 3.419 ± 0.589
0.201IleTrp: 0.201 ± 0.137
2.212IleTyr: 2.212 ± 0.599
0.0IleXaa: 0.0 ± 0.0
Lys
8.648LysAla: 8.648 ± 1.061
0.603LysCys: 0.603 ± 0.283
6.335LysAsp: 6.335 ± 1.189
12.67LysGlu: 12.67 ± 1.142
4.324LysPhe: 4.324 ± 0.701
3.62LysGly: 3.62 ± 0.558
3.017LysHis: 3.017 ± 0.61
7.843LysIle: 7.843 ± 0.87
8.849LysLys: 8.849 ± 1.103
8.849LysLeu: 8.849 ± 0.859
1.508LysMet: 1.508 ± 0.305
8.849LysAsn: 8.849 ± 1.018
3.922LysPro: 3.922 ± 0.629
6.033LysGln: 6.033 ± 0.791
4.324LysArg: 4.324 ± 0.903
5.53LysSer: 5.53 ± 0.808
6.033LysThr: 6.033 ± 0.687
4.525LysVal: 4.525 ± 0.706
0.704LysTrp: 0.704 ± 0.288
2.715LysTyr: 2.715 ± 0.428
0.0LysXaa: 0.0 ± 0.0
Leu
4.927LeuAla: 4.927 ± 0.555
1.408LeuCys: 1.408 ± 0.457
4.525LeuAsp: 4.525 ± 0.622
13.072LeuGlu: 13.072 ± 1.304
3.62LeuPhe: 3.62 ± 0.626
6.033LeuGly: 6.033 ± 0.95
0.905LeuHis: 0.905 ± 0.413
6.033LeuIle: 6.033 ± 1.03
16.591LeuLys: 16.591 ± 1.62
7.642LeuLeu: 7.642 ± 0.797
2.212LeuMet: 2.212 ± 0.547
11.262LeuAsn: 11.262 ± 1.163
2.212LeuPro: 2.212 ± 0.409
4.625LeuGln: 4.625 ± 0.814
4.525LeuArg: 4.525 ± 0.643
6.335LeuSer: 6.335 ± 1.062
4.223LeuThr: 4.223 ± 0.916
4.726LeuVal: 4.726 ± 0.671
0.603LeuTrp: 0.603 ± 0.253
2.112LeuTyr: 2.112 ± 0.374
0.0LeuXaa: 0.0 ± 0.0
Met
0.603MetAla: 0.603 ± 0.219
0.101MetCys: 0.101 ± 0.105
1.106MetAsp: 1.106 ± 0.366
0.905MetGlu: 0.905 ± 0.287
0.804MetPhe: 0.804 ± 0.261
1.006MetGly: 1.006 ± 0.383
0.302MetHis: 0.302 ± 0.174
1.408MetIle: 1.408 ± 0.349
1.709MetLys: 1.709 ± 0.361
2.011MetLeu: 2.011 ± 0.426
0.201MetMet: 0.201 ± 0.146
1.408MetAsn: 1.408 ± 0.409
0.905MetPro: 0.905 ± 0.302
1.81MetGln: 1.81 ± 0.394
0.905MetArg: 0.905 ± 0.256
0.704MetSer: 0.704 ± 0.234
0.201MetThr: 0.201 ± 0.132
0.603MetVal: 0.603 ± 0.301
0.302MetTrp: 0.302 ± 0.243
0.201MetTyr: 0.201 ± 0.155
0.0MetXaa: 0.0 ± 0.0
Asn
9.452AsnAla: 9.452 ± 1.164
0.201AsnCys: 0.201 ± 0.15
3.62AsnAsp: 3.62 ± 0.694
8.949AsnGlu: 8.949 ± 1.093
3.519AsnPhe: 3.519 ± 0.544
3.017AsnGly: 3.017 ± 0.542
1.609AsnHis: 1.609 ± 0.363
4.726AsnIle: 4.726 ± 0.823
6.435AsnLys: 6.435 ± 0.706
6.838AsnLeu: 6.838 ± 0.959
1.307AsnMet: 1.307 ± 0.362
6.033AsnAsn: 6.033 ± 0.937
1.609AsnPro: 1.609 ± 0.358
5.229AsnGln: 5.229 ± 0.81
2.514AsnArg: 2.514 ± 0.418
4.927AsnSer: 4.927 ± 0.613
4.123AsnThr: 4.123 ± 0.858
1.609AsnVal: 1.609 ± 0.471
0.201AsnTrp: 0.201 ± 0.139
3.922AsnTyr: 3.922 ± 0.513
0.0AsnXaa: 0.0 ± 0.0
Pro
0.302ProAla: 0.302 ± 0.16
0.0ProCys: 0.0 ± 0.0
0.402ProAsp: 0.402 ± 0.18
1.81ProGlu: 1.81 ± 0.35
1.609ProPhe: 1.609 ± 0.383
0.201ProGly: 0.201 ± 0.136
0.201ProHis: 0.201 ± 0.126
2.112ProIle: 2.112 ± 0.417
3.62ProLys: 3.62 ± 0.641
2.313ProLeu: 2.313 ± 0.597
0.603ProMet: 0.603 ± 0.239
2.916ProAsn: 2.916 ± 0.443
0.804ProPro: 0.804 ± 0.23
0.603ProGln: 0.603 ± 0.205
0.905ProArg: 0.905 ± 0.287
2.815ProSer: 2.815 ± 0.379
2.212ProThr: 2.212 ± 0.414
0.503ProVal: 0.503 ± 0.195
0.0ProTrp: 0.0 ± 0.0
1.207ProTyr: 1.207 ± 0.33
0.0ProXaa: 0.0 ± 0.0
Gln
5.028GlnAla: 5.028 ± 0.83
0.302GlnCys: 0.302 ± 0.169
2.313GlnAsp: 2.313 ± 0.404
4.424GlnGlu: 4.424 ± 0.601
1.508GlnPhe: 1.508 ± 0.34
2.313GlnGly: 2.313 ± 0.51
0.402GlnHis: 0.402 ± 0.192
3.318GlnIle: 3.318 ± 0.55
5.53GlnLys: 5.53 ± 0.83
3.017GlnLeu: 3.017 ± 0.462
0.804GlnMet: 0.804 ± 0.348
3.821GlnAsn: 3.821 ± 0.731
0.704GlnPro: 0.704 ± 0.286
2.112GlnGln: 2.112 ± 0.583
1.81GlnArg: 1.81 ± 0.411
3.218GlnSer: 3.218 ± 0.695
2.413GlnThr: 2.413 ± 0.435
2.112GlnVal: 2.112 ± 0.445
0.101GlnTrp: 0.101 ± 0.1
1.106GlnTyr: 1.106 ± 0.388
0.0GlnXaa: 0.0 ± 0.0
Arg
3.218ArgAla: 3.218 ± 0.595
0.201ArgCys: 0.201 ± 0.132
2.514ArgAsp: 2.514 ± 0.508
3.922ArgGlu: 3.922 ± 0.915
2.715ArgPhe: 2.715 ± 0.493
0.905ArgGly: 0.905 ± 0.257
0.603ArgHis: 0.603 ± 0.209
2.313ArgIle: 2.313 ± 0.595
3.519ArgLys: 3.519 ± 0.556
4.927ArgLeu: 4.927 ± 0.601
0.905ArgMet: 0.905 ± 0.347
2.313ArgAsn: 2.313 ± 0.578
0.804ArgPro: 0.804 ± 0.311
1.609ArgGln: 1.609 ± 0.437
0.804ArgArg: 0.804 ± 0.249
2.815ArgSer: 2.815 ± 0.685
1.609ArgThr: 1.609 ± 0.493
1.609ArgVal: 1.609 ± 0.297
0.0ArgTrp: 0.0 ± 0.0
1.307ArgTyr: 1.307 ± 0.44
0.0ArgXaa: 0.0 ± 0.0
Ser
4.625SerAla: 4.625 ± 0.732
0.402SerCys: 0.402 ± 0.251
5.028SerAsp: 5.028 ± 0.509
5.631SerGlu: 5.631 ± 0.803
3.419SerPhe: 3.419 ± 0.673
4.223SerGly: 4.223 ± 0.798
0.704SerHis: 0.704 ± 0.277
3.72SerIle: 3.72 ± 0.598
6.335SerLys: 6.335 ± 0.939
7.642SerLeu: 7.642 ± 0.805
1.609SerMet: 1.609 ± 0.515
3.922SerAsn: 3.922 ± 0.678
1.307SerPro: 1.307 ± 0.333
2.815SerGln: 2.815 ± 0.465
1.207SerArg: 1.207 ± 0.337
3.218SerSer: 3.218 ± 0.555
1.81SerThr: 1.81 ± 0.461
4.927SerVal: 4.927 ± 0.974
0.603SerTrp: 0.603 ± 0.202
3.017SerTyr: 3.017 ± 0.485
0.0SerXaa: 0.0 ± 0.0
Thr
2.112ThrAla: 2.112 ± 0.67
0.302ThrCys: 0.302 ± 0.194
2.614ThrAsp: 2.614 ± 0.495
3.117ThrGlu: 3.117 ± 0.533
1.307ThrPhe: 1.307 ± 0.46
2.212ThrGly: 2.212 ± 0.518
1.006ThrHis: 1.006 ± 0.27
3.922ThrIle: 3.922 ± 0.525
3.72ThrLys: 3.72 ± 0.816
5.128ThrLeu: 5.128 ± 0.806
1.006ThrMet: 1.006 ± 0.355
3.519ThrAsn: 3.519 ± 0.641
3.017ThrPro: 3.017 ± 0.501
3.72ThrGln: 3.72 ± 0.52
1.81ThrArg: 1.81 ± 0.361
3.922ThrSer: 3.922 ± 0.696
2.614ThrThr: 2.614 ± 0.508
0.402ThrVal: 0.402 ± 0.256
0.503ThrTrp: 0.503 ± 0.179
1.508ThrTyr: 1.508 ± 0.443
0.0ThrXaa: 0.0 ± 0.0
Val
2.614ValAla: 2.614 ± 0.463
0.503ValCys: 0.503 ± 0.225
1.408ValAsp: 1.408 ± 0.381
2.212ValGlu: 2.212 ± 0.406
3.117ValPhe: 3.117 ± 0.49
2.815ValGly: 2.815 ± 0.608
0.302ValHis: 0.302 ± 0.201
3.72ValIle: 3.72 ± 0.596
4.022ValLys: 4.022 ± 0.548
5.631ValLeu: 5.631 ± 0.855
0.603ValMet: 0.603 ± 0.248
2.413ValAsn: 2.413 ± 0.683
0.603ValPro: 0.603 ± 0.282
1.006ValGln: 1.006 ± 0.326
1.81ValArg: 1.81 ± 0.375
3.72ValSer: 3.72 ± 0.61
1.709ValThr: 1.709 ± 0.349
1.911ValVal: 1.911 ± 0.414
0.503ValTrp: 0.503 ± 0.186
1.307ValTyr: 1.307 ± 0.482
0.0ValXaa: 0.0 ± 0.0
Trp
0.101TrpAla: 0.101 ± 0.106
0.0TrpCys: 0.0 ± 0.0
0.302TrpAsp: 0.302 ± 0.138
0.402TrpGlu: 0.402 ± 0.218
0.0TrpPhe: 0.0 ± 0.0
0.302TrpGly: 0.302 ± 0.158
0.302TrpHis: 0.302 ± 0.192
0.402TrpIle: 0.402 ± 0.227
0.302TrpLys: 0.302 ± 0.174
0.201TrpLeu: 0.201 ± 0.147
0.201TrpMet: 0.201 ± 0.148
0.503TrpAsn: 0.503 ± 0.302
0.0TrpPro: 0.0 ± 0.0
0.201TrpGln: 0.201 ± 0.145
0.302TrpArg: 0.302 ± 0.176
0.302TrpSer: 0.302 ± 0.212
0.201TrpThr: 0.201 ± 0.159
0.603TrpVal: 0.603 ± 0.264
0.0TrpTrp: 0.0 ± 0.0
0.201TrpTyr: 0.201 ± 0.194
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.112TyrAla: 2.112 ± 0.401
0.503TyrCys: 0.503 ± 0.21
1.207TyrAsp: 1.207 ± 0.355
3.117TyrGlu: 3.117 ± 0.509
2.815TyrPhe: 2.815 ± 0.823
1.307TyrGly: 1.307 ± 0.272
1.006TyrHis: 1.006 ± 0.396
2.916TyrIle: 2.916 ± 0.685
3.922TyrLys: 3.922 ± 0.538
3.318TyrLeu: 3.318 ± 0.554
0.603TyrMet: 0.603 ± 0.261
2.514TyrAsn: 2.514 ± 0.573
1.709TyrPro: 1.709 ± 0.373
1.81TyrGln: 1.81 ± 0.368
1.207TyrArg: 1.207 ± 0.344
1.911TyrSer: 1.911 ± 0.486
1.609TyrThr: 1.609 ± 0.296
0.302TyrVal: 0.302 ± 0.202
0.0TyrTrp: 0.0 ± 0.0
1.006TyrTyr: 1.006 ± 0.322
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (9946 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski