Amino acid dipepetide frequency for Helicobacter phage KHP40

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.455AlaAla: 1.455 ± 0.421
0.97AlaCys: 0.97 ± 0.452
2.303AlaAsp: 2.303 ± 0.473
4.121AlaGlu: 4.121 ± 1.057
3.636AlaPhe: 3.636 ± 0.806
3.879AlaGly: 3.879 ± 0.772
0.727AlaHis: 0.727 ± 0.256
6.424AlaIle: 6.424 ± 0.701
8.0AlaLys: 8.0 ± 0.989
10.909AlaLeu: 10.909 ± 1.456
1.333AlaMet: 1.333 ± 0.503
5.939AlaAsn: 5.939 ± 1.218
1.697AlaPro: 1.697 ± 0.341
2.667AlaGln: 2.667 ± 0.49
3.515AlaArg: 3.515 ± 0.695
3.03AlaSer: 3.03 ± 0.575
2.667AlaThr: 2.667 ± 0.651
2.182AlaVal: 2.182 ± 0.582
0.242AlaTrp: 0.242 ± 0.163
2.303AlaTyr: 2.303 ± 0.58
0.0AlaXaa: 0.0 ± 0.0
Cys
0.364CysAla: 0.364 ± 0.195
0.121CysCys: 0.121 ± 0.13
0.485CysAsp: 0.485 ± 0.322
0.727CysGlu: 0.727 ± 0.287
0.848CysPhe: 0.848 ± 0.444
0.485CysGly: 0.485 ± 0.413
0.0CysHis: 0.0 ± 0.0
0.727CysIle: 0.727 ± 0.405
0.242CysLys: 0.242 ± 0.179
0.97CysLeu: 0.97 ± 0.376
0.0CysMet: 0.0 ± 0.0
0.485CysAsn: 0.485 ± 0.255
0.485CysPro: 0.485 ± 0.243
0.242CysGln: 0.242 ± 0.187
0.242CysArg: 0.242 ± 0.23
0.0CysSer: 0.0 ± 0.0
0.485CysThr: 0.485 ± 0.259
0.606CysVal: 0.606 ± 0.29
0.0CysTrp: 0.0 ± 0.0
0.364CysTyr: 0.364 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
2.424AspAla: 2.424 ± 0.48
0.485AspCys: 0.485 ± 0.279
2.545AspAsp: 2.545 ± 0.476
3.636AspGlu: 3.636 ± 0.544
4.364AspPhe: 4.364 ± 0.608
1.091AspGly: 1.091 ± 0.276
0.485AspHis: 0.485 ± 0.184
3.152AspIle: 3.152 ± 0.81
7.03AspLys: 7.03 ± 1.002
7.394AspLeu: 7.394 ± 1.092
0.97AspMet: 0.97 ± 0.42
4.848AspAsn: 4.848 ± 0.579
2.424AspPro: 2.424 ± 0.429
1.091AspGln: 1.091 ± 0.308
1.455AspArg: 1.455 ± 0.459
3.152AspSer: 3.152 ± 0.851
2.061AspThr: 2.061 ± 0.642
1.212AspVal: 1.212 ± 0.461
0.0AspTrp: 0.0 ± 0.0
3.273AspTyr: 3.273 ± 0.845
0.0AspXaa: 0.0 ± 0.0
Glu
6.061GluAla: 6.061 ± 0.874
0.121GluCys: 0.121 ± 0.115
1.939GluAsp: 1.939 ± 0.496
5.697GluGlu: 5.697 ± 0.732
3.394GluPhe: 3.394 ± 0.656
1.455GluGly: 1.455 ± 0.386
1.212GluHis: 1.212 ± 0.347
7.515GluIle: 7.515 ± 0.959
8.97GluLys: 8.97 ± 0.766
7.758GluLeu: 7.758 ± 0.825
1.212GluMet: 1.212 ± 0.271
5.939GluAsn: 5.939 ± 0.754
1.576GluPro: 1.576 ± 0.395
4.97GluGln: 4.97 ± 0.914
5.333GluArg: 5.333 ± 0.629
7.758GluSer: 7.758 ± 0.886
4.727GluThr: 4.727 ± 0.672
3.636GluVal: 3.636 ± 0.727
0.606GluTrp: 0.606 ± 0.201
2.182GluTyr: 2.182 ± 0.453
0.0GluXaa: 0.0 ± 0.0
Phe
0.97PheAla: 0.97 ± 0.309
0.606PheCys: 0.606 ± 0.285
3.03PheAsp: 3.03 ± 0.713
3.879PheGlu: 3.879 ± 0.692
3.636PhePhe: 3.636 ± 0.715
1.697PheGly: 1.697 ± 0.466
0.848PheHis: 0.848 ± 0.234
3.273PheIle: 3.273 ± 0.611
6.303PheLys: 6.303 ± 0.695
5.576PheLeu: 5.576 ± 0.641
0.848PheMet: 0.848 ± 0.383
2.667PheAsn: 2.667 ± 0.493
0.485PhePro: 0.485 ± 0.192
0.727PheGln: 0.727 ± 0.289
1.576PheArg: 1.576 ± 0.43
4.727PheSer: 4.727 ± 0.813
2.788PheThr: 2.788 ± 0.538
1.455PheVal: 1.455 ± 0.452
0.242PheTrp: 0.242 ± 0.165
1.818PheTyr: 1.818 ± 0.514
0.0PheXaa: 0.0 ± 0.0
Gly
2.667GlyAla: 2.667 ± 0.832
0.485GlyCys: 0.485 ± 0.265
1.576GlyAsp: 1.576 ± 0.424
2.545GlyGlu: 2.545 ± 0.541
2.545GlyPhe: 2.545 ± 0.539
3.03GlyGly: 3.03 ± 0.839
0.485GlyHis: 0.485 ± 0.272
3.152GlyIle: 3.152 ± 0.571
2.424GlyLys: 2.424 ± 0.387
4.606GlyLeu: 4.606 ± 0.656
1.576GlyMet: 1.576 ± 0.459
3.03GlyAsn: 3.03 ± 0.572
0.0GlyPro: 0.0 ± 0.0
0.727GlyGln: 0.727 ± 0.221
1.091GlyArg: 1.091 ± 0.367
3.394GlySer: 3.394 ± 0.615
0.97GlyThr: 0.97 ± 0.252
4.0GlyVal: 4.0 ± 0.889
0.121GlyTrp: 0.121 ± 0.115
2.061GlyTyr: 2.061 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
1.212HisAla: 1.212 ± 0.315
0.0HisCys: 0.0 ± 0.0
0.848HisAsp: 0.848 ± 0.556
1.091HisGlu: 1.091 ± 0.231
0.606HisPhe: 0.606 ± 0.232
0.364HisGly: 0.364 ± 0.233
0.121HisHis: 0.121 ± 0.11
1.333HisIle: 1.333 ± 0.521
1.939HisLys: 1.939 ± 0.541
1.576HisLeu: 1.576 ± 0.442
0.242HisMet: 0.242 ± 0.173
0.97HisAsn: 0.97 ± 0.373
0.485HisPro: 0.485 ± 0.232
0.242HisGln: 0.242 ± 0.162
0.848HisArg: 0.848 ± 0.284
0.97HisSer: 0.97 ± 0.309
0.848HisThr: 0.848 ± 0.383
0.242HisVal: 0.242 ± 0.197
0.0HisTrp: 0.0 ± 0.0
0.606HisTyr: 0.606 ± 0.269
0.0HisXaa: 0.0 ± 0.0
Ile
5.091IleAla: 5.091 ± 0.7
0.97IleCys: 0.97 ± 0.459
4.242IleAsp: 4.242 ± 0.765
5.455IleGlu: 5.455 ± 0.699
2.061IlePhe: 2.061 ± 0.438
1.576IleGly: 1.576 ± 0.485
0.848IleHis: 0.848 ± 0.318
3.758IleIle: 3.758 ± 0.797
9.818IleLys: 9.818 ± 1.084
7.152IleLeu: 7.152 ± 0.757
1.333IleMet: 1.333 ± 0.35
4.97IleAsn: 4.97 ± 1.014
1.212IlePro: 1.212 ± 0.391
3.879IleGln: 3.879 ± 0.717
3.636IleArg: 3.636 ± 0.605
4.606IleSer: 4.606 ± 0.614
3.394IleThr: 3.394 ± 0.584
2.909IleVal: 2.909 ± 0.655
0.242IleTrp: 0.242 ± 0.175
2.061IleTyr: 2.061 ± 0.429
0.0IleXaa: 0.0 ± 0.0
Lys
9.455LysAla: 9.455 ± 1.237
0.848LysCys: 0.848 ± 0.377
7.03LysAsp: 7.03 ± 1.239
13.455LysGlu: 13.455 ± 1.579
2.909LysPhe: 2.909 ± 0.455
2.545LysGly: 2.545 ± 0.517
2.788LysHis: 2.788 ± 0.867
8.242LysIle: 8.242 ± 1.061
8.727LysLys: 8.727 ± 1.485
8.364LysLeu: 8.364 ± 0.943
1.212LysMet: 1.212 ± 0.417
10.061LysAsn: 10.061 ± 1.387
4.0LysPro: 4.0 ± 0.771
5.697LysGln: 5.697 ± 1.071
4.242LysArg: 4.242 ± 0.693
6.061LysSer: 6.061 ± 1.186
5.212LysThr: 5.212 ± 1.148
5.091LysVal: 5.091 ± 0.671
0.364LysTrp: 0.364 ± 0.195
2.667LysTyr: 2.667 ± 0.484
0.0LysXaa: 0.0 ± 0.0
Leu
6.788LeuAla: 6.788 ± 0.867
1.455LeuCys: 1.455 ± 0.566
4.606LeuAsp: 4.606 ± 0.67
10.667LeuGlu: 10.667 ± 1.024
3.758LeuPhe: 3.758 ± 0.86
5.939LeuGly: 5.939 ± 0.728
0.606LeuHis: 0.606 ± 0.261
5.333LeuIle: 5.333 ± 0.761
16.485LeuLys: 16.485 ± 1.575
8.0LeuLeu: 8.0 ± 0.882
1.576LeuMet: 1.576 ± 0.372
11.879LeuAsn: 11.879 ± 1.431
2.545LeuPro: 2.545 ± 0.802
4.242LeuGln: 4.242 ± 1.013
3.879LeuArg: 3.879 ± 0.543
5.091LeuSer: 5.091 ± 0.608
4.606LeuThr: 4.606 ± 0.751
4.121LeuVal: 4.121 ± 0.778
0.97LeuTrp: 0.97 ± 0.368
2.424LeuTyr: 2.424 ± 0.491
0.0LeuXaa: 0.0 ± 0.0
Met
0.97MetAla: 0.97 ± 0.404
0.121MetCys: 0.121 ± 0.121
1.333MetAsp: 1.333 ± 0.393
1.091MetGlu: 1.091 ± 0.351
0.97MetPhe: 0.97 ± 0.365
1.212MetGly: 1.212 ± 0.476
0.364MetHis: 0.364 ± 0.222
0.97MetIle: 0.97 ± 0.441
1.455MetLys: 1.455 ± 0.457
1.818MetLeu: 1.818 ± 0.385
0.121MetMet: 0.121 ± 0.116
1.697MetAsn: 1.697 ± 0.472
0.606MetPro: 0.606 ± 0.209
1.818MetGln: 1.818 ± 0.478
0.727MetArg: 0.727 ± 0.312
0.97MetSer: 0.97 ± 0.281
0.485MetThr: 0.485 ± 0.253
0.364MetVal: 0.364 ± 0.207
0.242MetTrp: 0.242 ± 0.184
0.364MetTyr: 0.364 ± 0.225
0.0MetXaa: 0.0 ± 0.0
Asn
10.061AsnAla: 10.061 ± 1.472
0.121AsnCys: 0.121 ± 0.121
4.727AsnAsp: 4.727 ± 0.496
6.909AsnGlu: 6.909 ± 1.003
3.515AsnPhe: 3.515 ± 0.551
2.909AsnGly: 2.909 ± 0.496
1.697AsnHis: 1.697 ± 0.33
3.515AsnIle: 3.515 ± 0.603
7.273AsnLys: 7.273 ± 0.678
7.636AsnLeu: 7.636 ± 0.938
1.576AsnMet: 1.576 ± 0.282
7.394AsnAsn: 7.394 ± 1.207
1.939AsnPro: 1.939 ± 0.467
5.212AsnGln: 5.212 ± 1.092
2.909AsnArg: 2.909 ± 0.499
4.0AsnSer: 4.0 ± 0.597
3.394AsnThr: 3.394 ± 0.595
1.697AsnVal: 1.697 ± 0.608
0.242AsnTrp: 0.242 ± 0.153
5.212AsnTyr: 5.212 ± 0.915
0.0AsnXaa: 0.0 ± 0.0
Pro
0.727ProAla: 0.727 ± 0.333
0.0ProCys: 0.0 ± 0.0
1.333ProAsp: 1.333 ± 0.34
0.727ProGlu: 0.727 ± 0.23
1.939ProPhe: 1.939 ± 0.395
0.364ProGly: 0.364 ± 0.248
0.0ProHis: 0.0 ± 0.0
1.818ProIle: 1.818 ± 0.384
3.879ProLys: 3.879 ± 0.77
2.545ProLeu: 2.545 ± 0.418
0.485ProMet: 0.485 ± 0.242
2.424ProAsn: 2.424 ± 0.506
0.242ProPro: 0.242 ± 0.158
1.212ProGln: 1.212 ± 0.404
0.848ProArg: 0.848 ± 0.267
2.061ProSer: 2.061 ± 0.448
2.303ProThr: 2.303 ± 0.55
0.848ProVal: 0.848 ± 0.402
0.121ProTrp: 0.121 ± 0.115
0.97ProTyr: 0.97 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
5.576GlnAla: 5.576 ± 0.859
0.242GlnCys: 0.242 ± 0.18
2.182GlnAsp: 2.182 ± 0.415
4.364GlnGlu: 4.364 ± 0.985
1.455GlnPhe: 1.455 ± 0.334
2.424GlnGly: 2.424 ± 0.736
0.364GlnHis: 0.364 ± 0.226
2.909GlnIle: 2.909 ± 0.615
4.97GlnLys: 4.97 ± 0.742
2.909GlnLeu: 2.909 ± 0.46
0.727GlnMet: 0.727 ± 0.316
4.121GlnAsn: 4.121 ± 0.718
0.606GlnPro: 0.606 ± 0.304
2.667GlnGln: 2.667 ± 0.501
1.818GlnArg: 1.818 ± 0.444
2.909GlnSer: 2.909 ± 0.636
1.697GlnThr: 1.697 ± 0.404
2.061GlnVal: 2.061 ± 0.504
0.364GlnTrp: 0.364 ± 0.164
0.97GlnTyr: 0.97 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
3.758ArgAla: 3.758 ± 0.653
0.121ArgCys: 0.121 ± 0.115
2.424ArgAsp: 2.424 ± 0.46
3.03ArgGlu: 3.03 ± 0.701
1.939ArgPhe: 1.939 ± 0.383
1.091ArgGly: 1.091 ± 0.314
0.97ArgHis: 0.97 ± 0.379
2.424ArgIle: 2.424 ± 0.646
3.152ArgLys: 3.152 ± 0.646
6.424ArgLeu: 6.424 ± 0.732
0.727ArgMet: 0.727 ± 0.323
2.545ArgAsn: 2.545 ± 0.489
1.091ArgPro: 1.091 ± 0.447
1.576ArgGln: 1.576 ± 0.522
1.212ArgArg: 1.212 ± 0.477
2.909ArgSer: 2.909 ± 0.698
1.697ArgThr: 1.697 ± 0.569
1.697ArgVal: 1.697 ± 0.405
0.242ArgTrp: 0.242 ± 0.126
1.697ArgTyr: 1.697 ± 0.463
0.0ArgXaa: 0.0 ± 0.0
Ser
3.515SerAla: 3.515 ± 0.683
0.364SerCys: 0.364 ± 0.289
5.576SerAsp: 5.576 ± 0.675
6.061SerGlu: 6.061 ± 1.042
3.879SerPhe: 3.879 ± 0.858
3.515SerGly: 3.515 ± 0.498
0.242SerHis: 0.242 ± 0.171
3.394SerIle: 3.394 ± 0.507
6.667SerLys: 6.667 ± 0.788
7.273SerLeu: 7.273 ± 0.927
1.455SerMet: 1.455 ± 0.454
3.636SerAsn: 3.636 ± 0.64
1.212SerPro: 1.212 ± 0.324
2.545SerGln: 2.545 ± 0.649
1.939SerArg: 1.939 ± 0.401
2.424SerSer: 2.424 ± 0.647
2.424SerThr: 2.424 ± 0.574
4.97SerVal: 4.97 ± 0.881
0.485SerTrp: 0.485 ± 0.235
3.152SerTyr: 3.152 ± 0.526
0.0SerXaa: 0.0 ± 0.0
Thr
2.182ThrAla: 2.182 ± 0.549
0.242ThrCys: 0.242 ± 0.175
2.545ThrAsp: 2.545 ± 0.591
2.545ThrGlu: 2.545 ± 0.415
1.091ThrPhe: 1.091 ± 0.429
2.667ThrGly: 2.667 ± 0.659
1.212ThrHis: 1.212 ± 0.325
4.606ThrIle: 4.606 ± 0.735
4.0ThrLys: 4.0 ± 0.862
4.364ThrLeu: 4.364 ± 0.715
0.97ThrMet: 0.97 ± 0.334
3.758ThrAsn: 3.758 ± 0.533
2.424ThrPro: 2.424 ± 0.376
3.152ThrGln: 3.152 ± 0.755
1.576ThrArg: 1.576 ± 0.491
3.273ThrSer: 3.273 ± 0.691
2.424ThrThr: 2.424 ± 0.508
0.727ThrVal: 0.727 ± 0.361
0.727ThrTrp: 0.727 ± 0.271
1.212ThrTyr: 1.212 ± 0.344
0.0ThrXaa: 0.0 ± 0.0
Val
2.424ValAla: 2.424 ± 0.497
0.364ValCys: 0.364 ± 0.249
2.424ValAsp: 2.424 ± 0.456
1.939ValGlu: 1.939 ± 0.41
2.545ValPhe: 2.545 ± 0.583
3.03ValGly: 3.03 ± 0.82
0.242ValHis: 0.242 ± 0.224
3.515ValIle: 3.515 ± 0.6
3.879ValLys: 3.879 ± 0.768
5.212ValLeu: 5.212 ± 0.88
0.364ValMet: 0.364 ± 0.221
1.939ValAsn: 1.939 ± 0.599
0.848ValPro: 0.848 ± 0.321
0.364ValGln: 0.364 ± 0.317
2.061ValArg: 2.061 ± 0.479
4.242ValSer: 4.242 ± 0.962
1.939ValThr: 1.939 ± 0.325
2.182ValVal: 2.182 ± 0.643
0.242ValTrp: 0.242 ± 0.153
1.091ValTyr: 1.091 ± 0.38
0.0ValXaa: 0.0 ± 0.0
Trp
0.242TrpAla: 0.242 ± 0.161
0.0TrpCys: 0.0 ± 0.0
0.121TrpAsp: 0.121 ± 0.127
0.848TrpGlu: 0.848 ± 0.292
0.0TrpPhe: 0.0 ± 0.0
0.364TrpGly: 0.364 ± 0.223
0.121TrpHis: 0.121 ± 0.115
0.242TrpIle: 0.242 ± 0.163
0.364TrpLys: 0.364 ± 0.191
0.242TrpLeu: 0.242 ± 0.182
0.242TrpMet: 0.242 ± 0.181
0.848TrpAsn: 0.848 ± 0.411
0.0TrpPro: 0.0 ± 0.0
0.121TrpGln: 0.121 ± 0.086
0.242TrpArg: 0.242 ± 0.153
0.485TrpSer: 0.485 ± 0.299
0.242TrpThr: 0.242 ± 0.131
0.485TrpVal: 0.485 ± 0.243
0.0TrpTrp: 0.0 ± 0.0
0.242TrpTyr: 0.242 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.061TyrAla: 2.061 ± 0.438
0.242TyrCys: 0.242 ± 0.178
1.818TyrAsp: 1.818 ± 0.393
3.273TyrGlu: 3.273 ± 0.638
2.061TyrPhe: 2.061 ± 0.563
0.848TyrGly: 0.848 ± 0.267
1.212TyrHis: 1.212 ± 0.343
2.909TyrIle: 2.909 ± 0.609
3.394TyrLys: 3.394 ± 0.604
4.242TyrLeu: 4.242 ± 0.585
0.606TyrMet: 0.606 ± 0.269
2.788TyrAsn: 2.788 ± 0.534
0.97TyrPro: 0.97 ± 0.269
2.303TyrGln: 2.303 ± 0.346
1.576TyrArg: 1.576 ± 0.52
2.667TyrSer: 2.667 ± 0.575
1.333TyrThr: 1.333 ± 0.459
0.364TyrVal: 0.364 ± 0.207
0.0TyrTrp: 0.0 ± 0.0
1.697TyrTyr: 1.697 ± 0.492
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 32 proteins (8251 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski