Amino acid dipepetide frequency for Helicobacter phage FrGC43A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.216AlaAla: 1.216 ± 0.397
0.912AlaCys: 0.912 ± 0.335
2.027AlaAsp: 2.027 ± 0.394
3.446AlaGlu: 3.446 ± 0.7
3.142AlaPhe: 3.142 ± 0.551
2.432AlaGly: 2.432 ± 0.534
1.115AlaHis: 1.115 ± 0.35
6.182AlaIle: 6.182 ± 0.713
8.311AlaLys: 8.311 ± 0.97
11.047AlaLeu: 11.047 ± 0.99
1.723AlaMet: 1.723 ± 0.412
6.081AlaAsn: 6.081 ± 0.759
1.419AlaPro: 1.419 ± 0.361
2.736AlaGln: 2.736 ± 0.453
3.142AlaArg: 3.142 ± 0.532
3.142AlaSer: 3.142 ± 0.659
1.926AlaThr: 1.926 ± 0.514
2.23AlaVal: 2.23 ± 0.612
0.304AlaTrp: 0.304 ± 0.177
1.723AlaTyr: 1.723 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
0.608CysAla: 0.608 ± 0.328
0.304CysCys: 0.304 ± 0.208
0.608CysAsp: 0.608 ± 0.308
0.811CysGlu: 0.811 ± 0.288
0.709CysPhe: 0.709 ± 0.332
0.507CysGly: 0.507 ± 0.251
0.101CysHis: 0.101 ± 0.112
0.608CysIle: 0.608 ± 0.225
0.608CysLys: 0.608 ± 0.231
0.608CysLeu: 0.608 ± 0.263
0.101CysMet: 0.101 ± 0.113
0.405CysAsn: 0.405 ± 0.204
0.507CysPro: 0.507 ± 0.217
0.203CysGln: 0.203 ± 0.127
0.101CysArg: 0.101 ± 0.081
0.608CysSer: 0.608 ± 0.417
0.811CysThr: 0.811 ± 0.283
0.608CysVal: 0.608 ± 0.246
0.0CysTrp: 0.0 ± 0.0
0.203CysTyr: 0.203 ± 0.131
0.0CysXaa: 0.0 ± 0.0
Asp
3.344AspAla: 3.344 ± 0.565
0.304AspCys: 0.304 ± 0.187
1.52AspAsp: 1.52 ± 0.409
4.358AspGlu: 4.358 ± 0.538
4.358AspPhe: 4.358 ± 0.859
2.128AspGly: 2.128 ± 0.436
0.405AspHis: 0.405 ± 0.22
1.52AspIle: 1.52 ± 0.457
5.777AspLys: 5.777 ± 0.791
6.892AspLeu: 6.892 ± 0.863
0.912AspMet: 0.912 ± 0.282
4.561AspAsn: 4.561 ± 0.586
1.216AspPro: 1.216 ± 0.3
0.811AspGln: 0.811 ± 0.32
1.622AspArg: 1.622 ± 0.404
2.534AspSer: 2.534 ± 0.547
2.331AspThr: 2.331 ± 0.519
1.115AspVal: 1.115 ± 0.405
0.101AspTrp: 0.101 ± 0.096
3.04AspTyr: 3.04 ± 0.517
0.0AspXaa: 0.0 ± 0.0
Glu
5.169GluAla: 5.169 ± 0.755
0.405GluCys: 0.405 ± 0.183
1.926GluAsp: 1.926 ± 0.359
3.953GluGlu: 3.953 ± 0.659
3.344GluPhe: 3.344 ± 0.481
1.926GluGly: 1.926 ± 0.367
0.811GluHis: 0.811 ± 0.465
7.5GluIle: 7.5 ± 0.955
9.831GluLys: 9.831 ± 1.073
9.831GluLeu: 9.831 ± 1.333
1.318GluMet: 1.318 ± 0.324
6.892GluAsn: 6.892 ± 0.829
1.318GluPro: 1.318 ± 0.376
5.067GluGln: 5.067 ± 0.808
4.662GluArg: 4.662 ± 0.714
8.006GluSer: 8.006 ± 0.938
5.777GluThr: 5.777 ± 0.746
3.953GluVal: 3.953 ± 0.819
0.405GluTrp: 0.405 ± 0.178
2.838GluTyr: 2.838 ± 0.489
0.0GluXaa: 0.0 ± 0.0
Phe
1.723PheAla: 1.723 ± 0.364
0.912PheCys: 0.912 ± 0.371
2.736PheAsp: 2.736 ± 0.755
3.04PheGlu: 3.04 ± 0.526
3.851PhePhe: 3.851 ± 0.607
1.318PheGly: 1.318 ± 0.291
0.608PheHis: 0.608 ± 0.199
3.142PheIle: 3.142 ± 0.577
6.892PheLys: 6.892 ± 0.748
6.486PheLeu: 6.486 ± 0.55
0.405PheMet: 0.405 ± 0.233
3.243PheAsn: 3.243 ± 0.496
0.507PhePro: 0.507 ± 0.206
0.608PheGln: 0.608 ± 0.255
1.419PheArg: 1.419 ± 0.374
4.966PheSer: 4.966 ± 0.838
2.635PheThr: 2.635 ± 0.439
1.824PheVal: 1.824 ± 0.419
0.203PheTrp: 0.203 ± 0.205
2.23PheTyr: 2.23 ± 0.464
0.0PheXaa: 0.0 ± 0.0
Gly
2.939GlyAla: 2.939 ± 0.831
0.405GlyCys: 0.405 ± 0.182
1.318GlyAsp: 1.318 ± 0.369
2.23GlyGlu: 2.23 ± 0.384
3.446GlyPhe: 3.446 ± 0.68
2.331GlyGly: 2.331 ± 0.638
0.405GlyHis: 0.405 ± 0.171
3.142GlyIle: 3.142 ± 0.543
3.142GlyLys: 3.142 ± 0.673
5.371GlyLeu: 5.371 ± 0.565
1.419GlyMet: 1.419 ± 0.325
3.142GlyAsn: 3.142 ± 0.507
0.0GlyPro: 0.0 ± 0.0
0.912GlyGln: 0.912 ± 0.269
1.723GlyArg: 1.723 ± 0.383
3.142GlySer: 3.142 ± 0.643
0.912GlyThr: 0.912 ± 0.309
3.953GlyVal: 3.953 ± 0.907
0.101GlyTrp: 0.101 ± 0.088
1.824GlyTyr: 1.824 ± 0.277
0.0GlyXaa: 0.0 ± 0.0
His
0.405HisAla: 0.405 ± 0.156
0.101HisCys: 0.101 ± 0.096
0.811HisAsp: 0.811 ± 0.452
1.013HisGlu: 1.013 ± 0.267
0.608HisPhe: 0.608 ± 0.278
0.304HisGly: 0.304 ± 0.174
0.203HisHis: 0.203 ± 0.163
1.115HisIle: 1.115 ± 0.344
1.622HisLys: 1.622 ± 0.444
1.419HisLeu: 1.419 ± 0.458
0.203HisMet: 0.203 ± 0.163
1.115HisAsn: 1.115 ± 0.282
0.203HisPro: 0.203 ± 0.112
0.101HisGln: 0.101 ± 0.103
0.608HisArg: 0.608 ± 0.277
0.709HisSer: 0.709 ± 0.203
1.318HisThr: 1.318 ± 0.288
0.507HisVal: 0.507 ± 0.228
0.0HisTrp: 0.0 ± 0.0
1.115HisTyr: 1.115 ± 0.303
0.0HisXaa: 0.0 ± 0.0
Ile
4.763IleAla: 4.763 ± 0.687
0.405IleCys: 0.405 ± 0.201
4.155IleAsp: 4.155 ± 0.492
5.169IleGlu: 5.169 ± 0.72
2.534IlePhe: 2.534 ± 0.396
2.534IleGly: 2.534 ± 0.581
0.811IleHis: 0.811 ± 0.336
4.155IleIle: 4.155 ± 0.78
8.817IleLys: 8.817 ± 0.967
7.804IleLeu: 7.804 ± 0.87
0.709IleMet: 0.709 ± 0.25
5.574IleAsn: 5.574 ± 1.221
1.926IlePro: 1.926 ± 0.564
4.155IleGln: 4.155 ± 0.863
3.04IleArg: 3.04 ± 0.522
4.459IleSer: 4.459 ± 0.63
5.473IleThr: 5.473 ± 1.099
3.649IleVal: 3.649 ± 0.44
0.203IleTrp: 0.203 ± 0.137
1.723IleTyr: 1.723 ± 0.348
0.0IleXaa: 0.0 ± 0.0
Lys
7.804LysAla: 7.804 ± 0.896
0.811LysCys: 0.811 ± 0.312
7.297LysAsp: 7.297 ± 1.186
13.783LysGlu: 13.783 ± 1.177
4.155LysPhe: 4.155 ± 0.674
3.953LysGly: 3.953 ± 0.609
2.635LysHis: 2.635 ± 0.545
8.716LysIle: 8.716 ± 0.821
9.527LysLys: 9.527 ± 1.276
7.804LysLeu: 7.804 ± 0.834
1.926LysMet: 1.926 ± 0.419
8.919LysAsn: 8.919 ± 0.892
2.736LysPro: 2.736 ± 0.454
6.993LysGln: 6.993 ± 1.045
4.054LysArg: 4.054 ± 0.703
5.169LysSer: 5.169 ± 0.914
4.966LysThr: 4.966 ± 0.623
4.763LysVal: 4.763 ± 0.778
0.608LysTrp: 0.608 ± 0.241
3.04LysTyr: 3.04 ± 0.384
0.0LysXaa: 0.0 ± 0.0
Leu
5.27LeuAla: 5.27 ± 0.613
1.318LeuCys: 1.318 ± 0.455
4.257LeuAsp: 4.257 ± 0.503
13.378LeuGlu: 13.378 ± 1.16
3.547LeuPhe: 3.547 ± 0.589
6.284LeuGly: 6.284 ± 0.919
0.608LeuHis: 0.608 ± 0.325
6.385LeuIle: 6.385 ± 0.809
15.506LeuLys: 15.506 ± 1.549
6.689LeuLeu: 6.689 ± 0.827
2.128LeuMet: 2.128 ± 0.441
10.844LeuAsn: 10.844 ± 1.039
1.622LeuPro: 1.622 ± 0.518
4.662LeuGln: 4.662 ± 0.812
4.054LeuArg: 4.054 ± 0.781
6.588LeuSer: 6.588 ± 0.752
3.649LeuThr: 3.649 ± 0.55
4.257LeuVal: 4.257 ± 0.454
0.507LeuTrp: 0.507 ± 0.244
2.23LeuTyr: 2.23 ± 0.481
0.0LeuXaa: 0.0 ± 0.0
Met
1.013MetAla: 1.013 ± 0.292
0.101MetCys: 0.101 ± 0.109
1.115MetAsp: 1.115 ± 0.361
0.709MetGlu: 0.709 ± 0.264
1.013MetPhe: 1.013 ± 0.349
1.318MetGly: 1.318 ± 0.377
0.101MetHis: 0.101 ± 0.096
1.52MetIle: 1.52 ± 0.363
1.115MetLys: 1.115 ± 0.313
2.432MetLeu: 2.432 ± 0.456
0.203MetMet: 0.203 ± 0.136
2.128MetAsn: 2.128 ± 0.505
1.013MetPro: 1.013 ± 0.257
1.419MetGln: 1.419 ± 0.455
1.115MetArg: 1.115 ± 0.384
0.912MetSer: 0.912 ± 0.212
0.507MetThr: 0.507 ± 0.191
0.405MetVal: 0.405 ± 0.217
0.203MetTrp: 0.203 ± 0.157
0.405MetTyr: 0.405 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
9.932AsnAla: 9.932 ± 1.396
0.101AsnCys: 0.101 ± 0.115
4.459AsnAsp: 4.459 ± 0.732
7.702AsnGlu: 7.702 ± 1.329
3.243AsnPhe: 3.243 ± 0.554
2.736AsnGly: 2.736 ± 0.394
1.013AsnHis: 1.013 ± 0.444
5.067AsnIle: 5.067 ± 0.854
7.5AsnLys: 7.5 ± 0.994
6.79AsnLeu: 6.79 ± 0.977
1.318AsnMet: 1.318 ± 0.318
7.297AsnAsn: 7.297 ± 1.273
2.128AsnPro: 2.128 ± 0.446
4.763AsnGln: 4.763 ± 0.756
2.736AsnArg: 2.736 ± 0.569
4.865AsnSer: 4.865 ± 0.686
4.054AsnThr: 4.054 ± 0.784
1.52AsnVal: 1.52 ± 0.399
0.304AsnTrp: 0.304 ± 0.151
4.358AsnTyr: 4.358 ± 0.618
0.0AsnXaa: 0.0 ± 0.0
Pro
0.405ProAla: 0.405 ± 0.239
0.0ProCys: 0.0 ± 0.0
0.507ProAsp: 0.507 ± 0.24
1.318ProGlu: 1.318 ± 0.276
1.824ProPhe: 1.824 ± 0.457
0.304ProGly: 0.304 ± 0.231
0.203ProHis: 0.203 ± 0.193
2.635ProIle: 2.635 ± 0.453
3.04ProLys: 3.04 ± 0.542
1.926ProLeu: 1.926 ± 0.483
0.304ProMet: 0.304 ± 0.179
1.926ProAsn: 1.926 ± 0.358
0.608ProPro: 0.608 ± 0.195
0.507ProGln: 0.507 ± 0.209
0.811ProArg: 0.811 ± 0.289
2.23ProSer: 2.23 ± 0.362
1.622ProThr: 1.622 ± 0.395
0.507ProVal: 0.507 ± 0.251
0.203ProTrp: 0.203 ± 0.152
1.318ProTyr: 1.318 ± 0.332
0.0ProXaa: 0.0 ± 0.0
Gln
5.777GlnAla: 5.777 ± 0.845
0.101GlnCys: 0.101 ± 0.093
1.926GlnAsp: 1.926 ± 0.432
4.865GlnGlu: 4.865 ± 0.842
1.216GlnPhe: 1.216 ± 0.32
2.331GlnGly: 2.331 ± 0.5
0.405GlnHis: 0.405 ± 0.235
3.243GlnIle: 3.243 ± 0.638
4.966GlnLys: 4.966 ± 0.783
3.75GlnLeu: 3.75 ± 0.56
0.709GlnMet: 0.709 ± 0.211
3.446GlnAsn: 3.446 ± 0.795
0.507GlnPro: 0.507 ± 0.246
1.52GlnGln: 1.52 ± 0.436
1.622GlnArg: 1.622 ± 0.397
2.939GlnSer: 2.939 ± 0.573
1.419GlnThr: 1.419 ± 0.321
2.432GlnVal: 2.432 ± 0.448
0.203GlnTrp: 0.203 ± 0.128
1.216GlnTyr: 1.216 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
3.04ArgAla: 3.04 ± 0.571
0.203ArgCys: 0.203 ± 0.142
2.635ArgAsp: 2.635 ± 0.495
3.851ArgGlu: 3.851 ± 0.86
2.23ArgPhe: 2.23 ± 0.333
1.926ArgGly: 1.926 ± 0.466
0.912ArgHis: 0.912 ± 0.332
2.939ArgIle: 2.939 ± 0.799
3.547ArgLys: 3.547 ± 0.654
4.662ArgLeu: 4.662 ± 0.642
0.912ArgMet: 0.912 ± 0.373
2.23ArgAsn: 2.23 ± 0.388
0.811ArgPro: 0.811 ± 0.305
1.622ArgGln: 1.622 ± 0.466
0.811ArgArg: 0.811 ± 0.354
2.23ArgSer: 2.23 ± 0.521
1.824ArgThr: 1.824 ± 0.325
1.115ArgVal: 1.115 ± 0.417
0.0ArgTrp: 0.0 ± 0.0
1.216ArgTyr: 1.216 ± 0.336
0.0ArgXaa: 0.0 ± 0.0
Ser
4.459SerAla: 4.459 ± 0.662
0.709SerCys: 0.709 ± 0.291
4.966SerAsp: 4.966 ± 0.607
6.892SerGlu: 6.892 ± 0.928
3.446SerPhe: 3.446 ± 0.605
3.75SerGly: 3.75 ± 0.651
0.811SerHis: 0.811 ± 0.234
3.649SerIle: 3.649 ± 0.641
6.284SerLys: 6.284 ± 0.869
6.892SerLeu: 6.892 ± 0.916
1.622SerMet: 1.622 ± 0.459
3.953SerAsn: 3.953 ± 0.695
1.419SerPro: 1.419 ± 0.38
2.736SerGln: 2.736 ± 0.414
1.419SerArg: 1.419 ± 0.346
3.547SerSer: 3.547 ± 0.455
1.622SerThr: 1.622 ± 0.413
4.966SerVal: 4.966 ± 0.738
0.507SerTrp: 0.507 ± 0.187
3.04SerTyr: 3.04 ± 0.55
0.0SerXaa: 0.0 ± 0.0
Thr
2.23ThrAla: 2.23 ± 0.459
0.507ThrCys: 0.507 ± 0.179
2.23ThrAsp: 2.23 ± 0.577
3.04ThrGlu: 3.04 ± 0.669
1.013ThrPhe: 1.013 ± 0.234
1.52ThrGly: 1.52 ± 0.396
1.013ThrHis: 1.013 ± 0.318
3.851ThrIle: 3.851 ± 0.607
4.966ThrLys: 4.966 ± 0.622
4.257ThrLeu: 4.257 ± 0.614
1.013ThrMet: 1.013 ± 0.243
3.851ThrAsn: 3.851 ± 0.651
2.331ThrPro: 2.331 ± 0.549
3.04ThrGln: 3.04 ± 0.828
1.926ThrArg: 1.926 ± 0.401
3.953ThrSer: 3.953 ± 0.729
3.547ThrThr: 3.547 ± 0.647
0.507ThrVal: 0.507 ± 0.212
0.304ThrTrp: 0.304 ± 0.156
2.027ThrTyr: 2.027 ± 0.402
0.0ThrXaa: 0.0 ± 0.0
Val
2.23ValAla: 2.23 ± 0.515
1.013ValCys: 1.013 ± 0.3
1.824ValAsp: 1.824 ± 0.452
2.23ValGlu: 2.23 ± 0.426
2.736ValPhe: 2.736 ± 0.5
2.635ValGly: 2.635 ± 0.584
0.0ValHis: 0.0 ± 0.0
3.649ValIle: 3.649 ± 0.484
4.459ValLys: 4.459 ± 0.737
4.763ValLeu: 4.763 ± 0.635
1.013ValMet: 1.013 ± 0.346
2.736ValAsn: 2.736 ± 0.643
0.608ValPro: 0.608 ± 0.259
0.912ValGln: 0.912 ± 0.34
1.52ValArg: 1.52 ± 0.432
4.358ValSer: 4.358 ± 0.724
1.318ValThr: 1.318 ± 0.322
2.23ValVal: 2.23 ± 0.566
0.405ValTrp: 0.405 ± 0.181
1.216ValTyr: 1.216 ± 0.501
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.101TrpAsp: 0.101 ± 0.109
0.608TrpGlu: 0.608 ± 0.309
0.101TrpPhe: 0.101 ± 0.076
0.304TrpGly: 0.304 ± 0.123
0.203TrpHis: 0.203 ± 0.13
0.203TrpIle: 0.203 ± 0.143
0.405TrpLys: 0.405 ± 0.184
0.304TrpLeu: 0.304 ± 0.192
0.203TrpMet: 0.203 ± 0.16
0.507TrpAsn: 0.507 ± 0.296
0.0TrpPro: 0.0 ± 0.0
0.101TrpGln: 0.101 ± 0.103
0.304TrpArg: 0.304 ± 0.156
0.405TrpSer: 0.405 ± 0.227
0.101TrpThr: 0.101 ± 0.102
0.709TrpVal: 0.709 ± 0.309
0.0TrpTrp: 0.0 ± 0.0
0.203TrpTyr: 0.203 ± 0.191
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.926TyrAla: 1.926 ± 0.478
0.405TyrCys: 0.405 ± 0.186
2.432TyrAsp: 2.432 ± 0.44
2.534TyrGlu: 2.534 ± 0.456
2.331TyrPhe: 2.331 ± 0.567
1.318TyrGly: 1.318 ± 0.314
1.115TyrHis: 1.115 ± 0.321
2.838TyrIle: 2.838 ± 0.532
3.344TyrLys: 3.344 ± 0.582
4.054TyrLeu: 4.054 ± 0.612
0.507TyrMet: 0.507 ± 0.252
3.04TyrAsn: 3.04 ± 0.528
1.318TyrPro: 1.318 ± 0.316
1.723TyrGln: 1.723 ± 0.376
2.027TyrArg: 2.027 ± 0.403
1.926TyrSer: 1.926 ± 0.485
1.318TyrThr: 1.318 ± 0.368
0.608TyrVal: 0.608 ± 0.258
0.203TyrTrp: 0.203 ± 0.126
1.824TyrTyr: 1.824 ± 0.507
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37 proteins (9868 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski