Amino acid dipepetide frequency for Cellulophaga phage phi12:3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.694AlaAla: 4.694 ± 1.266
0.636AlaCys: 0.636 ± 0.274
3.103AlaAsp: 3.103 ± 0.453
3.739AlaGlu: 3.739 ± 0.67
2.148AlaPhe: 2.148 ± 0.507
3.501AlaGly: 3.501 ± 0.548
0.716AlaHis: 0.716 ± 0.273
4.455AlaIle: 4.455 ± 0.749
5.728AlaLys: 5.728 ± 0.776
5.012AlaLeu: 5.012 ± 0.78
1.432AlaMet: 1.432 ± 0.383
2.387AlaAsn: 2.387 ± 0.436
2.069AlaPro: 2.069 ± 0.552
2.944AlaGln: 2.944 ± 0.657
2.785AlaArg: 2.785 ± 0.451
4.774AlaSer: 4.774 ± 0.67
3.58AlaThr: 3.58 ± 0.518
4.058AlaVal: 4.058 ± 0.678
1.193AlaTrp: 1.193 ± 0.256
2.705AlaTyr: 2.705 ± 0.446
0.0AlaXaa: 0.0 ± 0.0
Cys
0.477CysAla: 0.477 ± 0.166
0.159CysCys: 0.159 ± 0.123
0.875CysAsp: 0.875 ± 0.258
0.796CysGlu: 0.796 ± 0.247
0.716CysPhe: 0.716 ± 0.23
0.796CysGly: 0.796 ± 0.332
0.08CysHis: 0.08 ± 0.078
0.636CysIle: 0.636 ± 0.253
1.193CysLys: 1.193 ± 0.31
0.796CysLeu: 0.796 ± 0.306
0.159CysMet: 0.159 ± 0.108
0.398CysAsn: 0.398 ± 0.182
0.716CysPro: 0.716 ± 0.226
0.398CysGln: 0.398 ± 0.18
0.875CysArg: 0.875 ± 0.261
0.875CysSer: 0.875 ± 0.259
0.239CysThr: 0.239 ± 0.131
0.239CysVal: 0.239 ± 0.177
0.08CysTrp: 0.08 ± 0.076
0.318CysTyr: 0.318 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
4.376AspAla: 4.376 ± 0.515
0.796AspCys: 0.796 ± 0.276
3.421AspAsp: 3.421 ± 0.673
3.739AspGlu: 3.739 ± 0.574
3.898AspPhe: 3.898 ± 0.572
4.933AspGly: 4.933 ± 0.879
1.034AspHis: 1.034 ± 0.259
5.728AspIle: 5.728 ± 0.682
5.012AspLys: 5.012 ± 0.641
6.683AspLeu: 6.683 ± 0.679
1.512AspMet: 1.512 ± 0.469
3.501AspAsn: 3.501 ± 0.49
2.944AspPro: 2.944 ± 0.578
2.626AspGln: 2.626 ± 0.457
2.546AspArg: 2.546 ± 0.482
4.137AspSer: 4.137 ± 0.68
3.182AspThr: 3.182 ± 0.652
3.262AspVal: 3.262 ± 0.497
1.114AspTrp: 1.114 ± 0.257
2.228AspTyr: 2.228 ± 0.449
0.0AspXaa: 0.0 ± 0.0
Glu
5.171GluAla: 5.171 ± 0.735
0.318GluCys: 0.318 ± 0.164
4.058GluAsp: 4.058 ± 0.59
4.774GluGlu: 4.774 ± 0.859
2.705GluPhe: 2.705 ± 0.473
3.262GluGly: 3.262 ± 0.695
0.716GluHis: 0.716 ± 0.26
6.285GluIle: 6.285 ± 0.646
4.933GluLys: 4.933 ± 0.678
6.604GluLeu: 6.604 ± 0.706
1.989GluMet: 1.989 ± 0.34
3.421GluAsn: 3.421 ± 0.435
1.75GluPro: 1.75 ± 0.449
2.864GluGln: 2.864 ± 0.647
2.307GluArg: 2.307 ± 0.387
3.103GluSer: 3.103 ± 0.621
3.103GluThr: 3.103 ± 0.516
4.694GluVal: 4.694 ± 0.527
0.875GluTrp: 0.875 ± 0.256
2.466GluTyr: 2.466 ± 0.442
0.0GluXaa: 0.0 ± 0.0
Phe
2.228PheAla: 2.228 ± 0.363
0.239PheCys: 0.239 ± 0.124
4.615PheAsp: 4.615 ± 0.662
2.785PheGlu: 2.785 ± 0.477
2.944PhePhe: 2.944 ± 0.506
2.944PheGly: 2.944 ± 0.537
0.636PheHis: 0.636 ± 0.219
3.58PheIle: 3.58 ± 0.53
4.455PheLys: 4.455 ± 0.47
3.66PheLeu: 3.66 ± 0.511
1.114PheMet: 1.114 ± 0.298
4.058PheAsn: 4.058 ± 0.462
1.273PhePro: 1.273 ± 0.411
0.955PheGln: 0.955 ± 0.298
1.114PheArg: 1.114 ± 0.359
2.626PheSer: 2.626 ± 0.451
2.546PheThr: 2.546 ± 0.374
2.466PheVal: 2.466 ± 0.449
0.716PheTrp: 0.716 ± 0.226
1.83PheTyr: 1.83 ± 0.369
0.0PheXaa: 0.0 ± 0.0
Gly
4.455GlyAla: 4.455 ± 0.785
0.716GlyCys: 0.716 ± 0.291
4.217GlyAsp: 4.217 ± 0.692
3.66GlyGlu: 3.66 ± 0.987
4.455GlyPhe: 4.455 ± 0.589
4.058GlyGly: 4.058 ± 0.774
0.636GlyHis: 0.636 ± 0.209
4.217GlyIle: 4.217 ± 0.573
4.058GlyLys: 4.058 ± 0.633
5.888GlyLeu: 5.888 ± 0.982
1.273GlyMet: 1.273 ± 0.359
2.785GlyAsn: 2.785 ± 0.533
1.75GlyPro: 1.75 ± 0.517
2.228GlyGln: 2.228 ± 0.433
1.671GlyArg: 1.671 ± 0.326
4.853GlySer: 4.853 ± 0.693
4.535GlyThr: 4.535 ± 0.827
4.137GlyVal: 4.137 ± 0.569
0.875GlyTrp: 0.875 ± 0.24
3.182GlyTyr: 3.182 ± 0.615
0.0GlyXaa: 0.0 ± 0.0
His
0.875HisAla: 0.875 ± 0.285
0.318HisCys: 0.318 ± 0.155
0.477HisAsp: 0.477 ± 0.188
0.477HisGlu: 0.477 ± 0.172
0.716HisPhe: 0.716 ± 0.262
0.796HisGly: 0.796 ± 0.275
0.08HisHis: 0.08 ± 0.087
1.273HisIle: 1.273 ± 0.372
1.353HisLys: 1.353 ± 0.451
1.193HisLeu: 1.193 ± 0.31
0.477HisMet: 0.477 ± 0.203
0.716HisAsn: 0.716 ± 0.234
0.716HisPro: 0.716 ± 0.292
0.318HisGln: 0.318 ± 0.143
0.955HisArg: 0.955 ± 0.338
0.796HisSer: 0.796 ± 0.216
0.318HisThr: 0.318 ± 0.19
0.557HisVal: 0.557 ± 0.206
0.239HisTrp: 0.239 ± 0.148
0.636HisTyr: 0.636 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
4.774IleAla: 4.774 ± 0.864
0.875IleCys: 0.875 ± 0.272
6.444IleAsp: 6.444 ± 0.723
7.638IleGlu: 7.638 ± 0.832
3.023IlePhe: 3.023 ± 0.56
4.137IleGly: 4.137 ± 0.487
0.796IleHis: 0.796 ± 0.217
5.171IleIle: 5.171 ± 0.677
7.081IleLys: 7.081 ± 1.045
7.32IleLeu: 7.32 ± 0.825
1.353IleMet: 1.353 ± 0.351
5.331IleAsn: 5.331 ± 0.599
2.307IlePro: 2.307 ± 0.525
2.148IleGln: 2.148 ± 0.508
2.466IleArg: 2.466 ± 0.515
6.047IleSer: 6.047 ± 0.58
3.262IleThr: 3.262 ± 0.475
4.455IleVal: 4.455 ± 0.544
0.955IleTrp: 0.955 ± 0.232
2.466IleTyr: 2.466 ± 0.567
0.0IleXaa: 0.0 ± 0.0
Lys
5.41LysAla: 5.41 ± 0.831
0.477LysCys: 0.477 ± 0.191
5.967LysAsp: 5.967 ± 0.815
7.001LysGlu: 7.001 ± 0.997
3.023LysPhe: 3.023 ± 0.514
5.331LysGly: 5.331 ± 0.842
1.432LysHis: 1.432 ± 0.415
6.922LysIle: 6.922 ± 0.925
9.149LysLys: 9.149 ± 1.107
7.001LysLeu: 7.001 ± 1.071
2.785LysMet: 2.785 ± 0.528
5.49LysAsn: 5.49 ± 0.759
3.262LysPro: 3.262 ± 0.537
2.626LysGln: 2.626 ± 0.445
3.898LysArg: 3.898 ± 0.631
5.808LysSer: 5.808 ± 0.675
4.217LysThr: 4.217 ± 0.538
4.376LysVal: 4.376 ± 0.682
1.273LysTrp: 1.273 ± 0.337
4.137LysTyr: 4.137 ± 0.663
0.0LysXaa: 0.0 ± 0.0
Leu
3.342LeuAla: 3.342 ± 0.658
1.193LeuCys: 1.193 ± 0.345
6.444LeuAsp: 6.444 ± 0.766
5.171LeuGlu: 5.171 ± 0.648
3.978LeuPhe: 3.978 ± 0.721
4.058LeuGly: 4.058 ± 0.507
1.114LeuHis: 1.114 ± 0.362
6.763LeuIle: 6.763 ± 0.848
6.842LeuLys: 6.842 ± 1.11
6.047LeuLeu: 6.047 ± 0.709
2.785LeuMet: 2.785 ± 0.484
5.49LeuAsn: 5.49 ± 0.519
3.103LeuPro: 3.103 ± 0.431
3.182LeuGln: 3.182 ± 0.507
3.182LeuArg: 3.182 ± 0.552
7.399LeuSer: 7.399 ± 0.651
5.092LeuThr: 5.092 ± 0.683
3.898LeuVal: 3.898 ± 0.59
0.875LeuTrp: 0.875 ± 0.273
2.387LeuTyr: 2.387 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
1.75MetAla: 1.75 ± 0.395
0.239MetCys: 0.239 ± 0.112
1.193MetAsp: 1.193 ± 0.311
1.353MetGlu: 1.353 ± 0.309
0.796MetPhe: 0.796 ± 0.246
1.193MetGly: 1.193 ± 0.253
0.477MetHis: 0.477 ± 0.219
2.069MetIle: 2.069 ± 0.402
2.466MetLys: 2.466 ± 0.418
1.273MetLeu: 1.273 ± 0.304
0.557MetMet: 0.557 ± 0.294
2.466MetAsn: 2.466 ± 0.326
1.114MetPro: 1.114 ± 0.349
1.432MetGln: 1.432 ± 0.368
1.114MetArg: 1.114 ± 0.316
2.387MetSer: 2.387 ± 0.519
1.353MetThr: 1.353 ± 0.344
1.75MetVal: 1.75 ± 0.328
0.08MetTrp: 0.08 ± 0.08
0.716MetTyr: 0.716 ± 0.284
0.0MetXaa: 0.0 ± 0.0
Asn
3.819AsnAla: 3.819 ± 0.583
1.034AsnCys: 1.034 ± 0.248
3.501AsnAsp: 3.501 ± 0.549
3.103AsnGlu: 3.103 ± 0.47
2.466AsnPhe: 2.466 ± 0.457
3.421AsnGly: 3.421 ± 0.571
1.114AsnHis: 1.114 ± 0.27
4.296AsnIle: 4.296 ± 0.482
6.922AsnLys: 6.922 ± 0.686
3.501AsnLeu: 3.501 ± 0.501
1.83AsnMet: 1.83 ± 0.439
3.58AsnAsn: 3.58 ± 0.754
2.148AsnPro: 2.148 ± 0.349
1.83AsnGln: 1.83 ± 0.308
2.785AsnArg: 2.785 ± 0.49
3.501AsnSer: 3.501 ± 0.572
3.819AsnThr: 3.819 ± 0.716
2.148AsnVal: 2.148 ± 0.398
0.398AsnTrp: 0.398 ± 0.199
3.103AsnTyr: 3.103 ± 0.692
0.0AsnXaa: 0.0 ± 0.0
Pro
2.546ProAla: 2.546 ± 0.551
0.477ProCys: 0.477 ± 0.203
1.353ProAsp: 1.353 ± 0.362
2.148ProGlu: 2.148 ± 0.509
1.83ProPhe: 1.83 ± 0.463
2.864ProGly: 2.864 ± 0.448
0.636ProHis: 0.636 ± 0.195
2.307ProIle: 2.307 ± 0.406
3.58ProLys: 3.58 ± 0.666
2.387ProLeu: 2.387 ± 0.367
0.716ProMet: 0.716 ± 0.295
1.909ProAsn: 1.909 ± 0.538
1.193ProPro: 1.193 ± 0.303
1.75ProGln: 1.75 ± 0.578
0.716ProArg: 0.716 ± 0.193
2.069ProSer: 2.069 ± 0.437
1.273ProThr: 1.273 ± 0.333
1.75ProVal: 1.75 ± 0.39
0.318ProTrp: 0.318 ± 0.147
1.512ProTyr: 1.512 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
2.069GlnAla: 2.069 ± 0.566
0.159GlnCys: 0.159 ± 0.122
2.466GlnAsp: 2.466 ± 0.447
2.228GlnGlu: 2.228 ± 0.345
1.512GlnPhe: 1.512 ± 0.299
3.501GlnGly: 3.501 ± 1.757
0.239GlnHis: 0.239 ± 0.187
3.182GlnIle: 3.182 ± 0.443
2.785GlnLys: 2.785 ± 0.503
4.137GlnLeu: 4.137 ± 0.496
1.193GlnMet: 1.193 ± 0.303
2.228GlnAsn: 2.228 ± 0.461
0.318GlnPro: 0.318 ± 0.144
0.796GlnGln: 0.796 ± 0.378
1.83GlnArg: 1.83 ± 0.345
1.512GlnSer: 1.512 ± 0.298
2.148GlnThr: 2.148 ± 0.381
1.512GlnVal: 1.512 ± 0.406
0.477GlnTrp: 0.477 ± 0.176
1.034GlnTyr: 1.034 ± 0.271
0.0GlnXaa: 0.0 ± 0.0
Arg
1.909ArgAla: 1.909 ± 0.449
0.716ArgCys: 0.716 ± 0.222
2.466ArgAsp: 2.466 ± 0.423
2.705ArgGlu: 2.705 ± 0.365
0.796ArgPhe: 0.796 ± 0.263
2.307ArgGly: 2.307 ± 0.458
0.318ArgHis: 0.318 ± 0.156
3.819ArgIle: 3.819 ± 0.646
4.137ArgLys: 4.137 ± 0.517
3.421ArgLeu: 3.421 ± 0.43
1.591ArgMet: 1.591 ± 0.393
1.432ArgAsn: 1.432 ± 0.356
0.716ArgPro: 0.716 ± 0.265
1.83ArgGln: 1.83 ± 0.265
0.875ArgArg: 0.875 ± 0.279
1.909ArgSer: 1.909 ± 0.355
1.75ArgThr: 1.75 ± 0.45
2.148ArgVal: 2.148 ± 0.453
0.636ArgTrp: 0.636 ± 0.189
1.75ArgTyr: 1.75 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
3.342SerAla: 3.342 ± 0.462
1.034SerCys: 1.034 ± 0.281
5.171SerAsp: 5.171 ± 0.716
3.739SerGlu: 3.739 ± 0.43
3.66SerPhe: 3.66 ± 0.401
5.092SerGly: 5.092 ± 0.704
1.193SerHis: 1.193 ± 0.308
5.649SerIle: 5.649 ± 0.77
5.967SerLys: 5.967 ± 1.024
5.569SerLeu: 5.569 ± 0.627
1.512SerMet: 1.512 ± 0.338
3.739SerAsn: 3.739 ± 0.469
1.989SerPro: 1.989 ± 0.416
2.626SerGln: 2.626 ± 0.462
2.387SerArg: 2.387 ± 0.421
5.331SerSer: 5.331 ± 0.793
4.535SerThr: 4.535 ± 0.824
3.023SerVal: 3.023 ± 0.61
0.636SerTrp: 0.636 ± 0.212
3.103SerTyr: 3.103 ± 0.54
0.0SerXaa: 0.0 ± 0.0
Thr
4.058ThrAla: 4.058 ± 0.479
0.239ThrCys: 0.239 ± 0.126
3.342ThrAsp: 3.342 ± 0.584
2.785ThrGlu: 2.785 ± 0.497
2.705ThrPhe: 2.705 ± 0.624
4.774ThrGly: 4.774 ± 0.658
0.716ThrHis: 0.716 ± 0.211
3.819ThrIle: 3.819 ± 0.653
4.137ThrLys: 4.137 ± 0.742
3.739ThrLeu: 3.739 ± 0.556
0.955ThrMet: 0.955 ± 0.32
3.182ThrAsn: 3.182 ± 0.696
2.148ThrPro: 2.148 ± 0.444
1.83ThrGln: 1.83 ± 0.349
1.75ThrArg: 1.75 ± 0.42
4.137ThrSer: 4.137 ± 0.796
3.103ThrThr: 3.103 ± 0.646
2.944ThrVal: 2.944 ± 0.394
0.955ThrTrp: 0.955 ± 0.276
1.75ThrTyr: 1.75 ± 0.417
0.0ThrXaa: 0.0 ± 0.0
Val
2.228ValAla: 2.228 ± 0.436
0.318ValCys: 0.318 ± 0.171
4.694ValAsp: 4.694 ± 0.466
3.898ValGlu: 3.898 ± 0.503
2.307ValPhe: 2.307 ± 0.438
3.262ValGly: 3.262 ± 0.476
0.557ValHis: 0.557 ± 0.186
3.739ValIle: 3.739 ± 0.46
5.171ValLys: 5.171 ± 0.658
3.739ValLeu: 3.739 ± 0.656
1.034ValMet: 1.034 ± 0.297
3.262ValAsn: 3.262 ± 0.537
2.387ValPro: 2.387 ± 0.442
1.273ValGln: 1.273 ± 0.401
2.228ValArg: 2.228 ± 0.399
4.933ValSer: 4.933 ± 0.653
2.387ValThr: 2.387 ± 0.385
3.103ValVal: 3.103 ± 0.614
1.114ValTrp: 1.114 ± 0.291
2.785ValTyr: 2.785 ± 0.732
0.0ValXaa: 0.0 ± 0.0
Trp
1.591TrpAla: 1.591 ± 0.414
0.159TrpCys: 0.159 ± 0.114
0.875TrpAsp: 0.875 ± 0.228
1.512TrpGlu: 1.512 ± 0.348
0.716TrpPhe: 0.716 ± 0.231
0.875TrpGly: 0.875 ± 0.264
0.159TrpHis: 0.159 ± 0.119
1.273TrpIle: 1.273 ± 0.285
0.477TrpLys: 0.477 ± 0.231
0.796TrpLeu: 0.796 ± 0.288
0.318TrpMet: 0.318 ± 0.153
0.398TrpAsn: 0.398 ± 0.241
0.0TrpPro: 0.0 ± 0.0
0.398TrpGln: 0.398 ± 0.174
0.557TrpArg: 0.557 ± 0.217
0.875TrpSer: 0.875 ± 0.236
1.034TrpThr: 1.034 ± 0.282
1.193TrpVal: 1.193 ± 0.316
0.159TrpTrp: 0.159 ± 0.11
0.557TrpTyr: 0.557 ± 0.214
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.546TyrAla: 2.546 ± 0.431
0.716TyrCys: 0.716 ± 0.24
1.989TyrAsp: 1.989 ± 0.308
2.228TyrGlu: 2.228 ± 0.387
2.387TyrPhe: 2.387 ± 0.594
2.546TyrGly: 2.546 ± 0.533
0.636TyrHis: 0.636 ± 0.232
2.864TyrIle: 2.864 ± 0.407
4.137TyrLys: 4.137 ± 0.641
3.58TyrLeu: 3.58 ± 0.604
1.114TyrMet: 1.114 ± 0.26
2.546TyrAsn: 2.546 ± 0.363
1.432TyrPro: 1.432 ± 0.248
1.273TyrGln: 1.273 ± 0.364
1.193TyrArg: 1.193 ± 0.298
2.228TyrSer: 2.228 ± 0.392
1.591TyrThr: 1.591 ± 0.269
2.705TyrVal: 2.705 ± 0.596
0.955TyrTrp: 0.955 ± 0.363
2.387TyrTyr: 2.387 ± 0.633
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (12570 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski