Amino acid dipepetide frequency for Cellulophaga phage phi19:1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.615AlaAla: 2.615 ± 0.995
0.568AlaCys: 0.568 ± 0.225
2.615AlaAsp: 2.615 ± 0.496
2.728AlaGlu: 2.728 ± 0.535
2.785AlaPhe: 2.785 ± 0.462
2.558AlaGly: 2.558 ± 0.451
0.682AlaHis: 0.682 ± 0.195
5.343AlaIle: 5.343 ± 0.535
3.524AlaLys: 3.524 ± 0.443
5.116AlaLeu: 5.116 ± 0.678
0.966AlaMet: 0.966 ± 0.231
4.093AlaAsn: 4.093 ± 0.379
0.682AlaPro: 0.682 ± 0.207
2.046AlaGln: 2.046 ± 0.455
1.478AlaArg: 1.478 ± 0.298
3.751AlaSer: 3.751 ± 0.439
3.979AlaThr: 3.979 ± 0.502
2.899AlaVal: 2.899 ± 0.464
0.512AlaTrp: 0.512 ± 0.156
2.103AlaTyr: 2.103 ± 0.273
0.0AlaXaa: 0.0 ± 0.0
Cys
0.398CysAla: 0.398 ± 0.135
0.512CysCys: 0.512 ± 0.218
1.478CysAsp: 1.478 ± 0.332
0.682CysGlu: 0.682 ± 0.196
0.398CysPhe: 0.398 ± 0.176
1.023CysGly: 1.023 ± 0.286
0.057CysHis: 0.057 ± 0.055
0.853CysIle: 0.853 ± 0.206
0.853CysLys: 0.853 ± 0.237
0.739CysLeu: 0.739 ± 0.234
0.284CysMet: 0.284 ± 0.113
0.909CysAsn: 0.909 ± 0.241
0.171CysPro: 0.171 ± 0.087
0.227CysGln: 0.227 ± 0.107
0.512CysArg: 0.512 ± 0.178
0.909CysSer: 0.909 ± 0.215
0.739CysThr: 0.739 ± 0.188
0.625CysVal: 0.625 ± 0.17
0.057CysTrp: 0.057 ± 0.041
0.512CysTyr: 0.512 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
3.808AspAla: 3.808 ± 0.579
1.194AspCys: 1.194 ± 0.269
4.036AspAsp: 4.036 ± 0.565
4.718AspGlu: 4.718 ± 0.524
4.263AspPhe: 4.263 ± 0.543
4.093AspGly: 4.093 ± 0.552
0.512AspHis: 0.512 ± 0.16
3.524AspIle: 3.524 ± 0.465
4.434AspLys: 4.434 ± 0.476
6.48AspLeu: 6.48 ± 0.546
0.853AspMet: 0.853 ± 0.214
4.604AspAsn: 4.604 ± 0.524
1.25AspPro: 1.25 ± 0.264
0.853AspGln: 0.853 ± 0.198
1.592AspArg: 1.592 ± 0.262
4.888AspSer: 4.888 ± 0.471
3.695AspThr: 3.695 ± 0.427
3.751AspVal: 3.751 ± 0.443
1.023AspTrp: 1.023 ± 0.194
3.126AspTyr: 3.126 ± 0.376
0.0AspXaa: 0.0 ± 0.0
Glu
3.069GluAla: 3.069 ± 0.563
0.625GluCys: 0.625 ± 0.181
2.956GluAsp: 2.956 ± 0.411
5.798GluGlu: 5.798 ± 0.584
2.842GluPhe: 2.842 ± 0.367
3.581GluGly: 3.581 ± 0.454
1.25GluHis: 1.25 ± 0.226
6.48GluIle: 6.48 ± 0.612
6.878GluLys: 6.878 ± 0.903
7.048GluLeu: 7.048 ± 0.645
1.478GluMet: 1.478 ± 0.301
4.263GluAsn: 4.263 ± 0.454
1.762GluPro: 1.762 ± 0.251
2.672GluGln: 2.672 ± 0.38
2.785GluArg: 2.785 ± 0.408
5.627GluSer: 5.627 ± 0.521
3.24GluThr: 3.24 ± 0.545
4.206GluVal: 4.206 ± 0.49
0.966GluTrp: 0.966 ± 0.22
3.126GluTyr: 3.126 ± 0.456
0.0GluXaa: 0.0 ± 0.0
Phe
2.274PheAla: 2.274 ± 0.371
0.625PheCys: 0.625 ± 0.179
3.41PheAsp: 3.41 ± 0.451
2.956PheGlu: 2.956 ± 0.379
1.421PhePhe: 1.421 ± 0.267
2.672PheGly: 2.672 ± 0.327
0.568PheHis: 0.568 ± 0.173
3.695PheIle: 3.695 ± 0.349
3.865PheLys: 3.865 ± 0.538
3.695PheLeu: 3.695 ± 0.5
0.853PheMet: 0.853 ± 0.207
4.718PheAsn: 4.718 ± 0.544
0.966PhePro: 0.966 ± 0.221
1.25PheGln: 1.25 ± 0.241
1.478PheArg: 1.478 ± 0.308
2.956PheSer: 2.956 ± 0.409
2.387PheThr: 2.387 ± 0.296
2.956PheVal: 2.956 ± 0.479
0.455PheTrp: 0.455 ± 0.196
2.274PheTyr: 2.274 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
2.615GlyAla: 2.615 ± 0.394
0.568GlyCys: 0.568 ± 0.18
3.581GlyAsp: 3.581 ± 0.387
3.013GlyGlu: 3.013 ± 0.426
2.785GlyPhe: 2.785 ± 0.325
3.183GlyGly: 3.183 ± 0.514
0.625GlyHis: 0.625 ± 0.196
3.808GlyIle: 3.808 ± 0.439
3.979GlyLys: 3.979 ± 0.407
5.173GlyLeu: 5.173 ± 0.52
1.137GlyMet: 1.137 ± 0.23
2.899GlyAsn: 2.899 ± 0.415
0.0GlyPro: 0.0 ± 0.0
1.08GlyGln: 1.08 ± 0.243
2.274GlyArg: 2.274 ± 0.326
4.547GlySer: 4.547 ± 0.755
4.149GlyThr: 4.149 ± 0.638
5.457GlyVal: 5.457 ± 0.63
1.023GlyTrp: 1.023 ± 0.21
3.013GlyTyr: 3.013 ± 0.462
0.0GlyXaa: 0.0 ± 0.0
His
0.455HisAla: 0.455 ± 0.159
0.057HisCys: 0.057 ± 0.054
0.796HisAsp: 0.796 ± 0.21
0.966HisGlu: 0.966 ± 0.209
0.739HisPhe: 0.739 ± 0.218
0.682HisGly: 0.682 ± 0.19
0.284HisHis: 0.284 ± 0.208
1.137HisIle: 1.137 ± 0.256
1.08HisLys: 1.08 ± 0.235
1.819HisLeu: 1.819 ± 0.31
0.284HisMet: 0.284 ± 0.115
0.853HisAsn: 0.853 ± 0.194
0.341HisPro: 0.341 ± 0.133
0.398HisGln: 0.398 ± 0.152
0.341HisArg: 0.341 ± 0.141
1.08HisSer: 1.08 ± 0.316
0.625HisThr: 0.625 ± 0.191
0.909HisVal: 0.909 ± 0.224
0.171HisTrp: 0.171 ± 0.085
0.568HisTyr: 0.568 ± 0.18
0.0HisXaa: 0.0 ± 0.0
Ile
5.002IleAla: 5.002 ± 0.857
1.023IleCys: 1.023 ± 0.229
6.139IleAsp: 6.139 ± 0.674
6.594IleGlu: 6.594 ± 0.55
2.842IlePhe: 2.842 ± 0.419
3.808IleGly: 3.808 ± 0.491
1.478IleHis: 1.478 ± 0.39
4.888IleIle: 4.888 ± 0.67
9.151IleLys: 9.151 ± 0.716
6.594IleLeu: 6.594 ± 0.59
1.762IleMet: 1.762 ± 0.327
6.366IleAsn: 6.366 ± 0.564
2.444IlePro: 2.444 ± 0.377
2.387IleGln: 2.387 ± 0.354
2.728IleArg: 2.728 ± 0.309
5.286IleSer: 5.286 ± 0.574
4.661IleThr: 4.661 ± 0.562
4.49IleVal: 4.49 ± 0.553
0.682IleTrp: 0.682 ± 0.169
2.33IleTyr: 2.33 ± 0.309
0.0IleXaa: 0.0 ± 0.0
Lys
4.775LysAla: 4.775 ± 0.621
0.966LysCys: 0.966 ± 0.224
5.343LysAsp: 5.343 ± 0.484
8.924LysGlu: 8.924 ± 0.937
3.013LysPhe: 3.013 ± 0.417
5.173LysGly: 5.173 ± 0.504
1.478LysHis: 1.478 ± 0.285
6.764LysIle: 6.764 ± 0.6
6.594LysLys: 6.594 ± 0.849
8.128LysLeu: 8.128 ± 0.583
2.558LysMet: 2.558 ± 0.452
6.139LysAsn: 6.139 ± 0.748
2.444LysPro: 2.444 ± 0.403
3.638LysGln: 3.638 ± 0.586
3.069LysArg: 3.069 ± 0.45
5.627LysSer: 5.627 ± 0.655
6.082LysThr: 6.082 ± 0.512
5.059LysVal: 5.059 ± 0.571
1.421LysTrp: 1.421 ± 0.291
4.32LysTyr: 4.32 ± 0.673
0.0LysXaa: 0.0 ± 0.0
Leu
4.036LeuAla: 4.036 ± 0.745
0.341LeuCys: 0.341 ± 0.114
6.196LeuAsp: 6.196 ± 0.587
6.423LeuGlu: 6.423 ± 0.607
4.036LeuPhe: 4.036 ± 0.546
5.002LeuGly: 5.002 ± 0.488
0.966LeuHis: 0.966 ± 0.226
7.844LeuIle: 7.844 ± 0.576
7.73LeuLys: 7.73 ± 0.691
6.991LeuLeu: 6.991 ± 0.648
1.819LeuMet: 1.819 ± 0.286
6.366LeuAsn: 6.366 ± 0.591
2.33LeuPro: 2.33 ± 0.4
2.103LeuGln: 2.103 ± 0.327
3.467LeuArg: 3.467 ± 0.457
6.423LeuSer: 6.423 ± 0.495
6.309LeuThr: 6.309 ± 0.595
5.514LeuVal: 5.514 ± 0.496
0.398LeuTrp: 0.398 ± 0.146
3.297LeuTyr: 3.297 ± 0.455
0.0LeuXaa: 0.0 ± 0.0
Met
1.25MetAla: 1.25 ± 0.27
0.171MetCys: 0.171 ± 0.089
1.421MetAsp: 1.421 ± 0.3
1.478MetGlu: 1.478 ± 0.275
0.398MetPhe: 0.398 ± 0.162
0.455MetGly: 0.455 ± 0.135
0.171MetHis: 0.171 ± 0.098
1.989MetIle: 1.989 ± 0.33
3.524MetLys: 3.524 ± 0.512
1.364MetLeu: 1.364 ± 0.227
0.455MetMet: 0.455 ± 0.133
1.592MetAsn: 1.592 ± 0.32
0.568MetPro: 0.568 ± 0.155
0.853MetGln: 0.853 ± 0.208
0.739MetArg: 0.739 ± 0.22
1.648MetSer: 1.648 ± 0.264
1.307MetThr: 1.307 ± 0.264
1.023MetVal: 1.023 ± 0.254
0.114MetTrp: 0.114 ± 0.078
0.796MetTyr: 0.796 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.638AsnAla: 3.638 ± 0.442
0.966AsnCys: 0.966 ± 0.317
3.865AsnAsp: 3.865 ± 0.585
4.49AsnGlu: 4.49 ± 0.549
2.728AsnPhe: 2.728 ± 0.42
4.775AsnGly: 4.775 ± 0.506
1.307AsnHis: 1.307 ± 0.221
5.627AsnIle: 5.627 ± 0.573
8.981AsnLys: 8.981 ± 0.705
5.002AsnLeu: 5.002 ± 0.509
2.046AsnMet: 2.046 ± 0.313
4.604AsnAsn: 4.604 ± 0.73
1.989AsnPro: 1.989 ± 0.354
2.274AsnGln: 2.274 ± 0.29
2.33AsnArg: 2.33 ± 0.36
4.377AsnSer: 4.377 ± 0.514
3.751AsnThr: 3.751 ± 0.461
4.661AsnVal: 4.661 ± 0.502
1.25AsnTrp: 1.25 ± 0.278
3.524AsnTyr: 3.524 ± 0.52
0.0AsnXaa: 0.0 ± 0.0
Pro
0.966ProAla: 0.966 ± 0.222
0.284ProCys: 0.284 ± 0.111
0.796ProAsp: 0.796 ± 0.197
2.103ProGlu: 2.103 ± 0.408
1.705ProPhe: 1.705 ± 0.294
0.0ProGly: 0.0 ± 0.0
0.398ProHis: 0.398 ± 0.148
1.819ProIle: 1.819 ± 0.29
2.103ProLys: 2.103 ± 0.369
1.762ProLeu: 1.762 ± 0.352
0.398ProMet: 0.398 ± 0.148
1.762ProAsn: 1.762 ± 0.32
0.568ProPro: 0.568 ± 0.278
0.568ProGln: 0.568 ± 0.165
0.739ProArg: 0.739 ± 0.169
2.274ProSer: 2.274 ± 0.368
1.478ProThr: 1.478 ± 0.257
1.876ProVal: 1.876 ± 0.332
0.114ProTrp: 0.114 ± 0.075
1.648ProTyr: 1.648 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
1.535GlnAla: 1.535 ± 0.431
0.114GlnCys: 0.114 ± 0.076
1.592GlnAsp: 1.592 ± 0.315
1.819GlnGlu: 1.819 ± 0.325
1.137GlnPhe: 1.137 ± 0.225
1.648GlnGly: 1.648 ± 0.315
0.398GlnHis: 0.398 ± 0.148
2.615GlnIle: 2.615 ± 0.457
2.444GlnLys: 2.444 ± 0.405
2.615GlnLeu: 2.615 ± 0.439
0.853GlnMet: 0.853 ± 0.223
1.762GlnAsn: 1.762 ± 0.296
0.796GlnPro: 0.796 ± 0.202
1.023GlnGln: 1.023 ± 0.242
1.421GlnArg: 1.421 ± 0.333
2.274GlnSer: 2.274 ± 0.29
1.989GlnThr: 1.989 ± 0.306
1.535GlnVal: 1.535 ± 0.297
0.171GlnTrp: 0.171 ± 0.101
1.08GlnTyr: 1.08 ± 0.221
0.0GlnXaa: 0.0 ± 0.0
Arg
1.25ArgAla: 1.25 ± 0.231
0.398ArgCys: 0.398 ± 0.159
2.103ArgAsp: 2.103 ± 0.413
2.046ArgGlu: 2.046 ± 0.401
1.478ArgPhe: 1.478 ± 0.25
1.194ArgGly: 1.194 ± 0.262
0.171ArgHis: 0.171 ± 0.098
3.751ArgIle: 3.751 ± 0.423
3.183ArgLys: 3.183 ± 0.438
3.865ArgLeu: 3.865 ± 0.472
0.853ArgMet: 0.853 ± 0.226
2.899ArgAsn: 2.899 ± 0.414
0.966ArgPro: 0.966 ± 0.194
1.421ArgGln: 1.421 ± 0.244
1.137ArgArg: 1.137 ± 0.231
1.421ArgSer: 1.421 ± 0.278
2.274ArgThr: 2.274 ± 0.383
2.444ArgVal: 2.444 ± 0.374
0.227ArgTrp: 0.227 ± 0.102
1.648ArgTyr: 1.648 ± 0.273
0.0ArgXaa: 0.0 ± 0.0
Ser
4.093SerAla: 4.093 ± 0.58
0.909SerCys: 0.909 ± 0.225
5.343SerAsp: 5.343 ± 0.656
4.945SerGlu: 4.945 ± 0.507
3.808SerPhe: 3.808 ± 0.482
4.718SerGly: 4.718 ± 0.596
0.796SerHis: 0.796 ± 0.22
5.286SerIle: 5.286 ± 0.619
6.309SerLys: 6.309 ± 0.528
6.423SerLeu: 6.423 ± 0.514
1.023SerMet: 1.023 ± 0.234
5.57SerAsn: 5.57 ± 0.647
1.478SerPro: 1.478 ± 0.292
1.535SerGln: 1.535 ± 0.233
2.672SerArg: 2.672 ± 0.374
4.149SerSer: 4.149 ± 0.558
5.798SerThr: 5.798 ± 0.607
3.126SerVal: 3.126 ± 0.429
0.455SerTrp: 0.455 ± 0.149
2.615SerTyr: 2.615 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
3.922ThrAla: 3.922 ± 0.486
0.739ThrCys: 0.739 ± 0.203
3.695ThrAsp: 3.695 ± 0.493
3.979ThrGlu: 3.979 ± 0.541
3.126ThrPhe: 3.126 ± 0.363
3.808ThrGly: 3.808 ± 0.457
0.796ThrHis: 0.796 ± 0.203
5.286ThrIle: 5.286 ± 0.557
5.343ThrLys: 5.343 ± 0.578
5.059ThrLeu: 5.059 ± 0.512
1.137ThrMet: 1.137 ± 0.257
4.093ThrAsn: 4.093 ± 0.408
2.103ThrPro: 2.103 ± 0.332
1.989ThrGln: 1.989 ± 0.298
1.762ThrArg: 1.762 ± 0.324
4.831ThrSer: 4.831 ± 0.581
4.547ThrThr: 4.547 ± 0.838
3.354ThrVal: 3.354 ± 0.447
0.966ThrTrp: 0.966 ± 0.208
2.615ThrTyr: 2.615 ± 0.46
0.0ThrXaa: 0.0 ± 0.0
Val
3.069ValAla: 3.069 ± 0.452
0.853ValCys: 0.853 ± 0.24
3.524ValAsp: 3.524 ± 0.439
3.922ValGlu: 3.922 ± 0.408
2.956ValPhe: 2.956 ± 0.421
3.581ValGly: 3.581 ± 0.482
0.568ValHis: 0.568 ± 0.179
5.4ValIle: 5.4 ± 0.594
5.855ValLys: 5.855 ± 0.579
4.945ValLeu: 4.945 ± 0.488
1.25ValMet: 1.25 ± 0.296
4.32ValAsn: 4.32 ± 0.461
1.364ValPro: 1.364 ± 0.324
1.25ValGln: 1.25 ± 0.243
2.217ValArg: 2.217 ± 0.313
5.286ValSer: 5.286 ± 0.698
2.728ValThr: 2.728 ± 0.484
3.24ValVal: 3.24 ± 0.413
0.398ValTrp: 0.398 ± 0.151
3.126ValTyr: 3.126 ± 0.461
0.0ValXaa: 0.0 ± 0.0
Trp
0.284TrpAla: 0.284 ± 0.117
0.227TrpCys: 0.227 ± 0.115
0.796TrpAsp: 0.796 ± 0.194
0.512TrpGlu: 0.512 ± 0.136
0.568TrpPhe: 0.568 ± 0.195
0.284TrpGly: 0.284 ± 0.121
0.171TrpHis: 0.171 ± 0.107
1.137TrpIle: 1.137 ± 0.251
1.25TrpLys: 1.25 ± 0.244
1.023TrpLeu: 1.023 ± 0.268
0.455TrpMet: 0.455 ± 0.141
1.307TrpAsn: 1.307 ± 0.248
0.0TrpPro: 0.0 ± 0.0
0.284TrpGln: 0.284 ± 0.124
0.512TrpArg: 0.512 ± 0.187
0.682TrpSer: 0.682 ± 0.199
0.739TrpThr: 0.739 ± 0.202
0.682TrpVal: 0.682 ± 0.19
0.227TrpTrp: 0.227 ± 0.133
0.455TrpTyr: 0.455 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.989TyrAla: 1.989 ± 0.351
0.796TyrCys: 0.796 ± 0.216
3.069TyrAsp: 3.069 ± 0.419
2.558TyrGlu: 2.558 ± 0.379
2.444TyrPhe: 2.444 ± 0.379
2.672TyrGly: 2.672 ± 0.403
0.909TyrHis: 0.909 ± 0.203
3.41TyrIle: 3.41 ± 0.363
4.434TyrLys: 4.434 ± 0.461
3.751TyrLeu: 3.751 ± 0.415
0.796TyrMet: 0.796 ± 0.196
3.069TyrAsn: 3.069 ± 0.496
1.137TyrPro: 1.137 ± 0.216
0.966TyrGln: 0.966 ± 0.169
1.478TyrArg: 1.478 ± 0.328
3.013TyrSer: 3.013 ± 0.433
2.672TyrThr: 2.672 ± 0.383
2.046TyrVal: 2.046 ± 0.43
0.966TyrTrp: 0.966 ± 0.252
1.705TyrTyr: 1.705 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 113 proteins (17594 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski