Amino acid dipepetide frequency for Cellulophaga phage phi19:3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.846AlaAla: 2.846 ± 0.56
0.552AlaCys: 0.552 ± 0.158
2.931AlaAsp: 2.931 ± 0.268
3.058AlaGlu: 3.058 ± 0.433
1.699AlaPhe: 1.699 ± 0.29
2.081AlaGly: 2.081 ± 0.371
0.467AlaHis: 0.467 ± 0.138
3.823AlaIle: 3.823 ± 0.398
4.97AlaLys: 4.97 ± 0.523
4.63AlaLeu: 4.63 ± 0.633
1.232AlaMet: 1.232 ± 0.252
4.375AlaAsn: 4.375 ± 0.422
0.85AlaPro: 0.85 ± 0.246
2.124AlaGln: 2.124 ± 0.331
1.402AlaArg: 1.402 ± 0.31
3.526AlaSer: 3.526 ± 0.469
3.058AlaThr: 3.058 ± 0.326
2.889AlaVal: 2.889 ± 0.31
0.637AlaTrp: 0.637 ± 0.192
2.294AlaTyr: 2.294 ± 0.274
0.0AlaXaa: 0.0 ± 0.0
Cys
0.212CysAla: 0.212 ± 0.1
0.085CysCys: 0.085 ± 0.062
1.019CysAsp: 1.019 ± 0.281
0.85CysGlu: 0.85 ± 0.201
0.51CysPhe: 0.51 ± 0.162
0.765CysGly: 0.765 ± 0.179
0.085CysHis: 0.085 ± 0.058
0.722CysIle: 0.722 ± 0.187
1.317CysLys: 1.317 ± 0.318
0.722CysLeu: 0.722 ± 0.175
0.212CysMet: 0.212 ± 0.11
0.595CysAsn: 0.595 ± 0.185
0.34CysPro: 0.34 ± 0.128
0.17CysGln: 0.17 ± 0.086
0.255CysArg: 0.255 ± 0.092
0.68CysSer: 0.68 ± 0.181
0.85CysThr: 0.85 ± 0.268
0.51CysVal: 0.51 ± 0.172
0.127CysTrp: 0.127 ± 0.07
0.552CysTyr: 0.552 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
3.356AspAla: 3.356 ± 0.458
0.722AspCys: 0.722 ± 0.192
3.483AspAsp: 3.483 ± 0.373
4.205AspGlu: 4.205 ± 0.369
4.163AspPhe: 4.163 ± 0.469
5.182AspGly: 5.182 ± 0.519
0.467AspHis: 0.467 ± 0.141
5.013AspIle: 5.013 ± 0.409
5.14AspLys: 5.14 ± 0.47
6.372AspLeu: 6.372 ± 0.443
1.359AspMet: 1.359 ± 0.296
4.163AspAsn: 4.163 ± 0.536
1.869AspPro: 1.869 ± 0.337
1.189AspGln: 1.189 ± 0.24
2.676AspArg: 2.676 ± 0.291
4.375AspSer: 4.375 ± 0.348
3.696AspThr: 3.696 ± 0.434
3.441AspVal: 3.441 ± 0.359
0.85AspTrp: 0.85 ± 0.191
2.761AspTyr: 2.761 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
4.248GluAla: 4.248 ± 0.685
0.637GluCys: 0.637 ± 0.179
4.545GluAsp: 4.545 ± 0.64
6.712GluGlu: 6.712 ± 0.594
2.931GluPhe: 2.931 ± 0.332
3.866GluGly: 3.866 ± 0.399
0.892GluHis: 0.892 ± 0.221
6.117GluIle: 6.117 ± 0.609
6.627GluLys: 6.627 ± 0.682
7.179GluLeu: 7.179 ± 0.62
1.487GluMet: 1.487 ± 0.275
5.522GluAsn: 5.522 ± 0.405
1.189GluPro: 1.189 ± 0.211
2.549GluGln: 2.549 ± 0.375
2.421GluArg: 2.421 ± 0.357
5.055GluSer: 5.055 ± 0.594
5.48GluThr: 5.48 ± 0.564
4.418GluVal: 4.418 ± 0.503
0.977GluTrp: 0.977 ± 0.23
3.313GluTyr: 3.313 ± 0.444
0.0GluXaa: 0.0 ± 0.0
Phe
1.827PheAla: 1.827 ± 0.266
0.807PheCys: 0.807 ± 0.184
3.228PheAsp: 3.228 ± 0.333
2.931PheGlu: 2.931 ± 0.406
2.081PhePhe: 2.081 ± 0.317
2.889PheGly: 2.889 ± 0.366
0.637PheHis: 0.637 ± 0.185
3.611PheIle: 3.611 ± 0.434
3.908PheLys: 3.908 ± 0.394
3.058PheLeu: 3.058 ± 0.364
0.935PheMet: 0.935 ± 0.179
3.398PheAsn: 3.398 ± 0.518
1.317PhePro: 1.317 ± 0.267
0.85PheGln: 0.85 ± 0.206
1.317PheArg: 1.317 ± 0.26
3.611PheSer: 3.611 ± 0.462
2.676PheThr: 2.676 ± 0.311
2.251PheVal: 2.251 ± 0.298
0.34PheTrp: 0.34 ± 0.137
1.657PheTyr: 1.657 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
2.676GlyAla: 2.676 ± 0.309
0.637GlyCys: 0.637 ± 0.181
3.951GlyAsp: 3.951 ± 0.404
3.568GlyGlu: 3.568 ± 0.426
2.761GlyPhe: 2.761 ± 0.376
3.653GlyGly: 3.653 ± 0.484
0.382GlyHis: 0.382 ± 0.15
5.225GlyIle: 5.225 ± 0.587
4.545GlyLys: 4.545 ± 0.416
4.843GlyLeu: 4.843 ± 0.465
1.317GlyMet: 1.317 ± 0.241
3.143GlyAsn: 3.143 ± 0.496
0.85GlyPro: 0.85 ± 0.208
1.274GlyGln: 1.274 ± 0.191
2.464GlyArg: 2.464 ± 0.328
5.14GlySer: 5.14 ± 0.607
4.46GlyThr: 4.46 ± 0.604
4.588GlyVal: 4.588 ± 0.4
0.935GlyTrp: 0.935 ± 0.211
2.846GlyTyr: 2.846 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
0.595HisAla: 0.595 ± 0.14
0.17HisCys: 0.17 ± 0.083
0.51HisAsp: 0.51 ± 0.141
1.189HisGlu: 1.189 ± 0.257
0.595HisPhe: 0.595 ± 0.176
0.552HisGly: 0.552 ± 0.177
0.127HisHis: 0.127 ± 0.078
1.487HisIle: 1.487 ± 0.273
0.935HisLys: 0.935 ± 0.229
1.189HisLeu: 1.189 ± 0.301
0.382HisMet: 0.382 ± 0.135
0.977HisAsn: 0.977 ± 0.214
0.595HisPro: 0.595 ± 0.176
0.467HisGln: 0.467 ± 0.135
0.637HisArg: 0.637 ± 0.15
0.68HisSer: 0.68 ± 0.167
0.892HisThr: 0.892 ± 0.223
0.425HisVal: 0.425 ± 0.137
0.255HisTrp: 0.255 ± 0.091
0.892HisTyr: 0.892 ± 0.187
0.0HisXaa: 0.0 ± 0.0
Ile
4.12IleAla: 4.12 ± 0.612
1.019IleCys: 1.019 ± 0.23
5.99IleAsp: 5.99 ± 0.425
6.669IleGlu: 6.669 ± 0.577
2.166IlePhe: 2.166 ± 0.39
4.418IleGly: 4.418 ± 0.456
0.85IleHis: 0.85 ± 0.212
5.097IleIle: 5.097 ± 0.572
7.986IleLys: 7.986 ± 0.616
5.565IleLeu: 5.565 ± 0.59
1.827IleMet: 1.827 ± 0.295
6.329IleAsn: 6.329 ± 0.533
2.846IlePro: 2.846 ± 0.31
2.506IleGln: 2.506 ± 0.285
2.379IleArg: 2.379 ± 0.361
6.159IleSer: 6.159 ± 0.526
5.607IleThr: 5.607 ± 0.756
4.163IleVal: 4.163 ± 0.413
0.552IleTrp: 0.552 ± 0.118
2.719IleTyr: 2.719 ± 0.281
0.0IleXaa: 0.0 ± 0.0
Lys
3.908LysAla: 3.908 ± 0.413
0.935LysCys: 0.935 ± 0.269
6.244LysAsp: 6.244 ± 0.526
9.091LysGlu: 9.091 ± 0.872
3.228LysPhe: 3.228 ± 0.425
4.673LysGly: 4.673 ± 0.438
1.529LysHis: 1.529 ± 0.291
6.032LysIle: 6.032 ± 0.514
8.708LysLys: 8.708 ± 0.78
7.774LysLeu: 7.774 ± 0.574
2.931LysMet: 2.931 ± 0.39
5.947LysAsn: 5.947 ± 0.518
2.549LysPro: 2.549 ± 0.281
3.568LysGln: 3.568 ± 0.419
3.611LysArg: 3.611 ± 0.493
5.522LysSer: 5.522 ± 0.641
5.692LysThr: 5.692 ± 0.536
4.885LysVal: 4.885 ± 0.422
1.019LysTrp: 1.019 ± 0.21
3.738LysTyr: 3.738 ± 0.425
0.0LysXaa: 0.0 ± 0.0
Leu
3.866LeuAla: 3.866 ± 0.431
0.595LeuCys: 0.595 ± 0.154
5.097LeuAsp: 5.097 ± 0.487
6.202LeuGlu: 6.202 ± 0.506
3.611LeuPhe: 3.611 ± 0.416
4.248LeuGly: 4.248 ± 0.511
1.444LeuHis: 1.444 ± 0.336
6.372LeuIle: 6.372 ± 0.542
7.859LeuLys: 7.859 ± 0.748
6.414LeuLeu: 6.414 ± 0.655
1.742LeuMet: 1.742 ± 0.247
6.372LeuAsn: 6.372 ± 0.428
1.827LeuPro: 1.827 ± 0.304
2.974LeuGln: 2.974 ± 0.38
3.356LeuArg: 3.356 ± 0.456
6.075LeuSer: 6.075 ± 0.503
4.97LeuThr: 4.97 ± 0.513
4.078LeuVal: 4.078 ± 0.44
0.892LeuTrp: 0.892 ± 0.194
3.696LeuTyr: 3.696 ± 0.544
0.0LeuXaa: 0.0 ± 0.0
Met
1.487MetAla: 1.487 ± 0.262
0.51MetCys: 0.51 ± 0.149
1.742MetAsp: 1.742 ± 0.342
1.402MetGlu: 1.402 ± 0.274
0.68MetPhe: 0.68 ± 0.185
1.189MetGly: 1.189 ± 0.192
0.297MetHis: 0.297 ± 0.143
1.359MetIle: 1.359 ± 0.242
3.398MetLys: 3.398 ± 0.444
1.359MetLeu: 1.359 ± 0.256
0.255MetMet: 0.255 ± 0.083
1.784MetAsn: 1.784 ± 0.223
0.637MetPro: 0.637 ± 0.142
1.232MetGln: 1.232 ± 0.262
0.807MetArg: 0.807 ± 0.245
1.954MetSer: 1.954 ± 0.276
1.062MetThr: 1.062 ± 0.22
0.892MetVal: 0.892 ± 0.186
0.085MetTrp: 0.085 ± 0.058
0.552MetTyr: 0.552 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
4.205AsnAla: 4.205 ± 0.497
0.807AsnCys: 0.807 ± 0.273
4.163AsnAsp: 4.163 ± 0.354
4.078AsnGlu: 4.078 ± 0.453
2.676AsnPhe: 2.676 ± 0.35
4.205AsnGly: 4.205 ± 0.539
1.529AsnHis: 1.529 ± 0.267
6.117AsnIle: 6.117 ± 0.601
5.777AsnLys: 5.777 ± 0.455
5.352AsnLeu: 5.352 ± 0.542
2.039AsnMet: 2.039 ± 0.282
4.29AsnAsn: 4.29 ± 0.498
2.506AsnPro: 2.506 ± 0.302
2.421AsnGln: 2.421 ± 0.317
2.931AsnArg: 2.931 ± 0.383
4.46AsnSer: 4.46 ± 0.487
4.418AsnThr: 4.418 ± 0.44
3.568AsnVal: 3.568 ± 0.323
0.637AsnTrp: 0.637 ± 0.173
3.016AsnTyr: 3.016 ± 0.369
0.0AsnXaa: 0.0 ± 0.0
Pro
1.062ProAla: 1.062 ± 0.236
0.17ProCys: 0.17 ± 0.087
1.742ProAsp: 1.742 ± 0.298
2.591ProGlu: 2.591 ± 0.296
1.317ProPhe: 1.317 ± 0.259
0.935ProGly: 0.935 ± 0.191
0.297ProHis: 0.297 ± 0.111
2.294ProIle: 2.294 ± 0.339
3.016ProLys: 3.016 ± 0.317
2.294ProLeu: 2.294 ± 0.316
0.382ProMet: 0.382 ± 0.135
1.402ProAsn: 1.402 ± 0.21
0.51ProPro: 0.51 ± 0.147
0.935ProGln: 0.935 ± 0.198
0.51ProArg: 0.51 ± 0.142
1.657ProSer: 1.657 ± 0.231
2.336ProThr: 2.336 ± 0.337
1.699ProVal: 1.699 ± 0.268
0.127ProTrp: 0.127 ± 0.072
1.189ProTyr: 1.189 ± 0.186
0.0ProXaa: 0.0 ± 0.0
Gln
1.614GlnAla: 1.614 ± 0.31
0.212GlnCys: 0.212 ± 0.1
1.784GlnAsp: 1.784 ± 0.306
2.931GlnGlu: 2.931 ± 0.658
1.359GlnPhe: 1.359 ± 0.276
2.251GlnGly: 2.251 ± 0.363
0.68GlnHis: 0.68 ± 0.192
2.294GlnIle: 2.294 ± 0.328
2.889GlnLys: 2.889 ± 0.485
2.719GlnLeu: 2.719 ± 0.369
0.935GlnMet: 0.935 ± 0.236
2.166GlnAsn: 2.166 ± 0.296
0.977GlnPro: 0.977 ± 0.245
1.444GlnGln: 1.444 ± 0.272
0.807GlnArg: 0.807 ± 0.221
2.379GlnSer: 2.379 ± 0.315
1.869GlnThr: 1.869 ± 0.249
1.529GlnVal: 1.529 ± 0.225
0.255GlnTrp: 0.255 ± 0.097
1.402GlnTyr: 1.402 ± 0.244
0.0GlnXaa: 0.0 ± 0.0
Arg
1.147ArgAla: 1.147 ± 0.272
0.34ArgCys: 0.34 ± 0.121
2.294ArgAsp: 2.294 ± 0.286
3.186ArgGlu: 3.186 ± 0.451
1.784ArgPhe: 1.784 ± 0.245
1.529ArgGly: 1.529 ± 0.334
0.552ArgHis: 0.552 ± 0.138
3.271ArgIle: 3.271 ± 0.393
2.974ArgLys: 2.974 ± 0.363
3.186ArgLeu: 3.186 ± 0.342
0.935ArgMet: 0.935 ± 0.184
2.039ArgAsn: 2.039 ± 0.328
0.85ArgPro: 0.85 ± 0.178
1.402ArgGln: 1.402 ± 0.243
1.317ArgArg: 1.317 ± 0.207
2.251ArgSer: 2.251 ± 0.294
2.209ArgThr: 2.209 ± 0.215
1.954ArgVal: 1.954 ± 0.26
0.34ArgTrp: 0.34 ± 0.113
1.742ArgTyr: 1.742 ± 0.295
0.0ArgXaa: 0.0 ± 0.0
Ser
3.271SerAla: 3.271 ± 0.423
0.722SerCys: 0.722 ± 0.174
4.97SerAsp: 4.97 ± 0.441
5.055SerGlu: 5.055 ± 0.461
3.356SerPhe: 3.356 ± 0.361
6.202SerGly: 6.202 ± 0.663
0.935SerHis: 0.935 ± 0.208
6.075SerIle: 6.075 ± 0.56
6.712SerLys: 6.712 ± 0.692
5.31SerLeu: 5.31 ± 0.516
0.977SerMet: 0.977 ± 0.206
4.375SerAsn: 4.375 ± 0.587
1.317SerPro: 1.317 ± 0.24
2.634SerGln: 2.634 ± 0.355
2.464SerArg: 2.464 ± 0.271
4.503SerSer: 4.503 ± 0.492
5.225SerThr: 5.225 ± 0.523
2.974SerVal: 2.974 ± 0.382
0.51SerTrp: 0.51 ± 0.148
2.931SerTyr: 2.931 ± 0.325
0.0SerXaa: 0.0 ± 0.0
Thr
3.653ThrAla: 3.653 ± 0.453
0.382ThrCys: 0.382 ± 0.136
4.248ThrAsp: 4.248 ± 0.411
4.715ThrGlu: 4.715 ± 0.409
2.931ThrPhe: 2.931 ± 0.411
3.951ThrGly: 3.951 ± 0.436
0.892ThrHis: 0.892 ± 0.199
6.202ThrIle: 6.202 ± 0.551
5.055ThrLys: 5.055 ± 0.484
4.63ThrLeu: 4.63 ± 0.395
1.232ThrMet: 1.232 ± 0.212
3.993ThrAsn: 3.993 ± 0.407
2.336ThrPro: 2.336 ± 0.327
1.954ThrGln: 1.954 ± 0.311
2.336ThrArg: 2.336 ± 0.244
4.29ThrSer: 4.29 ± 0.475
4.715ThrThr: 4.715 ± 0.649
4.375ThrVal: 4.375 ± 0.564
0.51ThrTrp: 0.51 ± 0.126
2.846ThrTyr: 2.846 ± 0.341
0.0ThrXaa: 0.0 ± 0.0
Val
2.591ValAla: 2.591 ± 0.315
0.51ValCys: 0.51 ± 0.155
3.611ValAsp: 3.611 ± 0.339
4.163ValGlu: 4.163 ± 0.376
3.186ValPhe: 3.186 ± 0.304
3.186ValGly: 3.186 ± 0.394
0.807ValHis: 0.807 ± 0.194
3.526ValIle: 3.526 ± 0.411
5.097ValLys: 5.097 ± 0.547
4.418ValLeu: 4.418 ± 0.58
1.147ValMet: 1.147 ± 0.226
3.738ValAsn: 3.738 ± 0.422
1.869ValPro: 1.869 ± 0.235
1.317ValGln: 1.317 ± 0.222
1.657ValArg: 1.657 ± 0.217
4.97ValSer: 4.97 ± 0.457
3.313ValThr: 3.313 ± 0.388
3.908ValVal: 3.908 ± 0.494
0.467ValTrp: 0.467 ± 0.126
2.209ValTyr: 2.209 ± 0.292
0.0ValXaa: 0.0 ± 0.0
Trp
0.637TrpAla: 0.637 ± 0.146
0.127TrpCys: 0.127 ± 0.11
0.467TrpAsp: 0.467 ± 0.146
0.595TrpGlu: 0.595 ± 0.149
0.382TrpPhe: 0.382 ± 0.117
0.595TrpGly: 0.595 ± 0.156
0.297TrpHis: 0.297 ± 0.12
1.147TrpIle: 1.147 ± 0.223
0.722TrpLys: 0.722 ± 0.18
0.85TrpLeu: 0.85 ± 0.177
0.085TrpMet: 0.085 ± 0.046
1.062TrpAsn: 1.062 ± 0.188
0.042TrpPro: 0.042 ± 0.044
0.255TrpGln: 0.255 ± 0.101
0.637TrpArg: 0.637 ± 0.216
0.552TrpSer: 0.552 ± 0.165
0.765TrpThr: 0.765 ± 0.163
0.595TrpVal: 0.595 ± 0.134
0.212TrpTrp: 0.212 ± 0.087
0.51TrpTyr: 0.51 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.124TyrAla: 2.124 ± 0.332
0.637TyrCys: 0.637 ± 0.202
2.464TyrAsp: 2.464 ± 0.313
2.974TyrGlu: 2.974 ± 0.378
1.912TyrPhe: 1.912 ± 0.299
2.889TyrGly: 2.889 ± 0.352
0.382TyrHis: 0.382 ± 0.112
3.186TyrIle: 3.186 ± 0.326
3.993TyrLys: 3.993 ± 0.419
3.823TyrLeu: 3.823 ± 0.444
1.232TyrMet: 1.232 ± 0.239
3.526TyrAsn: 3.526 ± 0.369
1.189TyrPro: 1.189 ± 0.219
1.232TyrGln: 1.232 ± 0.228
1.317TyrArg: 1.317 ± 0.26
2.676TyrSer: 2.676 ± 0.242
1.954TyrThr: 1.954 ± 0.265
2.549TyrVal: 2.549 ± 0.37
0.807TyrTrp: 0.807 ± 0.2
1.954TyrTyr: 1.954 ± 0.268
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 129 proteins (23542 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski