Amino acid dipepetide frequency for Erysipelothrix phage phi1605

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.826AlaAla: 3.826 ± 0.557
0.563AlaCys: 0.563 ± 0.154
3.976AlaAsp: 3.976 ± 0.47
4.989AlaGlu: 4.989 ± 0.509
2.626AlaPhe: 2.626 ± 0.339
3.713AlaGly: 3.713 ± 0.386
0.975AlaHis: 0.975 ± 0.183
5.101AlaIle: 5.101 ± 0.62
4.126AlaLys: 4.126 ± 0.393
6.226AlaLeu: 6.226 ± 0.738
1.65AlaMet: 1.65 ± 0.256
3.563AlaAsn: 3.563 ± 0.267
1.388AlaPro: 1.388 ± 0.24
2.138AlaGln: 2.138 ± 0.241
3.113AlaArg: 3.113 ± 0.307
3.376AlaSer: 3.376 ± 0.378
3.226AlaThr: 3.226 ± 0.379
4.163AlaVal: 4.163 ± 0.461
0.375AlaTrp: 0.375 ± 0.153
3.263AlaTyr: 3.263 ± 0.402
0.0AlaXaa: 0.0 ± 0.0
Cys
0.525CysAla: 0.525 ± 0.149
0.075CysCys: 0.075 ± 0.053
0.675CysAsp: 0.675 ± 0.15
0.9CysGlu: 0.9 ± 0.198
0.45CysPhe: 0.45 ± 0.12
0.938CysGly: 0.938 ± 0.236
0.188CysHis: 0.188 ± 0.083
0.825CysIle: 0.825 ± 0.169
0.638CysLys: 0.638 ± 0.167
0.825CysLeu: 0.825 ± 0.175
0.3CysMet: 0.3 ± 0.105
0.375CysAsn: 0.375 ± 0.094
0.413CysPro: 0.413 ± 0.133
0.188CysGln: 0.188 ± 0.081
0.45CysArg: 0.45 ± 0.145
0.563CysSer: 0.563 ± 0.16
0.525CysThr: 0.525 ± 0.177
0.6CysVal: 0.6 ± 0.149
0.038CysTrp: 0.038 ± 0.037
0.225CysTyr: 0.225 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
4.088AspAla: 4.088 ± 0.362
0.938AspCys: 0.938 ± 0.195
4.726AspAsp: 4.726 ± 0.461
5.326AspGlu: 5.326 ± 0.465
3.226AspPhe: 3.226 ± 0.263
4.989AspGly: 4.989 ± 0.307
0.75AspHis: 0.75 ± 0.134
5.026AspIle: 5.026 ± 0.406
4.388AspLys: 4.388 ± 0.376
5.851AspLeu: 5.851 ± 0.7
2.063AspMet: 2.063 ± 0.268
3.638AspAsn: 3.638 ± 0.401
1.913AspPro: 1.913 ± 0.278
1.238AspGln: 1.238 ± 0.185
2.626AspArg: 2.626 ± 0.31
4.276AspSer: 4.276 ± 0.39
3.676AspThr: 3.676 ± 0.443
3.488AspVal: 3.488 ± 0.376
0.9AspTrp: 0.9 ± 0.194
3.301AspTyr: 3.301 ± 0.385
0.0AspXaa: 0.0 ± 0.0
Glu
4.463GluAla: 4.463 ± 0.45
1.088GluCys: 1.088 ± 0.158
3.901GluAsp: 3.901 ± 0.333
6.376GluGlu: 6.376 ± 0.558
2.588GluPhe: 2.588 ± 0.407
4.013GluGly: 4.013 ± 0.342
0.975GluHis: 0.975 ± 0.169
6.301GluIle: 6.301 ± 0.456
7.464GluLys: 7.464 ± 0.457
6.601GluLeu: 6.601 ± 0.45
2.325GluMet: 2.325 ± 0.277
4.951GluAsn: 4.951 ± 0.42
1.763GluPro: 1.763 ± 0.271
3.001GluGln: 3.001 ± 0.313
3.376GluArg: 3.376 ± 0.341
4.538GluSer: 4.538 ± 0.399
3.976GluThr: 3.976 ± 0.429
4.651GluVal: 4.651 ± 0.402
0.825GluTrp: 0.825 ± 0.181
3.376GluTyr: 3.376 ± 0.449
0.0GluXaa: 0.0 ± 0.0
Phe
2.438PheAla: 2.438 ± 0.367
0.375PheCys: 0.375 ± 0.113
2.363PheAsp: 2.363 ± 0.294
2.701PheGlu: 2.701 ± 0.327
1.688PhePhe: 1.688 ± 0.247
2.325PheGly: 2.325 ± 0.271
0.675PheHis: 0.675 ± 0.12
3.151PheIle: 3.151 ± 0.311
3.526PheLys: 3.526 ± 0.349
2.813PheLeu: 2.813 ± 0.341
0.788PheMet: 0.788 ± 0.165
2.551PheAsn: 2.551 ± 0.33
1.238PhePro: 1.238 ± 0.194
1.088PheGln: 1.088 ± 0.191
1.5PheArg: 1.5 ± 0.301
2.888PheSer: 2.888 ± 0.295
2.738PheThr: 2.738 ± 0.27
2.401PheVal: 2.401 ± 0.278
0.638PheTrp: 0.638 ± 0.192
1.875PheTyr: 1.875 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
3.938GlyAla: 3.938 ± 0.387
0.525GlyCys: 0.525 ± 0.139
4.163GlyAsp: 4.163 ± 0.462
3.976GlyGlu: 3.976 ± 0.286
3.263GlyPhe: 3.263 ± 0.33
3.976GlyGly: 3.976 ± 0.39
0.938GlyHis: 0.938 ± 0.163
4.688GlyIle: 4.688 ± 0.385
5.439GlyLys: 5.439 ± 0.389
4.876GlyLeu: 4.876 ± 0.418
1.2GlyMet: 1.2 ± 0.196
3.601GlyAsn: 3.601 ± 0.288
0.675GlyPro: 0.675 ± 0.16
1.988GlyGln: 1.988 ± 0.26
3.263GlyArg: 3.263 ± 0.289
3.676GlySer: 3.676 ± 0.337
3.901GlyThr: 3.901 ± 0.437
3.976GlyVal: 3.976 ± 0.397
0.713GlyTrp: 0.713 ± 0.131
2.738GlyTyr: 2.738 ± 0.387
0.0GlyXaa: 0.0 ± 0.0
His
0.713HisAla: 0.713 ± 0.118
0.113HisCys: 0.113 ± 0.065
1.088HisAsp: 1.088 ± 0.207
0.9HisGlu: 0.9 ± 0.178
0.675HisPhe: 0.675 ± 0.143
0.9HisGly: 0.9 ± 0.17
0.338HisHis: 0.338 ± 0.122
1.238HisIle: 1.238 ± 0.177
1.05HisLys: 1.05 ± 0.2
1.088HisLeu: 1.088 ± 0.179
0.675HisMet: 0.675 ± 0.18
0.863HisAsn: 0.863 ± 0.205
0.9HisPro: 0.9 ± 0.16
0.525HisGln: 0.525 ± 0.125
0.563HisArg: 0.563 ± 0.133
0.938HisSer: 0.938 ± 0.149
0.675HisThr: 0.675 ± 0.124
0.9HisVal: 0.9 ± 0.163
0.263HisTrp: 0.263 ± 0.083
0.75HisTyr: 0.75 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
4.276IleAla: 4.276 ± 0.413
0.488IleCys: 0.488 ± 0.12
6.339IleAsp: 6.339 ± 0.452
6.076IleGlu: 6.076 ± 0.438
2.701IlePhe: 2.701 ± 0.3
4.051IleGly: 4.051 ± 0.376
1.163IleHis: 1.163 ± 0.239
4.951IleIle: 4.951 ± 0.514
5.814IleLys: 5.814 ± 0.5
5.889IleLeu: 5.889 ± 0.39
1.988IleMet: 1.988 ± 0.295
3.938IleAsn: 3.938 ± 0.342
3.263IlePro: 3.263 ± 0.354
2.701IleGln: 2.701 ± 0.423
4.201IleArg: 4.201 ± 0.402
5.251IleSer: 5.251 ± 0.435
5.026IleThr: 5.026 ± 0.443
4.914IleVal: 4.914 ± 0.523
1.05IleTrp: 1.05 ± 0.177
2.663IleTyr: 2.663 ± 0.272
0.0IleXaa: 0.0 ± 0.0
Lys
5.176LysAla: 5.176 ± 0.456
0.45LysCys: 0.45 ± 0.137
5.439LysAsp: 5.439 ± 0.54
7.539LysGlu: 7.539 ± 0.689
2.438LysPhe: 2.438 ± 0.349
4.388LysGly: 4.388 ± 0.35
1.2LysHis: 1.2 ± 0.244
5.926LysIle: 5.926 ± 0.577
6.451LysLys: 6.451 ± 0.671
7.652LysLeu: 7.652 ± 0.601
1.575LysMet: 1.575 ± 0.243
4.013LysAsn: 4.013 ± 0.408
2.363LysPro: 2.363 ± 0.278
2.401LysGln: 2.401 ± 0.366
4.163LysArg: 4.163 ± 0.47
5.289LysSer: 5.289 ± 0.462
4.276LysThr: 4.276 ± 0.425
4.726LysVal: 4.726 ± 0.361
0.788LysTrp: 0.788 ± 0.169
3.226LysTyr: 3.226 ± 0.36
0.0LysXaa: 0.0 ± 0.0
Leu
5.439LeuAla: 5.439 ± 0.364
0.713LeuCys: 0.713 ± 0.189
5.551LeuAsp: 5.551 ± 0.522
7.502LeuGlu: 7.502 ± 0.608
3.301LeuPhe: 3.301 ± 0.317
5.176LeuGly: 5.176 ± 0.444
1.238LeuHis: 1.238 ± 0.213
6.151LeuIle: 6.151 ± 0.573
7.014LeuLys: 7.014 ± 0.524
7.427LeuLeu: 7.427 ± 0.654
2.175LeuMet: 2.175 ± 0.342
4.651LeuAsn: 4.651 ± 0.393
3.638LeuPro: 3.638 ± 0.494
3.001LeuGln: 3.001 ± 0.35
3.713LeuArg: 3.713 ± 0.36
6.601LeuSer: 6.601 ± 0.502
4.501LeuThr: 4.501 ± 0.29
5.251LeuVal: 5.251 ± 0.464
0.9LeuTrp: 0.9 ± 0.158
3.376LeuTyr: 3.376 ± 0.347
0.0LeuXaa: 0.0 ± 0.0
Met
1.913MetAla: 1.913 ± 0.268
0.263MetCys: 0.263 ± 0.09
2.063MetAsp: 2.063 ± 0.245
2.025MetGlu: 2.025 ± 0.251
0.75MetPhe: 0.75 ± 0.145
1.763MetGly: 1.763 ± 0.277
0.113MetHis: 0.113 ± 0.062
1.088MetIle: 1.088 ± 0.194
2.701MetLys: 2.701 ± 0.275
2.363MetLeu: 2.363 ± 0.255
0.338MetMet: 0.338 ± 0.109
1.388MetAsn: 1.388 ± 0.209
1.088MetPro: 1.088 ± 0.215
0.375MetGln: 0.375 ± 0.11
0.788MetArg: 0.788 ± 0.137
1.35MetSer: 1.35 ± 0.261
1.575MetThr: 1.575 ± 0.408
1.425MetVal: 1.425 ± 0.192
0.075MetTrp: 0.075 ± 0.049
0.675MetTyr: 0.675 ± 0.143
0.0MetXaa: 0.0 ± 0.0
Asn
3.451AsnAla: 3.451 ± 0.544
0.375AsnCys: 0.375 ± 0.105
2.663AsnAsp: 2.663 ± 0.282
3.788AsnGlu: 3.788 ± 0.439
1.988AsnPhe: 1.988 ± 0.225
4.313AsnGly: 4.313 ± 0.345
1.238AsnHis: 1.238 ± 0.217
4.313AsnIle: 4.313 ± 0.404
4.613AsnLys: 4.613 ± 0.491
4.501AsnLeu: 4.501 ± 0.363
1.163AsnMet: 1.163 ± 0.202
2.776AsnAsn: 2.776 ± 0.394
2.776AsnPro: 2.776 ± 0.232
1.763AsnGln: 1.763 ± 0.254
2.776AsnArg: 2.776 ± 0.274
3.488AsnSer: 3.488 ± 0.332
3.076AsnThr: 3.076 ± 0.345
3.301AsnVal: 3.301 ± 0.397
0.563AsnTrp: 0.563 ± 0.132
2.213AsnTyr: 2.213 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
1.988ProAla: 1.988 ± 0.287
0.225ProCys: 0.225 ± 0.087
1.988ProAsp: 1.988 ± 0.227
2.588ProGlu: 2.588 ± 0.294
1.238ProPhe: 1.238 ± 0.186
1.388ProGly: 1.388 ± 0.219
0.338ProHis: 0.338 ± 0.095
2.513ProIle: 2.513 ± 0.339
1.95ProLys: 1.95 ± 0.271
2.588ProLeu: 2.588 ± 0.292
0.563ProMet: 0.563 ± 0.14
2.1ProAsn: 2.1 ± 0.283
0.975ProPro: 0.975 ± 0.173
1.463ProGln: 1.463 ± 0.286
1.163ProArg: 1.163 ± 0.209
1.763ProSer: 1.763 ± 0.299
2.138ProThr: 2.138 ± 0.287
2.138ProVal: 2.138 ± 0.301
0.488ProTrp: 0.488 ± 0.143
2.025ProTyr: 2.025 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
1.988GlnAla: 1.988 ± 0.254
0.338GlnCys: 0.338 ± 0.104
1.2GlnAsp: 1.2 ± 0.182
2.588GlnGlu: 2.588 ± 0.315
1.35GlnPhe: 1.35 ± 0.233
1.763GlnGly: 1.763 ± 0.317
0.413GlnHis: 0.413 ± 0.144
3.713GlnIle: 3.713 ± 0.344
2.813GlnLys: 2.813 ± 0.323
2.738GlnLeu: 2.738 ± 0.432
0.9GlnMet: 0.9 ± 0.19
1.913GlnAsn: 1.913 ± 0.281
0.713GlnPro: 0.713 ± 0.157
1.088GlnGln: 1.088 ± 0.222
1.688GlnArg: 1.688 ± 0.368
1.538GlnSer: 1.538 ± 0.229
1.988GlnThr: 1.988 ± 0.272
2.138GlnVal: 2.138 ± 0.21
0.338GlnTrp: 0.338 ± 0.113
1.275GlnTyr: 1.275 ± 0.217
0.0GlnXaa: 0.0 ± 0.0
Arg
2.438ArgAla: 2.438 ± 0.321
0.338ArgCys: 0.338 ± 0.112
2.888ArgAsp: 2.888 ± 0.349
2.738ArgGlu: 2.738 ± 0.283
2.025ArgPhe: 2.025 ± 0.252
2.025ArgGly: 2.025 ± 0.29
0.675ArgHis: 0.675 ± 0.178
3.826ArgIle: 3.826 ± 0.369
4.201ArgLys: 4.201 ± 0.429
4.688ArgLeu: 4.688 ± 0.34
1.313ArgMet: 1.313 ± 0.261
2.851ArgAsn: 2.851 ± 0.332
1.163ArgPro: 1.163 ± 0.223
1.5ArgGln: 1.5 ± 0.224
1.913ArgArg: 1.913 ± 0.232
1.838ArgSer: 1.838 ± 0.308
2.175ArgThr: 2.175 ± 0.288
3.188ArgVal: 3.188 ± 0.277
0.563ArgTrp: 0.563 ± 0.167
2.851ArgTyr: 2.851 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
3.901SerAla: 3.901 ± 0.508
0.638SerCys: 0.638 ± 0.169
4.201SerAsp: 4.201 ± 0.404
4.088SerGlu: 4.088 ± 0.393
2.401SerPhe: 2.401 ± 0.265
4.914SerGly: 4.914 ± 0.402
1.238SerHis: 1.238 ± 0.153
5.139SerIle: 5.139 ± 0.44
4.351SerLys: 4.351 ± 0.459
5.664SerLeu: 5.664 ± 0.441
1.2SerMet: 1.2 ± 0.227
3.263SerAsn: 3.263 ± 0.387
1.763SerPro: 1.763 ± 0.233
1.613SerGln: 1.613 ± 0.202
2.663SerArg: 2.663 ± 0.396
4.238SerSer: 4.238 ± 0.405
3.638SerThr: 3.638 ± 0.341
5.514SerVal: 5.514 ± 0.404
0.525SerTrp: 0.525 ± 0.146
2.438SerTyr: 2.438 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
3.638ThrAla: 3.638 ± 0.363
0.45ThrCys: 0.45 ± 0.111
4.051ThrAsp: 4.051 ± 0.36
4.276ThrGlu: 4.276 ± 0.424
2.138ThrPhe: 2.138 ± 0.271
4.276ThrGly: 4.276 ± 0.339
0.825ThrHis: 0.825 ± 0.19
4.538ThrIle: 4.538 ± 0.476
4.764ThrLys: 4.764 ± 0.438
5.589ThrLeu: 5.589 ± 0.377
1.05ThrMet: 1.05 ± 0.171
2.438ThrAsn: 2.438 ± 0.266
1.95ThrPro: 1.95 ± 0.27
1.388ThrGln: 1.388 ± 0.238
2.25ThrArg: 2.25 ± 0.286
3.601ThrSer: 3.601 ± 0.386
2.926ThrThr: 2.926 ± 0.399
3.638ThrVal: 3.638 ± 0.369
0.675ThrTrp: 0.675 ± 0.147
1.988ThrTyr: 1.988 ± 0.321
0.0ThrXaa: 0.0 ± 0.0
Val
4.388ValAla: 4.388 ± 0.539
0.9ValCys: 0.9 ± 0.196
4.989ValAsp: 4.989 ± 0.472
4.238ValGlu: 4.238 ± 0.418
2.401ValPhe: 2.401 ± 0.309
3.301ValGly: 3.301 ± 0.233
0.975ValHis: 0.975 ± 0.154
5.064ValIle: 5.064 ± 0.429
4.501ValLys: 4.501 ± 0.497
5.626ValLeu: 5.626 ± 0.458
1.538ValMet: 1.538 ± 0.244
3.038ValAsn: 3.038 ± 0.287
2.213ValPro: 2.213 ± 0.216
2.025ValGln: 2.025 ± 0.349
2.438ValArg: 2.438 ± 0.183
4.088ValSer: 4.088 ± 0.379
3.451ValThr: 3.451 ± 0.461
3.226ValVal: 3.226 ± 0.456
0.713ValTrp: 0.713 ± 0.19
3.038ValTyr: 3.038 ± 0.33
0.0ValXaa: 0.0 ± 0.0
Trp
0.788TrpAla: 0.788 ± 0.183
0.188TrpCys: 0.188 ± 0.095
1.088TrpAsp: 1.088 ± 0.192
0.75TrpGlu: 0.75 ± 0.15
0.6TrpPhe: 0.6 ± 0.145
0.3TrpGly: 0.3 ± 0.098
0.075TrpHis: 0.075 ± 0.05
0.563TrpIle: 0.563 ± 0.122
0.863TrpLys: 0.863 ± 0.179
0.713TrpLeu: 0.713 ± 0.183
0.413TrpMet: 0.413 ± 0.126
1.013TrpAsn: 1.013 ± 0.19
0.225TrpPro: 0.225 ± 0.081
0.713TrpGln: 0.713 ± 0.177
0.563TrpArg: 0.563 ± 0.167
0.788TrpSer: 0.788 ± 0.205
0.45TrpThr: 0.45 ± 0.131
0.6TrpVal: 0.6 ± 0.149
0.15TrpTrp: 0.15 ± 0.077
0.413TrpTyr: 0.413 ± 0.124
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.001TyrAla: 3.001 ± 0.292
0.675TyrCys: 0.675 ± 0.219
3.076TyrAsp: 3.076 ± 0.361
3.188TyrGlu: 3.188 ± 0.336
1.913TyrPhe: 1.913 ± 0.292
3.001TyrGly: 3.001 ± 0.342
0.825TyrHis: 0.825 ± 0.156
2.626TyrIle: 2.626 ± 0.33
2.851TyrLys: 2.851 ± 0.374
3.601TyrLeu: 3.601 ± 0.478
0.863TyrMet: 0.863 ± 0.148
2.213TyrAsn: 2.213 ± 0.265
1.2TyrPro: 1.2 ± 0.172
2.288TyrGln: 2.288 ± 0.256
1.988TyrArg: 1.988 ± 0.278
3.301TyrSer: 3.301 ± 0.329
2.513TyrThr: 2.513 ± 0.25
1.913TyrVal: 1.913 ± 0.227
0.675TyrTrp: 0.675 ± 0.214
1.463TyrTyr: 1.463 ± 0.265
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (26662 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski