Amino acid dipepetide frequency for Pseudomonas phage VCM

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.57AlaAla: 8.57 ± 0.698
1.097AlaCys: 1.097 ± 0.18
4.354AlaAsp: 4.354 ± 0.348
5.314AlaGlu: 5.314 ± 0.449
3.565AlaPhe: 3.565 ± 0.328
5.862AlaGly: 5.862 ± 0.53
1.68AlaHis: 1.68 ± 0.231
5.074AlaIle: 5.074 ± 0.494
5.451AlaLys: 5.451 ± 0.39
8.056AlaLeu: 8.056 ± 0.602
2.983AlaMet: 2.983 ± 0.277
4.011AlaAsn: 4.011 ± 0.39
3.188AlaPro: 3.188 ± 0.471
3.84AlaGln: 3.84 ± 0.279
4.697AlaArg: 4.697 ± 0.448
4.148AlaSer: 4.148 ± 0.356
6.479AlaThr: 6.479 ± 0.597
5.794AlaVal: 5.794 ± 0.466
1.097AlaTrp: 1.097 ± 0.22
2.674AlaTyr: 2.674 ± 0.325
0.0AlaXaa: 0.0 ± 0.0
Cys
0.754CysAla: 0.754 ± 0.15
0.309CysCys: 0.309 ± 0.107
0.994CysAsp: 0.994 ± 0.22
0.788CysGlu: 0.788 ± 0.157
0.411CysPhe: 0.411 ± 0.105
1.028CysGly: 1.028 ± 0.196
0.411CysHis: 0.411 ± 0.142
0.617CysIle: 0.617 ± 0.135
0.617CysLys: 0.617 ± 0.174
0.754CysLeu: 0.754 ± 0.183
0.514CysMet: 0.514 ± 0.129
0.583CysAsn: 0.583 ± 0.152
0.754CysPro: 0.754 ± 0.199
0.514CysGln: 0.514 ± 0.127
0.549CysArg: 0.549 ± 0.156
0.514CysSer: 0.514 ± 0.149
0.926CysThr: 0.926 ± 0.174
0.823CysVal: 0.823 ± 0.155
0.137CysTrp: 0.137 ± 0.069
0.686CysTyr: 0.686 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
5.519AspAla: 5.519 ± 0.399
0.994AspCys: 0.994 ± 0.164
4.148AspAsp: 4.148 ± 0.43
3.805AspGlu: 3.805 ± 0.428
2.845AspPhe: 2.845 ± 0.33
4.697AspGly: 4.697 ± 0.394
1.337AspHis: 1.337 ± 0.205
3.394AspIle: 3.394 ± 0.31
4.08AspLys: 4.08 ± 0.394
4.799AspLeu: 4.799 ± 0.422
2.16AspMet: 2.16 ± 0.293
2.88AspAsn: 2.88 ± 0.292
1.92AspPro: 1.92 ± 0.261
2.16AspGln: 2.16 ± 0.258
2.708AspArg: 2.708 ± 0.323
3.12AspSer: 3.12 ± 0.299
3.702AspThr: 3.702 ± 0.333
3.908AspVal: 3.908 ± 0.381
1.44AspTrp: 1.44 ± 0.245
1.885AspTyr: 1.885 ± 0.27
0.0AspXaa: 0.0 ± 0.0
Glu
6.582GluAla: 6.582 ± 0.55
0.754GluCys: 0.754 ± 0.19
4.388GluAsp: 4.388 ± 0.453
4.491GluGlu: 4.491 ± 0.465
2.983GluPhe: 2.983 ± 0.332
3.977GluGly: 3.977 ± 0.381
1.337GluHis: 1.337 ± 0.228
3.565GluIle: 3.565 ± 0.363
3.565GluLys: 3.565 ± 0.384
6.102GluLeu: 6.102 ± 0.499
2.263GluMet: 2.263 ± 0.291
2.468GluAsn: 2.468 ± 0.304
1.234GluPro: 1.234 ± 0.21
2.948GluGln: 2.948 ± 0.292
3.634GluArg: 3.634 ± 0.362
3.085GluSer: 3.085 ± 0.364
3.188GluThr: 3.188 ± 0.351
5.279GluVal: 5.279 ± 0.38
0.994GluTrp: 0.994 ± 0.188
2.537GluTyr: 2.537 ± 0.285
0.0GluXaa: 0.0 ± 0.0
Phe
3.428PheAla: 3.428 ± 0.415
0.48PheCys: 0.48 ± 0.135
2.4PheAsp: 2.4 ± 0.284
3.085PheGlu: 3.085 ± 0.388
1.44PhePhe: 1.44 ± 0.235
2.674PheGly: 2.674 ± 0.294
0.549PheHis: 0.549 ± 0.132
2.091PheIle: 2.091 ± 0.246
2.434PheLys: 2.434 ± 0.31
2.914PheLeu: 2.914 ± 0.295
0.788PheMet: 0.788 ± 0.144
1.988PheAsn: 1.988 ± 0.294
1.337PhePro: 1.337 ± 0.225
1.611PheGln: 1.611 ± 0.233
1.954PheArg: 1.954 ± 0.242
2.194PheSer: 2.194 ± 0.225
2.88PheThr: 2.88 ± 0.305
2.331PheVal: 2.331 ± 0.3
0.309PheTrp: 0.309 ± 0.126
1.131PheTyr: 1.131 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
5.211GlyAla: 5.211 ± 0.504
1.097GlyCys: 1.097 ± 0.216
4.422GlyAsp: 4.422 ± 0.38
3.737GlyGlu: 3.737 ± 0.341
3.257GlyPhe: 3.257 ± 0.286
5.759GlyGly: 5.759 ± 0.564
1.68GlyHis: 1.68 ± 0.215
3.805GlyIle: 3.805 ± 0.363
4.697GlyLys: 4.697 ± 0.463
6.205GlyLeu: 6.205 ± 0.44
2.16GlyMet: 2.16 ± 0.286
3.668GlyAsn: 3.668 ± 0.517
1.303GlyPro: 1.303 ± 0.196
2.194GlyGln: 2.194 ± 0.265
3.497GlyArg: 3.497 ± 0.396
4.731GlySer: 4.731 ± 0.349
4.354GlyThr: 4.354 ± 0.422
5.828GlyVal: 5.828 ± 0.47
1.303GlyTrp: 1.303 ± 0.221
3.497GlyTyr: 3.497 ± 0.439
0.0GlyXaa: 0.0 ± 0.0
His
1.303HisAla: 1.303 ± 0.207
0.48HisCys: 0.48 ± 0.144
1.406HisAsp: 1.406 ± 0.218
1.303HisGlu: 1.303 ± 0.195
0.926HisPhe: 0.926 ± 0.185
1.44HisGly: 1.44 ± 0.209
0.411HisHis: 0.411 ± 0.13
1.268HisIle: 1.268 ± 0.219
1.406HisLys: 1.406 ± 0.228
1.166HisLeu: 1.166 ± 0.2
0.617HisMet: 0.617 ± 0.153
0.994HisAsn: 0.994 ± 0.173
0.857HisPro: 0.857 ± 0.174
0.514HisGln: 0.514 ± 0.13
0.96HisArg: 0.96 ± 0.176
0.72HisSer: 0.72 ± 0.147
1.371HisThr: 1.371 ± 0.192
1.44HisVal: 1.44 ± 0.221
0.48HisTrp: 0.48 ± 0.124
0.857HisTyr: 0.857 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
4.868IleAla: 4.868 ± 0.484
0.857IleCys: 0.857 ± 0.176
4.32IleAsp: 4.32 ± 0.397
4.251IleGlu: 4.251 ± 0.327
1.474IlePhe: 1.474 ± 0.268
3.634IleGly: 3.634 ± 0.297
1.268IleHis: 1.268 ± 0.235
3.051IleIle: 3.051 ± 0.327
4.114IleLys: 4.114 ± 0.36
4.045IleLeu: 4.045 ± 0.387
1.303IleMet: 1.303 ± 0.209
3.222IleAsn: 3.222 ± 0.377
2.125IlePro: 2.125 ± 0.238
2.228IleGln: 2.228 ± 0.223
2.88IleArg: 2.88 ± 0.257
2.743IleSer: 2.743 ± 0.326
3.668IleThr: 3.668 ± 0.362
3.36IleVal: 3.36 ± 0.387
0.788IleTrp: 0.788 ± 0.169
1.611IleTyr: 1.611 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
6.034LysAla: 6.034 ± 0.62
0.617LysCys: 0.617 ± 0.143
3.668LysAsp: 3.668 ± 0.369
4.559LysGlu: 4.559 ± 0.444
2.228LysPhe: 2.228 ± 0.274
4.594LysGly: 4.594 ± 0.428
1.131LysHis: 1.131 ± 0.188
3.6LysIle: 3.6 ± 0.364
3.257LysLys: 3.257 ± 0.385
5.177LysLeu: 5.177 ± 0.466
2.743LysMet: 2.743 ± 0.315
1.851LysAsn: 1.851 ± 0.277
2.64LysPro: 2.64 ± 0.381
2.811LysGln: 2.811 ± 0.278
3.017LysArg: 3.017 ± 0.379
3.12LysSer: 3.12 ± 0.303
3.497LysThr: 3.497 ± 0.333
4.937LysVal: 4.937 ± 0.451
0.823LysTrp: 0.823 ± 0.174
2.4LysTyr: 2.4 ± 0.295
0.0LysXaa: 0.0 ± 0.0
Leu
6.993LeuAla: 6.993 ± 0.534
0.823LeuCys: 0.823 ± 0.169
5.828LeuAsp: 5.828 ± 0.441
6.205LeuGlu: 6.205 ± 0.474
2.605LeuPhe: 2.605 ± 0.332
5.451LeuGly: 5.451 ± 0.475
1.303LeuHis: 1.303 ± 0.21
4.731LeuIle: 4.731 ± 0.379
5.588LeuLys: 5.588 ± 0.441
5.382LeuLeu: 5.382 ± 0.537
2.605LeuMet: 2.605 ± 0.305
3.428LeuAsn: 3.428 ± 0.324
3.771LeuPro: 3.771 ± 0.308
3.462LeuGln: 3.462 ± 0.311
3.771LeuArg: 3.771 ± 0.376
4.32LeuSer: 4.32 ± 0.301
5.554LeuThr: 5.554 ± 0.495
4.457LeuVal: 4.457 ± 0.372
0.994LeuTrp: 0.994 ± 0.156
2.537LeuTyr: 2.537 ± 0.274
0.0LeuXaa: 0.0 ± 0.0
Met
3.017MetAla: 3.017 ± 0.301
0.377MetCys: 0.377 ± 0.117
1.234MetAsp: 1.234 ± 0.213
1.885MetGlu: 1.885 ± 0.248
1.028MetPhe: 1.028 ± 0.172
1.611MetGly: 1.611 ± 0.251
0.651MetHis: 0.651 ± 0.169
1.988MetIle: 1.988 ± 0.241
2.708MetLys: 2.708 ± 0.325
2.263MetLeu: 2.263 ± 0.245
0.617MetMet: 0.617 ± 0.161
1.577MetAsn: 1.577 ± 0.247
0.891MetPro: 0.891 ± 0.18
1.2MetGln: 1.2 ± 0.176
1.268MetArg: 1.268 ± 0.179
2.468MetSer: 2.468 ± 0.238
2.537MetThr: 2.537 ± 0.303
2.194MetVal: 2.194 ± 0.273
0.583MetTrp: 0.583 ± 0.132
0.788MetTyr: 0.788 ± 0.185
0.0MetXaa: 0.0 ± 0.0
Asn
3.702AsnAla: 3.702 ± 0.337
0.446AsnCys: 0.446 ± 0.131
2.16AsnAsp: 2.16 ± 0.292
2.468AsnGlu: 2.468 ± 0.279
1.2AsnPhe: 1.2 ± 0.189
4.011AsnGly: 4.011 ± 0.475
0.754AsnHis: 0.754 ± 0.138
2.948AsnIle: 2.948 ± 0.351
2.845AsnLys: 2.845 ± 0.304
4.08AsnLeu: 4.08 ± 0.529
0.857AsnMet: 0.857 ± 0.162
2.297AsnAsn: 2.297 ± 0.294
2.434AsnPro: 2.434 ± 0.31
1.028AsnGln: 1.028 ± 0.182
2.091AsnArg: 2.091 ± 0.276
2.845AsnSer: 2.845 ± 0.353
3.222AsnThr: 3.222 ± 0.408
3.462AsnVal: 3.462 ± 0.281
0.583AsnTrp: 0.583 ± 0.125
1.92AsnTyr: 1.92 ± 0.255
0.0AsnXaa: 0.0 ± 0.0
Pro
3.257ProAla: 3.257 ± 0.41
0.549ProCys: 0.549 ± 0.14
2.228ProAsp: 2.228 ± 0.291
3.257ProGlu: 3.257 ± 0.288
1.337ProPhe: 1.337 ± 0.181
1.988ProGly: 1.988 ± 0.288
0.96ProHis: 0.96 ± 0.162
1.783ProIle: 1.783 ± 0.23
1.954ProLys: 1.954 ± 0.315
2.503ProLeu: 2.503 ± 0.242
0.994ProMet: 0.994 ± 0.178
1.611ProAsn: 1.611 ± 0.24
0.686ProPro: 0.686 ± 0.165
1.646ProGln: 1.646 ± 0.255
1.611ProArg: 1.611 ± 0.246
2.023ProSer: 2.023 ± 0.299
2.571ProThr: 2.571 ± 0.316
3.6ProVal: 3.6 ± 0.341
0.411ProTrp: 0.411 ± 0.111
1.303ProTyr: 1.303 ± 0.198
0.0ProXaa: 0.0 ± 0.0
Gln
4.114GlnAla: 4.114 ± 0.367
0.446GlnCys: 0.446 ± 0.11
1.714GlnAsp: 1.714 ± 0.264
2.365GlnGlu: 2.365 ± 0.3
1.714GlnPhe: 1.714 ± 0.253
2.811GlnGly: 2.811 ± 0.285
0.754GlnHis: 0.754 ± 0.172
2.434GlnIle: 2.434 ± 0.255
2.297GlnLys: 2.297 ± 0.289
3.737GlnLeu: 3.737 ± 0.351
1.44GlnMet: 1.44 ± 0.279
1.268GlnAsn: 1.268 ± 0.174
1.2GlnPro: 1.2 ± 0.223
1.508GlnGln: 1.508 ± 0.224
1.954GlnArg: 1.954 ± 0.254
2.125GlnSer: 2.125 ± 0.277
2.228GlnThr: 2.228 ± 0.291
3.051GlnVal: 3.051 ± 0.322
0.651GlnTrp: 0.651 ± 0.126
1.474GlnTyr: 1.474 ± 0.252
0.0GlnXaa: 0.0 ± 0.0
Arg
3.702ArgAla: 3.702 ± 0.355
0.411ArgCys: 0.411 ± 0.12
3.257ArgAsp: 3.257 ± 0.44
3.394ArgGlu: 3.394 ± 0.292
1.577ArgPhe: 1.577 ± 0.194
3.702ArgGly: 3.702 ± 0.423
1.063ArgHis: 1.063 ± 0.227
2.948ArgIle: 2.948 ± 0.317
3.257ArgLys: 3.257 ± 0.38
3.771ArgLeu: 3.771 ± 0.326
1.714ArgMet: 1.714 ± 0.209
2.263ArgAsn: 2.263 ± 0.26
1.92ArgPro: 1.92 ± 0.235
2.057ArgGln: 2.057 ± 0.284
2.4ArgArg: 2.4 ± 0.346
2.194ArgSer: 2.194 ± 0.283
2.743ArgThr: 2.743 ± 0.321
3.394ArgVal: 3.394 ± 0.32
0.891ArgTrp: 0.891 ± 0.176
1.783ArgTyr: 1.783 ± 0.208
0.0ArgXaa: 0.0 ± 0.0
Ser
4.765SerAla: 4.765 ± 0.456
0.651SerCys: 0.651 ± 0.14
2.605SerAsp: 2.605 ± 0.242
2.914SerGlu: 2.914 ± 0.313
2.125SerPhe: 2.125 ± 0.273
4.525SerGly: 4.525 ± 0.506
1.063SerHis: 1.063 ± 0.168
2.64SerIle: 2.64 ± 0.307
2.983SerLys: 2.983 ± 0.316
4.662SerLeu: 4.662 ± 0.345
1.337SerMet: 1.337 ± 0.18
2.537SerAsn: 2.537 ± 0.325
2.365SerPro: 2.365 ± 0.286
2.365SerGln: 2.365 ± 0.266
2.88SerArg: 2.88 ± 0.363
3.497SerSer: 3.497 ± 0.601
3.874SerThr: 3.874 ± 0.627
4.011SerVal: 4.011 ± 0.427
1.063SerTrp: 1.063 ± 0.227
2.023SerTyr: 2.023 ± 0.217
0.0SerXaa: 0.0 ± 0.0
Thr
6.068ThrAla: 6.068 ± 0.51
0.583ThrCys: 0.583 ± 0.144
3.771ThrAsp: 3.771 ± 0.409
3.565ThrGlu: 3.565 ± 0.349
2.708ThrPhe: 2.708 ± 0.275
5.588ThrGly: 5.588 ± 0.486
1.166ThrHis: 1.166 ± 0.217
3.702ThrIle: 3.702 ± 0.335
2.983ThrLys: 2.983 ± 0.388
5.211ThrLeu: 5.211 ± 0.521
2.057ThrMet: 2.057 ± 0.248
2.88ThrAsn: 2.88 ± 0.384
3.36ThrPro: 3.36 ± 0.332
2.605ThrGln: 2.605 ± 0.316
2.674ThrArg: 2.674 ± 0.256
3.531ThrSer: 3.531 ± 0.397
4.422ThrThr: 4.422 ± 0.505
5.656ThrVal: 5.656 ± 0.473
0.617ThrTrp: 0.617 ± 0.146
2.64ThrTyr: 2.64 ± 0.264
0.0ThrXaa: 0.0 ± 0.0
Val
6.136ValAla: 6.136 ± 0.575
1.131ValCys: 1.131 ± 0.23
4.662ValAsp: 4.662 ± 0.412
4.697ValGlu: 4.697 ± 0.579
2.777ValPhe: 2.777 ± 0.297
4.868ValGly: 4.868 ± 0.388
1.131ValHis: 1.131 ± 0.22
3.36ValIle: 3.36 ± 0.333
4.937ValLys: 4.937 ± 0.466
5.108ValLeu: 5.108 ± 0.426
2.331ValMet: 2.331 ± 0.226
2.983ValAsn: 2.983 ± 0.293
2.503ValPro: 2.503 ± 0.326
2.64ValGln: 2.64 ± 0.282
3.462ValArg: 3.462 ± 0.368
4.662ValSer: 4.662 ± 0.429
5.142ValThr: 5.142 ± 0.467
5.828ValVal: 5.828 ± 0.505
0.926ValTrp: 0.926 ± 0.192
2.88ValTyr: 2.88 ± 0.299
0.0ValXaa: 0.0 ± 0.0
Trp
1.028TrpAla: 1.028 ± 0.191
0.171TrpCys: 0.171 ± 0.075
1.097TrpAsp: 1.097 ± 0.147
1.268TrpGlu: 1.268 ± 0.244
0.686TrpPhe: 0.686 ± 0.125
0.891TrpGly: 0.891 ± 0.201
0.549TrpHis: 0.549 ± 0.137
0.994TrpIle: 0.994 ± 0.18
0.788TrpLys: 0.788 ± 0.141
1.371TrpLeu: 1.371 ± 0.216
0.377TrpMet: 0.377 ± 0.123
0.754TrpAsn: 0.754 ± 0.183
0.514TrpPro: 0.514 ± 0.144
0.754TrpGln: 0.754 ± 0.17
0.583TrpArg: 0.583 ± 0.149
0.549TrpSer: 0.549 ± 0.137
0.72TrpThr: 0.72 ± 0.197
0.96TrpVal: 0.96 ± 0.199
0.206TrpTrp: 0.206 ± 0.092
0.446TrpTyr: 0.446 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.88TyrAla: 2.88 ± 0.347
0.411TyrCys: 0.411 ± 0.111
2.845TyrAsp: 2.845 ± 0.313
2.023TyrGlu: 2.023 ± 0.303
1.131TyrPhe: 1.131 ± 0.18
3.291TyrGly: 3.291 ± 0.318
0.72TyrHis: 0.72 ± 0.145
1.817TyrIle: 1.817 ± 0.237
2.743TyrLys: 2.743 ± 0.253
2.503TyrLeu: 2.503 ± 0.328
0.994TyrMet: 0.994 ± 0.213
2.091TyrAsn: 2.091 ± 0.298
1.268TyrPro: 1.268 ± 0.198
1.2TyrGln: 1.2 ± 0.217
1.851TyrArg: 1.851 ± 0.255
2.297TyrSer: 2.297 ± 0.319
2.708TyrThr: 2.708 ± 0.34
1.851TyrVal: 1.851 ± 0.227
0.411TyrTrp: 0.411 ± 0.108
1.028TyrTyr: 1.028 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 172 proteins (29171 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski