Amino acid dipepetide frequency for Vibrio phage SHOU24

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.508AlaAla: 6.508 ± 1.271
0.918AlaCys: 0.918 ± 0.221
5.382AlaAsp: 5.382 ± 0.512
6.049AlaGlu: 6.049 ± 0.832
2.753AlaPhe: 2.753 ± 0.373
5.131AlaGly: 5.131 ± 0.474
1.502AlaHis: 1.502 ± 0.318
4.714AlaIle: 4.714 ± 0.397
6.341AlaLys: 6.341 ± 0.687
5.757AlaLeu: 5.757 ± 0.478
2.253AlaMet: 2.253 ± 0.32
3.796AlaAsn: 3.796 ± 0.522
2.336AlaPro: 2.336 ± 0.303
3.671AlaGln: 3.671 ± 0.711
3.504AlaArg: 3.504 ± 0.422
6.425AlaSer: 6.425 ± 0.754
4.339AlaThr: 4.339 ± 0.495
5.173AlaVal: 5.173 ± 0.573
0.876AlaTrp: 0.876 ± 0.185
2.545AlaTyr: 2.545 ± 0.322
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.179
0.209CysCys: 0.209 ± 0.089
1.001CysAsp: 1.001 ± 0.221
0.918CysGlu: 0.918 ± 0.208
0.459CysPhe: 0.459 ± 0.146
0.626CysGly: 0.626 ± 0.185
0.292CysHis: 0.292 ± 0.117
0.668CysIle: 0.668 ± 0.156
1.043CysLys: 1.043 ± 0.245
1.21CysLeu: 1.21 ± 0.239
0.459CysMet: 0.459 ± 0.134
0.668CysAsn: 0.668 ± 0.164
0.542CysPro: 0.542 ± 0.177
0.417CysGln: 0.417 ± 0.143
0.751CysArg: 0.751 ± 0.204
0.751CysSer: 0.751 ± 0.218
0.542CysThr: 0.542 ± 0.148
0.751CysVal: 0.751 ± 0.181
0.167CysTrp: 0.167 ± 0.098
0.375CysTyr: 0.375 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
4.297AspAla: 4.297 ± 0.394
0.709AspCys: 0.709 ± 0.167
3.922AspAsp: 3.922 ± 0.497
4.589AspGlu: 4.589 ± 0.562
2.753AspPhe: 2.753 ± 0.29
4.756AspGly: 4.756 ± 0.717
1.126AspHis: 1.126 ± 0.223
4.172AspIle: 4.172 ± 0.431
5.215AspLys: 5.215 ± 0.486
5.757AspLeu: 5.757 ± 0.521
1.627AspMet: 1.627 ± 0.279
2.837AspAsn: 2.837 ± 0.258
2.003AspPro: 2.003 ± 0.258
2.169AspGln: 2.169 ± 0.345
2.42AspArg: 2.42 ± 0.346
3.838AspSer: 3.838 ± 0.402
4.047AspThr: 4.047 ± 0.519
3.963AspVal: 3.963 ± 0.495
0.96AspTrp: 0.96 ± 0.213
2.587AspTyr: 2.587 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
6.8GluAla: 6.8 ± 0.765
1.001GluCys: 1.001 ± 0.248
4.172GluAsp: 4.172 ± 0.431
6.967GluGlu: 6.967 ± 0.924
2.962GluPhe: 2.962 ± 0.325
4.005GluGly: 4.005 ± 0.395
1.544GluHis: 1.544 ± 0.26
4.38GluIle: 4.38 ± 0.419
4.422GluLys: 4.422 ± 0.468
7.05GluLeu: 7.05 ± 0.511
2.295GluMet: 2.295 ± 0.313
2.837GluAsn: 2.837 ± 0.313
1.877GluPro: 1.877 ± 0.312
3.212GluGln: 3.212 ± 0.393
3.63GluArg: 3.63 ± 0.512
3.922GluSer: 3.922 ± 0.439
4.088GluThr: 4.088 ± 0.682
5.966GluVal: 5.966 ± 0.627
0.459GluTrp: 0.459 ± 0.135
1.919GluTyr: 1.919 ± 0.286
0.0GluXaa: 0.0 ± 0.0
Phe
2.461PheAla: 2.461 ± 0.302
0.918PheCys: 0.918 ± 0.219
3.338PheAsp: 3.338 ± 0.348
2.879PheGlu: 2.879 ± 0.333
1.752PhePhe: 1.752 ± 0.287
2.378PheGly: 2.378 ± 0.27
0.876PheHis: 0.876 ± 0.184
2.295PheIle: 2.295 ± 0.266
2.628PheLys: 2.628 ± 0.343
3.004PheLeu: 3.004 ± 0.351
1.21PheMet: 1.21 ± 0.202
2.044PheAsn: 2.044 ± 0.241
0.918PhePro: 0.918 ± 0.186
0.793PheGln: 0.793 ± 0.18
1.168PheArg: 1.168 ± 0.178
3.296PheSer: 3.296 ± 0.368
2.295PheThr: 2.295 ± 0.283
2.587PheVal: 2.587 ± 0.295
0.501PheTrp: 0.501 ± 0.146
1.377PheTyr: 1.377 ± 0.225
0.0PheXaa: 0.0 ± 0.0
Gly
4.839GlyAla: 4.839 ± 0.44
1.126GlyCys: 1.126 ± 0.29
3.963GlyAsp: 3.963 ± 0.453
4.13GlyGlu: 4.13 ± 0.385
2.879GlyPhe: 2.879 ± 0.404
4.172GlyGly: 4.172 ± 0.442
1.335GlyHis: 1.335 ± 0.236
3.379GlyIle: 3.379 ± 0.361
5.173GlyLys: 5.173 ± 0.495
5.423GlyLeu: 5.423 ± 0.457
1.71GlyMet: 1.71 ± 0.21
2.753GlyAsn: 2.753 ± 0.311
1.627GlyPro: 1.627 ± 0.272
2.545GlyGln: 2.545 ± 0.296
2.587GlyArg: 2.587 ± 0.359
4.673GlySer: 4.673 ± 0.545
4.088GlyThr: 4.088 ± 0.423
4.547GlyVal: 4.547 ± 0.48
1.293GlyTrp: 1.293 ± 0.256
2.253GlyTyr: 2.253 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
1.335HisAla: 1.335 ± 0.216
0.334HisCys: 0.334 ± 0.109
1.043HisAsp: 1.043 ± 0.194
1.21HisGlu: 1.21 ± 0.226
1.168HisPhe: 1.168 ± 0.275
1.335HisGly: 1.335 ± 0.246
0.584HisHis: 0.584 ± 0.164
1.168HisIle: 1.168 ± 0.265
1.46HisLys: 1.46 ± 0.345
2.128HisLeu: 2.128 ± 0.293
0.209HisMet: 0.209 ± 0.084
1.126HisAsn: 1.126 ± 0.18
1.085HisPro: 1.085 ± 0.242
0.626HisGln: 0.626 ± 0.165
1.001HisArg: 1.001 ± 0.257
1.46HisSer: 1.46 ± 0.265
0.96HisThr: 0.96 ± 0.205
1.168HisVal: 1.168 ± 0.26
0.292HisTrp: 0.292 ± 0.115
0.709HisTyr: 0.709 ± 0.206
0.0HisXaa: 0.0 ± 0.0
Ile
4.297IleAla: 4.297 ± 0.473
0.751IleCys: 0.751 ± 0.201
4.339IleAsp: 4.339 ± 0.443
4.255IleGlu: 4.255 ± 0.417
1.335IlePhe: 1.335 ± 0.212
3.588IleGly: 3.588 ± 0.382
0.751IleHis: 0.751 ± 0.181
3.63IleIle: 3.63 ± 0.35
3.922IleLys: 3.922 ± 0.403
4.214IleLeu: 4.214 ± 0.474
1.46IleMet: 1.46 ± 0.281
3.88IleAsn: 3.88 ± 0.389
2.128IlePro: 2.128 ± 0.3
1.46IleGln: 1.46 ± 0.22
2.587IleArg: 2.587 ± 0.333
4.631IleSer: 4.631 ± 0.38
3.713IleThr: 3.713 ± 0.426
3.963IleVal: 3.963 ± 0.417
0.292IleTrp: 0.292 ± 0.12
2.461IleTyr: 2.461 ± 0.332
0.0IleXaa: 0.0 ± 0.0
Lys
5.799LysAla: 5.799 ± 0.663
0.668LysCys: 0.668 ± 0.182
4.172LysAsp: 4.172 ± 0.445
4.923LysGlu: 4.923 ± 0.578
2.461LysPhe: 2.461 ± 0.277
4.214LysGly: 4.214 ± 0.409
1.752LysHis: 1.752 ± 0.315
4.13LysIle: 4.13 ± 0.432
3.463LysLys: 3.463 ± 0.446
7.05LysLeu: 7.05 ± 0.746
2.211LysMet: 2.211 ± 0.297
2.503LysAsn: 2.503 ± 0.372
2.753LysPro: 2.753 ± 0.342
3.004LysGln: 3.004 ± 0.406
3.212LysArg: 3.212 ± 0.381
5.382LysSer: 5.382 ± 0.446
4.13LysThr: 4.13 ± 0.476
5.382LysVal: 5.382 ± 0.44
0.709LysTrp: 0.709 ± 0.154
2.545LysTyr: 2.545 ± 0.311
0.0LysXaa: 0.0 ± 0.0
Leu
6.508LeuAla: 6.508 ± 0.512
0.876LeuCys: 0.876 ± 0.17
5.882LeuAsp: 5.882 ± 0.527
6.258LeuGlu: 6.258 ± 0.565
2.587LeuPhe: 2.587 ± 0.401
5.215LeuGly: 5.215 ± 0.484
1.544LeuHis: 1.544 ± 0.286
4.13LeuIle: 4.13 ± 0.434
6.174LeuLys: 6.174 ± 0.54
6.091LeuLeu: 6.091 ± 0.612
2.211LeuMet: 2.211 ± 0.333
4.38LeuAsn: 4.38 ± 0.44
3.296LeuPro: 3.296 ± 0.345
2.753LeuGln: 2.753 ± 0.352
3.504LeuArg: 3.504 ± 0.34
6.466LeuSer: 6.466 ± 0.673
5.257LeuThr: 5.257 ± 0.465
5.006LeuVal: 5.006 ± 0.407
0.709LeuTrp: 0.709 ± 0.149
2.378LeuTyr: 2.378 ± 0.312
0.0LeuXaa: 0.0 ± 0.0
Met
2.753MetAla: 2.753 ± 0.399
0.209MetCys: 0.209 ± 0.098
1.335MetAsp: 1.335 ± 0.224
1.502MetGlu: 1.502 ± 0.309
1.043MetPhe: 1.043 ± 0.233
1.585MetGly: 1.585 ± 0.227
0.375MetHis: 0.375 ± 0.132
1.252MetIle: 1.252 ± 0.269
1.877MetLys: 1.877 ± 0.244
2.295MetLeu: 2.295 ± 0.281
0.709MetMet: 0.709 ± 0.156
1.46MetAsn: 1.46 ± 0.233
1.001MetPro: 1.001 ± 0.17
1.001MetGln: 1.001 ± 0.244
1.168MetArg: 1.168 ± 0.239
2.128MetSer: 2.128 ± 0.318
2.128MetThr: 2.128 ± 0.27
1.752MetVal: 1.752 ± 0.233
0.25MetTrp: 0.25 ± 0.12
0.793MetTyr: 0.793 ± 0.161
0.0MetXaa: 0.0 ± 0.0
Asn
3.713AsnAla: 3.713 ± 0.485
0.459AsnCys: 0.459 ± 0.141
2.795AsnAsp: 2.795 ± 0.379
3.588AsnGlu: 3.588 ± 0.458
2.169AsnPhe: 2.169 ± 0.281
3.254AsnGly: 3.254 ± 0.335
0.96AsnHis: 0.96 ± 0.182
2.753AsnIle: 2.753 ± 0.422
3.212AsnLys: 3.212 ± 0.381
4.005AsnLeu: 4.005 ± 0.34
1.168AsnMet: 1.168 ± 0.222
2.753AsnAsn: 2.753 ± 0.301
2.587AsnPro: 2.587 ± 0.365
1.752AsnGln: 1.752 ± 0.272
1.877AsnArg: 1.877 ± 0.26
3.796AsnSer: 3.796 ± 0.428
2.879AsnThr: 2.879 ± 0.407
3.254AsnVal: 3.254 ± 0.335
0.626AsnTrp: 0.626 ± 0.198
1.168AsnTyr: 1.168 ± 0.217
0.0AsnXaa: 0.0 ± 0.0
Pro
2.92ProAla: 2.92 ± 0.354
0.459ProCys: 0.459 ± 0.131
2.67ProAsp: 2.67 ± 0.501
2.962ProGlu: 2.962 ± 0.376
1.627ProPhe: 1.627 ± 0.28
2.503ProGly: 2.503 ± 0.342
0.709ProHis: 0.709 ± 0.166
1.794ProIle: 1.794 ± 0.246
2.879ProLys: 2.879 ± 0.319
1.836ProLeu: 1.836 ± 0.296
0.876ProMet: 0.876 ± 0.185
1.71ProAsn: 1.71 ± 0.263
1.252ProPro: 1.252 ± 0.257
0.793ProGln: 0.793 ± 0.183
1.418ProArg: 1.418 ± 0.271
2.587ProSer: 2.587 ± 0.536
2.378ProThr: 2.378 ± 0.381
3.338ProVal: 3.338 ± 0.398
0.417ProTrp: 0.417 ± 0.116
1.627ProTyr: 1.627 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
3.588GlnAla: 3.588 ± 0.438
0.25GlnCys: 0.25 ± 0.131
2.128GlnAsp: 2.128 ± 0.347
2.253GlnGlu: 2.253 ± 0.322
1.919GlnPhe: 1.919 ± 0.325
2.169GlnGly: 2.169 ± 0.331
0.834GlnHis: 0.834 ± 0.199
2.628GlnIle: 2.628 ± 0.328
2.003GlnLys: 2.003 ± 0.383
2.753GlnLeu: 2.753 ± 0.481
1.168GlnMet: 1.168 ± 0.25
1.46GlnAsn: 1.46 ± 0.254
1.627GlnPro: 1.627 ± 0.248
1.418GlnGln: 1.418 ± 0.266
1.377GlnArg: 1.377 ± 0.205
2.169GlnSer: 2.169 ± 0.377
1.961GlnThr: 1.961 ± 0.27
2.461GlnVal: 2.461 ± 0.373
0.459GlnTrp: 0.459 ± 0.153
0.918GlnTyr: 0.918 ± 0.179
0.0GlnXaa: 0.0 ± 0.0
Arg
3.588ArgAla: 3.588 ± 0.395
0.584ArgCys: 0.584 ± 0.191
2.545ArgAsp: 2.545 ± 0.33
3.296ArgGlu: 3.296 ± 0.412
1.836ArgPhe: 1.836 ± 0.308
2.461ArgGly: 2.461 ± 0.338
0.96ArgHis: 0.96 ± 0.192
3.254ArgIle: 3.254 ± 0.396
2.587ArgLys: 2.587 ± 0.367
4.297ArgLeu: 4.297 ± 0.425
1.043ArgMet: 1.043 ± 0.261
1.418ArgAsn: 1.418 ± 0.293
1.877ArgPro: 1.877 ± 0.307
1.71ArgGln: 1.71 ± 0.327
2.211ArgArg: 2.211 ± 0.316
2.753ArgSer: 2.753 ± 0.351
2.169ArgThr: 2.169 ± 0.295
2.712ArgVal: 2.712 ± 0.422
0.584ArgTrp: 0.584 ± 0.181
1.544ArgTyr: 1.544 ± 0.237
0.0ArgXaa: 0.0 ± 0.0
Ser
5.757SerAla: 5.757 ± 0.581
0.459SerCys: 0.459 ± 0.141
4.38SerAsp: 4.38 ± 0.456
5.465SerGlu: 5.465 ± 0.487
2.378SerPhe: 2.378 ± 0.282
5.507SerGly: 5.507 ± 0.527
1.377SerHis: 1.377 ± 0.219
3.755SerIle: 3.755 ± 0.349
5.423SerLys: 5.423 ± 0.526
5.757SerLeu: 5.757 ± 0.495
2.211SerMet: 2.211 ± 0.316
3.713SerAsn: 3.713 ± 0.456
2.628SerPro: 2.628 ± 0.422
2.169SerGln: 2.169 ± 0.332
3.171SerArg: 3.171 ± 0.413
4.214SerSer: 4.214 ± 0.525
4.214SerThr: 4.214 ± 0.458
5.006SerVal: 5.006 ± 0.465
0.584SerTrp: 0.584 ± 0.152
2.461SerTyr: 2.461 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
5.34ThrAla: 5.34 ± 0.667
0.501ThrCys: 0.501 ± 0.149
3.63ThrAsp: 3.63 ± 0.402
4.172ThrGlu: 4.172 ± 0.559
2.378ThrPhe: 2.378 ± 0.342
4.255ThrGly: 4.255 ± 0.466
1.085ThrHis: 1.085 ± 0.226
2.962ThrIle: 2.962 ± 0.385
3.963ThrLys: 3.963 ± 0.392
4.506ThrLeu: 4.506 ± 0.561
1.293ThrMet: 1.293 ± 0.236
2.545ThrAsn: 2.545 ± 0.359
3.171ThrPro: 3.171 ± 0.447
2.587ThrGln: 2.587 ± 0.367
1.877ThrArg: 1.877 ± 0.287
4.714ThrSer: 4.714 ± 0.479
3.338ThrThr: 3.338 ± 0.407
5.09ThrVal: 5.09 ± 0.524
0.584ThrTrp: 0.584 ± 0.18
2.211ThrTyr: 2.211 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
6.049ValAla: 6.049 ± 0.609
1.043ValCys: 1.043 ± 0.208
3.838ValAsp: 3.838 ± 0.438
5.048ValGlu: 5.048 ± 0.507
2.253ValPhe: 2.253 ± 0.306
4.38ValGly: 4.38 ± 0.52
1.877ValHis: 1.877 ± 0.33
4.255ValIle: 4.255 ± 0.517
5.799ValLys: 5.799 ± 0.493
3.922ValLeu: 3.922 ± 0.415
1.585ValMet: 1.585 ± 0.272
4.255ValAsn: 4.255 ± 0.492
2.336ValPro: 2.336 ± 0.369
1.877ValGln: 1.877 ± 0.279
3.922ValArg: 3.922 ± 0.389
4.965ValSer: 4.965 ± 0.552
4.923ValThr: 4.923 ± 0.719
4.714ValVal: 4.714 ± 0.611
1.043ValTrp: 1.043 ± 0.199
2.503ValTyr: 2.503 ± 0.307
0.0ValXaa: 0.0 ± 0.0
Trp
0.709TrpAla: 0.709 ± 0.15
0.125TrpCys: 0.125 ± 0.077
0.668TrpAsp: 0.668 ± 0.149
0.626TrpGlu: 0.626 ± 0.148
0.709TrpPhe: 0.709 ± 0.161
0.334TrpGly: 0.334 ± 0.12
0.167TrpHis: 0.167 ± 0.08
0.626TrpIle: 0.626 ± 0.178
0.709TrpLys: 0.709 ± 0.155
1.126TrpLeu: 1.126 ± 0.209
0.167TrpMet: 0.167 ± 0.08
0.501TrpAsn: 0.501 ± 0.147
0.501TrpPro: 0.501 ± 0.153
0.584TrpGln: 0.584 ± 0.132
0.876TrpArg: 0.876 ± 0.172
0.668TrpSer: 0.668 ± 0.145
0.751TrpThr: 0.751 ± 0.192
0.96TrpVal: 0.96 ± 0.186
0.042TrpTrp: 0.042 ± 0.042
0.334TrpTyr: 0.334 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.169TyrAla: 2.169 ± 0.261
0.793TyrCys: 0.793 ± 0.195
2.545TyrAsp: 2.545 ± 0.362
2.42TyrGlu: 2.42 ± 0.304
1.293TyrPhe: 1.293 ± 0.244
2.628TyrGly: 2.628 ± 0.306
0.751TyrHis: 0.751 ± 0.215
1.71TyrIle: 1.71 ± 0.3
2.169TyrLys: 2.169 ± 0.262
3.045TyrLeu: 3.045 ± 0.349
0.709TyrMet: 0.709 ± 0.14
2.128TyrAsn: 2.128 ± 0.292
1.21TyrPro: 1.21 ± 0.273
1.001TyrGln: 1.001 ± 0.211
1.21TyrArg: 1.21 ± 0.205
1.794TyrSer: 1.794 ± 0.29
2.003TyrThr: 2.003 ± 0.33
2.795TyrVal: 2.795 ± 0.405
0.334TyrTrp: 0.334 ± 0.12
1.001TyrTyr: 1.001 ± 0.24
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 96 proteins (23971 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski