Amino acid dipepetide frequency for Vibrio phage JSF12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.781AlaAla: 7.781 ± 1.581
0.861AlaCys: 0.861 ± 0.165
4.018AlaAsp: 4.018 ± 0.362
5.772AlaGlu: 5.772 ± 0.593
3.029AlaPhe: 3.029 ± 0.347
4.624AlaGly: 4.624 ± 0.377
1.116AlaHis: 1.116 ± 0.195
4.018AlaIle: 4.018 ± 0.376
6.633AlaLys: 6.633 ± 0.487
7.621AlaLeu: 7.621 ± 0.538
1.722AlaMet: 1.722 ± 0.282
3.827AlaAsn: 3.827 ± 0.603
2.806AlaPro: 2.806 ± 0.285
2.679AlaGln: 2.679 ± 0.401
3.444AlaArg: 3.444 ± 0.354
6.027AlaSer: 6.027 ± 0.619
4.241AlaThr: 4.241 ± 0.676
5.166AlaVal: 5.166 ± 0.4
0.925AlaTrp: 0.925 ± 0.173
3.253AlaTyr: 3.253 ± 0.315
0.0AlaXaa: 0.0 ± 0.0
Cys
0.702CysAla: 0.702 ± 0.139
0.159CysCys: 0.159 ± 0.077
0.733CysAsp: 0.733 ± 0.17
0.829CysGlu: 0.829 ± 0.148
0.542CysPhe: 0.542 ± 0.145
0.893CysGly: 0.893 ± 0.217
0.351CysHis: 0.351 ± 0.131
0.702CysIle: 0.702 ± 0.199
1.148CysLys: 1.148 ± 0.178
0.606CysLeu: 0.606 ± 0.134
0.191CysMet: 0.191 ± 0.079
0.415CysAsn: 0.415 ± 0.116
0.733CysPro: 0.733 ± 0.18
0.574CysGln: 0.574 ± 0.148
0.606CysArg: 0.606 ± 0.147
1.02CysSer: 1.02 ± 0.207
0.765CysThr: 0.765 ± 0.164
0.893CysVal: 0.893 ± 0.183
0.191CysTrp: 0.191 ± 0.078
0.446CysTyr: 0.446 ± 0.14
0.0CysXaa: 0.0 ± 0.0
Asp
5.325AspAla: 5.325 ± 0.423
0.606AspCys: 0.606 ± 0.16
2.36AspAsp: 2.36 ± 0.284
4.337AspGlu: 4.337 ± 0.423
3.093AspPhe: 3.093 ± 0.316
3.093AspGly: 3.093 ± 0.322
0.989AspHis: 0.989 ± 0.177
4.369AspIle: 4.369 ± 0.448
4.018AspLys: 4.018 ± 0.403
5.517AspLeu: 5.517 ± 0.429
1.371AspMet: 1.371 ± 0.206
2.296AspAsn: 2.296 ± 0.302
1.945AspPro: 1.945 ± 0.278
1.881AspGln: 1.881 ± 0.255
2.583AspArg: 2.583 ± 0.341
4.528AspSer: 4.528 ± 0.425
2.902AspThr: 2.902 ± 0.299
3.667AspVal: 3.667 ± 0.346
0.797AspTrp: 0.797 ± 0.149
2.806AspTyr: 2.806 ± 0.296
0.0AspXaa: 0.0 ± 0.0
Glu
6.76GluAla: 6.76 ± 0.635
0.989GluCys: 0.989 ± 0.222
4.72GluAsp: 4.72 ± 0.407
5.166GluGlu: 5.166 ± 0.524
2.615GluPhe: 2.615 ± 0.339
3.986GluGly: 3.986 ± 0.435
1.084GluHis: 1.084 ± 0.171
4.433GluIle: 4.433 ± 0.367
4.496GluLys: 4.496 ± 0.441
6.792GluLeu: 6.792 ± 0.536
1.977GluMet: 1.977 ± 0.284
3.316GluAsn: 3.316 ± 0.349
1.531GluPro: 1.531 ± 0.208
2.615GluGln: 2.615 ± 0.327
2.583GluArg: 2.583 ± 0.399
3.189GluSer: 3.189 ± 0.256
3.221GluThr: 3.221 ± 0.301
5.581GluVal: 5.581 ± 0.511
0.765GluTrp: 0.765 ± 0.211
3.603GluTyr: 3.603 ± 0.341
0.0GluXaa: 0.0 ± 0.0
Phe
2.296PheAla: 2.296 ± 0.282
0.638PheCys: 0.638 ± 0.163
2.838PheAsp: 2.838 ± 0.321
2.87PheGlu: 2.87 ± 0.336
1.467PhePhe: 1.467 ± 0.231
2.487PheGly: 2.487 ± 0.275
0.733PheHis: 0.733 ± 0.153
2.2PheIle: 2.2 ± 0.347
2.966PheLys: 2.966 ± 0.292
3.316PheLeu: 3.316 ± 0.304
0.925PheMet: 0.925 ± 0.182
2.742PheAsn: 2.742 ± 0.31
1.371PhePro: 1.371 ± 0.229
0.861PheGln: 0.861 ± 0.15
1.754PheArg: 1.754 ± 0.214
3.508PheSer: 3.508 ± 0.303
2.519PheThr: 2.519 ± 0.324
2.392PheVal: 2.392 ± 0.272
0.446PheTrp: 0.446 ± 0.128
1.626PheTyr: 1.626 ± 0.269
0.0PheXaa: 0.0 ± 0.0
Gly
4.656GlyAla: 4.656 ± 0.44
0.829GlyCys: 0.829 ± 0.172
3.253GlyAsp: 3.253 ± 0.319
3.795GlyGlu: 3.795 ± 0.427
2.679GlyPhe: 2.679 ± 0.286
3.253GlyGly: 3.253 ± 0.384
0.797GlyHis: 0.797 ± 0.17
3.859GlyIle: 3.859 ± 0.454
4.943GlyLys: 4.943 ± 0.371
5.868GlyLeu: 5.868 ± 0.474
1.371GlyMet: 1.371 ± 0.218
3.827GlyAsn: 3.827 ± 0.329
0.542GlyPro: 0.542 ± 0.161
1.881GlyGln: 1.881 ± 0.269
3.285GlyArg: 3.285 ± 0.328
4.656GlySer: 4.656 ± 0.51
4.847GlyThr: 4.847 ± 0.49
4.401GlyVal: 4.401 ± 0.357
0.765GlyTrp: 0.765 ± 0.162
2.296GlyTyr: 2.296 ± 0.307
0.0GlyXaa: 0.0 ± 0.0
His
1.18HisAla: 1.18 ± 0.217
0.351HisCys: 0.351 ± 0.094
1.212HisAsp: 1.212 ± 0.195
1.148HisGlu: 1.148 ± 0.181
0.861HisPhe: 0.861 ± 0.157
1.244HisGly: 1.244 ± 0.196
0.351HisHis: 0.351 ± 0.097
1.339HisIle: 1.339 ± 0.188
1.18HisLys: 1.18 ± 0.226
1.818HisLeu: 1.818 ± 0.28
0.542HisMet: 0.542 ± 0.142
0.861HisAsn: 0.861 ± 0.158
0.893HisPro: 0.893 ± 0.159
0.351HisGln: 0.351 ± 0.114
0.893HisArg: 0.893 ± 0.181
0.957HisSer: 0.957 ± 0.202
0.893HisThr: 0.893 ± 0.173
1.18HisVal: 1.18 ± 0.188
0.223HisTrp: 0.223 ± 0.083
0.861HisTyr: 0.861 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
4.464IleAla: 4.464 ± 0.394
0.638IleCys: 0.638 ± 0.147
4.528IleAsp: 4.528 ± 0.352
4.656IleGlu: 4.656 ± 0.505
1.531IlePhe: 1.531 ± 0.275
3.253IleGly: 3.253 ± 0.29
1.276IleHis: 1.276 ± 0.201
2.902IleIle: 2.902 ± 0.342
3.827IleLys: 3.827 ± 0.353
4.911IleLeu: 4.911 ± 0.421
1.148IleMet: 1.148 ± 0.184
3.508IleAsn: 3.508 ± 0.393
2.487IlePro: 2.487 ± 0.239
1.913IleGln: 1.913 ± 0.231
3.316IleArg: 3.316 ± 0.316
3.603IleSer: 3.603 ± 0.415
3.986IleThr: 3.986 ± 0.45
3.253IleVal: 3.253 ± 0.313
0.574IleTrp: 0.574 ± 0.17
1.913IleTyr: 1.913 ± 0.245
0.0IleXaa: 0.0 ± 0.0
Lys
5.995LysAla: 5.995 ± 0.586
0.478LysCys: 0.478 ± 0.177
3.922LysAsp: 3.922 ± 0.335
5.74LysGlu: 5.74 ± 0.561
2.519LysPhe: 2.519 ± 0.299
3.731LysGly: 3.731 ± 0.395
1.276LysHis: 1.276 ± 0.208
4.209LysIle: 4.209 ± 0.395
4.337LysLys: 4.337 ± 0.382
7.143LysLeu: 7.143 ± 0.501
1.945LysMet: 1.945 ± 0.306
3.348LysAsn: 3.348 ± 0.308
2.774LysPro: 2.774 ± 0.339
2.774LysGln: 2.774 ± 0.302
2.934LysArg: 2.934 ± 0.321
3.89LysSer: 3.89 ± 0.331
4.241LysThr: 4.241 ± 0.376
5.804LysVal: 5.804 ± 0.508
0.606LysTrp: 0.606 ± 0.16
3.157LysTyr: 3.157 ± 0.337
0.0LysXaa: 0.0 ± 0.0
Leu
7.111LeuAla: 7.111 ± 0.554
0.989LeuCys: 0.989 ± 0.182
6.346LeuAsp: 6.346 ± 0.453
7.781LeuGlu: 7.781 ± 0.602
2.742LeuPhe: 2.742 ± 0.313
6.25LeuGly: 6.25 ± 0.49
1.85LeuHis: 1.85 ± 0.228
4.656LeuIle: 4.656 ± 0.397
6.218LeuLys: 6.218 ± 0.547
7.781LeuLeu: 7.781 ± 0.534
2.009LeuMet: 2.009 ± 0.24
4.273LeuAsn: 4.273 ± 0.338
3.859LeuPro: 3.859 ± 0.306
2.838LeuGln: 2.838 ± 0.29
4.209LeuArg: 4.209 ± 0.333
5.517LeuSer: 5.517 ± 0.442
5.453LeuThr: 5.453 ± 0.456
7.398LeuVal: 7.398 ± 0.466
0.638LeuTrp: 0.638 ± 0.175
2.838LeuTyr: 2.838 ± 0.336
0.0LeuXaa: 0.0 ± 0.0
Met
2.073MetAla: 2.073 ± 0.272
0.287MetCys: 0.287 ± 0.074
1.18MetAsp: 1.18 ± 0.172
1.307MetGlu: 1.307 ± 0.21
0.989MetPhe: 0.989 ± 0.186
1.658MetGly: 1.658 ± 0.188
0.51MetHis: 0.51 ± 0.133
1.18MetIle: 1.18 ± 0.211
1.913MetLys: 1.913 ± 0.291
1.977MetLeu: 1.977 ± 0.26
0.351MetMet: 0.351 ± 0.114
1.148MetAsn: 1.148 ± 0.177
0.797MetPro: 0.797 ± 0.148
0.765MetGln: 0.765 ± 0.169
0.861MetArg: 0.861 ± 0.179
1.945MetSer: 1.945 ± 0.289
1.977MetThr: 1.977 ± 0.275
1.594MetVal: 1.594 ± 0.245
0.287MetTrp: 0.287 ± 0.084
1.084MetTyr: 1.084 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.572AsnAla: 3.572 ± 0.448
0.67AsnCys: 0.67 ± 0.153
1.658AsnAsp: 1.658 ± 0.241
2.998AsnGlu: 2.998 ± 0.324
1.85AsnPhe: 1.85 ± 0.288
3.316AsnGly: 3.316 ± 0.381
0.957AsnHis: 0.957 ± 0.179
3.54AsnIle: 3.54 ± 0.313
3.731AsnLys: 3.731 ± 0.344
4.624AsnLeu: 4.624 ± 0.352
1.467AsnMet: 1.467 ± 0.197
2.2AsnAsn: 2.2 ± 0.299
2.328AsnPro: 2.328 ± 0.266
1.244AsnGln: 1.244 ± 0.188
2.583AsnArg: 2.583 ± 0.335
4.018AsnSer: 4.018 ± 0.452
3.125AsnThr: 3.125 ± 0.277
2.966AsnVal: 2.966 ± 0.342
0.893AsnTrp: 0.893 ± 0.179
2.455AsnTyr: 2.455 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
2.296ProAla: 2.296 ± 0.295
0.446ProCys: 0.446 ± 0.102
2.137ProAsp: 2.137 ± 0.261
2.392ProGlu: 2.392 ± 0.279
1.244ProPhe: 1.244 ± 0.189
1.913ProGly: 1.913 ± 0.256
0.797ProHis: 0.797 ± 0.14
2.264ProIle: 2.264 ± 0.253
2.137ProLys: 2.137 ± 0.325
2.934ProLeu: 2.934 ± 0.301
0.638ProMet: 0.638 ± 0.153
2.264ProAsn: 2.264 ± 0.258
1.084ProPro: 1.084 ± 0.226
0.67ProGln: 0.67 ± 0.151
1.307ProArg: 1.307 ± 0.21
2.519ProSer: 2.519 ± 0.333
2.328ProThr: 2.328 ± 0.349
3.731ProVal: 3.731 ± 0.352
0.351ProTrp: 0.351 ± 0.11
1.307ProTyr: 1.307 ± 0.214
0.0ProXaa: 0.0 ± 0.0
Gln
2.711GlnAla: 2.711 ± 0.281
0.638GlnCys: 0.638 ± 0.128
1.69GlnAsp: 1.69 ± 0.188
2.36GlnGlu: 2.36 ± 0.288
1.084GlnPhe: 1.084 ± 0.17
2.168GlnGly: 2.168 ± 0.271
0.542GlnHis: 0.542 ± 0.141
1.754GlnIle: 1.754 ± 0.267
2.168GlnLys: 2.168 ± 0.256
3.189GlnLeu: 3.189 ± 0.293
1.116GlnMet: 1.116 ± 0.204
1.307GlnAsn: 1.307 ± 0.165
0.989GlnPro: 0.989 ± 0.172
0.829GlnGln: 0.829 ± 0.159
1.307GlnArg: 1.307 ± 0.227
1.977GlnSer: 1.977 ± 0.289
1.531GlnThr: 1.531 ± 0.215
2.583GlnVal: 2.583 ± 0.274
0.51GlnTrp: 0.51 ± 0.142
1.244GlnTyr: 1.244 ± 0.216
0.0GlnXaa: 0.0 ± 0.0
Arg
3.572ArgAla: 3.572 ± 0.392
0.638ArgCys: 0.638 ± 0.153
2.615ArgAsp: 2.615 ± 0.252
2.902ArgGlu: 2.902 ± 0.306
2.264ArgPhe: 2.264 ± 0.271
2.966ArgGly: 2.966 ± 0.31
1.084ArgHis: 1.084 ± 0.208
2.902ArgIle: 2.902 ± 0.289
2.934ArgLys: 2.934 ± 0.307
4.114ArgLeu: 4.114 ± 0.359
1.403ArgMet: 1.403 ± 0.231
2.137ArgAsn: 2.137 ± 0.308
1.403ArgPro: 1.403 ± 0.213
1.786ArgGln: 1.786 ± 0.257
2.583ArgArg: 2.583 ± 0.354
2.711ArgSer: 2.711 ± 0.307
1.594ArgThr: 1.594 ± 0.223
3.38ArgVal: 3.38 ± 0.359
0.415ArgTrp: 0.415 ± 0.104
1.626ArgTyr: 1.626 ± 0.244
0.0ArgXaa: 0.0 ± 0.0
Ser
5.772SerAla: 5.772 ± 0.871
0.733SerCys: 0.733 ± 0.169
3.89SerAsp: 3.89 ± 0.385
3.508SerGlu: 3.508 ± 0.348
3.221SerPhe: 3.221 ± 0.334
5.581SerGly: 5.581 ± 0.594
1.244SerHis: 1.244 ± 0.218
3.603SerIle: 3.603 ± 0.435
5.453SerLys: 5.453 ± 0.528
6.378SerLeu: 6.378 ± 0.489
1.658SerMet: 1.658 ± 0.252
3.508SerAsn: 3.508 ± 0.39
2.328SerPro: 2.328 ± 0.271
2.105SerGln: 2.105 ± 0.226
2.583SerArg: 2.583 ± 0.244
5.549SerSer: 5.549 ± 0.703
3.954SerThr: 3.954 ± 0.445
4.847SerVal: 4.847 ± 0.408
0.765SerTrp: 0.765 ± 0.16
2.551SerTyr: 2.551 ± 0.336
0.0SerXaa: 0.0 ± 0.0
Thr
4.783ThrAla: 4.783 ± 0.596
0.733ThrCys: 0.733 ± 0.168
3.444ThrAsp: 3.444 ± 0.328
3.54ThrGlu: 3.54 ± 0.369
2.774ThrPhe: 2.774 ± 0.331
4.018ThrGly: 4.018 ± 0.389
1.052ThrHis: 1.052 ± 0.213
3.348ThrIle: 3.348 ± 0.36
4.114ThrLys: 4.114 ± 0.36
5.134ThrLeu: 5.134 ± 0.369
0.861ThrMet: 0.861 ± 0.163
3.221ThrAsn: 3.221 ± 0.328
2.647ThrPro: 2.647 ± 0.401
1.658ThrGln: 1.658 ± 0.241
2.551ThrArg: 2.551 ± 0.298
4.05ThrSer: 4.05 ± 0.436
3.38ThrThr: 3.38 ± 0.365
4.815ThrVal: 4.815 ± 0.444
0.765ThrTrp: 0.765 ± 0.172
2.902ThrTyr: 2.902 ± 0.34
0.0ThrXaa: 0.0 ± 0.0
Val
5.676ValAla: 5.676 ± 0.389
0.861ValCys: 0.861 ± 0.167
4.305ValAsp: 4.305 ± 0.402
5.007ValGlu: 5.007 ± 0.418
3.093ValPhe: 3.093 ± 0.301
4.496ValGly: 4.496 ± 0.444
0.957ValHis: 0.957 ± 0.198
3.795ValIle: 3.795 ± 0.37
4.656ValLys: 4.656 ± 0.382
5.772ValLeu: 5.772 ± 0.482
1.881ValMet: 1.881 ± 0.247
3.285ValAsn: 3.285 ± 0.464
2.551ValPro: 2.551 ± 0.363
2.424ValGln: 2.424 ± 0.259
3.189ValArg: 3.189 ± 0.345
5.517ValSer: 5.517 ± 0.496
5.389ValThr: 5.389 ± 0.437
5.804ValVal: 5.804 ± 0.521
1.18ValTrp: 1.18 ± 0.285
3.157ValTyr: 3.157 ± 0.286
0.0ValXaa: 0.0 ± 0.0
Trp
0.733TrpAla: 0.733 ± 0.183
0.223TrpCys: 0.223 ± 0.078
1.116TrpAsp: 1.116 ± 0.216
0.733TrpGlu: 0.733 ± 0.146
0.415TrpPhe: 0.415 ± 0.118
0.606TrpGly: 0.606 ± 0.111
0.351TrpHis: 0.351 ± 0.118
0.383TrpIle: 0.383 ± 0.11
0.893TrpLys: 0.893 ± 0.196
1.371TrpLeu: 1.371 ± 0.221
0.351TrpMet: 0.351 ± 0.142
0.765TrpAsn: 0.765 ± 0.16
0.191TrpPro: 0.191 ± 0.072
0.287TrpGln: 0.287 ± 0.09
0.446TrpArg: 0.446 ± 0.107
0.702TrpSer: 0.702 ± 0.192
0.765TrpThr: 0.765 ± 0.138
1.084TrpVal: 1.084 ± 0.188
0.223TrpTrp: 0.223 ± 0.083
0.351TrpTyr: 0.351 ± 0.126
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.105TyrAla: 2.105 ± 0.25
0.733TyrCys: 0.733 ± 0.156
2.519TyrAsp: 2.519 ± 0.283
2.583TyrGlu: 2.583 ± 0.278
1.945TyrPhe: 1.945 ± 0.262
2.264TyrGly: 2.264 ± 0.292
1.02TyrHis: 1.02 ± 0.193
2.232TyrIle: 2.232 ± 0.315
3.061TyrLys: 3.061 ± 0.308
3.89TyrLeu: 3.89 ± 0.405
0.925TyrMet: 0.925 ± 0.177
1.945TyrAsn: 1.945 ± 0.295
1.499TyrPro: 1.499 ± 0.26
1.563TyrGln: 1.563 ± 0.24
2.009TyrArg: 2.009 ± 0.26
3.348TyrSer: 3.348 ± 0.373
2.679TyrThr: 2.679 ± 0.349
2.455TyrVal: 2.455 ± 0.284
0.67TyrTrp: 0.67 ± 0.124
1.658TyrTyr: 1.658 ± 0.231
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 152 proteins (31360 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski