Amino acid dipepetide frequency for Vibrio phage VspSw_1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.264AlaAla: 7.264 ± 0.811
0.952AlaCys: 0.952 ± 0.199
4.372AlaAsp: 4.372 ± 0.347
6.841AlaGlu: 6.841 ± 0.53
2.539AlaPhe: 2.539 ± 0.276
4.443AlaGly: 4.443 ± 0.469
1.128AlaHis: 1.128 ± 0.205
4.372AlaIle: 4.372 ± 0.398
5.818AlaLys: 5.818 ± 0.47
7.581AlaLeu: 7.581 ± 0.621
2.116AlaMet: 2.116 ± 0.307
3.491AlaAsn: 3.491 ± 0.391
2.468AlaPro: 2.468 ± 0.329
2.715AlaGln: 2.715 ± 0.292
3.456AlaArg: 3.456 ± 0.386
4.69AlaSer: 4.69 ± 0.466
3.984AlaThr: 3.984 ± 0.373
4.937AlaVal: 4.937 ± 0.421
0.882AlaTrp: 0.882 ± 0.177
3.138AlaTyr: 3.138 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.811CysAla: 0.811 ± 0.159
0.247CysCys: 0.247 ± 0.09
0.705CysAsp: 0.705 ± 0.178
0.917CysGlu: 0.917 ± 0.175
0.494CysPhe: 0.494 ± 0.13
0.846CysGly: 0.846 ± 0.197
0.212CysHis: 0.212 ± 0.077
0.635CysIle: 0.635 ± 0.157
0.952CysLys: 0.952 ± 0.198
0.705CysLeu: 0.705 ± 0.181
0.282CysMet: 0.282 ± 0.111
0.635CysAsn: 0.635 ± 0.143
0.564CysPro: 0.564 ± 0.182
0.317CysGln: 0.317 ± 0.101
0.494CysArg: 0.494 ± 0.133
0.882CysSer: 0.882 ± 0.174
0.705CysThr: 0.705 ± 0.19
0.811CysVal: 0.811 ± 0.179
0.176CysTrp: 0.176 ± 0.074
0.458CysTyr: 0.458 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
4.725AspAla: 4.725 ± 0.499
0.705AspCys: 0.705 ± 0.174
3.244AspAsp: 3.244 ± 0.388
5.042AspGlu: 5.042 ± 0.472
2.609AspPhe: 2.609 ± 0.354
4.443AspGly: 4.443 ± 0.513
0.67AspHis: 0.67 ± 0.137
3.738AspIle: 3.738 ± 0.348
3.667AspLys: 3.667 ± 0.353
6.629AspLeu: 6.629 ± 0.5
1.975AspMet: 1.975 ± 0.263
2.891AspAsn: 2.891 ± 0.32
2.468AspPro: 2.468 ± 0.287
1.481AspGln: 1.481 ± 0.245
2.962AspArg: 2.962 ± 0.317
4.513AspSer: 4.513 ± 0.438
3.667AspThr: 3.667 ± 0.327
4.196AspVal: 4.196 ± 0.419
1.481AspTrp: 1.481 ± 0.219
2.645AspTyr: 2.645 ± 0.312
0.0AspXaa: 0.0 ± 0.0
Glu
6.594GluAla: 6.594 ± 0.684
0.564GluCys: 0.564 ± 0.158
5.501GluAsp: 5.501 ± 0.493
5.536GluGlu: 5.536 ± 0.505
2.962GluPhe: 2.962 ± 0.325
4.196GluGly: 4.196 ± 0.337
1.446GluHis: 1.446 ± 0.245
4.337GluIle: 4.337 ± 0.501
5.042GluLys: 5.042 ± 0.475
7.616GluLeu: 7.616 ± 0.702
2.75GluMet: 2.75 ± 0.307
3.385GluAsn: 3.385 ± 0.383
1.939GluPro: 1.939 ± 0.27
3.315GluGln: 3.315 ± 0.43
3.702GluArg: 3.702 ± 0.378
3.738GluSer: 3.738 ± 0.329
3.279GluThr: 3.279 ± 0.268
5.712GluVal: 5.712 ± 0.389
0.987GluTrp: 0.987 ± 0.236
3.173GluTyr: 3.173 ± 0.331
0.0GluXaa: 0.0 ± 0.0
Phe
2.08PheAla: 2.08 ± 0.259
0.564PheCys: 0.564 ± 0.134
2.786PheAsp: 2.786 ± 0.308
3.103PheGlu: 3.103 ± 0.356
1.234PhePhe: 1.234 ± 0.25
2.574PheGly: 2.574 ± 0.302
0.705PheHis: 0.705 ± 0.16
1.798PheIle: 1.798 ± 0.215
2.504PheLys: 2.504 ± 0.245
3.173PheLeu: 3.173 ± 0.353
1.058PheMet: 1.058 ± 0.213
1.728PheAsn: 1.728 ± 0.212
1.481PhePro: 1.481 ± 0.197
1.093PheGln: 1.093 ± 0.233
1.657PheArg: 1.657 ± 0.275
2.821PheSer: 2.821 ± 0.335
2.715PheThr: 2.715 ± 0.316
1.975PheVal: 1.975 ± 0.264
0.494PheTrp: 0.494 ± 0.139
1.587PheTyr: 1.587 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
4.654GlyAla: 4.654 ± 0.412
1.058GlyCys: 1.058 ± 0.225
3.843GlyAsp: 3.843 ± 0.384
4.443GlyGlu: 4.443 ± 0.388
2.856GlyPhe: 2.856 ± 0.344
4.126GlyGly: 4.126 ± 0.467
1.199GlyHis: 1.199 ± 0.215
4.549GlyIle: 4.549 ± 0.396
3.949GlyLys: 3.949 ± 0.417
5.113GlyLeu: 5.113 ± 0.393
1.093GlyMet: 1.093 ± 0.191
3.032GlyAsn: 3.032 ± 0.454
1.093GlyPro: 1.093 ± 0.208
2.151GlyGln: 2.151 ± 0.211
3.279GlyArg: 3.279 ± 0.287
4.302GlySer: 4.302 ± 0.521
4.549GlyThr: 4.549 ± 0.572
5.219GlyVal: 5.219 ± 0.5
0.917GlyTrp: 0.917 ± 0.199
2.609GlyTyr: 2.609 ± 0.316
0.0GlyXaa: 0.0 ± 0.0
His
1.093HisAla: 1.093 ± 0.158
0.317HisCys: 0.317 ± 0.146
1.234HisAsp: 1.234 ± 0.184
1.093HisGlu: 1.093 ± 0.208
0.846HisPhe: 0.846 ± 0.186
1.164HisGly: 1.164 ± 0.181
0.635HisHis: 0.635 ± 0.177
1.34HisIle: 1.34 ± 0.241
1.375HisLys: 1.375 ± 0.235
1.622HisLeu: 1.622 ± 0.277
0.564HisMet: 0.564 ± 0.174
0.811HisAsn: 0.811 ± 0.153
0.952HisPro: 0.952 ± 0.184
0.494HisGln: 0.494 ± 0.141
1.305HisArg: 1.305 ± 0.209
0.952HisSer: 0.952 ± 0.221
1.269HisThr: 1.269 ± 0.206
1.023HisVal: 1.023 ± 0.157
0.176HisTrp: 0.176 ± 0.068
0.917HisTyr: 0.917 ± 0.201
0.0HisXaa: 0.0 ± 0.0
Ile
3.42IleAla: 3.42 ± 0.344
0.635IleCys: 0.635 ± 0.15
4.09IleAsp: 4.09 ± 0.397
4.372IleGlu: 4.372 ± 0.4
1.622IlePhe: 1.622 ± 0.284
2.891IleGly: 2.891 ± 0.313
0.846IleHis: 0.846 ± 0.156
2.68IleIle: 2.68 ± 0.306
3.526IleLys: 3.526 ± 0.353
4.478IleLeu: 4.478 ± 0.459
1.375IleMet: 1.375 ± 0.19
2.997IleAsn: 2.997 ± 0.35
2.504IlePro: 2.504 ± 0.294
2.68IleGln: 2.68 ± 0.336
3.279IleArg: 3.279 ± 0.336
4.372IleSer: 4.372 ± 0.451
4.126IleThr: 4.126 ± 0.355
4.443IleVal: 4.443 ± 0.345
0.529IleTrp: 0.529 ± 0.124
2.327IleTyr: 2.327 ± 0.311
0.0IleXaa: 0.0 ± 0.0
Lys
6.241LysAla: 6.241 ± 0.54
0.67LysCys: 0.67 ± 0.162
3.632LysAsp: 3.632 ± 0.326
5.219LysGlu: 5.219 ± 0.573
2.116LysPhe: 2.116 ± 0.303
3.491LysGly: 3.491 ± 0.367
1.622LysHis: 1.622 ± 0.224
3.526LysIle: 3.526 ± 0.385
3.42LysLys: 3.42 ± 0.489
6.77LysLeu: 6.77 ± 0.438
2.116LysMet: 2.116 ± 0.316
2.927LysAsn: 2.927 ± 0.342
2.468LysPro: 2.468 ± 0.45
2.151LysGln: 2.151 ± 0.288
2.75LysArg: 2.75 ± 0.359
3.561LysSer: 3.561 ± 0.431
3.667LysThr: 3.667 ± 0.464
4.831LysVal: 4.831 ± 0.403
0.67LysTrp: 0.67 ± 0.159
3.209LysTyr: 3.209 ± 0.337
0.0LysXaa: 0.0 ± 0.0
Leu
7.017LeuAla: 7.017 ± 0.557
0.811LeuCys: 0.811 ± 0.165
6.523LeuAsp: 6.523 ± 0.45
7.264LeuGlu: 7.264 ± 0.46
2.539LeuPhe: 2.539 ± 0.283
6.065LeuGly: 6.065 ± 0.455
1.939LeuHis: 1.939 ± 0.321
4.408LeuIle: 4.408 ± 0.369
5.712LeuLys: 5.712 ± 0.488
8.956LeuLeu: 8.956 ± 0.654
1.904LeuMet: 1.904 ± 0.291
4.231LeuAsn: 4.231 ± 0.379
3.843LeuPro: 3.843 ± 0.343
3.561LeuGln: 3.561 ± 0.391
4.937LeuArg: 4.937 ± 0.453
5.959LeuSer: 5.959 ± 0.543
5.324LeuThr: 5.324 ± 0.421
6.03LeuVal: 6.03 ± 0.404
0.564LeuTrp: 0.564 ± 0.131
3.032LeuTyr: 3.032 ± 0.39
0.0LeuXaa: 0.0 ± 0.0
Met
2.045MetAla: 2.045 ± 0.27
0.212MetCys: 0.212 ± 0.088
1.234MetAsp: 1.234 ± 0.219
1.904MetGlu: 1.904 ± 0.301
0.811MetPhe: 0.811 ± 0.178
1.693MetGly: 1.693 ± 0.191
0.599MetHis: 0.599 ± 0.151
1.234MetIle: 1.234 ± 0.214
1.728MetLys: 1.728 ± 0.277
2.433MetLeu: 2.433 ± 0.258
0.388MetMet: 0.388 ± 0.122
0.917MetAsn: 0.917 ± 0.188
0.846MetPro: 0.846 ± 0.178
1.023MetGln: 1.023 ± 0.229
1.199MetArg: 1.199 ± 0.26
2.257MetSer: 2.257 ± 0.303
1.657MetThr: 1.657 ± 0.292
1.657MetVal: 1.657 ± 0.192
0.176MetTrp: 0.176 ± 0.077
1.058MetTyr: 1.058 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
3.808AsnAla: 3.808 ± 0.384
0.494AsnCys: 0.494 ± 0.158
1.763AsnAsp: 1.763 ± 0.283
2.821AsnGlu: 2.821 ± 0.39
1.587AsnPhe: 1.587 ± 0.219
3.279AsnGly: 3.279 ± 0.416
0.67AsnHis: 0.67 ± 0.167
2.609AsnIle: 2.609 ± 0.341
3.138AsnLys: 3.138 ± 0.324
3.773AsnLeu: 3.773 ± 0.335
1.41AsnMet: 1.41 ± 0.23
2.045AsnAsn: 2.045 ± 0.386
2.786AsnPro: 2.786 ± 0.325
1.693AsnGln: 1.693 ± 0.282
2.151AsnArg: 2.151 ± 0.293
3.456AsnSer: 3.456 ± 0.394
2.821AsnThr: 2.821 ± 0.323
2.609AsnVal: 2.609 ± 0.315
0.811AsnTrp: 0.811 ± 0.174
1.798AsnTyr: 1.798 ± 0.269
0.0AsnXaa: 0.0 ± 0.0
Pro
2.08ProAla: 2.08 ± 0.251
0.458ProCys: 0.458 ± 0.13
2.116ProAsp: 2.116 ± 0.322
3.42ProGlu: 3.42 ± 0.308
1.269ProPhe: 1.269 ± 0.185
2.362ProGly: 2.362 ± 0.248
0.564ProHis: 0.564 ± 0.171
2.856ProIle: 2.856 ± 0.344
2.468ProLys: 2.468 ± 0.34
2.856ProLeu: 2.856 ± 0.336
0.635ProMet: 0.635 ± 0.143
1.622ProAsn: 1.622 ± 0.256
0.882ProPro: 0.882 ± 0.232
1.093ProGln: 1.093 ± 0.179
1.587ProArg: 1.587 ± 0.26
2.398ProSer: 2.398 ± 0.392
2.715ProThr: 2.715 ± 0.356
3.068ProVal: 3.068 ± 0.294
0.353ProTrp: 0.353 ± 0.118
1.164ProTyr: 1.164 ± 0.169
0.0ProXaa: 0.0 ± 0.0
Gln
2.539GlnAla: 2.539 ± 0.305
0.388GlnCys: 0.388 ± 0.128
2.292GlnAsp: 2.292 ± 0.269
2.997GlnGlu: 2.997 ± 0.421
1.481GlnPhe: 1.481 ± 0.232
2.574GlnGly: 2.574 ± 0.247
0.882GlnHis: 0.882 ± 0.155
1.869GlnIle: 1.869 ± 0.215
2.468GlnLys: 2.468 ± 0.308
3.315GlnLeu: 3.315 ± 0.311
0.987GlnMet: 0.987 ± 0.175
1.446GlnAsn: 1.446 ± 0.208
1.058GlnPro: 1.058 ± 0.212
1.269GlnGln: 1.269 ± 0.216
1.587GlnArg: 1.587 ± 0.255
1.798GlnSer: 1.798 ± 0.276
1.869GlnThr: 1.869 ± 0.239
2.362GlnVal: 2.362 ± 0.262
0.458GlnTrp: 0.458 ± 0.126
1.481GlnTyr: 1.481 ± 0.185
0.0GlnXaa: 0.0 ± 0.0
Arg
3.914ArgAla: 3.914 ± 0.394
0.529ArgCys: 0.529 ± 0.148
2.821ArgAsp: 2.821 ± 0.368
3.561ArgGlu: 3.561 ± 0.359
1.939ArgPhe: 1.939 ± 0.253
3.209ArgGly: 3.209 ± 0.327
0.811ArgHis: 0.811 ± 0.163
2.856ArgIle: 2.856 ± 0.313
3.526ArgLys: 3.526 ± 0.401
4.901ArgLeu: 4.901 ± 0.521
1.023ArgMet: 1.023 ± 0.19
2.362ArgAsn: 2.362 ± 0.241
1.551ArgPro: 1.551 ± 0.26
1.763ArgGln: 1.763 ± 0.284
2.362ArgArg: 2.362 ± 0.326
2.609ArgSer: 2.609 ± 0.306
2.962ArgThr: 2.962 ± 0.366
3.526ArgVal: 3.526 ± 0.325
0.529ArgTrp: 0.529 ± 0.134
1.657ArgTyr: 1.657 ± 0.207
0.0ArgXaa: 0.0 ± 0.0
Ser
5.007SerAla: 5.007 ± 0.582
0.917SerCys: 0.917 ± 0.203
4.443SerAsp: 4.443 ± 0.387
4.478SerGlu: 4.478 ± 0.461
2.856SerPhe: 2.856 ± 0.349
4.478SerGly: 4.478 ± 0.496
0.952SerHis: 0.952 ± 0.221
3.738SerIle: 3.738 ± 0.388
4.126SerLys: 4.126 ± 0.38
5.042SerLeu: 5.042 ± 0.395
1.904SerMet: 1.904 ± 0.269
3.138SerAsn: 3.138 ± 0.408
2.186SerPro: 2.186 ± 0.266
1.904SerGln: 1.904 ± 0.34
3.173SerArg: 3.173 ± 0.343
4.972SerSer: 4.972 ± 0.691
4.302SerThr: 4.302 ± 0.513
4.02SerVal: 4.02 ± 0.408
1.058SerTrp: 1.058 ± 0.226
2.01SerTyr: 2.01 ± 0.29
0.0SerXaa: 0.0 ± 0.0
Thr
4.619ThrAla: 4.619 ± 0.615
0.67ThrCys: 0.67 ± 0.223
3.879ThrAsp: 3.879 ± 0.414
4.126ThrGlu: 4.126 ± 0.435
2.539ThrPhe: 2.539 ± 0.258
4.972ThrGly: 4.972 ± 0.483
1.446ThrHis: 1.446 ± 0.245
3.879ThrIle: 3.879 ± 0.344
3.949ThrLys: 3.949 ± 0.396
4.901ThrLeu: 4.901 ± 0.425
0.811ThrMet: 0.811 ± 0.158
2.645ThrAsn: 2.645 ± 0.35
2.68ThrPro: 2.68 ± 0.274
2.292ThrGln: 2.292 ± 0.264
2.68ThrArg: 2.68 ± 0.316
3.738ThrSer: 3.738 ± 0.519
3.773ThrThr: 3.773 ± 0.442
4.372ThrVal: 4.372 ± 0.408
0.776ThrTrp: 0.776 ± 0.216
2.645ThrTyr: 2.645 ± 0.301
0.0ThrXaa: 0.0 ± 0.0
Val
5.43ValAla: 5.43 ± 0.458
0.705ValCys: 0.705 ± 0.143
5.219ValAsp: 5.219 ± 0.428
5.324ValGlu: 5.324 ± 0.36
2.962ValPhe: 2.962 ± 0.264
4.126ValGly: 4.126 ± 0.376
1.516ValHis: 1.516 ± 0.248
3.914ValIle: 3.914 ± 0.432
4.302ValLys: 4.302 ± 0.411
6.206ValLeu: 6.206 ± 0.46
1.093ValMet: 1.093 ± 0.177
2.75ValAsn: 2.75 ± 0.288
2.609ValPro: 2.609 ± 0.327
2.08ValGln: 2.08 ± 0.27
3.173ValArg: 3.173 ± 0.342
4.478ValSer: 4.478 ± 0.391
4.866ValThr: 4.866 ± 0.467
5.501ValVal: 5.501 ± 0.476
0.776ValTrp: 0.776 ± 0.213
2.786ValTyr: 2.786 ± 0.249
0.0ValXaa: 0.0 ± 0.0
Trp
0.882TrpAla: 0.882 ± 0.201
0.176TrpCys: 0.176 ± 0.078
1.023TrpAsp: 1.023 ± 0.238
1.023TrpGlu: 1.023 ± 0.208
0.494TrpPhe: 0.494 ± 0.137
0.776TrpGly: 0.776 ± 0.181
0.176TrpHis: 0.176 ± 0.077
0.564TrpIle: 0.564 ± 0.113
0.67TrpLys: 0.67 ± 0.163
1.199TrpLeu: 1.199 ± 0.23
0.67TrpMet: 0.67 ± 0.158
0.846TrpAsn: 0.846 ± 0.171
0.106TrpPro: 0.106 ± 0.061
0.317TrpGln: 0.317 ± 0.108
0.494TrpArg: 0.494 ± 0.146
0.776TrpSer: 0.776 ± 0.201
0.74TrpThr: 0.74 ± 0.178
0.952TrpVal: 0.952 ± 0.149
0.106TrpTrp: 0.106 ± 0.063
0.458TrpTyr: 0.458 ± 0.125
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.962TyrAla: 2.962 ± 0.322
0.776TyrCys: 0.776 ± 0.17
3.032TyrAsp: 3.032 ± 0.31
2.292TyrGlu: 2.292 ± 0.297
1.375TyrPhe: 1.375 ± 0.207
2.327TyrGly: 2.327 ± 0.265
1.164TyrHis: 1.164 ± 0.201
2.292TyrIle: 2.292 ± 0.271
2.715TyrLys: 2.715 ± 0.256
3.35TyrLeu: 3.35 ± 0.322
0.882TyrMet: 0.882 ± 0.188
1.622TyrAsn: 1.622 ± 0.217
1.481TyrPro: 1.481 ± 0.27
1.728TyrGln: 1.728 ± 0.294
2.186TyrArg: 2.186 ± 0.291
2.433TyrSer: 2.433 ± 0.301
2.398TyrThr: 2.398 ± 0.289
2.609TyrVal: 2.609 ± 0.327
0.529TyrTrp: 0.529 ± 0.13
1.622TyrTyr: 1.622 ± 0.262
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 151 proteins (28361 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski