Amino acid dipepetide frequency for Vibrio phage Va_90-11-287_p41_Ba35

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.698AlaAla: 6.698 ± 0.68
0.533AlaCys: 0.533 ± 0.174
4.92AlaAsp: 4.92 ± 0.684
4.209AlaGlu: 4.209 ± 0.546
2.964AlaPhe: 2.964 ± 0.484
5.453AlaGly: 5.453 ± 0.671
1.008AlaHis: 1.008 ± 0.246
6.876AlaIle: 6.876 ± 0.726
6.461AlaLys: 6.461 ± 0.569
6.758AlaLeu: 6.758 ± 0.659
2.964AlaMet: 2.964 ± 0.508
3.972AlaAsn: 3.972 ± 0.363
1.956AlaPro: 1.956 ± 0.359
3.023AlaGln: 3.023 ± 0.455
3.26AlaArg: 3.26 ± 0.408
5.75AlaSer: 5.75 ± 0.809
3.794AlaThr: 3.794 ± 0.441
4.979AlaVal: 4.979 ± 0.548
1.126AlaTrp: 1.126 ± 0.271
2.549AlaTyr: 2.549 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
0.652CysAla: 0.652 ± 0.186
0.356CysCys: 0.356 ± 0.146
1.126CysAsp: 1.126 ± 0.275
1.067CysGlu: 1.067 ± 0.309
0.296CysPhe: 0.296 ± 0.126
1.067CysGly: 1.067 ± 0.305
0.474CysHis: 0.474 ± 0.158
0.593CysIle: 0.593 ± 0.186
0.593CysLys: 0.593 ± 0.185
1.126CysLeu: 1.126 ± 0.305
0.474CysMet: 0.474 ± 0.171
0.415CysAsn: 0.415 ± 0.145
0.415CysPro: 0.415 ± 0.166
0.415CysGln: 0.415 ± 0.173
0.771CysArg: 0.771 ± 0.24
1.008CysSer: 1.008 ± 0.25
1.008CysThr: 1.008 ± 0.298
0.948CysVal: 0.948 ± 0.228
0.711CysTrp: 0.711 ± 0.205
0.771CysTyr: 0.771 ± 0.199
0.0CysXaa: 0.0 ± 0.0
Asp
4.564AspAla: 4.564 ± 0.587
0.771AspCys: 0.771 ± 0.265
3.794AspAsp: 3.794 ± 0.474
4.031AspGlu: 4.031 ± 0.569
2.549AspPhe: 2.549 ± 0.434
5.335AspGly: 5.335 ± 0.692
0.83AspHis: 0.83 ± 0.225
3.675AspIle: 3.675 ± 0.414
3.201AspLys: 3.201 ± 0.467
4.031AspLeu: 4.031 ± 0.383
1.363AspMet: 1.363 ± 0.254
2.845AspAsn: 2.845 ± 0.419
2.015AspPro: 2.015 ± 0.371
2.253AspGln: 2.253 ± 0.4
2.49AspArg: 2.49 ± 0.422
4.031AspSer: 4.031 ± 0.566
2.43AspThr: 2.43 ± 0.457
2.608AspVal: 2.608 ± 0.335
1.186AspTrp: 1.186 ± 0.245
2.371AspTyr: 2.371 ± 0.377
0.0AspXaa: 0.0 ± 0.0
Glu
4.327GluAla: 4.327 ± 0.623
0.83GluCys: 0.83 ± 0.219
2.727GluAsp: 2.727 ± 0.412
3.616GluGlu: 3.616 ± 0.532
3.023GluPhe: 3.023 ± 0.425
3.082GluGly: 3.082 ± 0.424
1.363GluHis: 1.363 ± 0.301
4.801GluIle: 4.801 ± 0.484
3.379GluLys: 3.379 ± 0.608
7.113GluLeu: 7.113 ± 0.718
2.43GluMet: 2.43 ± 0.455
2.964GluAsn: 2.964 ± 0.441
2.786GluPro: 2.786 ± 0.762
4.031GluGln: 4.031 ± 0.622
3.023GluArg: 3.023 ± 0.436
5.809GluSer: 5.809 ± 0.618
2.371GluThr: 2.371 ± 0.372
4.09GluVal: 4.09 ± 0.603
1.186GluTrp: 1.186 ± 0.253
1.482GluTyr: 1.482 ± 0.276
0.0GluXaa: 0.0 ± 0.0
Phe
2.549PheAla: 2.549 ± 0.381
0.593PheCys: 0.593 ± 0.173
2.667PheAsp: 2.667 ± 0.41
2.253PheGlu: 2.253 ± 0.356
1.423PhePhe: 1.423 ± 0.329
3.201PheGly: 3.201 ± 0.493
0.474PheHis: 0.474 ± 0.155
1.778PheIle: 1.778 ± 0.355
2.845PheLys: 2.845 ± 0.37
3.142PheLeu: 3.142 ± 0.441
0.652PheMet: 0.652 ± 0.207
2.015PheAsn: 2.015 ± 0.288
1.067PhePro: 1.067 ± 0.292
0.83PheGln: 0.83 ± 0.257
1.778PheArg: 1.778 ± 0.295
2.905PheSer: 2.905 ± 0.46
2.43PheThr: 2.43 ± 0.426
2.193PheVal: 2.193 ± 0.333
0.711PheTrp: 0.711 ± 0.241
1.245PheTyr: 1.245 ± 0.262
0.0PheXaa: 0.0 ± 0.0
Gly
5.987GlyAla: 5.987 ± 0.65
1.008GlyCys: 1.008 ± 0.277
3.734GlyAsp: 3.734 ± 0.725
4.327GlyGlu: 4.327 ± 0.498
2.845GlyPhe: 2.845 ± 0.37
5.631GlyGly: 5.631 ± 0.582
0.889GlyHis: 0.889 ± 0.206
4.683GlyIle: 4.683 ± 0.589
4.505GlyLys: 4.505 ± 0.622
5.276GlyLeu: 5.276 ± 0.593
2.015GlyMet: 2.015 ± 0.375
3.082GlyAsn: 3.082 ± 0.397
1.186GlyPro: 1.186 ± 0.23
2.312GlyGln: 2.312 ± 0.369
2.667GlyArg: 2.667 ± 0.385
5.039GlySer: 5.039 ± 0.607
4.446GlyThr: 4.446 ± 0.619
5.987GlyVal: 5.987 ± 0.608
1.423GlyTrp: 1.423 ± 0.25
2.845GlyTyr: 2.845 ± 0.507
0.0GlyXaa: 0.0 ± 0.0
His
1.008HisAla: 1.008 ± 0.325
0.474HisCys: 0.474 ± 0.174
1.245HisAsp: 1.245 ± 0.301
1.363HisGlu: 1.363 ± 0.307
0.771HisPhe: 0.771 ± 0.206
1.245HisGly: 1.245 ± 0.279
0.711HisHis: 0.711 ± 0.286
1.186HisIle: 1.186 ± 0.243
1.186HisLys: 1.186 ± 0.302
1.482HisLeu: 1.482 ± 0.26
0.474HisMet: 0.474 ± 0.189
0.652HisAsn: 0.652 ± 0.205
0.889HisPro: 0.889 ± 0.265
0.711HisGln: 0.711 ± 0.17
0.889HisArg: 0.889 ± 0.334
1.067HisSer: 1.067 ± 0.222
0.83HisThr: 0.83 ± 0.225
1.304HisVal: 1.304 ± 0.288
0.296HisTrp: 0.296 ± 0.13
0.474HisTyr: 0.474 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
6.402IleAla: 6.402 ± 0.605
1.186IleCys: 1.186 ± 0.262
4.149IleAsp: 4.149 ± 0.51
5.039IleGlu: 5.039 ± 0.525
1.719IlePhe: 1.719 ± 0.34
5.216IleGly: 5.216 ± 0.581
1.067IleHis: 1.067 ± 0.279
3.201IleIle: 3.201 ± 0.393
5.513IleLys: 5.513 ± 0.527
3.438IleLeu: 3.438 ± 0.473
1.067IleMet: 1.067 ± 0.274
4.209IleAsn: 4.209 ± 0.506
2.43IlePro: 2.43 ± 0.356
2.312IleGln: 2.312 ± 0.42
3.082IleArg: 3.082 ± 0.409
6.046IleSer: 6.046 ± 0.698
3.794IleThr: 3.794 ± 0.366
3.438IleVal: 3.438 ± 0.342
0.652IleTrp: 0.652 ± 0.162
2.371IleTyr: 2.371 ± 0.374
0.0IleXaa: 0.0 ± 0.0
Lys
7.232LysAla: 7.232 ± 0.901
1.186LysCys: 1.186 ± 0.33
2.727LysAsp: 2.727 ± 0.405
4.861LysGlu: 4.861 ± 0.703
2.253LysPhe: 2.253 ± 0.361
3.912LysGly: 3.912 ± 0.556
1.363LysHis: 1.363 ± 0.28
3.082LysIle: 3.082 ± 0.399
4.564LysLys: 4.564 ± 0.713
5.157LysLeu: 5.157 ± 0.646
2.015LysMet: 2.015 ± 0.342
2.905LysAsn: 2.905 ± 0.328
3.379LysPro: 3.379 ± 0.631
3.853LysGln: 3.853 ± 0.553
3.972LysArg: 3.972 ± 0.656
4.801LysSer: 4.801 ± 0.61
3.912LysThr: 3.912 ± 0.468
3.379LysVal: 3.379 ± 0.475
0.948LysTrp: 0.948 ± 0.238
1.719LysTyr: 1.719 ± 0.257
0.0LysXaa: 0.0 ± 0.0
Leu
6.58LeuAla: 6.58 ± 0.668
1.008LeuCys: 1.008 ± 0.244
3.972LeuAsp: 3.972 ± 0.399
5.157LeuGlu: 5.157 ± 0.626
2.253LeuPhe: 2.253 ± 0.364
4.149LeuGly: 4.149 ± 0.512
1.482LeuHis: 1.482 ± 0.291
5.631LeuIle: 5.631 ± 0.602
5.394LeuLys: 5.394 ± 0.563
5.276LeuLeu: 5.276 ± 0.53
2.312LeuMet: 2.312 ± 0.33
3.972LeuAsn: 3.972 ± 0.558
3.438LeuPro: 3.438 ± 0.392
2.727LeuGln: 2.727 ± 0.409
3.497LeuArg: 3.497 ± 0.485
6.817LeuSer: 6.817 ± 0.647
5.098LeuThr: 5.098 ± 0.438
5.276LeuVal: 5.276 ± 0.629
1.067LeuTrp: 1.067 ± 0.199
2.786LeuTyr: 2.786 ± 0.427
0.0LeuXaa: 0.0 ± 0.0
Met
2.193MetAla: 2.193 ± 0.368
0.296MetCys: 0.296 ± 0.117
1.363MetAsp: 1.363 ± 0.31
1.6MetGlu: 1.6 ± 0.356
0.83MetPhe: 0.83 ± 0.229
0.948MetGly: 0.948 ± 0.197
0.356MetHis: 0.356 ± 0.119
1.719MetIle: 1.719 ± 0.296
2.371MetLys: 2.371 ± 0.495
2.49MetLeu: 2.49 ± 0.368
0.83MetMet: 0.83 ± 0.234
1.423MetAsn: 1.423 ± 0.279
1.719MetPro: 1.719 ± 0.361
1.304MetGln: 1.304 ± 0.242
1.423MetArg: 1.423 ± 0.261
2.845MetSer: 2.845 ± 0.39
1.6MetThr: 1.6 ± 0.311
1.66MetVal: 1.66 ± 0.288
0.415MetTrp: 0.415 ± 0.178
0.474MetTyr: 0.474 ± 0.154
0.0MetXaa: 0.0 ± 0.0
Asn
4.268AsnAla: 4.268 ± 0.52
0.83AsnCys: 0.83 ± 0.29
2.608AsnAsp: 2.608 ± 0.505
2.549AsnGlu: 2.549 ± 0.338
1.245AsnPhe: 1.245 ± 0.233
4.505AsnGly: 4.505 ± 0.561
1.363AsnHis: 1.363 ± 0.326
2.845AsnIle: 2.845 ± 0.452
4.268AsnLys: 4.268 ± 0.513
3.32AsnLeu: 3.32 ± 0.388
1.304AsnMet: 1.304 ± 0.291
3.379AsnAsn: 3.379 ± 0.522
1.956AsnPro: 1.956 ± 0.368
2.49AsnGln: 2.49 ± 0.378
2.786AsnArg: 2.786 ± 0.389
2.727AsnSer: 2.727 ± 0.446
1.6AsnThr: 1.6 ± 0.33
3.675AsnVal: 3.675 ± 0.458
0.948AsnTrp: 0.948 ± 0.208
1.719AsnTyr: 1.719 ± 0.274
0.0AsnXaa: 0.0 ± 0.0
Pro
2.905ProAla: 2.905 ± 0.366
0.652ProCys: 0.652 ± 0.197
2.312ProAsp: 2.312 ± 0.333
3.794ProGlu: 3.794 ± 0.879
1.067ProPhe: 1.067 ± 0.208
1.541ProGly: 1.541 ± 0.301
0.889ProHis: 0.889 ± 0.228
2.253ProIle: 2.253 ± 0.45
1.838ProLys: 1.838 ± 0.372
2.727ProLeu: 2.727 ± 0.475
1.304ProMet: 1.304 ± 0.289
1.778ProAsn: 1.778 ± 0.343
1.541ProPro: 1.541 ± 0.431
1.363ProGln: 1.363 ± 0.283
1.897ProArg: 1.897 ± 0.409
3.734ProSer: 3.734 ± 0.659
2.43ProThr: 2.43 ± 0.429
2.49ProVal: 2.49 ± 0.35
0.296ProTrp: 0.296 ± 0.112
1.008ProTyr: 1.008 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
3.26GlnAla: 3.26 ± 0.472
0.356GlnCys: 0.356 ± 0.157
1.245GlnAsp: 1.245 ± 0.264
2.49GlnGlu: 2.49 ± 0.475
1.838GlnPhe: 1.838 ± 0.334
2.253GlnGly: 2.253 ± 0.405
0.474GlnHis: 0.474 ± 0.135
3.023GlnIle: 3.023 ± 0.501
2.43GlnLys: 2.43 ± 0.349
4.386GlnLeu: 4.386 ± 0.662
1.482GlnMet: 1.482 ± 0.284
2.075GlnAsn: 2.075 ± 0.408
1.66GlnPro: 1.66 ± 0.346
2.371GlnGln: 2.371 ± 0.603
1.66GlnArg: 1.66 ± 0.288
3.497GlnSer: 3.497 ± 0.438
1.838GlnThr: 1.838 ± 0.313
2.134GlnVal: 2.134 ± 0.345
0.415GlnTrp: 0.415 ± 0.16
1.897GlnTyr: 1.897 ± 0.35
0.0GlnXaa: 0.0 ± 0.0
Arg
3.26ArgAla: 3.26 ± 0.383
0.296ArgCys: 0.296 ± 0.153
2.312ArgAsp: 2.312 ± 0.346
3.26ArgGlu: 3.26 ± 0.594
2.43ArgPhe: 2.43 ± 0.397
2.312ArgGly: 2.312 ± 0.36
0.889ArgHis: 0.889 ± 0.237
4.149ArgIle: 4.149 ± 0.496
3.438ArgLys: 3.438 ± 0.522
4.742ArgLeu: 4.742 ± 0.547
1.66ArgMet: 1.66 ± 0.292
2.371ArgAsn: 2.371 ± 0.384
1.482ArgPro: 1.482 ± 0.439
1.245ArgGln: 1.245 ± 0.34
2.905ArgArg: 2.905 ± 0.525
3.972ArgSer: 3.972 ± 0.43
2.193ArgThr: 2.193 ± 0.338
3.142ArgVal: 3.142 ± 0.477
0.474ArgTrp: 0.474 ± 0.153
1.838ArgTyr: 1.838 ± 0.377
0.0ArgXaa: 0.0 ± 0.0
Ser
5.868SerAla: 5.868 ± 0.666
0.948SerCys: 0.948 ± 0.251
4.327SerAsp: 4.327 ± 0.563
5.394SerGlu: 5.394 ± 0.501
3.734SerPhe: 3.734 ± 0.479
7.35SerGly: 7.35 ± 0.764
1.304SerHis: 1.304 ± 0.309
6.046SerIle: 6.046 ± 0.884
4.564SerLys: 4.564 ± 0.675
5.335SerLeu: 5.335 ± 0.499
1.423SerMet: 1.423 ± 0.229
3.972SerAsn: 3.972 ± 0.446
2.964SerPro: 2.964 ± 0.548
3.023SerGln: 3.023 ± 0.444
3.616SerArg: 3.616 ± 0.541
5.868SerSer: 5.868 ± 0.704
3.675SerThr: 3.675 ± 0.537
4.92SerVal: 4.92 ± 0.605
0.771SerTrp: 0.771 ± 0.22
3.023SerTyr: 3.023 ± 0.407
0.0SerXaa: 0.0 ± 0.0
Thr
4.268ThrAla: 4.268 ± 0.536
0.711ThrCys: 0.711 ± 0.208
3.675ThrAsp: 3.675 ± 0.432
2.549ThrGlu: 2.549 ± 0.362
1.719ThrPhe: 1.719 ± 0.425
5.157ThrGly: 5.157 ± 0.501
1.008ThrHis: 1.008 ± 0.264
3.557ThrIle: 3.557 ± 0.486
3.082ThrLys: 3.082 ± 0.533
4.327ThrLeu: 4.327 ± 0.524
1.482ThrMet: 1.482 ± 0.305
2.667ThrAsn: 2.667 ± 0.497
2.727ThrPro: 2.727 ± 0.434
2.253ThrGln: 2.253 ± 0.416
2.371ThrArg: 2.371 ± 0.453
3.201ThrSer: 3.201 ± 0.446
2.964ThrThr: 2.964 ± 0.485
3.794ThrVal: 3.794 ± 0.613
0.474ThrTrp: 0.474 ± 0.167
1.897ThrTyr: 1.897 ± 0.281
0.0ThrXaa: 0.0 ± 0.0
Val
4.209ValAla: 4.209 ± 0.471
0.771ValCys: 0.771 ± 0.227
4.564ValAsp: 4.564 ± 0.646
3.972ValGlu: 3.972 ± 0.477
2.134ValPhe: 2.134 ± 0.298
4.624ValGly: 4.624 ± 0.648
1.126ValHis: 1.126 ± 0.281
3.794ValIle: 3.794 ± 0.501
4.861ValLys: 4.861 ± 0.588
3.853ValLeu: 3.853 ± 0.378
1.719ValMet: 1.719 ± 0.326
3.734ValAsn: 3.734 ± 0.374
2.371ValPro: 2.371 ± 0.345
1.838ValGln: 1.838 ± 0.34
3.201ValArg: 3.201 ± 0.417
4.801ValSer: 4.801 ± 0.519
4.386ValThr: 4.386 ± 0.581
3.675ValVal: 3.675 ± 0.543
0.948ValTrp: 0.948 ± 0.238
2.371ValTyr: 2.371 ± 0.291
0.0ValXaa: 0.0 ± 0.0
Trp
0.83TrpAla: 0.83 ± 0.194
0.474TrpCys: 0.474 ± 0.153
0.711TrpAsp: 0.711 ± 0.202
1.245TrpGlu: 1.245 ± 0.278
0.593TrpPhe: 0.593 ± 0.189
0.771TrpGly: 0.771 ± 0.248
0.356TrpHis: 0.356 ± 0.15
0.948TrpIle: 0.948 ± 0.231
1.186TrpLys: 1.186 ± 0.329
1.304TrpLeu: 1.304 ± 0.294
0.178TrpMet: 0.178 ± 0.087
0.593TrpAsn: 0.593 ± 0.181
0.711TrpPro: 0.711 ± 0.191
0.771TrpGln: 0.771 ± 0.175
0.948TrpArg: 0.948 ± 0.178
0.889TrpSer: 0.889 ± 0.205
0.771TrpThr: 0.771 ± 0.267
0.948TrpVal: 0.948 ± 0.271
0.533TrpTrp: 0.533 ± 0.204
0.415TrpTyr: 0.415 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.193TyrAla: 2.193 ± 0.394
1.008TyrCys: 1.008 ± 0.227
2.193TyrAsp: 2.193 ± 0.329
1.66TyrGlu: 1.66 ± 0.325
1.126TyrPhe: 1.126 ± 0.285
2.134TyrGly: 2.134 ± 0.391
0.83TyrHis: 0.83 ± 0.234
2.49TyrIle: 2.49 ± 0.381
1.541TyrLys: 1.541 ± 0.302
2.608TyrLeu: 2.608 ± 0.323
0.652TyrMet: 0.652 ± 0.213
1.363TyrAsn: 1.363 ± 0.275
1.126TyrPro: 1.126 ± 0.265
1.66TyrGln: 1.66 ± 0.272
2.134TyrArg: 2.134 ± 0.398
3.32TyrSer: 3.32 ± 0.473
2.312TyrThr: 2.312 ± 0.442
2.312TyrVal: 2.312 ± 0.446
0.593TyrTrp: 0.593 ± 0.18
0.711TyrTyr: 0.711 ± 0.192
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (16871 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski