Amino acid dipepetide frequency for Vibrio phage ICP2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.602AlaAla: 6.602 ± 1.147
0.712AlaCys: 0.712 ± 0.237
4.013AlaAsp: 4.013 ± 0.598
5.761AlaGlu: 5.761 ± 0.661
2.33AlaPhe: 2.33 ± 0.407
5.955AlaGly: 5.955 ± 0.722
1.23AlaHis: 1.23 ± 0.251
3.625AlaIle: 3.625 ± 0.404
4.531AlaLys: 4.531 ± 0.59
5.437AlaLeu: 5.437 ± 0.632
2.913AlaMet: 2.913 ± 0.443
3.366AlaAsn: 3.366 ± 0.543
2.654AlaPro: 2.654 ± 0.494
3.689AlaGln: 3.689 ± 0.706
4.078AlaArg: 4.078 ± 0.532
4.272AlaSer: 4.272 ± 0.48
6.019AlaThr: 6.019 ± 0.56
5.307AlaVal: 5.307 ± 0.623
1.1AlaTrp: 1.1 ± 0.248
3.301AlaTyr: 3.301 ± 0.548
0.0AlaXaa: 0.0 ± 0.0
Cys
0.583CysAla: 0.583 ± 0.215
0.129CysCys: 0.129 ± 0.083
0.583CysAsp: 0.583 ± 0.24
0.906CysGlu: 0.906 ± 0.315
0.194CysPhe: 0.194 ± 0.11
0.906CysGly: 0.906 ± 0.209
0.194CysHis: 0.194 ± 0.154
0.518CysIle: 0.518 ± 0.202
0.841CysLys: 0.841 ± 0.244
0.841CysLeu: 0.841 ± 0.229
0.259CysMet: 0.259 ± 0.152
0.388CysAsn: 0.388 ± 0.159
0.388CysPro: 0.388 ± 0.154
0.194CysGln: 0.194 ± 0.114
0.777CysArg: 0.777 ± 0.24
0.518CysSer: 0.518 ± 0.175
0.453CysThr: 0.453 ± 0.171
0.906CysVal: 0.906 ± 0.246
0.194CysTrp: 0.194 ± 0.115
0.647CysTyr: 0.647 ± 0.226
0.0CysXaa: 0.0 ± 0.0
Asp
4.854AspAla: 4.854 ± 0.888
1.1AspCys: 1.1 ± 0.267
3.43AspAsp: 3.43 ± 0.43
4.272AspGlu: 4.272 ± 0.494
2.395AspPhe: 2.395 ± 0.383
4.854AspGly: 4.854 ± 0.671
1.036AspHis: 1.036 ± 0.248
3.689AspIle: 3.689 ± 0.649
4.401AspLys: 4.401 ± 0.514
4.142AspLeu: 4.142 ± 0.536
1.748AspMet: 1.748 ± 0.352
3.236AspAsn: 3.236 ± 0.433
2.395AspPro: 2.395 ± 0.359
1.165AspGln: 1.165 ± 0.227
2.395AspArg: 2.395 ± 0.471
3.883AspSer: 3.883 ± 0.487
3.948AspThr: 3.948 ± 0.496
4.272AspVal: 4.272 ± 0.412
0.841AspTrp: 0.841 ± 0.229
2.654AspTyr: 2.654 ± 0.335
0.0AspXaa: 0.0 ± 0.0
Glu
4.984GluAla: 4.984 ± 0.635
0.777GluCys: 0.777 ± 0.223
4.013GluAsp: 4.013 ± 0.526
5.307GluGlu: 5.307 ± 0.82
3.301GluPhe: 3.301 ± 0.428
5.243GluGly: 5.243 ± 0.653
1.23GluHis: 1.23 ± 0.287
4.725GluIle: 4.725 ± 0.668
3.689GluLys: 3.689 ± 0.617
5.372GluLeu: 5.372 ± 0.473
2.071GluMet: 2.071 ± 0.374
3.107GluAsn: 3.107 ± 0.462
1.424GluPro: 1.424 ± 0.288
3.301GluGln: 3.301 ± 0.511
3.754GluArg: 3.754 ± 0.585
4.207GluSer: 4.207 ± 0.48
2.589GluThr: 2.589 ± 0.357
4.66GluVal: 4.66 ± 0.409
1.359GluTrp: 1.359 ± 0.323
2.848GluTyr: 2.848 ± 0.506
0.0GluXaa: 0.0 ± 0.0
Phe
3.495PheAla: 3.495 ± 0.511
0.388PheCys: 0.388 ± 0.142
2.654PheAsp: 2.654 ± 0.492
2.33PheGlu: 2.33 ± 0.348
1.294PhePhe: 1.294 ± 0.254
2.589PheGly: 2.589 ± 0.413
0.647PheHis: 0.647 ± 0.219
2.071PheIle: 2.071 ± 0.444
2.848PheLys: 2.848 ± 0.288
2.395PheLeu: 2.395 ± 0.319
1.036PheMet: 1.036 ± 0.282
1.23PheAsn: 1.23 ± 0.218
1.294PhePro: 1.294 ± 0.396
2.006PheGln: 2.006 ± 0.353
2.136PheArg: 2.136 ± 0.333
2.524PheSer: 2.524 ± 0.457
2.46PheThr: 2.46 ± 0.385
2.46PheVal: 2.46 ± 0.383
0.453PheTrp: 0.453 ± 0.173
1.424PheTyr: 1.424 ± 0.286
0.0PheXaa: 0.0 ± 0.0
Gly
5.566GlyAla: 5.566 ± 0.746
0.777GlyCys: 0.777 ± 0.253
4.725GlyAsp: 4.725 ± 0.704
4.66GlyGlu: 4.66 ± 0.533
3.172GlyPhe: 3.172 ± 0.536
6.926GlyGly: 6.926 ± 1.374
0.841GlyHis: 0.841 ± 0.206
4.725GlyIle: 4.725 ± 0.634
4.401GlyLys: 4.401 ± 0.63
4.984GlyLeu: 4.984 ± 0.548
1.812GlyMet: 1.812 ± 0.355
3.625GlyAsn: 3.625 ± 0.623
4.078GlyPro: 4.078 ± 2.893
2.524GlyGln: 2.524 ± 0.425
3.172GlyArg: 3.172 ± 0.4
5.437GlySer: 5.437 ± 0.68
5.178GlyThr: 5.178 ± 0.697
5.307GlyVal: 5.307 ± 0.824
1.165GlyTrp: 1.165 ± 0.274
3.236GlyTyr: 3.236 ± 0.484
0.0GlyXaa: 0.0 ± 0.0
His
0.518HisAla: 0.518 ± 0.193
0.388HisCys: 0.388 ± 0.155
0.777HisAsp: 0.777 ± 0.226
1.036HisGlu: 1.036 ± 0.237
0.712HisPhe: 0.712 ± 0.144
0.777HisGly: 0.777 ± 0.268
0.259HisHis: 0.259 ± 0.116
0.777HisIle: 0.777 ± 0.229
1.359HisLys: 1.359 ± 0.27
0.906HisLeu: 0.906 ± 0.252
0.518HisMet: 0.518 ± 0.161
0.906HisAsn: 0.906 ± 0.223
0.841HisPro: 0.841 ± 0.368
0.647HisGln: 0.647 ± 0.192
0.647HisArg: 0.647 ± 0.18
1.359HisSer: 1.359 ± 0.31
1.036HisThr: 1.036 ± 0.232
1.036HisVal: 1.036 ± 0.304
0.259HisTrp: 0.259 ± 0.136
0.518HisTyr: 0.518 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
3.495IleAla: 3.495 ± 0.599
0.518IleCys: 0.518 ± 0.206
3.689IleAsp: 3.689 ± 0.428
3.495IleGlu: 3.495 ± 0.473
1.489IlePhe: 1.489 ± 0.341
4.337IleGly: 4.337 ± 0.551
0.583IleHis: 0.583 ± 0.208
2.977IleIle: 2.977 ± 0.406
4.272IleLys: 4.272 ± 0.53
3.43IleLeu: 3.43 ± 0.49
1.359IleMet: 1.359 ± 0.263
4.466IleAsn: 4.466 ± 0.637
1.748IlePro: 1.748 ± 0.295
2.848IleGln: 2.848 ± 0.514
3.625IleArg: 3.625 ± 0.446
3.689IleSer: 3.689 ± 0.485
3.495IleThr: 3.495 ± 0.574
4.207IleVal: 4.207 ± 0.501
0.388IleTrp: 0.388 ± 0.155
2.265IleTyr: 2.265 ± 0.384
0.0IleXaa: 0.0 ± 0.0
Lys
5.825LysAla: 5.825 ± 0.679
0.518LysCys: 0.518 ± 0.215
3.754LysAsp: 3.754 ± 0.623
5.178LysGlu: 5.178 ± 0.631
2.783LysPhe: 2.783 ± 0.443
5.113LysGly: 5.113 ± 0.652
0.712LysHis: 0.712 ± 0.22
3.819LysIle: 3.819 ± 0.496
2.46LysLys: 2.46 ± 0.526
4.66LysLeu: 4.66 ± 0.44
1.618LysMet: 1.618 ± 0.381
1.877LysAsn: 1.877 ± 0.43
2.071LysPro: 2.071 ± 0.448
2.265LysGln: 2.265 ± 0.425
3.625LysArg: 3.625 ± 0.551
3.56LysSer: 3.56 ± 0.39
3.689LysThr: 3.689 ± 0.446
5.566LysVal: 5.566 ± 0.55
0.906LysTrp: 0.906 ± 0.238
2.718LysTyr: 2.718 ± 0.445
0.0LysXaa: 0.0 ± 0.0
Leu
6.537LeuAla: 6.537 ± 0.654
0.971LeuCys: 0.971 ± 0.336
4.854LeuAsp: 4.854 ± 0.569
6.214LeuGlu: 6.214 ± 0.544
2.654LeuPhe: 2.654 ± 0.383
4.272LeuGly: 4.272 ± 0.5
1.294LeuHis: 1.294 ± 0.244
4.595LeuIle: 4.595 ± 0.569
4.401LeuLys: 4.401 ± 0.471
6.019LeuLeu: 6.019 ± 0.623
2.783LeuMet: 2.783 ± 0.373
3.625LeuAsn: 3.625 ± 0.389
2.524LeuPro: 2.524 ± 0.399
2.913LeuGln: 2.913 ± 0.394
4.142LeuArg: 4.142 ± 0.484
4.466LeuSer: 4.466 ± 0.488
5.437LeuThr: 5.437 ± 0.558
4.078LeuVal: 4.078 ± 0.481
1.165LeuTrp: 1.165 ± 0.287
2.136LeuTyr: 2.136 ± 0.343
0.0LeuXaa: 0.0 ± 0.0
Met
2.783MetAla: 2.783 ± 0.436
0.324MetCys: 0.324 ± 0.149
1.618MetAsp: 1.618 ± 0.319
2.006MetGlu: 2.006 ± 0.448
0.906MetPhe: 0.906 ± 0.194
2.395MetGly: 2.395 ± 0.419
0.324MetHis: 0.324 ± 0.15
0.971MetIle: 0.971 ± 0.243
2.201MetLys: 2.201 ± 0.396
2.395MetLeu: 2.395 ± 0.415
1.1MetMet: 1.1 ± 0.286
1.036MetAsn: 1.036 ± 0.252
1.036MetPro: 1.036 ± 0.281
0.841MetGln: 0.841 ± 0.242
1.165MetArg: 1.165 ± 0.277
2.136MetSer: 2.136 ± 0.404
2.201MetThr: 2.201 ± 0.409
1.812MetVal: 1.812 ± 0.316
0.388MetTrp: 0.388 ± 0.191
1.23MetTyr: 1.23 ± 0.281
0.0MetXaa: 0.0 ± 0.0
Asn
3.366AsnAla: 3.366 ± 0.429
0.583AsnCys: 0.583 ± 0.252
2.395AsnAsp: 2.395 ± 0.467
2.136AsnGlu: 2.136 ± 0.3
1.424AsnPhe: 1.424 ± 0.294
3.689AsnGly: 3.689 ± 0.676
0.841AsnHis: 0.841 ± 0.249
2.848AsnIle: 2.848 ± 0.404
3.107AsnLys: 3.107 ± 0.359
4.919AsnLeu: 4.919 ± 0.572
1.489AsnMet: 1.489 ± 0.372
2.201AsnAsn: 2.201 ± 0.429
2.718AsnPro: 2.718 ± 0.346
2.395AsnGln: 2.395 ± 0.306
2.006AsnArg: 2.006 ± 0.346
2.977AsnSer: 2.977 ± 0.421
2.848AsnThr: 2.848 ± 0.396
4.142AsnVal: 4.142 ± 0.527
0.777AsnTrp: 0.777 ± 0.244
1.877AsnTyr: 1.877 ± 0.324
0.0AsnXaa: 0.0 ± 0.0
Pro
2.718ProAla: 2.718 ± 0.41
0.259ProCys: 0.259 ± 0.108
2.654ProAsp: 2.654 ± 0.387
4.078ProGlu: 4.078 ± 0.541
1.618ProPhe: 1.618 ± 0.452
0.324ProGly: 0.324 ± 0.143
0.777ProHis: 0.777 ± 0.266
2.265ProIle: 2.265 ± 0.4
2.265ProLys: 2.265 ± 0.377
2.524ProLeu: 2.524 ± 0.353
0.906ProMet: 0.906 ± 0.221
2.718ProAsn: 2.718 ± 0.397
0.712ProPro: 0.712 ± 0.224
2.848ProGln: 2.848 ± 1.466
1.748ProArg: 1.748 ± 0.413
2.395ProSer: 2.395 ± 0.457
4.078ProThr: 4.078 ± 0.796
2.718ProVal: 2.718 ± 0.372
0.583ProTrp: 0.583 ± 0.182
0.841ProTyr: 0.841 ± 0.246
0.0ProXaa: 0.0 ± 0.0
Gln
3.883GlnAla: 3.883 ± 0.786
0.453GlnCys: 0.453 ± 0.175
2.201GlnAsp: 2.201 ± 0.348
2.071GlnGlu: 2.071 ± 0.391
1.877GlnPhe: 1.877 ± 0.325
4.725GlnGly: 4.725 ± 2.493
0.712GlnHis: 0.712 ± 0.172
2.33GlnIle: 2.33 ± 0.488
1.877GlnLys: 1.877 ± 0.314
3.625GlnLeu: 3.625 ± 0.558
1.23GlnMet: 1.23 ± 0.242
1.812GlnAsn: 1.812 ± 0.359
1.165GlnPro: 1.165 ± 0.314
1.942GlnGln: 1.942 ± 0.438
1.942GlnArg: 1.942 ± 0.309
2.395GlnSer: 2.395 ± 0.456
2.201GlnThr: 2.201 ± 0.479
2.589GlnVal: 2.589 ± 0.307
0.518GlnTrp: 0.518 ± 0.194
1.877GlnTyr: 1.877 ± 0.345
0.0GlnXaa: 0.0 ± 0.0
Arg
3.43ArgAla: 3.43 ± 0.535
0.453ArgCys: 0.453 ± 0.172
3.366ArgAsp: 3.366 ± 0.389
4.272ArgGlu: 4.272 ± 0.585
1.748ArgPhe: 1.748 ± 0.309
3.754ArgGly: 3.754 ± 0.48
0.712ArgHis: 0.712 ± 0.157
3.366ArgIle: 3.366 ± 0.568
3.43ArgLys: 3.43 ± 0.509
3.301ArgLeu: 3.301 ± 0.47
1.877ArgMet: 1.877 ± 0.346
2.783ArgAsn: 2.783 ± 0.433
1.489ArgPro: 1.489 ± 0.301
2.395ArgGln: 2.395 ± 0.418
2.848ArgArg: 2.848 ± 0.543
2.201ArgSer: 2.201 ± 0.357
2.783ArgThr: 2.783 ± 0.363
3.43ArgVal: 3.43 ± 0.464
1.036ArgTrp: 1.036 ± 0.196
1.877ArgTyr: 1.877 ± 0.325
0.0ArgXaa: 0.0 ± 0.0
Ser
4.725SerAla: 4.725 ± 0.617
0.583SerCys: 0.583 ± 0.21
4.531SerAsp: 4.531 ± 0.742
3.042SerGlu: 3.042 ± 0.367
2.265SerPhe: 2.265 ± 0.458
5.307SerGly: 5.307 ± 0.729
0.647SerHis: 0.647 ± 0.218
3.107SerIle: 3.107 ± 0.413
4.595SerLys: 4.595 ± 0.658
4.79SerLeu: 4.79 ± 0.572
1.359SerMet: 1.359 ± 0.262
3.042SerAsn: 3.042 ± 0.431
3.625SerPro: 3.625 ± 0.452
2.913SerGln: 2.913 ± 0.421
3.301SerArg: 3.301 ± 0.449
4.919SerSer: 4.919 ± 0.755
4.466SerThr: 4.466 ± 0.607
4.013SerVal: 4.013 ± 0.601
1.165SerTrp: 1.165 ± 0.231
1.812SerTyr: 1.812 ± 0.346
0.0SerXaa: 0.0 ± 0.0
Thr
4.919ThrAla: 4.919 ± 0.608
0.259ThrCys: 0.259 ± 0.121
2.977ThrAsp: 2.977 ± 0.434
3.495ThrGlu: 3.495 ± 0.445
2.46ThrPhe: 2.46 ± 0.353
5.955ThrGly: 5.955 ± 0.854
1.036ThrHis: 1.036 ± 0.287
3.883ThrIle: 3.883 ± 0.493
3.948ThrLys: 3.948 ± 0.439
5.113ThrLeu: 5.113 ± 0.482
1.294ThrMet: 1.294 ± 0.305
3.56ThrAsn: 3.56 ± 0.543
4.142ThrPro: 4.142 ± 0.476
2.071ThrGln: 2.071 ± 0.343
2.654ThrArg: 2.654 ± 0.371
4.466ThrSer: 4.466 ± 0.542
5.178ThrThr: 5.178 ± 0.595
3.883ThrVal: 3.883 ± 0.473
0.841ThrTrp: 0.841 ± 0.26
2.46ThrTyr: 2.46 ± 0.363
0.0ThrXaa: 0.0 ± 0.0
Val
5.178ValAla: 5.178 ± 0.737
0.647ValCys: 0.647 ± 0.199
5.178ValAsp: 5.178 ± 0.683
4.013ValGlu: 4.013 ± 0.447
2.718ValPhe: 2.718 ± 0.505
5.437ValGly: 5.437 ± 0.876
1.23ValHis: 1.23 ± 0.311
3.689ValIle: 3.689 ± 0.413
4.79ValLys: 4.79 ± 0.528
5.825ValLeu: 5.825 ± 0.579
2.071ValMet: 2.071 ± 0.303
2.913ValAsn: 2.913 ± 0.484
2.654ValPro: 2.654 ± 0.345
2.33ValGln: 2.33 ± 0.4
3.689ValArg: 3.689 ± 0.484
4.919ValSer: 4.919 ± 0.55
4.142ValThr: 4.142 ± 0.453
5.243ValVal: 5.243 ± 0.613
1.23ValTrp: 1.23 ± 0.271
2.265ValTyr: 2.265 ± 0.351
0.0ValXaa: 0.0 ± 0.0
Trp
1.165TrpAla: 1.165 ± 0.261
0.129TrpCys: 0.129 ± 0.093
0.777TrpAsp: 0.777 ± 0.17
0.971TrpGlu: 0.971 ± 0.244
1.036TrpPhe: 1.036 ± 0.256
0.841TrpGly: 0.841 ± 0.208
0.259TrpHis: 0.259 ± 0.154
0.453TrpIle: 0.453 ± 0.149
1.359TrpLys: 1.359 ± 0.351
1.23TrpLeu: 1.23 ± 0.283
0.647TrpMet: 0.647 ± 0.188
0.518TrpAsn: 0.518 ± 0.211
0.453TrpPro: 0.453 ± 0.171
1.1TrpGln: 1.1 ± 0.207
0.388TrpArg: 0.388 ± 0.161
1.165TrpSer: 1.165 ± 0.282
0.777TrpThr: 0.777 ± 0.221
0.906TrpVal: 0.906 ± 0.187
0.453TrpTrp: 0.453 ± 0.207
0.583TrpTyr: 0.583 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.265TyrAla: 2.265 ± 0.302
0.453TyrCys: 0.453 ± 0.15
2.524TyrAsp: 2.524 ± 0.401
2.718TyrGlu: 2.718 ± 0.4
1.359TyrPhe: 1.359 ± 0.307
2.913TyrGly: 2.913 ± 0.369
0.712TyrHis: 0.712 ± 0.176
1.942TyrIle: 1.942 ± 0.443
1.942TyrLys: 1.942 ± 0.511
3.236TyrLeu: 3.236 ± 0.509
0.712TyrMet: 0.712 ± 0.244
2.33TyrAsn: 2.33 ± 0.307
1.618TyrPro: 1.618 ± 0.358
1.036TyrGln: 1.036 ± 0.242
2.46TyrArg: 2.46 ± 0.343
2.718TyrSer: 2.718 ± 0.348
1.618TyrThr: 1.618 ± 0.29
3.625TyrVal: 3.625 ± 0.458
0.388TyrTrp: 0.388 ± 0.171
1.165TyrTyr: 1.165 ± 0.255
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (15451 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski