Amino acid dipepetide frequency for Vibrio phage CHOED

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.273AlaAla: 9.273 ± 1.38
0.681AlaCys: 0.681 ± 0.217
5.029AlaAsp: 5.029 ± 0.51
4.872AlaGlu: 4.872 ± 0.685
2.41AlaPhe: 2.41 ± 0.312
6.444AlaGly: 6.444 ± 0.816
1.834AlaHis: 1.834 ± 0.387
5.029AlaIle: 5.029 ± 0.551
5.396AlaLys: 5.396 ± 0.637
8.435AlaLeu: 8.435 ± 0.622
2.41AlaMet: 2.41 ± 0.388
4.244AlaAsn: 4.244 ± 0.67
4.086AlaPro: 4.086 ± 0.453
6.025AlaGln: 6.025 ± 1.155
3.353AlaArg: 3.353 ± 0.461
5.501AlaSer: 5.501 ± 0.654
5.71AlaThr: 5.71 ± 0.571
5.71AlaVal: 5.71 ± 0.52
0.943AlaTrp: 0.943 ± 0.209
3.143AlaTyr: 3.143 ± 0.356
0.0AlaXaa: 0.0 ± 0.0
Cys
0.786CysAla: 0.786 ± 0.25
0.105CysCys: 0.105 ± 0.101
0.681CysAsp: 0.681 ± 0.213
0.681CysGlu: 0.681 ± 0.224
0.419CysPhe: 0.419 ± 0.159
0.681CysGly: 0.681 ± 0.22
0.105CysHis: 0.105 ± 0.07
0.472CysIle: 0.472 ± 0.148
0.629CysLys: 0.629 ± 0.194
0.472CysLeu: 0.472 ± 0.172
0.157CysMet: 0.157 ± 0.097
0.367CysAsn: 0.367 ± 0.137
0.367CysPro: 0.367 ± 0.166
0.419CysGln: 0.419 ± 0.188
0.524CysArg: 0.524 ± 0.187
0.786CysSer: 0.786 ± 0.231
0.262CysThr: 0.262 ± 0.113
0.419CysVal: 0.419 ± 0.145
0.105CysTrp: 0.105 ± 0.063
0.472CysTyr: 0.472 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
5.396AspAla: 5.396 ± 0.492
0.524AspCys: 0.524 ± 0.196
3.51AspAsp: 3.51 ± 0.522
4.558AspGlu: 4.558 ± 0.459
2.41AspPhe: 2.41 ± 0.326
4.191AspGly: 4.191 ± 0.527
0.838AspHis: 0.838 ± 0.216
3.824AspIle: 3.824 ± 0.483
2.934AspLys: 2.934 ± 0.332
4.296AspLeu: 4.296 ± 0.427
2.305AspMet: 2.305 ± 0.35
2.672AspAsn: 2.672 ± 0.404
2.462AspPro: 2.462 ± 0.304
1.729AspGln: 1.729 ± 0.241
2.41AspArg: 2.41 ± 0.44
4.244AspSer: 4.244 ± 0.46
3.72AspThr: 3.72 ± 0.439
5.029AspVal: 5.029 ± 0.619
1.048AspTrp: 1.048 ± 0.223
2.2AspTyr: 2.2 ± 0.306
0.0AspXaa: 0.0 ± 0.0
Glu
8.33GluAla: 8.33 ± 0.791
0.524GluCys: 0.524 ± 0.224
4.244GluAsp: 4.244 ± 0.634
4.401GluGlu: 4.401 ± 0.606
2.829GluPhe: 2.829 ± 0.342
5.029GluGly: 5.029 ± 0.459
1.257GluHis: 1.257 ± 0.251
3.615GluIle: 3.615 ± 0.46
2.777GluLys: 2.777 ± 0.406
6.339GluLeu: 6.339 ± 0.549
1.467GluMet: 1.467 ± 0.282
2.305GluAsn: 2.305 ± 0.411
2.515GluPro: 2.515 ± 0.369
2.462GluGln: 2.462 ± 0.372
3.562GluArg: 3.562 ± 0.449
3.091GluSer: 3.091 ± 0.413
3.143GluThr: 3.143 ± 0.455
5.134GluVal: 5.134 ± 0.637
0.786GluTrp: 0.786 ± 0.218
1.991GluTyr: 1.991 ± 0.39
0.0GluXaa: 0.0 ± 0.0
Phe
2.253PheAla: 2.253 ± 0.368
0.314PheCys: 0.314 ± 0.235
2.619PheAsp: 2.619 ± 0.365
1.886PheGlu: 1.886 ± 0.275
0.629PhePhe: 0.629 ± 0.191
2.462PheGly: 2.462 ± 0.323
0.943PheHis: 0.943 ± 0.228
1.834PheIle: 1.834 ± 0.31
2.253PheLys: 2.253 ± 0.329
2.567PheLeu: 2.567 ± 0.302
1.31PheMet: 1.31 ± 0.298
2.41PheAsn: 2.41 ± 0.466
1.886PhePro: 1.886 ± 0.349
0.995PheGln: 0.995 ± 0.2
1.834PheArg: 1.834 ± 0.308
1.991PheSer: 1.991 ± 0.338
2.462PheThr: 2.462 ± 0.414
1.676PheVal: 1.676 ± 0.279
0.367PheTrp: 0.367 ± 0.167
1.1PheTyr: 1.1 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
8.12GlyAla: 8.12 ± 0.953
0.838GlyCys: 0.838 ± 0.259
4.453GlyAsp: 4.453 ± 0.534
4.925GlyGlu: 4.925 ± 0.515
2.462GlyPhe: 2.462 ± 0.372
6.444GlyGly: 6.444 ± 1.665
1.467GlyHis: 1.467 ± 0.282
4.401GlyIle: 4.401 ± 0.476
5.291GlyLys: 5.291 ± 0.593
5.658GlyLeu: 5.658 ± 0.5
2.358GlyMet: 2.358 ± 0.382
3.982GlyAsn: 3.982 ± 0.52
2.2GlyPro: 2.2 ± 0.316
2.777GlyGln: 2.777 ± 0.414
2.724GlyArg: 2.724 ± 0.399
4.558GlySer: 4.558 ± 0.809
5.868GlyThr: 5.868 ± 0.638
5.187GlyVal: 5.187 ± 0.612
0.733GlyTrp: 0.733 ± 0.262
3.824GlyTyr: 3.824 ± 0.463
0.0GlyXaa: 0.0 ± 0.0
His
1.1HisAla: 1.1 ± 0.244
0.314HisCys: 0.314 ± 0.13
0.995HisAsp: 0.995 ± 0.288
1.467HisGlu: 1.467 ± 0.286
0.419HisPhe: 0.419 ± 0.174
1.624HisGly: 1.624 ± 0.391
0.419HisHis: 0.419 ± 0.181
0.786HisIle: 0.786 ± 0.202
1.1HisLys: 1.1 ± 0.268
1.886HisLeu: 1.886 ± 0.403
0.576HisMet: 0.576 ± 0.169
0.786HisAsn: 0.786 ± 0.266
1.153HisPro: 1.153 ± 0.255
0.367HisGln: 0.367 ± 0.136
1.048HisArg: 1.048 ± 0.22
1.048HisSer: 1.048 ± 0.267
0.943HisThr: 0.943 ± 0.227
1.415HisVal: 1.415 ± 0.291
0.524HisTrp: 0.524 ± 0.194
0.733HisTyr: 0.733 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
4.244IleAla: 4.244 ± 0.457
0.629IleCys: 0.629 ± 0.177
3.667IleAsp: 3.667 ± 0.427
3.615IleGlu: 3.615 ± 0.415
1.624IlePhe: 1.624 ± 0.303
3.615IleGly: 3.615 ± 0.56
1.624IleHis: 1.624 ± 0.304
2.934IleIle: 2.934 ± 0.441
4.086IleLys: 4.086 ± 0.475
4.086IleLeu: 4.086 ± 0.451
1.31IleMet: 1.31 ± 0.274
3.143IleAsn: 3.143 ± 0.505
3.353IlePro: 3.353 ± 0.45
2.253IleGln: 2.253 ± 0.271
2.358IleArg: 2.358 ± 0.355
3.51IleSer: 3.51 ± 0.473
2.881IleThr: 2.881 ± 0.422
3.562IleVal: 3.562 ± 0.46
0.786IleTrp: 0.786 ± 0.177
2.096IleTyr: 2.096 ± 0.382
0.0IleXaa: 0.0 ± 0.0
Lys
6.653LysAla: 6.653 ± 0.482
0.629LysCys: 0.629 ± 0.186
3.196LysAsp: 3.196 ± 0.383
4.925LysGlu: 4.925 ± 0.537
1.938LysPhe: 1.938 ± 0.297
5.187LysGly: 5.187 ± 0.572
0.995LysHis: 0.995 ± 0.239
2.41LysIle: 2.41 ± 0.329
2.358LysLys: 2.358 ± 0.32
4.872LysLeu: 4.872 ± 0.562
2.043LysMet: 2.043 ± 0.353
2.096LysAsn: 2.096 ± 0.32
2.724LysPro: 2.724 ± 0.379
2.934LysGln: 2.934 ± 0.474
2.567LysArg: 2.567 ± 0.371
2.934LysSer: 2.934 ± 0.437
3.877LysThr: 3.877 ± 0.444
4.453LysVal: 4.453 ± 0.431
0.786LysTrp: 0.786 ± 0.197
1.938LysTyr: 1.938 ± 0.272
0.0LysXaa: 0.0 ± 0.0
Leu
6.758LeuAla: 6.758 ± 0.684
0.733LeuCys: 0.733 ± 0.209
4.558LeuAsp: 4.558 ± 0.516
6.444LeuGlu: 6.444 ± 0.602
2.829LeuPhe: 2.829 ± 0.442
6.496LeuGly: 6.496 ± 0.515
1.153LeuHis: 1.153 ± 0.288
3.982LeuIle: 3.982 ± 0.415
6.077LeuLys: 6.077 ± 0.522
5.344LeuLeu: 5.344 ± 0.614
2.672LeuMet: 2.672 ± 0.346
3.72LeuAsn: 3.72 ± 0.416
2.043LeuPro: 2.043 ± 0.32
3.405LeuGln: 3.405 ± 0.487
4.086LeuArg: 4.086 ± 0.472
5.396LeuSer: 5.396 ± 0.44
5.658LeuThr: 5.658 ± 0.465
5.448LeuVal: 5.448 ± 0.613
0.943LeuTrp: 0.943 ± 0.222
2.096LeuTyr: 2.096 ± 0.289
0.0LeuXaa: 0.0 ± 0.0
Met
3.091MetAla: 3.091 ± 0.469
0.524MetCys: 0.524 ± 0.153
1.991MetAsp: 1.991 ± 0.374
1.572MetGlu: 1.572 ± 0.385
1.048MetPhe: 1.048 ± 0.251
2.2MetGly: 2.2 ± 0.308
0.576MetHis: 0.576 ± 0.175
1.519MetIle: 1.519 ± 0.318
1.938MetLys: 1.938 ± 0.334
2.253MetLeu: 2.253 ± 0.308
0.995MetMet: 0.995 ± 0.245
1.362MetAsn: 1.362 ± 0.27
0.838MetPro: 0.838 ± 0.168
1.781MetGln: 1.781 ± 0.427
1.362MetArg: 1.362 ± 0.265
2.619MetSer: 2.619 ± 0.465
1.676MetThr: 1.676 ± 0.288
1.991MetVal: 1.991 ± 0.303
0.262MetTrp: 0.262 ± 0.124
0.629MetTyr: 0.629 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
3.405AsnAla: 3.405 ± 0.554
0.105AsnCys: 0.105 ± 0.066
2.829AsnAsp: 2.829 ± 0.381
3.143AsnGlu: 3.143 ± 0.372
1.519AsnPhe: 1.519 ± 0.257
3.458AsnGly: 3.458 ± 0.501
0.576AsnHis: 0.576 ± 0.174
3.196AsnIle: 3.196 ± 0.387
2.777AsnLys: 2.777 ± 0.412
3.615AsnLeu: 3.615 ± 0.438
1.624AsnMet: 1.624 ± 0.304
1.519AsnAsn: 1.519 ± 0.276
3.091AsnPro: 3.091 ± 0.336
2.043AsnGln: 2.043 ± 0.476
1.886AsnArg: 1.886 ± 0.345
2.881AsnSer: 2.881 ± 0.399
2.567AsnThr: 2.567 ± 0.422
3.039AsnVal: 3.039 ± 0.388
0.681AsnTrp: 0.681 ± 0.196
1.415AsnTyr: 1.415 ± 0.237
0.0AsnXaa: 0.0 ± 0.0
Pro
2.934ProAla: 2.934 ± 0.388
0.419ProCys: 0.419 ± 0.176
3.143ProAsp: 3.143 ± 0.441
3.51ProGlu: 3.51 ± 0.496
1.257ProPhe: 1.257 ± 0.282
1.991ProGly: 1.991 ± 0.371
0.891ProHis: 0.891 ± 0.256
2.096ProIle: 2.096 ± 0.395
2.829ProLys: 2.829 ± 0.459
3.196ProLeu: 3.196 ± 0.413
1.31ProMet: 1.31 ± 0.265
2.148ProAsn: 2.148 ± 0.374
0.838ProPro: 0.838 ± 0.332
1.781ProGln: 1.781 ± 0.408
1.467ProArg: 1.467 ± 0.236
2.881ProSer: 2.881 ± 0.501
3.091ProThr: 3.091 ± 0.542
3.248ProVal: 3.248 ± 0.431
0.576ProTrp: 0.576 ± 0.191
1.415ProTyr: 1.415 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
4.925GlnAla: 4.925 ± 0.963
0.262GlnCys: 0.262 ± 0.111
2.41GlnAsp: 2.41 ± 0.343
3.615GlnGlu: 3.615 ± 0.517
2.043GlnPhe: 2.043 ± 0.394
3.301GlnGly: 3.301 ± 0.538
1.1GlnHis: 1.1 ± 0.228
1.467GlnIle: 1.467 ± 0.25
1.729GlnLys: 1.729 ± 0.283
3.824GlnLeu: 3.824 ± 0.529
1.153GlnMet: 1.153 ± 0.363
1.153GlnAsn: 1.153 ± 0.539
1.205GlnPro: 1.205 ± 0.316
2.148GlnGln: 2.148 ± 0.437
1.415GlnArg: 1.415 ± 0.215
3.091GlnSer: 3.091 ± 0.517
2.515GlnThr: 2.515 ± 0.652
4.191GlnVal: 4.191 ± 0.389
0.419GlnTrp: 0.419 ± 0.167
1.467GlnTyr: 1.467 ± 0.298
0.0GlnXaa: 0.0 ± 0.0
Arg
3.143ArgAla: 3.143 ± 0.397
0.367ArgCys: 0.367 ± 0.159
2.462ArgAsp: 2.462 ± 0.341
3.143ArgGlu: 3.143 ± 0.42
1.729ArgPhe: 1.729 ± 0.265
3.405ArgGly: 3.405 ± 0.448
0.681ArgHis: 0.681 ± 0.18
2.777ArgIle: 2.777 ± 0.448
3.353ArgLys: 3.353 ± 0.473
3.353ArgLeu: 3.353 ± 0.403
1.572ArgMet: 1.572 ± 0.298
1.624ArgAsn: 1.624 ± 0.337
1.991ArgPro: 1.991 ± 0.304
1.991ArgGln: 1.991 ± 0.315
1.467ArgArg: 1.467 ± 0.314
1.991ArgSer: 1.991 ± 0.287
2.986ArgThr: 2.986 ± 0.358
3.091ArgVal: 3.091 ± 0.541
0.314ArgTrp: 0.314 ± 0.124
1.31ArgTyr: 1.31 ± 0.446
0.0ArgXaa: 0.0 ± 0.0
Ser
5.029SerAla: 5.029 ± 0.667
0.472SerCys: 0.472 ± 0.183
3.091SerAsp: 3.091 ± 0.389
2.358SerGlu: 2.358 ± 0.312
2.096SerPhe: 2.096 ± 0.31
5.658SerGly: 5.658 ± 0.815
0.838SerHis: 0.838 ± 0.298
4.296SerIle: 4.296 ± 0.48
3.929SerLys: 3.929 ± 0.482
4.977SerLeu: 4.977 ± 0.464
1.781SerMet: 1.781 ± 0.338
3.039SerAsn: 3.039 ± 0.406
2.515SerPro: 2.515 ± 0.359
3.039SerGln: 3.039 ± 0.55
2.672SerArg: 2.672 ± 0.345
4.348SerSer: 4.348 ± 0.616
4.191SerThr: 4.191 ± 0.636
4.244SerVal: 4.244 ± 0.447
0.995SerTrp: 0.995 ± 0.239
2.462SerTyr: 2.462 ± 0.38
0.0SerXaa: 0.0 ± 0.0
Thr
5.501ThrAla: 5.501 ± 0.727
0.367ThrCys: 0.367 ± 0.137
4.401ThrAsp: 4.401 ± 0.531
3.458ThrGlu: 3.458 ± 0.519
2.358ThrPhe: 2.358 ± 0.314
6.025ThrGly: 6.025 ± 0.627
1.362ThrHis: 1.362 ± 0.248
3.458ThrIle: 3.458 ± 0.512
3.248ThrLys: 3.248 ± 0.516
5.291ThrLeu: 5.291 ± 0.59
1.205ThrMet: 1.205 ± 0.228
2.567ThrAsn: 2.567 ± 0.412
3.143ThrPro: 3.143 ± 0.335
2.881ThrGln: 2.881 ± 0.473
2.672ThrArg: 2.672 ± 0.398
3.772ThrSer: 3.772 ± 0.513
4.086ThrThr: 4.086 ± 0.438
4.663ThrVal: 4.663 ± 0.411
0.786ThrTrp: 0.786 ± 0.294
2.358ThrTyr: 2.358 ± 0.384
0.0ThrXaa: 0.0 ± 0.0
Val
5.92ValAla: 5.92 ± 0.57
0.681ValCys: 0.681 ± 0.178
3.929ValAsp: 3.929 ± 0.473
4.767ValGlu: 4.767 ± 0.421
2.148ValPhe: 2.148 ± 0.383
5.92ValGly: 5.92 ± 0.611
1.205ValHis: 1.205 ± 0.27
4.663ValIle: 4.663 ± 0.53
4.348ValLys: 4.348 ± 0.527
5.239ValLeu: 5.239 ± 0.567
2.096ValMet: 2.096 ± 0.321
4.244ValAsn: 4.244 ± 0.486
3.039ValPro: 3.039 ± 0.382
2.2ValGln: 2.2 ± 0.418
2.567ValArg: 2.567 ± 0.341
4.086ValSer: 4.086 ± 0.471
4.82ValThr: 4.82 ± 0.581
6.077ValVal: 6.077 ± 0.592
1.048ValTrp: 1.048 ± 0.231
2.724ValTyr: 2.724 ± 0.39
0.0ValXaa: 0.0 ± 0.0
Trp
0.995TrpAla: 0.995 ± 0.223
0.0TrpCys: 0.0 ± 0.0
1.048TrpAsp: 1.048 ± 0.222
0.786TrpGlu: 0.786 ± 0.165
0.367TrpPhe: 0.367 ± 0.142
0.629TrpGly: 0.629 ± 0.199
0.367TrpHis: 0.367 ± 0.167
0.629TrpIle: 0.629 ± 0.195
0.891TrpLys: 0.891 ± 0.225
1.048TrpLeu: 1.048 ± 0.258
0.576TrpMet: 0.576 ± 0.149
0.733TrpAsn: 0.733 ± 0.21
0.314TrpPro: 0.314 ± 0.135
0.367TrpGln: 0.367 ± 0.164
0.733TrpArg: 0.733 ± 0.192
0.733TrpSer: 0.733 ± 0.181
0.629TrpThr: 0.629 ± 0.179
1.205TrpVal: 1.205 ± 0.25
0.472TrpTrp: 0.472 ± 0.133
0.524TrpTyr: 0.524 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.672TyrAla: 2.672 ± 0.383
0.419TyrCys: 0.419 ± 0.146
2.043TyrAsp: 2.043 ± 0.358
1.676TyrGlu: 1.676 ± 0.278
1.257TyrPhe: 1.257 ± 0.249
3.51TyrGly: 3.51 ± 0.447
0.472TyrHis: 0.472 ± 0.149
1.991TyrIle: 1.991 ± 0.341
1.624TyrLys: 1.624 ± 0.254
3.091TyrLeu: 3.091 ± 0.376
1.257TyrMet: 1.257 ± 0.262
1.257TyrAsn: 1.257 ± 0.289
1.257TyrPro: 1.257 ± 0.242
1.834TyrGln: 1.834 ± 0.256
2.043TyrArg: 2.043 ± 0.36
2.567TyrSer: 2.567 ± 0.336
2.515TyrThr: 2.515 ± 0.452
1.938TyrVal: 1.938 ± 0.413
0.472TyrTrp: 0.472 ± 0.169
1.886TyrTyr: 1.886 ± 0.326
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (19089 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski