Amino acid dipepetide frequency for Vibrio phage LP.1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.128AlaAla: 7.128 ± 0.914
0.864AlaCys: 0.864 ± 0.244
4.824AlaAsp: 4.824 ± 0.57
5.688AlaGlu: 5.688 ± 0.836
2.736AlaPhe: 2.736 ± 0.381
5.904AlaGly: 5.904 ± 0.699
1.296AlaHis: 1.296 ± 0.321
5.256AlaIle: 5.256 ± 0.628
4.68AlaLys: 4.68 ± 0.719
7.416AlaLeu: 7.416 ± 0.717
2.88AlaMet: 2.88 ± 0.552
4.104AlaAsn: 4.104 ± 0.383
2.664AlaPro: 2.664 ± 0.418
3.384AlaGln: 3.384 ± 0.688
4.32AlaArg: 4.32 ± 0.721
4.608AlaSer: 4.608 ± 0.639
5.4AlaThr: 5.4 ± 0.847
5.184AlaVal: 5.184 ± 0.714
0.648AlaTrp: 0.648 ± 0.204
3.384AlaTyr: 3.384 ± 0.537
0.0AlaXaa: 0.0 ± 0.0
Cys
1.08CysAla: 1.08 ± 0.33
0.36CysCys: 0.36 ± 0.197
1.08CysAsp: 1.08 ± 0.292
0.648CysGlu: 0.648 ± 0.228
0.648CysPhe: 0.648 ± 0.227
1.08CysGly: 1.08 ± 0.266
0.072CysHis: 0.072 ± 0.076
0.576CysIle: 0.576 ± 0.217
0.864CysLys: 0.864 ± 0.272
1.224CysLeu: 1.224 ± 0.297
0.36CysMet: 0.36 ± 0.181
0.72CysAsn: 0.72 ± 0.233
0.72CysPro: 0.72 ± 0.197
0.432CysGln: 0.432 ± 0.157
0.648CysArg: 0.648 ± 0.21
0.576CysSer: 0.576 ± 0.198
1.008CysThr: 1.008 ± 0.247
0.792CysVal: 0.792 ± 0.239
0.288CysTrp: 0.288 ± 0.143
0.288CysTyr: 0.288 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.472AspAla: 5.472 ± 0.579
0.936AspCys: 0.936 ± 0.3
4.248AspAsp: 4.248 ± 0.494
5.328AspGlu: 5.328 ± 0.701
2.52AspPhe: 2.52 ± 0.439
5.76AspGly: 5.76 ± 0.738
1.368AspHis: 1.368 ± 0.283
3.456AspIle: 3.456 ± 0.594
3.6AspLys: 3.6 ± 0.579
6.768AspLeu: 6.768 ± 0.678
1.944AspMet: 1.944 ± 0.337
3.168AspAsn: 3.168 ± 0.402
2.232AspPro: 2.232 ± 0.59
2.088AspGln: 2.088 ± 0.324
2.88AspArg: 2.88 ± 0.414
3.6AspSer: 3.6 ± 0.571
3.888AspThr: 3.888 ± 0.641
4.32AspVal: 4.32 ± 0.589
0.936AspTrp: 0.936 ± 0.27
2.448AspTyr: 2.448 ± 0.424
0.0AspXaa: 0.0 ± 0.0
Glu
4.608GluAla: 4.608 ± 0.564
1.152GluCys: 1.152 ± 0.281
4.536GluAsp: 4.536 ± 0.675
3.744GluGlu: 3.744 ± 0.623
3.168GluPhe: 3.168 ± 0.499
4.536GluGly: 4.536 ± 0.711
0.864GluHis: 0.864 ± 0.264
4.464GluIle: 4.464 ± 0.725
4.824GluLys: 4.824 ± 0.841
6.912GluLeu: 6.912 ± 0.688
2.664GluMet: 2.664 ± 0.408
2.664GluAsn: 2.664 ± 0.485
2.016GluPro: 2.016 ± 0.391
2.232GluGln: 2.232 ± 0.534
3.888GluArg: 3.888 ± 0.468
4.248GluSer: 4.248 ± 0.625
3.744GluThr: 3.744 ± 0.487
5.112GluVal: 5.112 ± 0.701
1.584GluTrp: 1.584 ± 0.402
3.672GluTyr: 3.672 ± 0.52
0.0GluXaa: 0.0 ± 0.0
Phe
2.736PheAla: 2.736 ± 0.413
0.792PheCys: 0.792 ± 0.328
3.6PheAsp: 3.6 ± 0.46
2.232PheGlu: 2.232 ± 0.348
0.864PhePhe: 0.864 ± 0.231
2.592PheGly: 2.592 ± 0.504
0.288PheHis: 0.288 ± 0.131
2.592PheIle: 2.592 ± 0.431
2.088PheLys: 2.088 ± 0.355
2.088PheLeu: 2.088 ± 0.339
0.432PheMet: 0.432 ± 0.147
2.376PheAsn: 2.376 ± 0.381
1.224PhePro: 1.224 ± 0.281
1.08PheGln: 1.08 ± 0.269
1.584PheArg: 1.584 ± 0.283
1.656PheSer: 1.656 ± 0.26
2.808PheThr: 2.808 ± 0.471
2.592PheVal: 2.592 ± 0.369
1.008PheTrp: 1.008 ± 0.281
0.648PheTyr: 0.648 ± 0.195
0.0PheXaa: 0.0 ± 0.0
Gly
6.408GlyAla: 6.408 ± 0.816
1.08GlyCys: 1.08 ± 0.223
5.184GlyAsp: 5.184 ± 0.747
5.688GlyGlu: 5.688 ± 0.534
2.448GlyPhe: 2.448 ± 0.387
6.048GlyGly: 6.048 ± 0.872
1.224GlyHis: 1.224 ± 0.379
3.744GlyIle: 3.744 ± 0.633
4.392GlyLys: 4.392 ± 0.535
5.544GlyLeu: 5.544 ± 0.701
2.232GlyMet: 2.232 ± 0.56
2.88GlyAsn: 2.88 ± 0.491
1.368GlyPro: 1.368 ± 0.355
2.664GlyGln: 2.664 ± 0.434
2.952GlyArg: 2.952 ± 0.358
4.104GlySer: 4.104 ± 0.542
4.32GlyThr: 4.32 ± 0.796
6.696GlyVal: 6.696 ± 0.78
1.008GlyTrp: 1.008 ± 0.316
3.096GlyTyr: 3.096 ± 0.439
0.0GlyXaa: 0.0 ± 0.0
His
1.368HisAla: 1.368 ± 0.313
0.144HisCys: 0.144 ± 0.111
1.224HisAsp: 1.224 ± 0.307
1.224HisGlu: 1.224 ± 0.257
0.648HisPhe: 0.648 ± 0.269
1.512HisGly: 1.512 ± 0.317
0.36HisHis: 0.36 ± 0.135
1.584HisIle: 1.584 ± 0.404
0.864HisLys: 0.864 ± 0.325
1.152HisLeu: 1.152 ± 0.308
0.504HisMet: 0.504 ± 0.173
1.44HisAsn: 1.44 ± 0.328
0.936HisPro: 0.936 ± 0.277
0.648HisGln: 0.648 ± 0.174
0.648HisArg: 0.648 ± 0.183
0.936HisSer: 0.936 ± 0.301
0.72HisThr: 0.72 ± 0.179
1.08HisVal: 1.08 ± 0.359
0.36HisTrp: 0.36 ± 0.14
0.576HisTyr: 0.576 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
5.256IleAla: 5.256 ± 0.601
0.576IleCys: 0.576 ± 0.2
4.608IleAsp: 4.608 ± 0.598
4.968IleGlu: 4.968 ± 0.552
0.792IlePhe: 0.792 ± 0.215
4.176IleGly: 4.176 ± 0.498
0.864IleHis: 0.864 ± 0.222
2.88IleIle: 2.88 ± 0.433
4.608IleLys: 4.608 ± 0.672
3.024IleLeu: 3.024 ± 0.549
1.296IleMet: 1.296 ± 0.265
3.456IleAsn: 3.456 ± 0.389
1.944IlePro: 1.944 ± 0.32
1.872IleGln: 1.872 ± 0.415
2.376IleArg: 2.376 ± 0.48
3.672IleSer: 3.672 ± 0.617
4.536IleThr: 4.536 ± 0.589
4.392IleVal: 4.392 ± 0.451
0.648IleTrp: 0.648 ± 0.21
2.016IleTyr: 2.016 ± 0.452
0.0IleXaa: 0.0 ± 0.0
Lys
5.328LysAla: 5.328 ± 0.682
0.792LysCys: 0.792 ± 0.274
2.448LysAsp: 2.448 ± 0.394
4.68LysGlu: 4.68 ± 0.704
1.944LysPhe: 1.944 ± 0.513
3.96LysGly: 3.96 ± 0.657
1.512LysHis: 1.512 ± 0.383
2.448LysIle: 2.448 ± 0.379
2.448LysLys: 2.448 ± 0.449
4.824LysLeu: 4.824 ± 0.734
1.872LysMet: 1.872 ± 0.397
2.088LysAsn: 2.088 ± 0.382
2.304LysPro: 2.304 ± 0.385
3.168LysGln: 3.168 ± 0.528
3.6LysArg: 3.6 ± 0.5
4.392LysSer: 4.392 ± 0.569
3.6LysThr: 3.6 ± 0.654
3.96LysVal: 3.96 ± 0.499
1.152LysTrp: 1.152 ± 0.33
2.52LysTyr: 2.52 ± 0.665
0.0LysXaa: 0.0 ± 0.0
Leu
6.192LeuAla: 6.192 ± 0.811
0.72LeuCys: 0.72 ± 0.237
5.976LeuAsp: 5.976 ± 0.564
5.688LeuGlu: 5.688 ± 0.777
2.304LeuPhe: 2.304 ± 0.343
5.832LeuGly: 5.832 ± 0.526
1.944LeuHis: 1.944 ± 0.38
4.176LeuIle: 4.176 ± 0.515
4.464LeuLys: 4.464 ± 0.632
4.824LeuLeu: 4.824 ± 0.669
2.808LeuMet: 2.808 ± 0.468
4.752LeuAsn: 4.752 ± 0.513
3.168LeuPro: 3.168 ± 0.545
2.664LeuGln: 2.664 ± 0.443
2.808LeuArg: 2.808 ± 0.49
5.112LeuSer: 5.112 ± 0.618
5.472LeuThr: 5.472 ± 0.645
4.752LeuVal: 4.752 ± 0.684
1.008LeuTrp: 1.008 ± 0.251
2.952LeuTyr: 2.952 ± 0.39
0.0LeuXaa: 0.0 ± 0.0
Met
2.592MetAla: 2.592 ± 0.413
0.216MetCys: 0.216 ± 0.117
1.728MetAsp: 1.728 ± 0.438
1.656MetGlu: 1.656 ± 0.32
1.08MetPhe: 1.08 ± 0.284
1.512MetGly: 1.512 ± 0.312
0.648MetHis: 0.648 ± 0.175
2.088MetIle: 2.088 ± 0.384
2.16MetLys: 2.16 ± 0.464
2.088MetLeu: 2.088 ± 0.368
1.08MetMet: 1.08 ± 0.274
1.584MetAsn: 1.584 ± 0.335
1.296MetPro: 1.296 ± 0.302
1.08MetGln: 1.08 ± 0.274
1.584MetArg: 1.584 ± 0.289
2.376MetSer: 2.376 ± 0.467
2.16MetThr: 2.16 ± 0.428
1.512MetVal: 1.512 ± 0.378
0.144MetTrp: 0.144 ± 0.105
0.864MetTyr: 0.864 ± 0.253
0.0MetXaa: 0.0 ± 0.0
Asn
5.04AsnAla: 5.04 ± 0.643
0.792AsnCys: 0.792 ± 0.211
3.096AsnAsp: 3.096 ± 0.363
3.168AsnGlu: 3.168 ± 0.455
0.864AsnPhe: 0.864 ± 0.247
5.184AsnGly: 5.184 ± 0.78
0.936AsnHis: 0.936 ± 0.227
2.376AsnIle: 2.376 ± 0.363
2.952AsnLys: 2.952 ± 0.558
3.528AsnLeu: 3.528 ± 0.552
1.224AsnMet: 1.224 ± 0.277
2.88AsnAsn: 2.88 ± 0.528
2.592AsnPro: 2.592 ± 0.317
2.376AsnGln: 2.376 ± 0.337
2.448AsnArg: 2.448 ± 0.479
3.168AsnSer: 3.168 ± 0.476
2.952AsnThr: 2.952 ± 0.569
3.96AsnVal: 3.96 ± 0.626
0.72AsnTrp: 0.72 ± 0.213
1.512AsnTyr: 1.512 ± 0.302
0.0AsnXaa: 0.0 ± 0.0
Pro
2.88ProAla: 2.88 ± 0.405
0.792ProCys: 0.792 ± 0.263
3.312ProAsp: 3.312 ± 0.468
2.736ProGlu: 2.736 ± 0.341
1.224ProPhe: 1.224 ± 0.385
1.872ProGly: 1.872 ± 0.303
0.72ProHis: 0.72 ± 0.241
2.592ProIle: 2.592 ± 0.408
2.232ProLys: 2.232 ± 0.389
2.376ProLeu: 2.376 ± 0.538
0.936ProMet: 0.936 ± 0.274
2.376ProAsn: 2.376 ± 0.742
1.08ProPro: 1.08 ± 0.216
1.656ProGln: 1.656 ± 0.358
1.728ProArg: 1.728 ± 0.428
2.088ProSer: 2.088 ± 0.414
2.088ProThr: 2.088 ± 0.43
2.592ProVal: 2.592 ± 0.389
0.432ProTrp: 0.432 ± 0.148
1.368ProTyr: 1.368 ± 0.347
0.0ProXaa: 0.0 ± 0.0
Gln
3.6GlnAla: 3.6 ± 0.682
0.72GlnCys: 0.72 ± 0.193
1.44GlnAsp: 1.44 ± 0.232
2.448GlnGlu: 2.448 ± 0.447
1.8GlnPhe: 1.8 ± 0.358
2.088GlnGly: 2.088 ± 0.441
0.504GlnHis: 0.504 ± 0.172
3.024GlnIle: 3.024 ± 0.523
1.8GlnLys: 1.8 ± 0.317
3.456GlnLeu: 3.456 ± 0.649
0.864GlnMet: 0.864 ± 0.25
1.584GlnAsn: 1.584 ± 0.313
1.152GlnPro: 1.152 ± 0.25
1.728GlnGln: 1.728 ± 0.492
1.728GlnArg: 1.728 ± 0.275
1.944GlnSer: 1.944 ± 0.443
2.448GlnThr: 2.448 ± 0.368
3.24GlnVal: 3.24 ± 0.42
0.72GlnTrp: 0.72 ± 0.256
1.008GlnTyr: 1.008 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
3.744ArgAla: 3.744 ± 0.511
0.504ArgCys: 0.504 ± 0.195
3.528ArgAsp: 3.528 ± 0.507
4.104ArgGlu: 4.104 ± 0.541
1.944ArgPhe: 1.944 ± 0.316
2.952ArgGly: 2.952 ± 0.508
0.792ArgHis: 0.792 ± 0.222
2.808ArgIle: 2.808 ± 0.447
2.52ArgLys: 2.52 ± 0.532
3.6ArgLeu: 3.6 ± 0.436
1.368ArgMet: 1.368 ± 0.405
2.088ArgAsn: 2.088 ± 0.444
1.944ArgPro: 1.944 ± 0.308
1.656ArgGln: 1.656 ± 0.348
3.096ArgArg: 3.096 ± 0.533
2.016ArgSer: 2.016 ± 0.361
2.664ArgThr: 2.664 ± 0.462
2.952ArgVal: 2.952 ± 0.517
0.936ArgTrp: 0.936 ± 0.264
2.016ArgTyr: 2.016 ± 0.448
0.0ArgXaa: 0.0 ± 0.0
Ser
4.968SerAla: 4.968 ± 0.897
0.432SerCys: 0.432 ± 0.165
4.608SerAsp: 4.608 ± 0.655
3.528SerGlu: 3.528 ± 0.605
2.952SerPhe: 2.952 ± 0.35
5.112SerGly: 5.112 ± 0.718
0.792SerHis: 0.792 ± 0.276
3.168SerIle: 3.168 ± 0.581
3.816SerLys: 3.816 ± 0.79
4.32SerLeu: 4.32 ± 0.446
1.8SerMet: 1.8 ± 0.374
3.312SerAsn: 3.312 ± 0.451
2.088SerPro: 2.088 ± 0.284
2.16SerGln: 2.16 ± 0.385
3.384SerArg: 3.384 ± 0.558
3.744SerSer: 3.744 ± 0.54
2.808SerThr: 2.808 ± 0.428
4.176SerVal: 4.176 ± 0.677
0.72SerTrp: 0.72 ± 0.267
1.944SerTyr: 1.944 ± 0.387
0.0SerXaa: 0.0 ± 0.0
Thr
4.32ThrAla: 4.32 ± 0.584
0.792ThrCys: 0.792 ± 0.249
2.88ThrAsp: 2.88 ± 0.502
3.312ThrGlu: 3.312 ± 0.385
2.736ThrPhe: 2.736 ± 0.523
5.976ThrGly: 5.976 ± 0.73
1.368ThrHis: 1.368 ± 0.302
3.456ThrIle: 3.456 ± 0.713
4.32ThrLys: 4.32 ± 0.754
5.256ThrLeu: 5.256 ± 0.778
1.512ThrMet: 1.512 ± 0.359
3.168ThrAsn: 3.168 ± 0.541
2.952ThrPro: 2.952 ± 0.512
2.52ThrGln: 2.52 ± 0.441
2.088ThrArg: 2.088 ± 0.355
3.744ThrSer: 3.744 ± 0.592
3.744ThrThr: 3.744 ± 0.628
4.968ThrVal: 4.968 ± 0.565
1.224ThrTrp: 1.224 ± 0.316
2.088ThrTyr: 2.088 ± 0.438
0.0ThrXaa: 0.0 ± 0.0
Val
5.688ValAla: 5.688 ± 0.494
0.576ValCys: 0.576 ± 0.2
5.04ValAsp: 5.04 ± 0.611
6.048ValGlu: 6.048 ± 0.891
2.232ValPhe: 2.232 ± 0.347
4.32ValGly: 4.32 ± 0.753
0.936ValHis: 0.936 ± 0.299
4.104ValIle: 4.104 ± 0.607
3.168ValLys: 3.168 ± 0.488
5.112ValLeu: 5.112 ± 0.566
1.656ValMet: 1.656 ± 0.42
4.536ValAsn: 4.536 ± 0.546
2.808ValPro: 2.808 ± 0.486
2.16ValGln: 2.16 ± 0.28
3.096ValArg: 3.096 ± 0.483
4.824ValSer: 4.824 ± 0.854
5.04ValThr: 5.04 ± 0.514
3.528ValVal: 3.528 ± 0.448
1.008ValTrp: 1.008 ± 0.299
2.592ValTyr: 2.592 ± 0.426
0.0ValXaa: 0.0 ± 0.0
Trp
1.152TrpAla: 1.152 ± 0.392
0.36TrpCys: 0.36 ± 0.145
1.008TrpAsp: 1.008 ± 0.32
1.296TrpGlu: 1.296 ± 0.257
0.72TrpPhe: 0.72 ± 0.246
0.648TrpGly: 0.648 ± 0.232
0.576TrpHis: 0.576 ± 0.202
0.792TrpIle: 0.792 ± 0.256
0.792TrpLys: 0.792 ± 0.285
1.44TrpLeu: 1.44 ± 0.304
0.36TrpMet: 0.36 ± 0.172
0.864TrpAsn: 0.864 ± 0.259
0.648TrpPro: 0.648 ± 0.203
0.504TrpGln: 0.504 ± 0.162
0.72TrpArg: 0.72 ± 0.24
0.72TrpSer: 0.72 ± 0.211
0.864TrpThr: 0.864 ± 0.275
1.08TrpVal: 1.08 ± 0.338
0.216TrpTrp: 0.216 ± 0.134
0.504TrpTyr: 0.504 ± 0.188
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.88TyrAla: 2.88 ± 0.513
0.864TyrCys: 0.864 ± 0.257
2.592TyrAsp: 2.592 ± 0.456
2.448TyrGlu: 2.448 ± 0.392
1.656TyrPhe: 1.656 ± 0.342
2.016TyrGly: 2.016 ± 0.443
0.864TyrHis: 0.864 ± 0.268
2.376TyrIle: 2.376 ± 0.403
2.448TyrLys: 2.448 ± 0.505
2.664TyrLeu: 2.664 ± 0.394
1.584TyrMet: 1.584 ± 0.321
1.728TyrAsn: 1.728 ± 0.387
1.944TyrPro: 1.944 ± 0.409
1.224TyrGln: 1.224 ± 0.342
1.656TyrArg: 1.656 ± 0.363
2.304TyrSer: 2.304 ± 0.44
2.16TyrThr: 2.16 ± 0.374
1.584TyrVal: 1.584 ± 0.337
0.504TyrTrp: 0.504 ± 0.16
1.296TyrTyr: 1.296 ± 0.317
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (13889 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski