Amino acid dipepetide frequency for Vibrio virus VP882

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.057AlaAla: 12.057 ± 1.469
1.453AlaCys: 1.453 ± 0.352
6.174AlaAsp: 6.174 ± 0.653
6.682AlaGlu: 6.682 ± 0.709
2.978AlaPhe: 2.978 ± 0.416
9.079AlaGly: 9.079 ± 1.173
1.525AlaHis: 1.525 ± 0.327
5.084AlaIle: 5.084 ± 0.624
6.246AlaLys: 6.246 ± 0.569
8.788AlaLeu: 8.788 ± 0.947
3.995AlaMet: 3.995 ± 0.532
4.503AlaAsn: 4.503 ± 0.577
4.14AlaPro: 4.14 ± 0.646
4.285AlaGln: 4.285 ± 0.559
5.665AlaArg: 5.665 ± 0.602
5.811AlaSer: 5.811 ± 0.729
5.302AlaThr: 5.302 ± 0.719
5.956AlaVal: 5.956 ± 0.757
1.888AlaTrp: 1.888 ± 0.429
2.106AlaTyr: 2.106 ± 0.353
0.0AlaXaa: 0.0 ± 0.0
Cys
1.089CysAla: 1.089 ± 0.318
0.218CysCys: 0.218 ± 0.118
0.872CysAsp: 0.872 ± 0.252
0.581CysGlu: 0.581 ± 0.177
0.291CysPhe: 0.291 ± 0.126
1.38CysGly: 1.38 ± 0.362
0.363CysHis: 0.363 ± 0.175
0.581CysIle: 0.581 ± 0.242
0.291CysLys: 0.291 ± 0.188
0.872CysLeu: 0.872 ± 0.229
0.654CysMet: 0.654 ± 0.234
0.218CysAsn: 0.218 ± 0.114
0.654CysPro: 0.654 ± 0.241
0.436CysGln: 0.436 ± 0.18
0.944CysArg: 0.944 ± 0.292
0.944CysSer: 0.944 ± 0.33
0.944CysThr: 0.944 ± 0.283
1.307CysVal: 1.307 ± 0.352
0.0CysTrp: 0.0 ± 0.0
0.581CysTyr: 0.581 ± 0.213
0.0CysXaa: 0.0 ± 0.0
Asp
6.537AspAla: 6.537 ± 0.751
1.017AspCys: 1.017 ± 0.425
3.995AspAsp: 3.995 ± 0.601
4.794AspGlu: 4.794 ± 0.667
2.397AspPhe: 2.397 ± 0.392
3.777AspGly: 3.777 ± 0.623
0.944AspHis: 0.944 ± 0.239
3.777AspIle: 3.777 ± 0.416
2.76AspLys: 2.76 ± 0.367
5.23AspLeu: 5.23 ± 0.558
1.453AspMet: 1.453 ± 0.251
1.816AspAsn: 1.816 ± 0.289
2.687AspPro: 2.687 ± 0.342
1.816AspGln: 1.816 ± 0.334
2.833AspArg: 2.833 ± 0.517
3.123AspSer: 3.123 ± 0.427
2.978AspThr: 2.978 ± 0.471
3.414AspVal: 3.414 ± 0.393
1.235AspTrp: 1.235 ± 0.349
2.252AspTyr: 2.252 ± 0.493
0.0AspXaa: 0.0 ± 0.0
Glu
6.174GluAla: 6.174 ± 0.695
1.017GluCys: 1.017 ± 0.241
3.051GluAsp: 3.051 ± 0.502
4.503GluGlu: 4.503 ± 0.519
2.179GluPhe: 2.179 ± 0.331
3.704GluGly: 3.704 ± 0.52
2.034GluHis: 2.034 ± 0.28
3.268GluIle: 3.268 ± 0.58
2.833GluLys: 2.833 ± 0.324
8.353GluLeu: 8.353 ± 0.749
1.743GluMet: 1.743 ± 0.315
1.598GluAsn: 1.598 ± 0.346
2.542GluPro: 2.542 ± 0.529
4.721GluGln: 4.721 ± 0.543
4.14GluArg: 4.14 ± 0.525
2.615GluSer: 2.615 ± 0.4
3.341GluThr: 3.341 ± 0.485
6.028GluVal: 6.028 ± 0.596
0.508GluTrp: 0.508 ± 0.258
1.671GluTyr: 1.671 ± 0.398
0.0GluXaa: 0.0 ± 0.0
Phe
3.123PheAla: 3.123 ± 0.55
0.363PheCys: 0.363 ± 0.147
2.106PheAsp: 2.106 ± 0.48
2.469PheGlu: 2.469 ± 0.384
1.089PhePhe: 1.089 ± 0.337
2.687PheGly: 2.687 ± 0.405
0.944PheHis: 0.944 ± 0.291
1.453PheIle: 1.453 ± 0.323
2.252PheLys: 2.252 ± 0.481
1.888PheLeu: 1.888 ± 0.381
0.581PheMet: 0.581 ± 0.232
0.944PheAsn: 0.944 ± 0.259
1.017PhePro: 1.017 ± 0.245
1.671PheGln: 1.671 ± 0.306
1.162PheArg: 1.162 ± 0.261
1.888PheSer: 1.888 ± 0.366
1.525PheThr: 1.525 ± 0.441
2.397PheVal: 2.397 ± 0.513
0.363PheTrp: 0.363 ± 0.186
1.453PheTyr: 1.453 ± 0.353
0.0PheXaa: 0.0 ± 0.0
Gly
6.537GlyAla: 6.537 ± 1.011
1.38GlyCys: 1.38 ± 0.393
4.939GlyAsp: 4.939 ± 0.735
5.23GlyGlu: 5.23 ± 0.669
2.397GlyPhe: 2.397 ± 0.4
4.648GlyGly: 4.648 ± 1.097
1.525GlyHis: 1.525 ± 0.369
3.341GlyIle: 3.341 ± 0.501
4.285GlyLys: 4.285 ± 0.524
6.101GlyLeu: 6.101 ± 0.573
1.816GlyMet: 1.816 ± 0.349
2.615GlyAsn: 2.615 ± 0.515
1.525GlyPro: 1.525 ± 0.286
3.341GlyGln: 3.341 ± 0.463
5.157GlyArg: 5.157 ± 0.659
3.922GlySer: 3.922 ± 0.409
4.576GlyThr: 4.576 ± 0.507
5.23GlyVal: 5.23 ± 0.603
1.307GlyTrp: 1.307 ± 0.239
1.888GlyTyr: 1.888 ± 0.362
0.0GlyXaa: 0.0 ± 0.0
His
2.179HisAla: 2.179 ± 0.441
0.581HisCys: 0.581 ± 0.189
0.872HisAsp: 0.872 ± 0.226
1.307HisGlu: 1.307 ± 0.368
0.799HisPhe: 0.799 ± 0.274
1.671HisGly: 1.671 ± 0.476
0.799HisHis: 0.799 ± 0.269
1.307HisIle: 1.307 ± 0.256
1.017HisLys: 1.017 ± 0.291
2.76HisLeu: 2.76 ± 0.545
0.291HisMet: 0.291 ± 0.126
0.654HisAsn: 0.654 ± 0.224
1.307HisPro: 1.307 ± 0.321
1.38HisGln: 1.38 ± 0.342
1.525HisArg: 1.525 ± 0.4
1.307HisSer: 1.307 ± 0.362
1.598HisThr: 1.598 ± 0.336
1.162HisVal: 1.162 ± 0.337
0.654HisTrp: 0.654 ± 0.213
0.799HisTyr: 0.799 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
4.794IleAla: 4.794 ± 0.586
0.581IleCys: 0.581 ± 0.225
4.576IleAsp: 4.576 ± 0.715
4.285IleGlu: 4.285 ± 0.508
1.017IlePhe: 1.017 ± 0.248
3.777IleGly: 3.777 ± 0.652
0.944IleHis: 0.944 ± 0.249
2.106IleIle: 2.106 ± 0.358
2.469IleLys: 2.469 ± 0.406
2.034IleLeu: 2.034 ± 0.308
1.307IleMet: 1.307 ± 0.321
2.833IleAsn: 2.833 ± 0.377
2.106IlePro: 2.106 ± 0.371
1.307IleGln: 1.307 ± 0.285
2.978IleArg: 2.978 ± 0.532
2.905IleSer: 2.905 ± 0.487
3.486IleThr: 3.486 ± 0.486
2.252IleVal: 2.252 ± 0.411
0.436IleTrp: 0.436 ± 0.157
1.162IleTyr: 1.162 ± 0.303
0.0IleXaa: 0.0 ± 0.0
Lys
5.593LysAla: 5.593 ± 0.719
0.291LysCys: 0.291 ± 0.151
2.179LysAsp: 2.179 ± 0.52
2.615LysGlu: 2.615 ± 0.449
1.525LysPhe: 1.525 ± 0.316
3.704LysGly: 3.704 ± 0.496
1.017LysHis: 1.017 ± 0.32
2.179LysIle: 2.179 ± 0.357
3.995LysLys: 3.995 ± 0.667
3.922LysLeu: 3.922 ± 0.53
1.235LysMet: 1.235 ± 0.386
1.961LysAsn: 1.961 ± 0.387
2.687LysPro: 2.687 ± 0.431
2.615LysGln: 2.615 ± 0.446
3.922LysArg: 3.922 ± 0.533
2.905LysSer: 2.905 ± 0.386
2.833LysThr: 2.833 ± 0.537
4.067LysVal: 4.067 ± 0.504
1.162LysTrp: 1.162 ± 0.287
1.307LysTyr: 1.307 ± 0.328
0.0LysXaa: 0.0 ± 0.0
Leu
10.314LeuAla: 10.314 ± 0.908
1.017LeuCys: 1.017 ± 0.282
5.593LeuAsp: 5.593 ± 0.583
5.23LeuGlu: 5.23 ± 0.631
3.051LeuPhe: 3.051 ± 0.566
5.375LeuGly: 5.375 ± 0.691
2.542LeuHis: 2.542 ± 0.478
3.559LeuIle: 3.559 ± 0.42
4.213LeuLys: 4.213 ± 0.723
8.207LeuLeu: 8.207 ± 0.742
1.598LeuMet: 1.598 ± 0.306
2.76LeuAsn: 2.76 ± 0.434
4.14LeuPro: 4.14 ± 0.497
3.632LeuGln: 3.632 ± 0.54
6.319LeuArg: 6.319 ± 0.645
6.028LeuSer: 6.028 ± 0.909
5.084LeuThr: 5.084 ± 0.618
6.392LeuVal: 6.392 ± 0.692
1.089LeuTrp: 1.089 ± 0.292
1.162LeuTyr: 1.162 ± 0.28
0.0LeuXaa: 0.0 ± 0.0
Met
3.196MetAla: 3.196 ± 0.438
0.363MetCys: 0.363 ± 0.176
1.235MetAsp: 1.235 ± 0.31
1.307MetGlu: 1.307 ± 0.289
0.654MetPhe: 0.654 ± 0.226
2.252MetGly: 2.252 ± 0.573
0.508MetHis: 0.508 ± 0.187
1.017MetIle: 1.017 ± 0.282
1.38MetLys: 1.38 ± 0.297
1.598MetLeu: 1.598 ± 0.35
0.799MetMet: 0.799 ± 0.253
1.525MetAsn: 1.525 ± 0.306
1.743MetPro: 1.743 ± 0.333
1.017MetGln: 1.017 ± 0.287
1.671MetArg: 1.671 ± 0.339
2.034MetSer: 2.034 ± 0.382
1.671MetThr: 1.671 ± 0.31
2.397MetVal: 2.397 ± 0.392
0.363MetTrp: 0.363 ± 0.131
0.291MetTyr: 0.291 ± 0.139
0.0MetXaa: 0.0 ± 0.0
Asn
4.14AsnAla: 4.14 ± 0.769
0.581AsnCys: 0.581 ± 0.195
1.743AsnAsp: 1.743 ± 0.496
2.179AsnGlu: 2.179 ± 0.474
0.944AsnPhe: 0.944 ± 0.279
2.615AsnGly: 2.615 ± 0.396
0.872AsnHis: 0.872 ± 0.244
1.598AsnIle: 1.598 ± 0.381
2.179AsnLys: 2.179 ± 0.376
2.397AsnLeu: 2.397 ± 0.362
0.799AsnMet: 0.799 ± 0.249
1.089AsnAsn: 1.089 ± 0.273
2.034AsnPro: 2.034 ± 0.392
1.888AsnGln: 1.888 ± 0.3
2.687AsnArg: 2.687 ± 0.506
1.961AsnSer: 1.961 ± 0.496
1.888AsnThr: 1.888 ± 0.364
1.525AsnVal: 1.525 ± 0.382
0.799AsnTrp: 0.799 ± 0.294
0.944AsnTyr: 0.944 ± 0.27
0.0AsnXaa: 0.0 ± 0.0
Pro
4.794ProAla: 4.794 ± 0.576
0.581ProCys: 0.581 ± 0.222
3.196ProAsp: 3.196 ± 0.558
3.922ProGlu: 3.922 ± 0.474
1.162ProPhe: 1.162 ± 0.323
3.341ProGly: 3.341 ± 0.564
1.235ProHis: 1.235 ± 0.305
1.743ProIle: 1.743 ± 0.396
1.598ProLys: 1.598 ± 0.374
3.051ProLeu: 3.051 ± 0.605
1.089ProMet: 1.089 ± 0.326
1.235ProAsn: 1.235 ± 0.245
2.469ProPro: 2.469 ± 0.597
1.453ProGln: 1.453 ± 0.268
2.905ProArg: 2.905 ± 0.466
2.469ProSer: 2.469 ± 0.539
2.687ProThr: 2.687 ± 0.429
3.196ProVal: 3.196 ± 0.42
0.799ProTrp: 0.799 ± 0.274
1.888ProTyr: 1.888 ± 0.39
0.0ProXaa: 0.0 ± 0.0
Gln
4.794GlnAla: 4.794 ± 0.571
0.363GlnCys: 0.363 ± 0.172
2.252GlnAsp: 2.252 ± 0.389
2.397GlnGlu: 2.397 ± 0.45
1.598GlnPhe: 1.598 ± 0.306
3.196GlnGly: 3.196 ± 0.461
1.162GlnHis: 1.162 ± 0.336
1.961GlnIle: 1.961 ± 0.34
2.034GlnLys: 2.034 ± 0.433
4.939GlnLeu: 4.939 ± 0.572
1.162GlnMet: 1.162 ± 0.249
0.799GlnAsn: 0.799 ± 0.211
2.106GlnPro: 2.106 ± 0.289
2.76GlnGln: 2.76 ± 0.471
4.358GlnArg: 4.358 ± 0.514
2.397GlnSer: 2.397 ± 0.416
2.179GlnThr: 2.179 ± 0.351
3.704GlnVal: 3.704 ± 0.477
0.581GlnTrp: 0.581 ± 0.349
1.38GlnTyr: 1.38 ± 0.274
0.0GlnXaa: 0.0 ± 0.0
Arg
6.319ArgAla: 6.319 ± 0.584
0.944ArgCys: 0.944 ± 0.323
2.687ArgAsp: 2.687 ± 0.445
4.721ArgGlu: 4.721 ± 0.624
2.324ArgPhe: 2.324 ± 0.478
4.866ArgGly: 4.866 ± 0.614
2.106ArgHis: 2.106 ± 0.401
3.196ArgIle: 3.196 ± 0.408
2.978ArgLys: 2.978 ± 0.446
7.118ArgLeu: 7.118 ± 0.512
2.615ArgMet: 2.615 ± 0.451
1.961ArgAsn: 1.961 ± 0.326
2.252ArgPro: 2.252 ± 0.538
2.687ArgGln: 2.687 ± 0.52
6.392ArgArg: 6.392 ± 0.809
3.632ArgSer: 3.632 ± 0.438
2.978ArgThr: 2.978 ± 0.482
4.794ArgVal: 4.794 ± 0.641
1.598ArgTrp: 1.598 ± 0.364
1.525ArgTyr: 1.525 ± 0.335
0.0ArgXaa: 0.0 ± 0.0
Ser
5.593SerAla: 5.593 ± 0.805
1.089SerCys: 1.089 ± 0.378
2.615SerAsp: 2.615 ± 0.464
3.196SerGlu: 3.196 ± 0.46
2.034SerPhe: 2.034 ± 0.363
3.704SerGly: 3.704 ± 0.448
1.235SerHis: 1.235 ± 0.316
3.414SerIle: 3.414 ± 0.447
2.252SerLys: 2.252 ± 0.452
5.665SerLeu: 5.665 ± 0.594
2.469SerMet: 2.469 ± 0.435
2.034SerAsn: 2.034 ± 0.514
2.833SerPro: 2.833 ± 0.435
2.76SerGln: 2.76 ± 0.32
4.213SerArg: 4.213 ± 0.719
4.213SerSer: 4.213 ± 0.667
2.76SerThr: 2.76 ± 0.657
3.414SerVal: 3.414 ± 0.503
0.726SerTrp: 0.726 ± 0.203
1.235SerTyr: 1.235 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
5.811ThrAla: 5.811 ± 0.71
0.363ThrCys: 0.363 ± 0.178
2.905ThrAsp: 2.905 ± 0.54
3.704ThrGlu: 3.704 ± 0.455
1.38ThrPhe: 1.38 ± 0.281
5.593ThrGly: 5.593 ± 0.704
1.38ThrHis: 1.38 ± 0.275
2.252ThrIle: 2.252 ± 0.348
2.542ThrLys: 2.542 ± 0.502
5.665ThrLeu: 5.665 ± 0.744
1.162ThrMet: 1.162 ± 0.313
2.106ThrAsn: 2.106 ± 0.384
2.542ThrPro: 2.542 ± 0.382
2.905ThrGln: 2.905 ± 0.391
3.341ThrArg: 3.341 ± 0.598
2.542ThrSer: 2.542 ± 0.395
3.196ThrThr: 3.196 ± 0.421
4.576ThrVal: 4.576 ± 0.594
0.508ThrTrp: 0.508 ± 0.193
1.089ThrTyr: 1.089 ± 0.327
0.0ThrXaa: 0.0 ± 0.0
Val
7.263ValAla: 7.263 ± 0.698
0.436ValCys: 0.436 ± 0.192
5.447ValAsp: 5.447 ± 0.678
4.431ValGlu: 4.431 ± 0.666
2.034ValPhe: 2.034 ± 0.382
4.285ValGly: 4.285 ± 0.563
1.089ValHis: 1.089 ± 0.364
4.067ValIle: 4.067 ± 0.565
4.285ValLys: 4.285 ± 0.645
5.447ValLeu: 5.447 ± 0.595
1.453ValMet: 1.453 ± 0.34
2.469ValAsn: 2.469 ± 0.393
3.704ValPro: 3.704 ± 0.563
2.905ValGln: 2.905 ± 0.454
4.067ValArg: 4.067 ± 0.648
3.777ValSer: 3.777 ± 0.478
4.939ValThr: 4.939 ± 0.8
6.174ValVal: 6.174 ± 0.814
0.291ValTrp: 0.291 ± 0.135
2.179ValTyr: 2.179 ± 0.367
0.0ValXaa: 0.0 ± 0.0
Trp
1.38TrpAla: 1.38 ± 0.355
0.145TrpCys: 0.145 ± 0.11
0.654TrpAsp: 0.654 ± 0.256
0.872TrpGlu: 0.872 ± 0.278
0.436TrpPhe: 0.436 ± 0.148
0.944TrpGly: 0.944 ± 0.236
0.436TrpHis: 0.436 ± 0.162
0.436TrpIle: 0.436 ± 0.172
0.799TrpLys: 0.799 ± 0.284
1.453TrpLeu: 1.453 ± 0.302
0.291TrpMet: 0.291 ± 0.165
0.581TrpAsn: 0.581 ± 0.209
0.872TrpPro: 0.872 ± 0.27
1.089TrpGln: 1.089 ± 0.269
1.089TrpArg: 1.089 ± 0.298
1.089TrpSer: 1.089 ± 0.275
0.291TrpThr: 0.291 ± 0.162
1.38TrpVal: 1.38 ± 0.325
0.145TrpTrp: 0.145 ± 0.103
0.436TrpTyr: 0.436 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.034TyrAla: 2.034 ± 0.501
0.436TyrCys: 0.436 ± 0.147
1.743TyrAsp: 1.743 ± 0.393
1.598TyrGlu: 1.598 ± 0.357
1.089TyrPhe: 1.089 ± 0.304
1.162TyrGly: 1.162 ± 0.224
1.307TyrHis: 1.307 ± 0.317
1.017TyrIle: 1.017 ± 0.229
1.307TyrLys: 1.307 ± 0.298
1.816TyrLeu: 1.816 ± 0.397
0.436TyrMet: 0.436 ± 0.19
1.162TyrAsn: 1.162 ± 0.308
1.453TyrPro: 1.453 ± 0.318
1.453TyrGln: 1.453 ± 0.301
2.324TyrArg: 2.324 ± 0.444
1.961TyrSer: 1.961 ± 0.354
1.307TyrThr: 1.307 ± 0.324
1.38TyrVal: 1.38 ± 0.299
0.363TyrTrp: 0.363 ± 0.16
0.291TyrTyr: 0.291 ± 0.131
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (13769 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski