Amino acid dipepetide frequency for Vibrio phage JSF7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.553AlaAla: 10.553 ± 1.453
1.417AlaCys: 1.417 ± 0.432
4.533AlaAsp: 4.533 ± 0.457
6.162AlaGlu: 6.162 ± 1.034
3.541AlaPhe: 3.541 ± 0.501
5.808AlaGly: 5.808 ± 0.731
1.062AlaHis: 1.062 ± 0.291
4.462AlaIle: 4.462 ± 0.71
5.241AlaLys: 5.241 ± 0.851
6.02AlaLeu: 6.02 ± 0.811
3.612AlaMet: 3.612 ± 0.516
4.958AlaAsn: 4.958 ± 0.56
3.895AlaPro: 3.895 ± 0.662
5.17AlaGln: 5.17 ± 0.834
5.1AlaArg: 5.1 ± 0.563
4.179AlaSer: 4.179 ± 0.748
4.179AlaThr: 4.179 ± 0.53
7.153AlaVal: 7.153 ± 1.14
1.346AlaTrp: 1.346 ± 0.285
2.621AlaTyr: 2.621 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
1.062CysAla: 1.062 ± 0.326
0.142CysCys: 0.142 ± 0.081
0.708CysAsp: 0.708 ± 0.231
0.567CysGlu: 0.567 ± 0.196
0.212CysPhe: 0.212 ± 0.143
0.637CysGly: 0.637 ± 0.235
0.425CysHis: 0.425 ± 0.22
0.496CysIle: 0.496 ± 0.194
0.779CysLys: 0.779 ± 0.376
0.779CysLeu: 0.779 ± 0.226
0.496CysMet: 0.496 ± 0.235
0.85CysAsn: 0.85 ± 0.228
0.496CysPro: 0.496 ± 0.179
0.567CysGln: 0.567 ± 0.231
0.496CysArg: 0.496 ± 0.168
0.567CysSer: 0.567 ± 0.193
0.567CysThr: 0.567 ± 0.245
0.779CysVal: 0.779 ± 0.211
0.212CysTrp: 0.212 ± 0.12
0.425CysTyr: 0.425 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
5.1AspAla: 5.1 ± 0.765
0.425AspCys: 0.425 ± 0.197
2.479AspAsp: 2.479 ± 0.416
3.612AspGlu: 3.612 ± 0.584
2.408AspPhe: 2.408 ± 0.431
4.745AspGly: 4.745 ± 0.918
0.992AspHis: 0.992 ± 0.28
2.833AspIle: 2.833 ± 0.641
4.037AspLys: 4.037 ± 0.643
4.675AspLeu: 4.675 ± 0.577
2.479AspMet: 2.479 ± 0.397
2.55AspAsn: 2.55 ± 0.525
2.125AspPro: 2.125 ± 0.309
1.062AspGln: 1.062 ± 0.26
2.125AspArg: 2.125 ± 0.452
2.904AspSer: 2.904 ± 0.421
3.541AspThr: 3.541 ± 0.454
5.1AspVal: 5.1 ± 0.445
0.921AspTrp: 0.921 ± 0.23
2.621AspTyr: 2.621 ± 0.49
0.0AspXaa: 0.0 ± 0.0
Glu
6.445GluAla: 6.445 ± 0.751
0.637GluCys: 0.637 ± 0.174
2.833GluAsp: 2.833 ± 0.44
4.037GluGlu: 4.037 ± 0.494
2.55GluPhe: 2.55 ± 0.373
3.329GluGly: 3.329 ± 0.485
2.196GluHis: 2.196 ± 0.374
2.55GluIle: 2.55 ± 0.57
2.266GluLys: 2.266 ± 0.335
7.791GluLeu: 7.791 ± 0.765
2.125GluMet: 2.125 ± 0.395
1.983GluAsn: 1.983 ± 0.372
1.629GluPro: 1.629 ± 0.337
3.4GluGln: 3.4 ± 0.48
4.25GluArg: 4.25 ± 0.551
2.904GluSer: 2.904 ± 0.395
3.046GluThr: 3.046 ± 0.386
6.091GluVal: 6.091 ± 0.608
1.487GluTrp: 1.487 ± 0.301
2.479GluTyr: 2.479 ± 0.403
0.0GluXaa: 0.0 ± 0.0
Phe
2.691PheAla: 2.691 ± 0.435
0.212PheCys: 0.212 ± 0.112
2.337PheAsp: 2.337 ± 0.386
2.904PheGlu: 2.904 ± 0.454
1.204PhePhe: 1.204 ± 0.38
2.833PheGly: 2.833 ± 0.407
0.85PheHis: 0.85 ± 0.181
1.841PheIle: 1.841 ± 0.362
2.762PheLys: 2.762 ± 0.493
2.266PheLeu: 2.266 ± 0.325
1.133PheMet: 1.133 ± 0.286
1.912PheAsn: 1.912 ± 0.448
0.921PhePro: 0.921 ± 0.221
1.275PheGln: 1.275 ± 0.301
2.691PheArg: 2.691 ± 0.4
1.417PheSer: 1.417 ± 0.32
2.125PheThr: 2.125 ± 0.403
1.983PheVal: 1.983 ± 0.33
0.354PheTrp: 0.354 ± 0.141
1.275PheTyr: 1.275 ± 0.315
0.0PheXaa: 0.0 ± 0.0
Gly
5.383GlyAla: 5.383 ± 0.704
1.062GlyCys: 1.062 ± 0.337
4.533GlyAsp: 4.533 ± 0.634
3.4GlyGlu: 3.4 ± 0.457
2.762GlyPhe: 2.762 ± 0.473
6.02GlyGly: 6.02 ± 0.8
1.275GlyHis: 1.275 ± 0.318
4.108GlyIle: 4.108 ± 0.626
3.754GlyLys: 3.754 ± 0.491
5.17GlyLeu: 5.17 ± 0.389
2.479GlyMet: 2.479 ± 0.37
3.754GlyAsn: 3.754 ± 0.594
0.637GlyPro: 0.637 ± 0.236
2.691GlyGln: 2.691 ± 0.476
3.895GlyArg: 3.895 ± 0.541
4.391GlySer: 4.391 ± 0.563
4.25GlyThr: 4.25 ± 0.761
5.879GlyVal: 5.879 ± 0.719
1.558GlyTrp: 1.558 ± 0.31
2.975GlyTyr: 2.975 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
1.841HisAla: 1.841 ± 0.364
0.496HisCys: 0.496 ± 0.186
1.417HisAsp: 1.417 ± 0.286
0.283HisGlu: 0.283 ± 0.135
0.708HisPhe: 0.708 ± 0.22
1.558HisGly: 1.558 ± 0.362
0.496HisHis: 0.496 ± 0.289
1.417HisIle: 1.417 ± 0.302
1.346HisLys: 1.346 ± 0.305
1.629HisLeu: 1.629 ± 0.384
1.275HisMet: 1.275 ± 0.285
0.992HisAsn: 0.992 ± 0.241
1.487HisPro: 1.487 ± 0.275
0.567HisGln: 0.567 ± 0.155
0.921HisArg: 0.921 ± 0.27
1.346HisSer: 1.346 ± 0.291
0.921HisThr: 0.921 ± 0.29
1.133HisVal: 1.133 ± 0.216
0.283HisTrp: 0.283 ± 0.151
0.708HisTyr: 0.708 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
4.604IleAla: 4.604 ± 0.445
0.283IleCys: 0.283 ± 0.143
3.683IleAsp: 3.683 ± 0.355
3.966IleGlu: 3.966 ± 0.571
1.275IlePhe: 1.275 ± 0.248
3.541IleGly: 3.541 ± 0.423
1.417IleHis: 1.417 ± 0.413
2.55IleIle: 2.55 ± 0.482
3.187IleLys: 3.187 ± 0.562
2.266IleLeu: 2.266 ± 0.343
1.558IleMet: 1.558 ± 0.36
2.833IleAsn: 2.833 ± 0.4
1.841IlePro: 1.841 ± 0.467
1.771IleGln: 1.771 ± 0.378
2.196IleArg: 2.196 ± 0.362
2.762IleSer: 2.762 ± 0.561
3.046IleThr: 3.046 ± 0.427
2.408IleVal: 2.408 ± 0.368
0.992IleTrp: 0.992 ± 0.298
1.204IleTyr: 1.204 ± 0.422
0.0IleXaa: 0.0 ± 0.0
Lys
6.091LysAla: 6.091 ± 1.146
0.637LysCys: 0.637 ± 0.185
3.754LysAsp: 3.754 ± 0.557
4.037LysGlu: 4.037 ± 0.72
2.408LysPhe: 2.408 ± 0.518
3.258LysGly: 3.258 ± 0.388
1.204LysHis: 1.204 ± 0.225
2.408LysIle: 2.408 ± 0.395
2.833LysLys: 2.833 ± 0.364
5.1LysLeu: 5.1 ± 0.613
1.912LysMet: 1.912 ± 0.302
2.125LysAsn: 2.125 ± 0.421
2.691LysPro: 2.691 ± 0.411
3.683LysGln: 3.683 ± 0.578
3.966LysArg: 3.966 ± 0.491
3.258LysSer: 3.258 ± 0.321
2.55LysThr: 2.55 ± 0.398
4.391LysVal: 4.391 ± 0.771
0.85LysTrp: 0.85 ± 0.248
2.337LysTyr: 2.337 ± 0.438
0.0LysXaa: 0.0 ± 0.0
Leu
6.941LeuAla: 6.941 ± 0.868
0.425LeuCys: 0.425 ± 0.203
4.745LeuAsp: 4.745 ± 0.499
5.737LeuGlu: 5.737 ± 0.543
2.196LeuPhe: 2.196 ± 0.375
4.675LeuGly: 4.675 ± 0.653
1.558LeuHis: 1.558 ± 0.406
3.895LeuIle: 3.895 ± 0.523
4.675LeuLys: 4.675 ± 0.471
5.454LeuLeu: 5.454 ± 0.552
2.55LeuMet: 2.55 ± 0.448
4.391LeuAsn: 4.391 ± 0.621
4.25LeuPro: 4.25 ± 0.593
3.471LeuGln: 3.471 ± 0.672
4.745LeuArg: 4.745 ± 0.613
4.32LeuSer: 4.32 ± 0.512
4.604LeuThr: 4.604 ± 0.508
5.1LeuVal: 5.1 ± 0.604
1.487LeuTrp: 1.487 ± 0.316
2.762LeuTyr: 2.762 ± 0.428
0.0LeuXaa: 0.0 ± 0.0
Met
2.196MetAla: 2.196 ± 0.396
0.212MetCys: 0.212 ± 0.146
2.266MetAsp: 2.266 ± 0.332
2.408MetGlu: 2.408 ± 0.312
1.275MetPhe: 1.275 ± 0.236
1.841MetGly: 1.841 ± 0.318
0.567MetHis: 0.567 ± 0.226
1.558MetIle: 1.558 ± 0.35
1.487MetLys: 1.487 ± 0.222
3.116MetLeu: 3.116 ± 0.576
1.062MetMet: 1.062 ± 0.303
2.125MetAsn: 2.125 ± 0.305
1.629MetPro: 1.629 ± 0.309
1.983MetGln: 1.983 ± 0.505
2.762MetArg: 2.762 ± 0.39
2.408MetSer: 2.408 ± 0.393
1.912MetThr: 1.912 ± 0.381
1.629MetVal: 1.629 ± 0.369
0.425MetTrp: 0.425 ± 0.214
1.133MetTyr: 1.133 ± 0.322
0.0MetXaa: 0.0 ± 0.0
Asn
4.533AsnAla: 4.533 ± 0.511
0.779AsnCys: 0.779 ± 0.197
2.762AsnAsp: 2.762 ± 0.471
2.975AsnGlu: 2.975 ± 0.468
1.133AsnPhe: 1.133 ± 0.258
3.683AsnGly: 3.683 ± 0.721
0.708AsnHis: 0.708 ± 0.231
2.196AsnIle: 2.196 ± 0.566
4.037AsnLys: 4.037 ± 0.598
4.037AsnLeu: 4.037 ± 0.578
1.912AsnMet: 1.912 ± 0.347
2.337AsnAsn: 2.337 ± 0.519
2.975AsnPro: 2.975 ± 0.481
1.629AsnGln: 1.629 ± 0.354
2.266AsnArg: 2.266 ± 0.344
2.762AsnSer: 2.762 ± 0.424
3.895AsnThr: 3.895 ± 0.599
2.904AsnVal: 2.904 ± 0.602
1.133AsnTrp: 1.133 ± 0.259
1.629AsnTyr: 1.629 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
3.612ProAla: 3.612 ± 0.746
0.354ProCys: 0.354 ± 0.153
2.904ProAsp: 2.904 ± 0.496
3.471ProGlu: 3.471 ± 0.601
1.487ProPhe: 1.487 ± 0.34
1.558ProGly: 1.558 ± 0.316
0.567ProHis: 0.567 ± 0.181
2.621ProIle: 2.621 ± 0.434
2.904ProLys: 2.904 ± 0.446
3.116ProLeu: 3.116 ± 0.372
1.417ProMet: 1.417 ± 0.338
2.621ProAsn: 2.621 ± 0.472
1.771ProPro: 1.771 ± 0.445
2.479ProGln: 2.479 ± 0.342
2.125ProArg: 2.125 ± 0.386
1.629ProSer: 1.629 ± 0.323
2.196ProThr: 2.196 ± 0.418
3.754ProVal: 3.754 ± 0.487
0.85ProTrp: 0.85 ± 0.207
1.417ProTyr: 1.417 ± 0.364
0.0ProXaa: 0.0 ± 0.0
Gln
4.391GlnAla: 4.391 ± 0.682
0.283GlnCys: 0.283 ± 0.141
1.417GlnAsp: 1.417 ± 0.284
3.046GlnGlu: 3.046 ± 0.438
1.771GlnPhe: 1.771 ± 0.392
3.754GlnGly: 3.754 ± 0.58
1.133GlnHis: 1.133 ± 0.266
1.487GlnIle: 1.487 ± 0.308
1.7GlnLys: 1.7 ± 0.23
4.108GlnLeu: 4.108 ± 0.543
1.275GlnMet: 1.275 ± 0.327
1.771GlnAsn: 1.771 ± 0.27
2.054GlnPro: 2.054 ± 0.67
3.471GlnGln: 3.471 ± 0.701
2.479GlnArg: 2.479 ± 0.377
1.983GlnSer: 1.983 ± 0.367
2.054GlnThr: 2.054 ± 0.409
4.816GlnVal: 4.816 ± 0.552
1.487GlnTrp: 1.487 ± 0.291
1.629GlnTyr: 1.629 ± 0.323
0.0GlnXaa: 0.0 ± 0.0
Arg
4.533ArgAla: 4.533 ± 0.537
0.85ArgCys: 0.85 ± 0.271
3.541ArgAsp: 3.541 ± 0.379
2.975ArgGlu: 2.975 ± 0.302
2.196ArgPhe: 2.196 ± 0.425
3.825ArgGly: 3.825 ± 0.539
1.417ArgHis: 1.417 ± 0.362
2.691ArgIle: 2.691 ± 0.562
3.329ArgLys: 3.329 ± 0.469
4.25ArgLeu: 4.25 ± 0.541
1.983ArgMet: 1.983 ± 0.274
3.116ArgAsn: 3.116 ± 0.473
2.479ArgPro: 2.479 ± 0.477
1.771ArgGln: 1.771 ± 0.396
2.621ArgArg: 2.621 ± 0.498
2.975ArgSer: 2.975 ± 0.411
2.833ArgThr: 2.833 ± 0.486
4.179ArgVal: 4.179 ± 0.602
0.708ArgTrp: 0.708 ± 0.189
2.266ArgTyr: 2.266 ± 0.551
0.0ArgXaa: 0.0 ± 0.0
Ser
4.675SerAla: 4.675 ± 0.486
0.354SerCys: 0.354 ± 0.203
2.904SerAsp: 2.904 ± 0.434
2.975SerGlu: 2.975 ± 0.463
1.841SerPhe: 1.841 ± 0.395
4.25SerGly: 4.25 ± 0.709
1.062SerHis: 1.062 ± 0.267
2.408SerIle: 2.408 ± 0.324
3.966SerLys: 3.966 ± 0.902
4.179SerLeu: 4.179 ± 0.522
1.275SerMet: 1.275 ± 0.286
2.904SerAsn: 2.904 ± 0.385
2.479SerPro: 2.479 ± 0.334
2.691SerGln: 2.691 ± 0.41
2.691SerArg: 2.691 ± 0.422
3.046SerSer: 3.046 ± 0.546
3.471SerThr: 3.471 ± 0.373
3.187SerVal: 3.187 ± 0.506
0.779SerTrp: 0.779 ± 0.173
2.125SerTyr: 2.125 ± 0.295
0.0SerXaa: 0.0 ± 0.0
Thr
4.887ThrAla: 4.887 ± 0.583
0.779ThrCys: 0.779 ± 0.214
3.046ThrAsp: 3.046 ± 0.491
2.975ThrGlu: 2.975 ± 0.447
1.912ThrPhe: 1.912 ± 0.291
5.595ThrGly: 5.595 ± 0.717
1.062ThrHis: 1.062 ± 0.268
2.266ThrIle: 2.266 ± 0.306
4.037ThrLys: 4.037 ± 0.486
4.887ThrLeu: 4.887 ± 0.758
1.558ThrMet: 1.558 ± 0.356
2.479ThrAsn: 2.479 ± 0.521
2.904ThrPro: 2.904 ± 0.476
2.479ThrGln: 2.479 ± 0.38
2.337ThrArg: 2.337 ± 0.341
2.975ThrSer: 2.975 ± 0.483
2.904ThrThr: 2.904 ± 0.598
3.683ThrVal: 3.683 ± 0.564
1.062ThrTrp: 1.062 ± 0.323
1.912ThrTyr: 1.912 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
6.799ValAla: 6.799 ± 0.846
0.921ValCys: 0.921 ± 0.324
3.683ValAsp: 3.683 ± 0.553
4.533ValGlu: 4.533 ± 0.587
2.266ValPhe: 2.266 ± 0.54
5.454ValGly: 5.454 ± 0.588
1.7ValHis: 1.7 ± 0.341
3.612ValIle: 3.612 ± 0.541
4.25ValLys: 4.25 ± 0.677
4.887ValLeu: 4.887 ± 0.694
2.266ValMet: 2.266 ± 0.223
3.825ValAsn: 3.825 ± 0.474
4.25ValPro: 4.25 ± 0.626
3.541ValGln: 3.541 ± 0.604
4.037ValArg: 4.037 ± 0.529
4.037ValSer: 4.037 ± 0.551
4.179ValThr: 4.179 ± 0.479
6.02ValVal: 6.02 ± 0.799
1.062ValTrp: 1.062 ± 0.302
3.116ValTyr: 3.116 ± 0.527
0.0ValXaa: 0.0 ± 0.0
Trp
1.629TrpAla: 1.629 ± 0.26
0.425TrpCys: 0.425 ± 0.179
1.275TrpAsp: 1.275 ± 0.248
1.204TrpGlu: 1.204 ± 0.302
0.496TrpPhe: 0.496 ± 0.177
1.204TrpGly: 1.204 ± 0.289
0.212TrpHis: 0.212 ± 0.117
0.354TrpIle: 0.354 ± 0.18
1.133TrpLys: 1.133 ± 0.255
2.054TrpLeu: 2.054 ± 0.34
0.496TrpMet: 0.496 ± 0.152
0.992TrpAsn: 0.992 ± 0.235
0.496TrpPro: 0.496 ± 0.203
0.921TrpGln: 0.921 ± 0.204
0.921TrpArg: 0.921 ± 0.337
0.921TrpSer: 0.921 ± 0.295
1.204TrpThr: 1.204 ± 0.232
1.204TrpVal: 1.204 ± 0.33
0.425TrpTrp: 0.425 ± 0.191
0.496TrpTyr: 0.496 ± 0.177
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.046TyrAla: 3.046 ± 0.403
0.567TyrCys: 0.567 ± 0.22
1.912TyrAsp: 1.912 ± 0.341
2.762TyrGlu: 2.762 ± 0.458
1.346TyrPhe: 1.346 ± 0.268
2.408TyrGly: 2.408 ± 0.418
0.921TyrHis: 0.921 ± 0.18
1.629TyrIle: 1.629 ± 0.305
1.771TyrLys: 1.771 ± 0.276
2.196TyrLeu: 2.196 ± 0.409
1.062TyrMet: 1.062 ± 0.222
1.771TyrAsn: 1.771 ± 0.3
1.841TyrPro: 1.841 ± 0.41
1.558TyrGln: 1.558 ± 0.223
1.912TyrArg: 1.912 ± 0.346
2.479TyrSer: 2.479 ± 0.478
2.266TyrThr: 2.266 ± 0.407
3.046TyrVal: 3.046 ± 0.384
0.637TyrTrp: 0.637 ± 0.267
1.912TyrTyr: 1.912 ± 0.351
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (14120 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski