Amino acid dipepetide frequency for Vibrio phage phiV141

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.967AlaAla: 9.967 ± 1.836
0.586AlaCys: 0.586 ± 0.222
4.764AlaAsp: 4.764 ± 0.646
5.57AlaGlu: 5.57 ± 0.853
3.078AlaPhe: 3.078 ± 0.406
6.889AlaGly: 6.889 ± 0.951
1.539AlaHis: 1.539 ± 0.338
3.444AlaIle: 3.444 ± 0.534
4.91AlaLys: 4.91 ± 0.589
7.915AlaLeu: 7.915 ± 1.01
3.151AlaMet: 3.151 ± 0.535
4.031AlaAsn: 4.031 ± 0.507
4.837AlaPro: 4.837 ± 1.371
6.01AlaGln: 6.01 ± 1.066
3.298AlaArg: 3.298 ± 0.816
3.444AlaSer: 3.444 ± 0.469
5.79AlaThr: 5.79 ± 0.568
7.695AlaVal: 7.695 ± 0.742
1.246AlaTrp: 1.246 ± 0.242
2.199AlaTyr: 2.199 ± 0.415
0.0AlaXaa: 0.0 ± 0.0
Cys
0.879CysAla: 0.879 ± 0.249
0.147CysCys: 0.147 ± 0.096
0.586CysAsp: 0.586 ± 0.201
0.733CysGlu: 0.733 ± 0.268
0.22CysPhe: 0.22 ± 0.11
0.44CysGly: 0.44 ± 0.198
0.366CysHis: 0.366 ± 0.219
0.366CysIle: 0.366 ± 0.153
0.44CysLys: 0.44 ± 0.147
0.879CysLeu: 0.879 ± 0.251
0.44CysMet: 0.44 ± 0.249
0.806CysAsn: 0.806 ± 0.285
0.22CysPro: 0.22 ± 0.121
0.586CysGln: 0.586 ± 0.187
0.879CysArg: 0.879 ± 0.237
0.513CysSer: 0.513 ± 0.201
0.44CysThr: 0.44 ± 0.167
0.44CysVal: 0.44 ± 0.195
0.073CysTrp: 0.073 ± 0.078
0.293CysTyr: 0.293 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
4.984AspAla: 4.984 ± 0.521
0.733AspCys: 0.733 ± 0.201
3.371AspAsp: 3.371 ± 0.488
4.177AspGlu: 4.177 ± 0.526
2.345AspPhe: 2.345 ± 0.489
4.031AspGly: 4.031 ± 0.742
0.953AspHis: 0.953 ± 0.352
3.371AspIle: 3.371 ± 0.5
3.664AspLys: 3.664 ± 0.51
5.423AspLeu: 5.423 ± 0.524
3.005AspMet: 3.005 ± 0.484
2.638AspAsn: 2.638 ± 0.586
2.492AspPro: 2.492 ± 0.382
2.052AspGln: 2.052 ± 0.223
2.272AspArg: 2.272 ± 0.516
2.785AspSer: 2.785 ± 0.3
3.957AspThr: 3.957 ± 0.58
5.203AspVal: 5.203 ± 0.578
1.173AspTrp: 1.173 ± 0.309
2.565AspTyr: 2.565 ± 0.555
0.0AspXaa: 0.0 ± 0.0
Glu
5.936GluAla: 5.936 ± 0.988
0.733GluCys: 0.733 ± 0.208
3.078GluAsp: 3.078 ± 0.438
3.884GluGlu: 3.884 ± 0.509
2.931GluPhe: 2.931 ± 0.532
3.811GluGly: 3.811 ± 0.571
1.979GluHis: 1.979 ± 0.402
2.345GluIle: 2.345 ± 0.448
3.078GluLys: 3.078 ± 0.586
7.768GluLeu: 7.768 ± 0.817
2.565GluMet: 2.565 ± 0.516
2.125GluAsn: 2.125 ± 0.373
1.612GluPro: 1.612 ± 0.351
5.277GluGln: 5.277 ± 0.806
4.104GluArg: 4.104 ± 0.518
2.931GluSer: 2.931 ± 0.554
3.298GluThr: 3.298 ± 0.34
6.376GluVal: 6.376 ± 0.767
1.466GluTrp: 1.466 ± 0.353
2.638GluTyr: 2.638 ± 0.504
0.0GluXaa: 0.0 ± 0.0
Phe
2.125PheAla: 2.125 ± 0.302
0.44PheCys: 0.44 ± 0.186
2.565PheAsp: 2.565 ± 0.348
1.832PheGlu: 1.832 ± 0.371
1.246PhePhe: 1.246 ± 0.469
2.272PheGly: 2.272 ± 0.394
1.099PheHis: 1.099 ± 0.231
2.272PheIle: 2.272 ± 0.397
2.931PheLys: 2.931 ± 0.409
1.905PheLeu: 1.905 ± 0.372
1.392PheMet: 1.392 ± 0.235
1.466PheAsn: 1.466 ± 0.293
1.539PhePro: 1.539 ± 0.319
1.466PheGln: 1.466 ± 0.317
2.125PheArg: 2.125 ± 0.396
1.612PheSer: 1.612 ± 0.509
2.712PheThr: 2.712 ± 0.592
1.759PheVal: 1.759 ± 0.397
0.586PheTrp: 0.586 ± 0.234
1.392PheTyr: 1.392 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
5.79GlyAla: 5.79 ± 0.689
0.586GlyCys: 0.586 ± 0.232
4.764GlyAsp: 4.764 ± 0.662
3.957GlyGlu: 3.957 ± 0.604
2.785GlyPhe: 2.785 ± 0.49
5.13GlyGly: 5.13 ± 0.811
1.173GlyHis: 1.173 ± 0.275
3.151GlyIle: 3.151 ± 0.545
3.591GlyLys: 3.591 ± 0.521
5.13GlyLeu: 5.13 ± 0.475
1.832GlyMet: 1.832 ± 0.29
2.785GlyAsn: 2.785 ± 0.383
0.44GlyPro: 0.44 ± 0.187
2.931GlyGln: 2.931 ± 0.512
3.591GlyArg: 3.591 ± 0.479
4.324GlySer: 4.324 ± 0.644
4.544GlyThr: 4.544 ± 0.466
7.036GlyVal: 7.036 ± 0.749
1.246GlyTrp: 1.246 ± 0.293
3.078GlyTyr: 3.078 ± 0.448
0.0GlyXaa: 0.0 ± 0.0
His
1.612HisAla: 1.612 ± 0.32
0.366HisCys: 0.366 ± 0.143
1.246HisAsp: 1.246 ± 0.325
0.879HisGlu: 0.879 ± 0.325
0.733HisPhe: 0.733 ± 0.23
1.905HisGly: 1.905 ± 0.432
0.293HisHis: 0.293 ± 0.196
1.319HisIle: 1.319 ± 0.299
2.052HisLys: 2.052 ± 0.361
1.612HisLeu: 1.612 ± 0.323
0.806HisMet: 0.806 ± 0.219
1.319HisAsn: 1.319 ± 0.237
1.539HisPro: 1.539 ± 0.381
1.319HisGln: 1.319 ± 0.322
0.953HisArg: 0.953 ± 0.244
1.392HisSer: 1.392 ± 0.311
1.466HisThr: 1.466 ± 0.256
1.539HisVal: 1.539 ± 0.281
0.147HisTrp: 0.147 ± 0.105
0.586HisTyr: 0.586 ± 0.224
0.0HisXaa: 0.0 ± 0.0
Ile
3.811IleAla: 3.811 ± 0.385
0.586IleCys: 0.586 ± 0.205
3.371IleAsp: 3.371 ± 0.55
4.251IleGlu: 4.251 ± 0.617
0.66IlePhe: 0.66 ± 0.218
3.078IleGly: 3.078 ± 0.52
1.319IleHis: 1.319 ± 0.38
1.686IleIle: 1.686 ± 0.421
3.078IleLys: 3.078 ± 0.495
1.979IleLeu: 1.979 ± 0.362
1.392IleMet: 1.392 ± 0.354
2.931IleAsn: 2.931 ± 0.473
2.638IlePro: 2.638 ± 0.487
1.832IleGln: 1.832 ± 0.452
2.858IleArg: 2.858 ± 0.512
1.979IleSer: 1.979 ± 0.382
2.931IleThr: 2.931 ± 0.504
2.931IleVal: 2.931 ± 0.418
0.44IleTrp: 0.44 ± 0.158
1.173IleTyr: 1.173 ± 0.249
0.0IleXaa: 0.0 ± 0.0
Lys
5.57LysAla: 5.57 ± 0.723
0.513LysCys: 0.513 ± 0.21
3.664LysAsp: 3.664 ± 0.53
5.057LysGlu: 5.057 ± 0.757
2.272LysPhe: 2.272 ± 0.386
3.591LysGly: 3.591 ± 0.493
2.345LysHis: 2.345 ± 0.503
1.979LysIle: 1.979 ± 0.423
2.712LysLys: 2.712 ± 0.337
5.277LysLeu: 5.277 ± 0.674
1.612LysMet: 1.612 ± 0.401
2.858LysAsn: 2.858 ± 0.406
2.272LysPro: 2.272 ± 0.452
2.565LysGln: 2.565 ± 0.405
3.444LysArg: 3.444 ± 0.675
2.785LysSer: 2.785 ± 0.501
2.199LysThr: 2.199 ± 0.506
4.031LysVal: 4.031 ± 0.566
1.319LysTrp: 1.319 ± 0.248
2.492LysTyr: 2.492 ± 0.405
0.0LysXaa: 0.0 ± 0.0
Leu
6.596LeuAla: 6.596 ± 0.609
0.66LeuCys: 0.66 ± 0.249
5.863LeuAsp: 5.863 ± 0.7
5.57LeuGlu: 5.57 ± 0.588
2.638LeuPhe: 2.638 ± 0.299
5.57LeuGly: 5.57 ± 0.689
2.052LeuHis: 2.052 ± 0.46
3.298LeuIle: 3.298 ± 0.521
3.884LeuLys: 3.884 ± 0.508
5.35LeuLeu: 5.35 ± 0.563
1.392LeuMet: 1.392 ± 0.305
2.712LeuAsn: 2.712 ± 0.445
3.371LeuPro: 3.371 ± 0.493
3.738LeuGln: 3.738 ± 0.646
4.764LeuArg: 4.764 ± 0.665
4.251LeuSer: 4.251 ± 0.637
5.13LeuThr: 5.13 ± 0.643
4.617LeuVal: 4.617 ± 0.733
1.026LeuTrp: 1.026 ± 0.264
2.492LeuTyr: 2.492 ± 0.444
0.0LeuXaa: 0.0 ± 0.0
Met
3.078MetAla: 3.078 ± 0.431
0.147MetCys: 0.147 ± 0.152
1.466MetAsp: 1.466 ± 0.301
2.638MetGlu: 2.638 ± 0.548
0.66MetPhe: 0.66 ± 0.207
2.125MetGly: 2.125 ± 0.428
0.293MetHis: 0.293 ± 0.144
1.392MetIle: 1.392 ± 0.357
1.612MetLys: 1.612 ± 0.304
3.005MetLeu: 3.005 ± 0.412
1.246MetMet: 1.246 ± 0.325
1.392MetAsn: 1.392 ± 0.282
1.099MetPro: 1.099 ± 0.351
2.492MetGln: 2.492 ± 0.538
1.686MetArg: 1.686 ± 0.373
2.125MetSer: 2.125 ± 0.338
2.345MetThr: 2.345 ± 0.408
1.905MetVal: 1.905 ± 0.361
0.66MetTrp: 0.66 ± 0.207
1.099MetTyr: 1.099 ± 0.275
0.0MetXaa: 0.0 ± 0.0
Asn
3.738AsnAla: 3.738 ± 0.585
0.586AsnCys: 0.586 ± 0.223
2.345AsnAsp: 2.345 ± 0.352
2.418AsnGlu: 2.418 ± 0.376
1.832AsnPhe: 1.832 ± 0.277
3.811AsnGly: 3.811 ± 0.701
0.879AsnHis: 0.879 ± 0.258
2.565AsnIle: 2.565 ± 0.478
2.785AsnLys: 2.785 ± 0.477
2.565AsnLeu: 2.565 ± 0.501
1.832AsnMet: 1.832 ± 0.307
2.345AsnAsn: 2.345 ± 0.389
2.199AsnPro: 2.199 ± 0.381
1.686AsnGln: 1.686 ± 0.223
1.466AsnArg: 1.466 ± 0.269
2.785AsnSer: 2.785 ± 0.359
3.005AsnThr: 3.005 ± 0.476
3.371AsnVal: 3.371 ± 0.683
0.733AsnTrp: 0.733 ± 0.2
1.612AsnTyr: 1.612 ± 0.38
0.0AsnXaa: 0.0 ± 0.0
Pro
3.444ProAla: 3.444 ± 0.598
0.44ProCys: 0.44 ± 0.153
3.005ProAsp: 3.005 ± 0.463
3.738ProGlu: 3.738 ± 0.706
1.099ProPhe: 1.099 ± 0.308
1.832ProGly: 1.832 ± 0.385
0.66ProHis: 0.66 ± 0.176
1.979ProIle: 1.979 ± 0.376
2.418ProLys: 2.418 ± 0.492
2.931ProLeu: 2.931 ± 0.472
0.586ProMet: 0.586 ± 0.223
1.905ProAsn: 1.905 ± 0.345
1.099ProPro: 1.099 ± 0.309
2.931ProGln: 2.931 ± 0.597
1.392ProArg: 1.392 ± 0.305
1.979ProSer: 1.979 ± 0.379
3.371ProThr: 3.371 ± 0.49
5.423ProVal: 5.423 ± 1.027
0.733ProTrp: 0.733 ± 0.255
1.099ProTyr: 1.099 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
6.596GlnAla: 6.596 ± 1.075
0.44GlnCys: 0.44 ± 0.158
2.565GlnAsp: 2.565 ± 0.411
3.078GlnGlu: 3.078 ± 0.576
2.199GlnPhe: 2.199 ± 0.498
3.151GlnGly: 3.151 ± 0.492
0.953GlnHis: 0.953 ± 0.267
1.905GlnIle: 1.905 ± 0.387
2.565GlnLys: 2.565 ± 0.445
3.811GlnLeu: 3.811 ± 0.572
2.638GlnMet: 2.638 ± 0.539
1.099GlnAsn: 1.099 ± 0.291
3.078GlnPro: 3.078 ± 0.9
3.738GlnGln: 3.738 ± 0.88
3.298GlnArg: 3.298 ± 0.458
3.225GlnSer: 3.225 ± 0.576
2.272GlnThr: 2.272 ± 0.368
4.251GlnVal: 4.251 ± 0.729
1.392GlnTrp: 1.392 ± 0.396
1.612GlnTyr: 1.612 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
4.104ArgAla: 4.104 ± 0.854
0.44ArgCys: 0.44 ± 0.164
2.712ArgAsp: 2.712 ± 0.564
3.957ArgGlu: 3.957 ± 0.619
1.539ArgPhe: 1.539 ± 0.372
3.225ArgGly: 3.225 ± 0.425
1.173ArgHis: 1.173 ± 0.353
2.712ArgIle: 2.712 ± 0.401
3.298ArgLys: 3.298 ± 0.503
3.151ArgLeu: 3.151 ± 0.368
1.759ArgMet: 1.759 ± 0.426
2.199ArgAsn: 2.199 ± 0.297
2.052ArgPro: 2.052 ± 0.408
2.418ArgGln: 2.418 ± 0.496
2.931ArgArg: 2.931 ± 0.365
2.125ArgSer: 2.125 ± 0.404
2.931ArgThr: 2.931 ± 0.375
3.591ArgVal: 3.591 ± 0.473
0.733ArgTrp: 0.733 ± 0.205
2.199ArgTyr: 2.199 ± 0.401
0.0ArgXaa: 0.0 ± 0.0
Ser
4.397SerAla: 4.397 ± 0.546
0.586SerCys: 0.586 ± 0.248
2.858SerAsp: 2.858 ± 0.41
3.298SerGlu: 3.298 ± 0.351
1.979SerPhe: 1.979 ± 0.383
3.371SerGly: 3.371 ± 0.394
1.173SerHis: 1.173 ± 0.208
2.638SerIle: 2.638 ± 0.45
3.591SerLys: 3.591 ± 0.459
3.884SerLeu: 3.884 ± 0.458
1.686SerMet: 1.686 ± 0.419
2.931SerAsn: 2.931 ± 0.566
2.418SerPro: 2.418 ± 0.383
2.565SerGln: 2.565 ± 0.479
2.199SerArg: 2.199 ± 0.459
2.272SerSer: 2.272 ± 0.425
2.785SerThr: 2.785 ± 0.33
5.057SerVal: 5.057 ± 0.541
1.099SerTrp: 1.099 ± 0.277
1.466SerTyr: 1.466 ± 0.288
0.0SerXaa: 0.0 ± 0.0
Thr
6.303ThrAla: 6.303 ± 0.643
0.513ThrCys: 0.513 ± 0.223
2.785ThrAsp: 2.785 ± 0.58
4.324ThrGlu: 4.324 ± 0.392
2.199ThrPhe: 2.199 ± 0.39
4.104ThrGly: 4.104 ± 0.687
1.759ThrHis: 1.759 ± 0.414
3.518ThrIle: 3.518 ± 0.495
4.617ThrLys: 4.617 ± 0.528
3.811ThrLeu: 3.811 ± 0.548
1.319ThrMet: 1.319 ± 0.328
2.638ThrAsn: 2.638 ± 0.493
3.298ThrPro: 3.298 ± 0.426
2.858ThrGln: 2.858 ± 0.412
2.565ThrArg: 2.565 ± 0.426
3.664ThrSer: 3.664 ± 0.559
3.371ThrThr: 3.371 ± 0.45
3.738ThrVal: 3.738 ± 0.575
0.366ThrTrp: 0.366 ± 0.149
2.492ThrTyr: 2.492 ± 0.306
0.0ThrXaa: 0.0 ± 0.0
Val
7.915ValAla: 7.915 ± 0.915
0.513ValCys: 0.513 ± 0.19
5.79ValAsp: 5.79 ± 0.691
5.423ValGlu: 5.423 ± 0.47
2.712ValPhe: 2.712 ± 0.466
5.35ValGly: 5.35 ± 0.697
2.125ValHis: 2.125 ± 0.459
2.638ValIle: 2.638 ± 0.463
4.544ValLys: 4.544 ± 0.726
4.984ValLeu: 4.984 ± 0.439
2.052ValMet: 2.052 ± 0.342
3.738ValAsn: 3.738 ± 0.508
3.738ValPro: 3.738 ± 0.601
4.251ValGln: 4.251 ± 0.836
3.225ValArg: 3.225 ± 0.524
4.324ValSer: 4.324 ± 0.67
4.69ValThr: 4.69 ± 0.654
5.497ValVal: 5.497 ± 0.645
1.686ValTrp: 1.686 ± 0.369
3.078ValTyr: 3.078 ± 0.448
0.0ValXaa: 0.0 ± 0.0
Trp
0.953TrpAla: 0.953 ± 0.284
0.366TrpCys: 0.366 ± 0.154
1.759TrpAsp: 1.759 ± 0.318
1.319TrpGlu: 1.319 ± 0.302
0.733TrpPhe: 0.733 ± 0.185
0.953TrpGly: 0.953 ± 0.234
0.147TrpHis: 0.147 ± 0.082
0.366TrpIle: 0.366 ± 0.135
1.026TrpLys: 1.026 ± 0.272
1.905TrpLeu: 1.905 ± 0.422
0.44TrpMet: 0.44 ± 0.193
0.66TrpAsn: 0.66 ± 0.19
0.366TrpPro: 0.366 ± 0.172
1.026TrpGln: 1.026 ± 0.266
0.66TrpArg: 0.66 ± 0.228
0.953TrpSer: 0.953 ± 0.214
1.392TrpThr: 1.392 ± 0.322
1.099TrpVal: 1.099 ± 0.315
0.44TrpTrp: 0.44 ± 0.22
0.66TrpTyr: 0.66 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.565TyrAla: 2.565 ± 0.424
0.366TyrCys: 0.366 ± 0.165
2.638TyrAsp: 2.638 ± 0.402
1.905TyrGlu: 1.905 ± 0.438
0.879TyrPhe: 0.879 ± 0.187
3.225TyrGly: 3.225 ± 0.415
0.806TyrHis: 0.806 ± 0.232
1.979TyrIle: 1.979 ± 0.394
1.759TyrLys: 1.759 ± 0.442
1.686TyrLeu: 1.686 ± 0.324
1.246TyrMet: 1.246 ± 0.277
1.979TyrAsn: 1.979 ± 0.384
1.759TyrPro: 1.759 ± 0.314
2.125TyrGln: 2.125 ± 0.466
1.539TyrArg: 1.539 ± 0.362
2.785TyrSer: 2.785 ± 0.426
1.612TyrThr: 1.612 ± 0.462
2.712TyrVal: 2.712 ± 0.461
0.66TyrTrp: 0.66 ± 0.194
2.199TyrTyr: 2.199 ± 0.334
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (13646 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski