Amino acid dipepetide frequency for Vibrio phage MZH0603

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.23AlaAla: 9.23 ± 1.447
0.521AlaCys: 0.521 ± 0.273
5.434AlaAsp: 5.434 ± 0.863
6.401AlaGlu: 6.401 ± 0.865
2.828AlaPhe: 2.828 ± 0.442
5.88AlaGly: 5.88 ± 0.944
0.893AlaHis: 0.893 ± 0.228
3.126AlaIle: 3.126 ± 0.436
4.54AlaLys: 4.54 ± 0.626
6.327AlaLeu: 6.327 ± 0.725
1.861AlaMet: 1.861 ± 0.348
2.382AlaAsn: 2.382 ± 0.478
4.987AlaPro: 4.987 ± 1.552
4.168AlaGln: 4.168 ± 0.766
4.466AlaArg: 4.466 ± 0.685
4.168AlaSer: 4.168 ± 0.527
5.955AlaThr: 5.955 ± 0.941
6.103AlaVal: 6.103 ± 0.839
1.116AlaTrp: 1.116 ± 0.344
2.605AlaTyr: 2.605 ± 0.49
0.0AlaXaa: 0.0 ± 0.0
Cys
0.447CysAla: 0.447 ± 0.191
0.223CysCys: 0.223 ± 0.105
0.893CysAsp: 0.893 ± 0.323
0.893CysGlu: 0.893 ± 0.297
0.521CysPhe: 0.521 ± 0.205
0.298CysGly: 0.298 ± 0.148
0.298CysHis: 0.298 ± 0.173
0.521CysIle: 0.521 ± 0.203
0.819CysLys: 0.819 ± 0.269
0.968CysLeu: 0.968 ± 0.321
0.447CysMet: 0.447 ± 0.17
0.819CysAsn: 0.819 ± 0.325
0.595CysPro: 0.595 ± 0.18
0.372CysGln: 0.372 ± 0.171
0.819CysArg: 0.819 ± 0.278
0.744CysSer: 0.744 ± 0.304
0.744CysThr: 0.744 ± 0.296
1.116CysVal: 1.116 ± 0.434
0.223CysTrp: 0.223 ± 0.134
0.149CysTyr: 0.149 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
6.327AspAla: 6.327 ± 1.004
0.595AspCys: 0.595 ± 0.232
4.168AspAsp: 4.168 ± 0.68
3.796AspGlu: 3.796 ± 0.66
2.828AspPhe: 2.828 ± 0.417
4.987AspGly: 4.987 ± 0.601
1.489AspHis: 1.489 ± 0.375
4.392AspIle: 4.392 ± 0.562
4.392AspLys: 4.392 ± 0.615
5.88AspLeu: 5.88 ± 0.592
1.638AspMet: 1.638 ± 0.313
3.424AspAsn: 3.424 ± 0.561
2.754AspPro: 2.754 ± 0.343
1.861AspGln: 1.861 ± 0.342
2.828AspArg: 2.828 ± 0.543
3.87AspSer: 3.87 ± 0.648
2.903AspThr: 2.903 ± 0.51
5.061AspVal: 5.061 ± 0.631
1.414AspTrp: 1.414 ± 0.271
2.605AspTyr: 2.605 ± 0.466
0.0AspXaa: 0.0 ± 0.0
Glu
6.773GluAla: 6.773 ± 0.736
0.744GluCys: 0.744 ± 0.305
3.201GluAsp: 3.201 ± 0.441
4.764GluGlu: 4.764 ± 0.835
2.977GluPhe: 2.977 ± 0.541
4.168GluGly: 4.168 ± 0.676
1.712GluHis: 1.712 ± 0.281
3.945GluIle: 3.945 ± 0.707
3.424GluLys: 3.424 ± 0.446
5.88GluLeu: 5.88 ± 0.618
2.382GluMet: 2.382 ± 0.498
3.052GluAsn: 3.052 ± 0.554
2.159GluPro: 2.159 ± 0.413
3.87GluGln: 3.87 ± 0.771
3.87GluArg: 3.87 ± 0.864
4.689GluSer: 4.689 ± 0.733
3.498GluThr: 3.498 ± 0.895
5.061GluVal: 5.061 ± 0.729
0.968GluTrp: 0.968 ± 0.285
2.456GluTyr: 2.456 ± 0.524
0.0GluXaa: 0.0 ± 0.0
Phe
2.828PheAla: 2.828 ± 0.63
0.447PheCys: 0.447 ± 0.153
3.349PheAsp: 3.349 ± 0.554
2.233PheGlu: 2.233 ± 0.381
1.489PhePhe: 1.489 ± 0.266
2.828PheGly: 2.828 ± 0.396
0.595PheHis: 0.595 ± 0.29
2.828PheIle: 2.828 ± 0.501
2.382PheLys: 2.382 ± 0.42
2.159PheLeu: 2.159 ± 0.375
1.265PheMet: 1.265 ± 0.341
3.349PheAsn: 3.349 ± 0.489
0.893PhePro: 0.893 ± 0.305
0.67PheGln: 0.67 ± 0.241
1.34PheArg: 1.34 ± 0.315
2.605PheSer: 2.605 ± 0.41
2.977PheThr: 2.977 ± 0.533
1.861PheVal: 1.861 ± 0.378
0.223PheTrp: 0.223 ± 0.119
1.563PheTyr: 1.563 ± 0.392
0.0PheXaa: 0.0 ± 0.0
Gly
5.508GlyAla: 5.508 ± 0.73
0.819GlyCys: 0.819 ± 0.303
4.317GlyAsp: 4.317 ± 0.601
5.285GlyGlu: 5.285 ± 0.598
2.903GlyPhe: 2.903 ± 0.37
4.689GlyGly: 4.689 ± 0.701
1.712GlyHis: 1.712 ± 0.437
3.945GlyIle: 3.945 ± 0.544
4.168GlyLys: 4.168 ± 0.599
4.019GlyLeu: 4.019 ± 0.487
2.159GlyMet: 2.159 ± 0.376
2.605GlyAsn: 2.605 ± 0.526
1.638GlyPro: 1.638 ± 0.259
2.828GlyGln: 2.828 ± 0.58
3.349GlyArg: 3.349 ± 0.356
4.54GlySer: 4.54 ± 0.493
4.987GlyThr: 4.987 ± 0.7
5.21GlyVal: 5.21 ± 0.658
1.34GlyTrp: 1.34 ± 0.39
2.307GlyTyr: 2.307 ± 0.5
0.0GlyXaa: 0.0 ± 0.0
His
1.34HisAla: 1.34 ± 0.303
0.372HisCys: 0.372 ± 0.315
1.638HisAsp: 1.638 ± 0.445
1.116HisGlu: 1.116 ± 0.272
1.116HisPhe: 1.116 ± 0.247
1.34HisGly: 1.34 ± 0.393
0.744HisHis: 0.744 ± 0.331
1.191HisIle: 1.191 ± 0.352
1.265HisLys: 1.265 ± 0.358
1.786HisLeu: 1.786 ± 0.329
0.521HisMet: 0.521 ± 0.168
0.67HisAsn: 0.67 ± 0.194
1.116HisPro: 1.116 ± 0.369
1.191HisGln: 1.191 ± 0.28
1.116HisArg: 1.116 ± 0.261
0.67HisSer: 0.67 ± 0.182
1.34HisThr: 1.34 ± 0.431
1.563HisVal: 1.563 ± 0.351
0.447HisTrp: 0.447 ± 0.194
1.042HisTyr: 1.042 ± 0.317
0.0HisXaa: 0.0 ± 0.0
Ile
4.466IleAla: 4.466 ± 0.655
0.595IleCys: 0.595 ± 0.22
4.243IleAsp: 4.243 ± 0.626
5.136IleGlu: 5.136 ± 0.658
1.638IlePhe: 1.638 ± 0.368
4.094IleGly: 4.094 ± 0.44
1.042IleHis: 1.042 ± 0.281
2.68IleIle: 2.68 ± 0.576
2.977IleLys: 2.977 ± 0.426
3.796IleLeu: 3.796 ± 0.512
1.042IleMet: 1.042 ± 0.202
3.275IleAsn: 3.275 ± 0.546
2.307IlePro: 2.307 ± 0.416
2.605IleGln: 2.605 ± 0.452
2.605IleArg: 2.605 ± 0.388
2.977IleSer: 2.977 ± 0.451
3.945IleThr: 3.945 ± 0.669
3.796IleVal: 3.796 ± 0.573
0.67IleTrp: 0.67 ± 0.176
1.935IleTyr: 1.935 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
4.54LysAla: 4.54 ± 0.649
1.265LysCys: 1.265 ± 0.355
3.945LysAsp: 3.945 ± 0.549
3.945LysGlu: 3.945 ± 0.635
1.191LysPhe: 1.191 ± 0.268
3.722LysGly: 3.722 ± 0.532
1.489LysHis: 1.489 ± 0.402
3.722LysIle: 3.722 ± 0.546
4.019LysLys: 4.019 ± 0.843
5.731LysLeu: 5.731 ± 1.02
1.935LysMet: 1.935 ± 0.317
1.861LysAsn: 1.861 ± 0.386
2.977LysPro: 2.977 ± 0.613
2.307LysGln: 2.307 ± 0.417
3.349LysArg: 3.349 ± 0.509
3.722LysSer: 3.722 ± 0.613
2.977LysThr: 2.977 ± 0.463
4.689LysVal: 4.689 ± 0.634
1.191LysTrp: 1.191 ± 0.265
1.786LysTyr: 1.786 ± 0.35
0.0LysXaa: 0.0 ± 0.0
Leu
6.327LeuAla: 6.327 ± 0.77
0.819LeuCys: 0.819 ± 0.321
5.434LeuAsp: 5.434 ± 0.55
5.21LeuGlu: 5.21 ± 0.586
2.605LeuPhe: 2.605 ± 0.487
4.987LeuGly: 4.987 ± 0.614
2.084LeuHis: 2.084 ± 0.42
4.54LeuIle: 4.54 ± 0.584
5.136LeuLys: 5.136 ± 0.634
6.699LeuLeu: 6.699 ± 0.841
2.084LeuMet: 2.084 ± 0.331
4.392LeuAsn: 4.392 ± 0.554
2.828LeuPro: 2.828 ± 0.412
3.201LeuGln: 3.201 ± 0.468
3.498LeuArg: 3.498 ± 0.554
5.88LeuSer: 5.88 ± 0.635
5.359LeuThr: 5.359 ± 0.678
3.796LeuVal: 3.796 ± 0.667
0.968LeuTrp: 0.968 ± 0.23
2.233LeuTyr: 2.233 ± 0.331
0.0LeuXaa: 0.0 ± 0.0
Met
2.084MetAla: 2.084 ± 0.46
0.67MetCys: 0.67 ± 0.266
1.191MetAsp: 1.191 ± 0.32
1.116MetGlu: 1.116 ± 0.343
1.786MetPhe: 1.786 ± 0.44
1.786MetGly: 1.786 ± 0.32
0.298MetHis: 0.298 ± 0.149
1.935MetIle: 1.935 ± 0.421
2.605MetLys: 2.605 ± 0.47
1.935MetLeu: 1.935 ± 0.398
0.968MetMet: 0.968 ± 0.256
1.414MetAsn: 1.414 ± 0.309
0.893MetPro: 0.893 ± 0.214
0.744MetGln: 0.744 ± 0.241
1.861MetArg: 1.861 ± 0.398
2.01MetSer: 2.01 ± 0.323
1.935MetThr: 1.935 ± 0.461
2.159MetVal: 2.159 ± 0.339
0.298MetTrp: 0.298 ± 0.165
0.298MetTyr: 0.298 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
3.573AsnAla: 3.573 ± 0.835
0.67AsnCys: 0.67 ± 0.216
2.605AsnAsp: 2.605 ± 0.375
2.977AsnGlu: 2.977 ± 0.59
0.744AsnPhe: 0.744 ± 0.221
4.094AsnGly: 4.094 ± 0.624
0.968AsnHis: 0.968 ± 0.29
2.828AsnIle: 2.828 ± 0.485
2.828AsnLys: 2.828 ± 0.451
3.647AsnLeu: 3.647 ± 0.537
2.159AsnMet: 2.159 ± 0.461
2.605AsnAsn: 2.605 ± 0.453
1.861AsnPro: 1.861 ± 0.413
1.861AsnGln: 1.861 ± 0.382
1.935AsnArg: 1.935 ± 0.379
3.275AsnSer: 3.275 ± 0.459
3.573AsnThr: 3.573 ± 0.447
2.605AsnVal: 2.605 ± 0.447
0.521AsnTrp: 0.521 ± 0.189
1.191AsnTyr: 1.191 ± 0.273
0.0AsnXaa: 0.0 ± 0.0
Pro
4.317ProAla: 4.317 ± 1.401
0.372ProCys: 0.372 ± 0.22
2.828ProAsp: 2.828 ± 0.52
3.275ProGlu: 3.275 ± 0.589
1.638ProPhe: 1.638 ± 0.308
1.935ProGly: 1.935 ± 0.436
0.893ProHis: 0.893 ± 0.251
1.563ProIle: 1.563 ± 0.33
2.307ProLys: 2.307 ± 0.471
2.456ProLeu: 2.456 ± 0.467
0.595ProMet: 0.595 ± 0.208
1.786ProAsn: 1.786 ± 0.32
1.563ProPro: 1.563 ± 0.514
1.042ProGln: 1.042 ± 0.407
1.861ProArg: 1.861 ± 0.449
2.01ProSer: 2.01 ± 0.398
1.861ProThr: 1.861 ± 0.562
3.052ProVal: 3.052 ± 0.417
0.67ProTrp: 0.67 ± 0.264
1.191ProTyr: 1.191 ± 0.329
0.0ProXaa: 0.0 ± 0.0
Gln
4.317GlnAla: 4.317 ± 0.829
0.298GlnCys: 0.298 ± 0.139
1.712GlnAsp: 1.712 ± 0.323
2.977GlnGlu: 2.977 ± 0.705
1.786GlnPhe: 1.786 ± 0.285
2.68GlnGly: 2.68 ± 0.465
0.893GlnHis: 0.893 ± 0.262
2.605GlnIle: 2.605 ± 0.409
1.638GlnLys: 1.638 ± 0.377
4.094GlnLeu: 4.094 ± 0.424
0.819GlnMet: 0.819 ± 0.243
1.116GlnAsn: 1.116 ± 0.377
1.414GlnPro: 1.414 ± 0.369
2.605GlnGln: 2.605 ± 0.492
3.424GlnArg: 3.424 ± 0.466
2.01GlnSer: 2.01 ± 0.439
2.531GlnThr: 2.531 ± 0.445
2.903GlnVal: 2.903 ± 0.451
0.968GlnTrp: 0.968 ± 0.23
0.893GlnTyr: 0.893 ± 0.237
0.0GlnXaa: 0.0 ± 0.0
Arg
3.126ArgAla: 3.126 ± 0.526
0.744ArgCys: 0.744 ± 0.244
3.349ArgAsp: 3.349 ± 0.517
3.424ArgGlu: 3.424 ± 0.449
1.935ArgPhe: 1.935 ± 0.385
3.796ArgGly: 3.796 ± 0.628
1.265ArgHis: 1.265 ± 0.266
2.754ArgIle: 2.754 ± 0.387
3.201ArgLys: 3.201 ± 0.468
4.392ArgLeu: 4.392 ± 0.522
1.935ArgMet: 1.935 ± 0.474
2.01ArgAsn: 2.01 ± 0.447
1.116ArgPro: 1.116 ± 0.4
2.605ArgGln: 2.605 ± 0.438
2.977ArgArg: 2.977 ± 0.607
2.605ArgSer: 2.605 ± 0.346
2.977ArgThr: 2.977 ± 0.55
2.903ArgVal: 2.903 ± 0.396
1.042ArgTrp: 1.042 ± 0.246
1.861ArgTyr: 1.861 ± 0.419
0.0ArgXaa: 0.0 ± 0.0
Ser
4.243SerAla: 4.243 ± 0.655
0.298SerCys: 0.298 ± 0.154
5.136SerAsp: 5.136 ± 0.706
4.243SerGlu: 4.243 ± 0.502
2.68SerPhe: 2.68 ± 0.347
5.582SerGly: 5.582 ± 0.907
1.414SerHis: 1.414 ± 0.291
3.275SerIle: 3.275 ± 0.481
4.094SerLys: 4.094 ± 0.656
4.466SerLeu: 4.466 ± 0.566
1.563SerMet: 1.563 ± 0.336
3.498SerAsn: 3.498 ± 0.356
1.563SerPro: 1.563 ± 0.382
2.531SerGln: 2.531 ± 0.427
2.456SerArg: 2.456 ± 0.366
3.647SerSer: 3.647 ± 0.641
3.87SerThr: 3.87 ± 0.508
3.87SerVal: 3.87 ± 0.505
0.595SerTrp: 0.595 ± 0.169
2.382SerTyr: 2.382 ± 0.431
0.0SerXaa: 0.0 ± 0.0
Thr
5.657ThrAla: 5.657 ± 1.178
1.042ThrCys: 1.042 ± 0.317
4.838ThrAsp: 4.838 ± 0.658
4.838ThrGlu: 4.838 ± 0.674
2.903ThrPhe: 2.903 ± 0.606
5.582ThrGly: 5.582 ± 0.55
0.893ThrHis: 0.893 ± 0.281
3.647ThrIle: 3.647 ± 0.548
2.977ThrLys: 2.977 ± 0.47
4.019ThrLeu: 4.019 ± 0.595
1.489ThrMet: 1.489 ± 0.395
2.68ThrAsn: 2.68 ± 0.465
2.605ThrPro: 2.605 ± 0.404
2.605ThrGln: 2.605 ± 0.525
2.605ThrArg: 2.605 ± 0.514
3.647ThrSer: 3.647 ± 0.748
3.722ThrThr: 3.722 ± 0.606
4.019ThrVal: 4.019 ± 0.691
0.819ThrTrp: 0.819 ± 0.317
2.307ThrTyr: 2.307 ± 0.541
0.0ThrXaa: 0.0 ± 0.0
Val
4.838ValAla: 4.838 ± 0.672
0.447ValCys: 0.447 ± 0.198
5.731ValAsp: 5.731 ± 0.716
4.392ValGlu: 4.392 ± 0.563
2.456ValPhe: 2.456 ± 0.345
3.126ValGly: 3.126 ± 0.56
1.489ValHis: 1.489 ± 0.242
3.498ValIle: 3.498 ± 0.588
4.764ValLys: 4.764 ± 0.586
4.317ValLeu: 4.317 ± 0.495
2.159ValMet: 2.159 ± 0.322
3.349ValAsn: 3.349 ± 0.447
2.159ValPro: 2.159 ± 0.394
2.531ValGln: 2.531 ± 0.546
4.019ValArg: 4.019 ± 0.601
5.806ValSer: 5.806 ± 0.939
5.061ValThr: 5.061 ± 0.948
5.285ValVal: 5.285 ± 0.98
1.265ValTrp: 1.265 ± 0.306
2.456ValTyr: 2.456 ± 0.359
0.0ValXaa: 0.0 ± 0.0
Trp
0.819TrpAla: 0.819 ± 0.263
0.447TrpCys: 0.447 ± 0.164
0.819TrpAsp: 0.819 ± 0.283
0.968TrpGlu: 0.968 ± 0.281
0.67TrpPhe: 0.67 ± 0.165
1.116TrpGly: 1.116 ± 0.28
0.521TrpHis: 0.521 ± 0.135
1.042TrpIle: 1.042 ± 0.328
0.521TrpLys: 0.521 ± 0.226
2.531TrpLeu: 2.531 ± 0.358
0.447TrpMet: 0.447 ± 0.174
0.819TrpAsn: 0.819 ± 0.25
0.149TrpPro: 0.149 ± 0.127
0.521TrpGln: 0.521 ± 0.207
0.447TrpArg: 0.447 ± 0.195
0.819TrpSer: 0.819 ± 0.227
0.595TrpThr: 0.595 ± 0.207
1.116TrpVal: 1.116 ± 0.268
0.298TrpTrp: 0.298 ± 0.15
0.744TrpTyr: 0.744 ± 0.263
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.233TyrAla: 2.233 ± 0.357
0.521TyrCys: 0.521 ± 0.241
2.382TyrAsp: 2.382 ± 0.303
2.605TyrGlu: 2.605 ± 0.43
1.191TyrPhe: 1.191 ± 0.35
1.638TyrGly: 1.638 ± 0.286
0.893TyrHis: 0.893 ± 0.337
1.786TyrIle: 1.786 ± 0.313
2.307TyrLys: 2.307 ± 0.439
2.977TyrLeu: 2.977 ± 0.511
0.521TyrMet: 0.521 ± 0.2
1.414TyrAsn: 1.414 ± 0.376
1.638TyrPro: 1.638 ± 0.293
1.563TyrGln: 1.563 ± 0.298
1.191TyrArg: 1.191 ± 0.381
1.712TyrSer: 1.712 ± 0.439
2.084TyrThr: 2.084 ± 0.572
2.977TyrVal: 2.977 ± 0.588
0.372TyrTrp: 0.372 ± 0.185
1.935TyrTyr: 1.935 ± 0.426
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (13436 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski