Amino acid dipepetide frequency for Vibrio phage ICP2_2011_A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.767AlaAla: 6.767 ± 1.031
0.838AlaCys: 0.838 ± 0.286
4.318AlaAsp: 4.318 ± 0.529
5.542AlaGlu: 5.542 ± 0.699
2.32AlaPhe: 2.32 ± 0.408
5.993AlaGly: 5.993 ± 0.926
1.289AlaHis: 1.289 ± 0.251
3.802AlaIle: 3.802 ± 0.459
4.833AlaLys: 4.833 ± 0.536
5.671AlaLeu: 5.671 ± 0.624
3.158AlaMet: 3.158 ± 0.515
3.351AlaAsn: 3.351 ± 0.469
2.513AlaPro: 2.513 ± 0.436
3.738AlaGln: 3.738 ± 0.734
3.867AlaArg: 3.867 ± 0.54
4.06AlaSer: 4.06 ± 0.586
5.736AlaThr: 5.736 ± 0.63
5.156AlaVal: 5.156 ± 0.587
1.031AlaTrp: 1.031 ± 0.238
3.222AlaTyr: 3.222 ± 0.493
0.0AlaXaa: 0.0 ± 0.0
Cys
0.709CysAla: 0.709 ± 0.215
0.129CysCys: 0.129 ± 0.086
0.516CysAsp: 0.516 ± 0.208
0.967CysGlu: 0.967 ± 0.285
0.193CysPhe: 0.193 ± 0.116
1.031CysGly: 1.031 ± 0.283
0.322CysHis: 0.322 ± 0.16
0.516CysIle: 0.516 ± 0.178
0.902CysLys: 0.902 ± 0.217
0.967CysLeu: 0.967 ± 0.286
0.258CysMet: 0.258 ± 0.156
0.451CysAsn: 0.451 ± 0.186
0.387CysPro: 0.387 ± 0.163
0.322CysGln: 0.322 ± 0.15
0.773CysArg: 0.773 ± 0.245
0.516CysSer: 0.516 ± 0.197
0.516CysThr: 0.516 ± 0.199
0.902CysVal: 0.902 ± 0.224
0.193CysTrp: 0.193 ± 0.112
0.644CysTyr: 0.644 ± 0.181
0.0CysXaa: 0.0 ± 0.0
Asp
5.091AspAla: 5.091 ± 0.768
1.096AspCys: 1.096 ± 0.301
3.351AspAsp: 3.351 ± 0.486
4.382AspGlu: 4.382 ± 0.573
2.384AspPhe: 2.384 ± 0.408
4.833AspGly: 4.833 ± 0.717
1.096AspHis: 1.096 ± 0.309
3.802AspIle: 3.802 ± 0.523
4.576AspLys: 4.576 ± 0.57
4.382AspLeu: 4.382 ± 0.691
1.869AspMet: 1.869 ± 0.321
3.416AspAsn: 3.416 ± 0.496
2.256AspPro: 2.256 ± 0.353
1.16AspGln: 1.16 ± 0.201
2.32AspArg: 2.32 ± 0.489
3.867AspSer: 3.867 ± 0.499
3.609AspThr: 3.609 ± 0.432
4.511AspVal: 4.511 ± 0.404
0.902AspTrp: 0.902 ± 0.273
3.158AspTyr: 3.158 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
5.027GluAla: 5.027 ± 0.64
0.838GluCys: 0.838 ± 0.24
4.125GluAsp: 4.125 ± 0.458
5.607GluGlu: 5.607 ± 0.846
3.48GluPhe: 3.48 ± 0.419
5.542GluGly: 5.542 ± 0.848
1.16GluHis: 1.16 ± 0.26
4.769GluIle: 4.769 ± 0.635
3.544GluLys: 3.544 ± 0.586
5.607GluLeu: 5.607 ± 0.513
2.191GluMet: 2.191 ± 0.327
3.029GluAsn: 3.029 ± 0.492
1.676GluPro: 1.676 ± 0.292
3.093GluGln: 3.093 ± 0.586
3.867GluArg: 3.867 ± 0.714
4.189GluSer: 4.189 ± 0.537
2.771GluThr: 2.771 ± 0.421
4.962GluVal: 4.962 ± 0.567
1.418GluTrp: 1.418 ± 0.284
3.093GluTyr: 3.093 ± 0.463
0.0GluXaa: 0.0 ± 0.0
Phe
3.48PheAla: 3.48 ± 0.506
0.387PheCys: 0.387 ± 0.142
2.513PheAsp: 2.513 ± 0.421
2.384PheGlu: 2.384 ± 0.392
1.353PhePhe: 1.353 ± 0.263
2.836PheGly: 2.836 ± 0.406
0.451PheHis: 0.451 ± 0.157
1.998PheIle: 1.998 ± 0.458
2.771PheLys: 2.771 ± 0.312
2.384PheLeu: 2.384 ± 0.344
0.967PheMet: 0.967 ± 0.267
1.418PheAsn: 1.418 ± 0.268
1.289PhePro: 1.289 ± 0.364
1.998PheGln: 1.998 ± 0.402
2.127PheArg: 2.127 ± 0.322
2.513PheSer: 2.513 ± 0.456
2.32PheThr: 2.32 ± 0.407
2.384PheVal: 2.384 ± 0.418
0.516PheTrp: 0.516 ± 0.2
1.418PheTyr: 1.418 ± 0.32
0.0PheXaa: 0.0 ± 0.0
Gly
5.091GlyAla: 5.091 ± 0.679
1.031GlyCys: 1.031 ± 0.326
4.382GlyAsp: 4.382 ± 0.597
4.511GlyGlu: 4.511 ± 0.618
3.416GlyPhe: 3.416 ± 0.497
6.896GlyGly: 6.896 ± 1.381
0.967GlyHis: 0.967 ± 0.219
4.511GlyIle: 4.511 ± 0.509
4.511GlyLys: 4.511 ± 0.618
4.769GlyLeu: 4.769 ± 0.658
1.804GlyMet: 1.804 ± 0.366
3.802GlyAsn: 3.802 ± 0.797
3.093GlyPro: 3.093 ± 1.65
2.771GlyGln: 2.771 ± 0.594
3.416GlyArg: 3.416 ± 0.412
5.349GlySer: 5.349 ± 0.78
5.22GlyThr: 5.22 ± 0.994
5.542GlyVal: 5.542 ± 1.006
1.031GlyTrp: 1.031 ± 0.28
3.093GlyTyr: 3.093 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
0.58HisAla: 0.58 ± 0.206
0.451HisCys: 0.451 ± 0.172
0.967HisAsp: 0.967 ± 0.277
1.031HisGlu: 1.031 ± 0.239
0.709HisPhe: 0.709 ± 0.174
0.838HisGly: 0.838 ± 0.229
0.322HisHis: 0.322 ± 0.13
0.967HisIle: 0.967 ± 0.295
1.482HisLys: 1.482 ± 0.382
1.031HisLeu: 1.031 ± 0.263
0.516HisMet: 0.516 ± 0.172
0.967HisAsn: 0.967 ± 0.276
0.838HisPro: 0.838 ± 0.313
0.516HisGln: 0.516 ± 0.179
0.644HisArg: 0.644 ± 0.208
1.289HisSer: 1.289 ± 0.304
0.967HisThr: 0.967 ± 0.252
1.096HisVal: 1.096 ± 0.274
0.258HisTrp: 0.258 ± 0.137
0.516HisTyr: 0.516 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
4.06IleAla: 4.06 ± 0.62
0.516IleCys: 0.516 ± 0.229
4.06IleAsp: 4.06 ± 0.523
3.673IleGlu: 3.673 ± 0.461
1.482IlePhe: 1.482 ± 0.358
4.382IleGly: 4.382 ± 0.572
0.58IleHis: 0.58 ± 0.222
3.093IleIle: 3.093 ± 0.397
4.382IleLys: 4.382 ± 0.621
3.416IleLeu: 3.416 ± 0.503
1.289IleMet: 1.289 ± 0.231
4.447IleAsn: 4.447 ± 0.704
1.804IlePro: 1.804 ± 0.293
2.707IleGln: 2.707 ± 0.469
3.544IleArg: 3.544 ± 0.431
3.802IleSer: 3.802 ± 0.469
3.351IleThr: 3.351 ± 0.679
3.931IleVal: 3.931 ± 0.49
0.387IleTrp: 0.387 ± 0.153
2.127IleTyr: 2.127 ± 0.38
0.0IleXaa: 0.0 ± 0.0
Lys
5.865LysAla: 5.865 ± 0.762
0.451LysCys: 0.451 ± 0.215
3.802LysAsp: 3.802 ± 0.549
5.607LysGlu: 5.607 ± 0.596
2.707LysPhe: 2.707 ± 0.464
4.962LysGly: 4.962 ± 0.747
0.838LysHis: 0.838 ± 0.207
3.673LysIle: 3.673 ± 0.587
2.771LysLys: 2.771 ± 0.528
4.769LysLeu: 4.769 ± 0.478
1.676LysMet: 1.676 ± 0.369
1.74LysAsn: 1.74 ± 0.418
1.933LysPro: 1.933 ± 0.372
2.513LysGln: 2.513 ± 0.378
3.544LysArg: 3.544 ± 0.543
3.158LysSer: 3.158 ± 0.42
3.48LysThr: 3.48 ± 0.405
5.542LysVal: 5.542 ± 0.542
1.031LysTrp: 1.031 ± 0.266
2.836LysTyr: 2.836 ± 0.455
0.0LysXaa: 0.0 ± 0.0
Leu
6.573LeuAla: 6.573 ± 0.733
0.967LeuCys: 0.967 ± 0.332
5.156LeuAsp: 5.156 ± 0.614
6.509LeuGlu: 6.509 ± 0.731
2.449LeuPhe: 2.449 ± 0.386
4.576LeuGly: 4.576 ± 0.582
1.418LeuHis: 1.418 ± 0.329
4.576LeuIle: 4.576 ± 0.657
4.447LeuLys: 4.447 ± 0.465
5.865LeuLeu: 5.865 ± 0.615
2.771LeuMet: 2.771 ± 0.405
3.48LeuAsn: 3.48 ± 0.495
2.707LeuPro: 2.707 ± 0.415
2.964LeuGln: 2.964 ± 0.378
4.253LeuArg: 4.253 ± 0.534
4.189LeuSer: 4.189 ± 0.405
4.962LeuThr: 4.962 ± 0.545
4.06LeuVal: 4.06 ± 0.511
1.16LeuTrp: 1.16 ± 0.233
2.127LeuTyr: 2.127 ± 0.384
0.0LeuXaa: 0.0 ± 0.0
Met
2.836MetAla: 2.836 ± 0.455
0.387MetCys: 0.387 ± 0.147
1.611MetAsp: 1.611 ± 0.347
2.191MetGlu: 2.191 ± 0.466
0.967MetPhe: 0.967 ± 0.245
2.384MetGly: 2.384 ± 0.342
0.322MetHis: 0.322 ± 0.156
1.096MetIle: 1.096 ± 0.263
2.191MetLys: 2.191 ± 0.348
2.578MetLeu: 2.578 ± 0.42
1.096MetMet: 1.096 ± 0.294
0.967MetAsn: 0.967 ± 0.237
1.096MetPro: 1.096 ± 0.334
0.709MetGln: 0.709 ± 0.221
1.096MetArg: 1.096 ± 0.298
2.127MetSer: 2.127 ± 0.411
2.127MetThr: 2.127 ± 0.443
1.547MetVal: 1.547 ± 0.297
0.387MetTrp: 0.387 ± 0.169
1.096MetTyr: 1.096 ± 0.236
0.0MetXaa: 0.0 ± 0.0
Asn
3.093AsnAla: 3.093 ± 0.535
0.709AsnCys: 0.709 ± 0.322
2.642AsnAsp: 2.642 ± 0.514
2.191AsnGlu: 2.191 ± 0.323
1.547AsnPhe: 1.547 ± 0.309
3.738AsnGly: 3.738 ± 0.818
0.773AsnHis: 0.773 ± 0.258
2.836AsnIle: 2.836 ± 0.427
2.9AsnLys: 2.9 ± 0.356
4.833AsnLeu: 4.833 ± 0.64
1.547AsnMet: 1.547 ± 0.361
2.127AsnAsn: 2.127 ± 0.397
2.9AsnPro: 2.9 ± 0.393
2.127AsnGln: 2.127 ± 0.331
1.804AsnArg: 1.804 ± 0.344
3.287AsnSer: 3.287 ± 0.602
2.964AsnThr: 2.964 ± 0.406
4.189AsnVal: 4.189 ± 0.57
0.709AsnTrp: 0.709 ± 0.229
1.74AsnTyr: 1.74 ± 0.335
0.0AsnXaa: 0.0 ± 0.0
Pro
2.642ProAla: 2.642 ± 0.396
0.193ProCys: 0.193 ± 0.126
2.578ProAsp: 2.578 ± 0.392
4.318ProGlu: 4.318 ± 0.428
1.611ProPhe: 1.611 ± 0.342
0.387ProGly: 0.387 ± 0.151
0.838ProHis: 0.838 ± 0.227
2.191ProIle: 2.191 ± 0.379
2.191ProLys: 2.191 ± 0.416
2.32ProLeu: 2.32 ± 0.309
1.031ProMet: 1.031 ± 0.242
2.642ProAsn: 2.642 ± 0.413
0.644ProPro: 0.644 ± 0.202
2.191ProGln: 2.191 ± 0.816
1.933ProArg: 1.933 ± 0.36
2.32ProSer: 2.32 ± 0.529
3.738ProThr: 3.738 ± 0.73
2.707ProVal: 2.707 ± 0.444
0.451ProTrp: 0.451 ± 0.139
1.16ProTyr: 1.16 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
4.06GlnAla: 4.06 ± 0.702
0.516GlnCys: 0.516 ± 0.181
2.449GlnAsp: 2.449 ± 0.35
2.062GlnGlu: 2.062 ± 0.4
1.804GlnPhe: 1.804 ± 0.388
3.48GlnGly: 3.48 ± 0.961
0.773GlnHis: 0.773 ± 0.212
2.513GlnIle: 2.513 ± 0.537
1.869GlnLys: 1.869 ± 0.307
3.544GlnLeu: 3.544 ± 0.58
1.224GlnMet: 1.224 ± 0.241
1.74GlnAsn: 1.74 ± 0.404
1.418GlnPro: 1.418 ± 0.35
1.869GlnGln: 1.869 ± 0.425
1.804GlnArg: 1.804 ± 0.362
2.127GlnSer: 2.127 ± 0.46
2.384GlnThr: 2.384 ± 0.48
2.642GlnVal: 2.642 ± 0.39
0.516GlnTrp: 0.516 ± 0.191
2.127GlnTyr: 2.127 ± 0.37
0.0GlnXaa: 0.0 ± 0.0
Arg
3.416ArgAla: 3.416 ± 0.495
0.387ArgCys: 0.387 ± 0.174
3.544ArgAsp: 3.544 ± 0.445
4.253ArgGlu: 4.253 ± 0.604
1.547ArgPhe: 1.547 ± 0.31
3.931ArgGly: 3.931 ± 0.537
0.709ArgHis: 0.709 ± 0.167
3.673ArgIle: 3.673 ± 0.542
3.287ArgLys: 3.287 ± 0.524
3.287ArgLeu: 3.287 ± 0.463
1.804ArgMet: 1.804 ± 0.307
2.578ArgAsn: 2.578 ± 0.37
1.418ArgPro: 1.418 ± 0.341
2.513ArgGln: 2.513 ± 0.449
3.029ArgArg: 3.029 ± 0.524
2.449ArgSer: 2.449 ± 0.416
2.449ArgThr: 2.449 ± 0.348
3.158ArgVal: 3.158 ± 0.464
0.967ArgTrp: 0.967 ± 0.209
1.804ArgTyr: 1.804 ± 0.396
0.0ArgXaa: 0.0 ± 0.0
Ser
4.769SerAla: 4.769 ± 0.643
0.58SerCys: 0.58 ± 0.178
4.382SerAsp: 4.382 ± 0.686
3.287SerGlu: 3.287 ± 0.407
2.127SerPhe: 2.127 ± 0.414
5.285SerGly: 5.285 ± 0.706
0.709SerHis: 0.709 ± 0.228
3.029SerIle: 3.029 ± 0.42
4.382SerLys: 4.382 ± 0.565
4.576SerLeu: 4.576 ± 0.467
1.289SerMet: 1.289 ± 0.248
3.029SerAsn: 3.029 ± 0.424
3.351SerPro: 3.351 ± 0.437
3.029SerGln: 3.029 ± 0.451
3.093SerArg: 3.093 ± 0.467
4.318SerSer: 4.318 ± 0.613
3.867SerThr: 3.867 ± 0.555
4.382SerVal: 4.382 ± 0.694
1.031SerTrp: 1.031 ± 0.23
2.127SerTyr: 2.127 ± 0.436
0.0SerXaa: 0.0 ± 0.0
Thr
4.447ThrAla: 4.447 ± 0.483
0.322ThrCys: 0.322 ± 0.142
2.964ThrAsp: 2.964 ± 0.458
3.609ThrGlu: 3.609 ± 0.42
2.191ThrPhe: 2.191 ± 0.418
5.285ThrGly: 5.285 ± 0.742
1.031ThrHis: 1.031 ± 0.287
3.867ThrIle: 3.867 ± 0.539
3.802ThrLys: 3.802 ± 0.474
4.576ThrLeu: 4.576 ± 0.477
1.16ThrMet: 1.16 ± 0.282
3.673ThrAsn: 3.673 ± 0.613
4.06ThrPro: 4.06 ± 0.495
2.256ThrGln: 2.256 ± 0.347
2.642ThrArg: 2.642 ± 0.364
4.318ThrSer: 4.318 ± 0.591
5.027ThrThr: 5.027 ± 0.557
3.802ThrVal: 3.802 ± 0.457
0.773ThrTrp: 0.773 ± 0.232
2.578ThrTyr: 2.578 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
5.027ValAla: 5.027 ± 0.659
0.773ValCys: 0.773 ± 0.231
5.349ValAsp: 5.349 ± 0.703
4.06ValGlu: 4.06 ± 0.503
2.9ValPhe: 2.9 ± 0.463
5.285ValGly: 5.285 ± 0.927
1.418ValHis: 1.418 ± 0.355
3.48ValIle: 3.48 ± 0.398
4.833ValLys: 4.833 ± 0.47
6.058ValLeu: 6.058 ± 0.692
1.74ValMet: 1.74 ± 0.326
2.964ValAsn: 2.964 ± 0.508
2.707ValPro: 2.707 ± 0.431
2.32ValGln: 2.32 ± 0.349
3.544ValArg: 3.544 ± 0.469
4.576ValSer: 4.576 ± 0.683
4.318ValThr: 4.318 ± 0.581
5.027ValVal: 5.027 ± 0.537
1.289ValTrp: 1.289 ± 0.261
2.32ValTyr: 2.32 ± 0.361
0.0ValXaa: 0.0 ± 0.0
Trp
1.224TrpAla: 1.224 ± 0.257
0.129TrpCys: 0.129 ± 0.092
0.709TrpAsp: 0.709 ± 0.173
1.031TrpGlu: 1.031 ± 0.242
1.096TrpPhe: 1.096 ± 0.285
0.644TrpGly: 0.644 ± 0.165
0.258TrpHis: 0.258 ± 0.159
0.387TrpIle: 0.387 ± 0.141
1.353TrpLys: 1.353 ± 0.286
1.353TrpLeu: 1.353 ± 0.337
0.58TrpMet: 0.58 ± 0.179
0.516TrpAsn: 0.516 ± 0.165
0.387TrpPro: 0.387 ± 0.17
1.031TrpGln: 1.031 ± 0.2
0.258TrpArg: 0.258 ± 0.133
1.224TrpSer: 1.224 ± 0.325
0.838TrpThr: 0.838 ± 0.245
0.773TrpVal: 0.773 ± 0.194
0.516TrpTrp: 0.516 ± 0.217
0.58TrpTyr: 0.58 ± 0.199
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.449TyrAla: 2.449 ± 0.324
0.516TyrCys: 0.516 ± 0.23
2.642TyrAsp: 2.642 ± 0.427
2.771TyrGlu: 2.771 ± 0.418
1.224TyrPhe: 1.224 ± 0.316
2.9TyrGly: 2.9 ± 0.321
0.709TyrHis: 0.709 ± 0.187
2.256TyrIle: 2.256 ± 0.474
1.869TyrLys: 1.869 ± 0.385
3.222TyrLeu: 3.222 ± 0.504
0.773TyrMet: 0.773 ± 0.223
2.32TyrAsn: 2.32 ± 0.336
1.611TyrPro: 1.611 ± 0.315
1.096TyrGln: 1.096 ± 0.265
2.642TyrArg: 2.642 ± 0.328
2.964TyrSer: 2.964 ± 0.4
1.74TyrThr: 1.74 ± 0.312
3.738TyrVal: 3.738 ± 0.471
0.322TyrTrp: 0.322 ± 0.171
1.353TyrTyr: 1.353 ± 0.278
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (15518 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski