Amino acid dipepetide frequency for Vibrio phage martha 12B12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.197AlaAla: 8.197 ± 1.238
0.477AlaCys: 0.477 ± 0.255
5.719AlaAsp: 5.719 ± 0.555
7.53AlaGlu: 7.53 ± 1.011
3.908AlaPhe: 3.908 ± 0.474
4.67AlaGly: 4.67 ± 0.566
0.667AlaHis: 0.667 ± 0.239
5.528AlaIle: 5.528 ± 0.44
6.958AlaLys: 6.958 ± 0.915
7.625AlaLeu: 7.625 ± 0.999
3.145AlaMet: 3.145 ± 0.548
4.194AlaAsn: 4.194 ± 0.795
2.764AlaPro: 2.764 ± 0.504
2.478AlaGln: 2.478 ± 0.538
2.859AlaArg: 2.859 ± 0.477
5.909AlaSer: 5.909 ± 0.89
4.67AlaThr: 4.67 ± 0.931
5.433AlaVal: 5.433 ± 0.689
1.048AlaTrp: 1.048 ± 0.324
2.383AlaTyr: 2.383 ± 0.493
0.0AlaXaa: 0.0 ± 0.0
Cys
0.477CysAla: 0.477 ± 0.186
0.095CysCys: 0.095 ± 0.111
0.572CysAsp: 0.572 ± 0.224
0.572CysGlu: 0.572 ± 0.196
0.191CysPhe: 0.191 ± 0.139
0.572CysGly: 0.572 ± 0.305
0.095CysHis: 0.095 ± 0.096
0.858CysIle: 0.858 ± 0.315
0.667CysLys: 0.667 ± 0.271
1.048CysLeu: 1.048 ± 0.37
0.191CysMet: 0.191 ± 0.127
0.0CysAsn: 0.0 ± 0.0
0.191CysPro: 0.191 ± 0.14
0.667CysGln: 0.667 ± 0.273
0.762CysArg: 0.762 ± 0.315
0.286CysSer: 0.286 ± 0.158
0.572CysThr: 0.572 ± 0.234
0.858CysVal: 0.858 ± 0.262
0.095CysTrp: 0.095 ± 0.086
0.381CysTyr: 0.381 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
5.719AspAla: 5.719 ± 0.875
0.667AspCys: 0.667 ± 0.234
4.194AspAsp: 4.194 ± 0.521
5.051AspGlu: 5.051 ± 0.635
2.192AspPhe: 2.192 ± 0.36
4.003AspGly: 4.003 ± 0.678
0.858AspHis: 0.858 ± 0.286
3.812AspIle: 3.812 ± 0.645
3.145AspLys: 3.145 ± 0.615
5.719AspLeu: 5.719 ± 0.671
1.811AspMet: 1.811 ± 0.397
2.573AspAsn: 2.573 ± 0.43
2.192AspPro: 2.192 ± 0.398
2.097AspGln: 2.097 ± 0.413
1.334AspArg: 1.334 ± 0.421
4.003AspSer: 4.003 ± 0.586
3.526AspThr: 3.526 ± 0.672
5.528AspVal: 5.528 ± 0.616
1.144AspTrp: 1.144 ± 0.317
1.525AspTyr: 1.525 ± 0.428
0.0AspXaa: 0.0 ± 0.0
Glu
6.386GluAla: 6.386 ± 0.715
1.048GluCys: 1.048 ± 0.343
5.719GluAsp: 5.719 ± 0.782
4.289GluGlu: 4.289 ± 0.866
2.192GluPhe: 2.192 ± 0.456
4.48GluGly: 4.48 ± 0.679
1.62GluHis: 1.62 ± 0.471
3.908GluIle: 3.908 ± 0.679
4.384GluLys: 4.384 ± 0.682
7.72GluLeu: 7.72 ± 1.064
2.287GluMet: 2.287 ± 0.478
3.145GluAsn: 3.145 ± 0.598
2.383GluPro: 2.383 ± 0.719
4.194GluGln: 4.194 ± 0.757
3.336GluArg: 3.336 ± 0.52
4.003GluSer: 4.003 ± 0.517
4.098GluThr: 4.098 ± 0.487
5.433GluVal: 5.433 ± 0.778
1.048GluTrp: 1.048 ± 0.232
2.383GluTyr: 2.383 ± 0.514
0.0GluXaa: 0.0 ± 0.0
Phe
2.955PheAla: 2.955 ± 0.714
0.095PheCys: 0.095 ± 0.083
2.287PheAsp: 2.287 ± 0.393
2.383PheGlu: 2.383 ± 0.593
0.572PhePhe: 0.572 ± 0.182
3.431PheGly: 3.431 ± 0.468
0.667PheHis: 0.667 ± 0.286
1.43PheIle: 1.43 ± 0.323
2.573PheLys: 2.573 ± 0.57
1.906PheLeu: 1.906 ± 0.325
0.858PheMet: 0.858 ± 0.231
2.669PheAsn: 2.669 ± 0.666
1.334PhePro: 1.334 ± 0.319
1.239PheGln: 1.239 ± 0.307
1.239PheArg: 1.239 ± 0.434
2.383PheSer: 2.383 ± 0.543
2.287PheThr: 2.287 ± 0.345
1.906PheVal: 1.906 ± 0.415
0.477PheTrp: 0.477 ± 0.208
0.667PheTyr: 0.667 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
3.717GlyAla: 3.717 ± 0.648
0.477GlyCys: 0.477 ± 0.213
4.766GlyAsp: 4.766 ± 0.795
5.433GlyGlu: 5.433 ± 0.602
2.764GlyPhe: 2.764 ± 0.476
5.528GlyGly: 5.528 ± 0.937
1.239GlyHis: 1.239 ± 0.376
5.623GlyIle: 5.623 ± 0.535
3.717GlyLys: 3.717 ± 0.505
5.433GlyLeu: 5.433 ± 0.83
2.287GlyMet: 2.287 ± 0.488
3.336GlyAsn: 3.336 ± 0.479
1.62GlyPro: 1.62 ± 0.266
2.859GlyGln: 2.859 ± 0.505
3.526GlyArg: 3.526 ± 0.651
4.67GlySer: 4.67 ± 0.662
3.145GlyThr: 3.145 ± 0.667
6.291GlyVal: 6.291 ± 0.778
1.811GlyTrp: 1.811 ± 0.478
2.383GlyTyr: 2.383 ± 0.405
0.0GlyXaa: 0.0 ± 0.0
His
1.048HisAla: 1.048 ± 0.327
0.477HisCys: 0.477 ± 0.196
0.858HisAsp: 0.858 ± 0.284
1.239HisGlu: 1.239 ± 0.305
0.191HisPhe: 0.191 ± 0.129
1.525HisGly: 1.525 ± 0.481
0.477HisHis: 0.477 ± 0.218
0.858HisIle: 0.858 ± 0.268
1.048HisLys: 1.048 ± 0.295
0.953HisLeu: 0.953 ± 0.337
0.286HisMet: 0.286 ± 0.177
0.572HisAsn: 0.572 ± 0.186
0.762HisPro: 0.762 ± 0.341
0.762HisGln: 0.762 ± 0.303
0.953HisArg: 0.953 ± 0.265
0.477HisSer: 0.477 ± 0.228
1.144HisThr: 1.144 ± 0.302
0.667HisVal: 0.667 ± 0.274
0.191HisTrp: 0.191 ± 0.118
0.477HisTyr: 0.477 ± 0.243
0.0HisXaa: 0.0 ± 0.0
Ile
6.1IleAla: 6.1 ± 0.964
0.381IleCys: 0.381 ± 0.221
4.384IleAsp: 4.384 ± 0.657
5.242IleGlu: 5.242 ± 0.765
0.762IlePhe: 0.762 ± 0.303
2.478IleGly: 2.478 ± 0.467
0.858IleHis: 0.858 ± 0.297
1.906IleIle: 1.906 ± 0.382
2.859IleLys: 2.859 ± 0.65
3.526IleLeu: 3.526 ± 0.638
1.048IleMet: 1.048 ± 0.271
2.573IleAsn: 2.573 ± 0.461
3.05IlePro: 3.05 ± 0.525
2.192IleGln: 2.192 ± 0.5
3.336IleArg: 3.336 ± 0.638
4.003IleSer: 4.003 ± 0.65
3.145IleThr: 3.145 ± 0.58
3.241IleVal: 3.241 ± 0.497
0.667IleTrp: 0.667 ± 0.246
1.716IleTyr: 1.716 ± 0.453
0.0IleXaa: 0.0 ± 0.0
Lys
7.339LysAla: 7.339 ± 1.12
0.572LysCys: 0.572 ± 0.264
3.145LysAsp: 3.145 ± 0.559
4.384LysGlu: 4.384 ± 0.666
1.906LysPhe: 1.906 ± 0.379
4.575LysGly: 4.575 ± 0.796
0.477LysHis: 0.477 ± 0.231
2.669LysIle: 2.669 ± 0.48
3.908LysLys: 3.908 ± 0.75
5.242LysLeu: 5.242 ± 0.534
1.144LysMet: 1.144 ± 0.324
2.383LysAsn: 2.383 ± 0.62
2.955LysPro: 2.955 ± 0.486
3.241LysGln: 3.241 ± 0.597
4.098LysArg: 4.098 ± 0.701
3.622LysSer: 3.622 ± 0.523
3.812LysThr: 3.812 ± 0.494
4.098LysVal: 4.098 ± 0.489
1.144LysTrp: 1.144 ± 0.279
1.525LysTyr: 1.525 ± 0.449
0.0LysXaa: 0.0 ± 0.0
Leu
7.815LeuAla: 7.815 ± 0.88
0.381LeuCys: 0.381 ± 0.149
5.051LeuAsp: 5.051 ± 0.578
6.958LeuGlu: 6.958 ± 0.81
2.573LeuPhe: 2.573 ± 0.421
6.386LeuGly: 6.386 ± 0.743
1.811LeuHis: 1.811 ± 0.387
3.431LeuIle: 3.431 ± 0.556
5.623LeuLys: 5.623 ± 0.716
7.053LeuLeu: 7.053 ± 0.845
2.669LeuMet: 2.669 ± 0.482
3.812LeuAsn: 3.812 ± 0.736
4.575LeuPro: 4.575 ± 0.635
3.05LeuGln: 3.05 ± 0.497
5.719LeuArg: 5.719 ± 0.79
5.814LeuSer: 5.814 ± 0.792
4.48LeuThr: 4.48 ± 0.641
5.719LeuVal: 5.719 ± 0.622
0.953LeuTrp: 0.953 ± 0.294
2.097LeuTyr: 2.097 ± 0.444
0.0LeuXaa: 0.0 ± 0.0
Met
2.955MetAla: 2.955 ± 0.495
0.191MetCys: 0.191 ± 0.221
1.525MetAsp: 1.525 ± 0.341
1.716MetGlu: 1.716 ± 0.374
1.144MetPhe: 1.144 ± 0.36
1.43MetGly: 1.43 ± 0.41
0.286MetHis: 0.286 ± 0.148
1.239MetIle: 1.239 ± 0.368
1.62MetLys: 1.62 ± 0.357
1.716MetLeu: 1.716 ± 0.313
1.239MetMet: 1.239 ± 0.455
1.525MetAsn: 1.525 ± 0.339
1.716MetPro: 1.716 ± 0.313
0.953MetGln: 0.953 ± 0.267
1.43MetArg: 1.43 ± 0.455
2.573MetSer: 2.573 ± 0.413
2.478MetThr: 2.478 ± 0.45
1.334MetVal: 1.334 ± 0.362
0.381MetTrp: 0.381 ± 0.233
0.477MetTyr: 0.477 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
4.48AsnAla: 4.48 ± 0.731
0.191AsnCys: 0.191 ± 0.127
2.287AsnAsp: 2.287 ± 0.618
2.955AsnGlu: 2.955 ± 0.519
1.048AsnPhe: 1.048 ± 0.282
4.575AsnGly: 4.575 ± 0.783
0.762AsnHis: 0.762 ± 0.357
1.525AsnIle: 1.525 ± 0.348
2.383AsnLys: 2.383 ± 0.524
3.526AsnLeu: 3.526 ± 0.525
1.048AsnMet: 1.048 ± 0.332
2.478AsnAsn: 2.478 ± 0.428
2.478AsnPro: 2.478 ± 0.458
2.383AsnGln: 2.383 ± 0.426
3.812AsnArg: 3.812 ± 0.568
2.764AsnSer: 2.764 ± 0.618
2.573AsnThr: 2.573 ± 0.412
3.145AsnVal: 3.145 ± 0.553
0.572AsnTrp: 0.572 ± 0.207
1.334AsnTyr: 1.334 ± 0.324
0.0AsnXaa: 0.0 ± 0.0
Pro
2.955ProAla: 2.955 ± 0.598
0.572ProCys: 0.572 ± 0.225
2.859ProAsp: 2.859 ± 0.609
3.336ProGlu: 3.336 ± 0.695
1.239ProPhe: 1.239 ± 0.287
2.669ProGly: 2.669 ± 0.608
0.762ProHis: 0.762 ± 0.29
1.525ProIle: 1.525 ± 0.279
2.764ProLys: 2.764 ± 0.424
3.908ProLeu: 3.908 ± 0.63
0.858ProMet: 0.858 ± 0.254
2.478ProAsn: 2.478 ± 0.497
1.43ProPro: 1.43 ± 0.412
1.716ProGln: 1.716 ± 0.367
1.239ProArg: 1.239 ± 0.337
3.717ProSer: 3.717 ± 0.445
1.906ProThr: 1.906 ± 0.489
3.336ProVal: 3.336 ± 0.624
0.953ProTrp: 0.953 ± 0.224
0.667ProTyr: 0.667 ± 0.225
0.0ProXaa: 0.0 ± 0.0
Gln
4.003GlnAla: 4.003 ± 0.777
0.286GlnCys: 0.286 ± 0.155
2.287GlnAsp: 2.287 ± 0.409
2.764GlnGlu: 2.764 ± 0.544
1.716GlnPhe: 1.716 ± 0.307
2.478GlnGly: 2.478 ± 0.436
0.191GlnHis: 0.191 ± 0.122
2.383GlnIle: 2.383 ± 0.5
2.097GlnLys: 2.097 ± 0.39
4.67GlnLeu: 4.67 ± 0.619
1.43GlnMet: 1.43 ± 0.433
2.097GlnAsn: 2.097 ± 0.416
0.953GlnPro: 0.953 ± 0.289
1.811GlnGln: 1.811 ± 0.429
3.145GlnArg: 3.145 ± 0.526
2.669GlnSer: 2.669 ± 0.467
3.145GlnThr: 3.145 ± 0.51
3.526GlnVal: 3.526 ± 0.612
1.048GlnTrp: 1.048 ± 0.334
1.239GlnTyr: 1.239 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
4.098ArgAla: 4.098 ± 0.605
0.572ArgCys: 0.572 ± 0.296
2.573ArgAsp: 2.573 ± 0.517
4.194ArgGlu: 4.194 ± 0.519
1.716ArgPhe: 1.716 ± 0.338
2.955ArgGly: 2.955 ± 0.541
1.334ArgHis: 1.334 ± 0.345
3.526ArgIle: 3.526 ± 0.533
2.573ArgLys: 2.573 ± 0.446
4.384ArgLeu: 4.384 ± 0.451
1.048ArgMet: 1.048 ± 0.309
2.955ArgAsn: 2.955 ± 0.53
2.287ArgPro: 2.287 ± 0.501
2.955ArgGln: 2.955 ± 0.453
2.764ArgArg: 2.764 ± 0.504
3.05ArgSer: 3.05 ± 0.585
2.955ArgThr: 2.955 ± 0.526
2.669ArgVal: 2.669 ± 0.575
1.525ArgTrp: 1.525 ± 0.267
2.383ArgTyr: 2.383 ± 0.468
0.0ArgXaa: 0.0 ± 0.0
Ser
4.194SerAla: 4.194 ± 0.707
0.667SerCys: 0.667 ± 0.271
3.336SerAsp: 3.336 ± 0.533
4.766SerGlu: 4.766 ± 0.723
2.573SerPhe: 2.573 ± 0.569
4.67SerGly: 4.67 ± 0.82
0.762SerHis: 0.762 ± 0.31
4.766SerIle: 4.766 ± 0.642
4.384SerLys: 4.384 ± 0.68
5.623SerLeu: 5.623 ± 0.944
2.383SerMet: 2.383 ± 0.457
3.336SerAsn: 3.336 ± 0.482
3.145SerPro: 3.145 ± 0.749
3.145SerGln: 3.145 ± 0.516
3.05SerArg: 3.05 ± 0.581
5.337SerSer: 5.337 ± 0.879
3.431SerThr: 3.431 ± 0.505
4.48SerVal: 4.48 ± 0.528
0.381SerTrp: 0.381 ± 0.179
2.002SerTyr: 2.002 ± 0.46
0.0SerXaa: 0.0 ± 0.0
Thr
4.766ThrAla: 4.766 ± 0.725
0.477ThrCys: 0.477 ± 0.297
3.717ThrAsp: 3.717 ± 0.562
2.955ThrGlu: 2.955 ± 0.678
2.002ThrPhe: 2.002 ± 0.453
5.623ThrGly: 5.623 ± 0.74
0.667ThrHis: 0.667 ± 0.219
2.192ThrIle: 2.192 ± 0.365
3.717ThrLys: 3.717 ± 0.643
7.053ThrLeu: 7.053 ± 0.81
1.144ThrMet: 1.144 ± 0.301
2.287ThrAsn: 2.287 ± 0.527
2.478ThrPro: 2.478 ± 0.549
2.573ThrGln: 2.573 ± 0.64
2.573ThrArg: 2.573 ± 0.524
2.764ThrSer: 2.764 ± 0.63
2.764ThrThr: 2.764 ± 0.53
4.384ThrVal: 4.384 ± 0.615
1.048ThrTrp: 1.048 ± 0.269
1.43ThrTyr: 1.43 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
5.623ValAla: 5.623 ± 0.788
0.572ValCys: 0.572 ± 0.274
3.431ValAsp: 3.431 ± 0.673
5.051ValGlu: 5.051 ± 0.732
2.383ValPhe: 2.383 ± 0.487
4.194ValGly: 4.194 ± 0.762
0.762ValHis: 0.762 ± 0.364
4.289ValIle: 4.289 ± 0.538
4.861ValLys: 4.861 ± 0.67
5.337ValLeu: 5.337 ± 0.651
1.62ValMet: 1.62 ± 0.299
2.287ValAsn: 2.287 ± 0.526
3.145ValPro: 3.145 ± 0.483
3.241ValGln: 3.241 ± 0.52
4.098ValArg: 4.098 ± 0.719
6.1ValSer: 6.1 ± 0.872
4.384ValThr: 4.384 ± 0.81
4.289ValVal: 4.289 ± 0.819
1.239ValTrp: 1.239 ± 0.351
2.287ValTyr: 2.287 ± 0.549
0.0ValXaa: 0.0 ± 0.0
Trp
1.239TrpAla: 1.239 ± 0.332
0.572TrpCys: 0.572 ± 0.273
0.953TrpAsp: 0.953 ± 0.259
1.239TrpGlu: 1.239 ± 0.365
1.334TrpPhe: 1.334 ± 0.356
1.43TrpGly: 1.43 ± 0.332
0.095TrpHis: 0.095 ± 0.085
0.762TrpIle: 0.762 ± 0.293
1.048TrpLys: 1.048 ± 0.27
1.239TrpLeu: 1.239 ± 0.366
0.477TrpMet: 0.477 ± 0.19
0.953TrpAsn: 0.953 ± 0.252
0.572TrpPro: 0.572 ± 0.223
0.381TrpGln: 0.381 ± 0.184
1.144TrpArg: 1.144 ± 0.311
0.858TrpSer: 0.858 ± 0.31
0.762TrpThr: 0.762 ± 0.227
0.858TrpVal: 0.858 ± 0.293
0.381TrpTrp: 0.381 ± 0.204
0.286TrpTyr: 0.286 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.192TyrAla: 2.192 ± 0.518
0.381TyrCys: 0.381 ± 0.196
1.43TyrAsp: 1.43 ± 0.399
2.002TyrGlu: 2.002 ± 0.49
0.953TyrPhe: 0.953 ± 0.32
2.955TyrGly: 2.955 ± 0.497
0.572TyrHis: 0.572 ± 0.323
1.239TyrIle: 1.239 ± 0.357
1.906TyrLys: 1.906 ± 0.466
2.287TyrLeu: 2.287 ± 0.408
0.858TyrMet: 0.858 ± 0.33
0.667TyrAsn: 0.667 ± 0.272
0.858TyrPro: 0.858 ± 0.241
2.002TyrGln: 2.002 ± 0.464
2.097TyrArg: 2.097 ± 0.413
1.43TyrSer: 1.43 ± 0.327
1.334TyrThr: 1.334 ± 0.27
2.002TyrVal: 2.002 ± 0.355
0.477TyrTrp: 0.477 ± 0.237
0.858TyrTyr: 0.858 ± 0.281
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (10493 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski