Amino acid dipepetide frequency for Marinomonas phage P12026

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.605AlaAla: 12.605 ± 2.426
0.605AlaCys: 0.605 ± 0.235
4.638AlaAsp: 4.638 ± 0.82
7.563AlaGlu: 7.563 ± 0.995
2.823AlaPhe: 2.823 ± 0.527
6.655AlaGly: 6.655 ± 0.746
1.412AlaHis: 1.412 ± 0.423
7.059AlaIle: 7.059 ± 0.731
8.571AlaLys: 8.571 ± 0.967
6.554AlaLeu: 6.554 ± 0.718
4.336AlaMet: 4.336 ± 1.012
3.731AlaAsn: 3.731 ± 0.547
3.025AlaPro: 3.025 ± 0.616
4.437AlaGln: 4.437 ± 0.929
3.933AlaArg: 3.933 ± 0.733
7.26AlaSer: 7.26 ± 0.916
4.739AlaThr: 4.739 ± 1.08
5.344AlaVal: 5.344 ± 0.801
1.613AlaTrp: 1.613 ± 0.415
2.723AlaTyr: 2.723 ± 0.6
0.0AlaXaa: 0.0 ± 0.0
Cys
1.109CysAla: 1.109 ± 0.273
0.202CysCys: 0.202 ± 0.13
0.807CysAsp: 0.807 ± 0.309
0.303CysGlu: 0.303 ± 0.15
0.202CysPhe: 0.202 ± 0.142
0.303CysGly: 0.303 ± 0.152
0.202CysHis: 0.202 ± 0.205
0.605CysIle: 0.605 ± 0.224
0.605CysLys: 0.605 ± 0.222
1.412CysLeu: 1.412 ± 0.417
0.202CysMet: 0.202 ± 0.147
0.403CysAsn: 0.403 ± 0.303
0.403CysPro: 0.403 ± 0.186
0.504CysGln: 0.504 ± 0.225
0.504CysArg: 0.504 ± 0.215
0.706CysSer: 0.706 ± 0.248
0.504CysThr: 0.504 ± 0.224
0.706CysVal: 0.706 ± 0.288
0.303CysTrp: 0.303 ± 0.167
0.202CysTyr: 0.202 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
7.059AspAla: 7.059 ± 1.145
0.706AspCys: 0.706 ± 0.25
3.328AspAsp: 3.328 ± 0.753
4.437AspGlu: 4.437 ± 0.629
2.218AspPhe: 2.218 ± 0.317
5.042AspGly: 5.042 ± 0.685
1.311AspHis: 1.311 ± 0.25
3.428AspIle: 3.428 ± 0.458
3.428AspLys: 3.428 ± 0.594
6.857AspLeu: 6.857 ± 0.811
1.412AspMet: 1.412 ± 0.374
2.218AspAsn: 2.218 ± 0.552
1.714AspPro: 1.714 ± 0.404
2.218AspGln: 2.218 ± 0.557
2.723AspArg: 2.723 ± 0.553
3.63AspSer: 3.63 ± 0.733
2.924AspThr: 2.924 ± 0.532
3.832AspVal: 3.832 ± 0.575
1.21AspTrp: 1.21 ± 0.269
1.513AspTyr: 1.513 ± 0.319
0.0AspXaa: 0.0 ± 0.0
Glu
5.546GluAla: 5.546 ± 0.828
0.706GluCys: 0.706 ± 0.224
3.529GluAsp: 3.529 ± 0.654
3.227GluGlu: 3.227 ± 0.51
2.723GluPhe: 2.723 ± 0.482
4.638GluGly: 4.638 ± 0.614
1.21GluHis: 1.21 ± 0.335
5.647GluIle: 5.647 ± 0.878
4.033GluLys: 4.033 ± 0.89
6.252GluLeu: 6.252 ± 0.924
2.319GluMet: 2.319 ± 0.522
2.42GluAsn: 2.42 ± 0.589
1.916GluPro: 1.916 ± 0.391
3.227GluGln: 3.227 ± 0.513
4.84GluArg: 4.84 ± 0.632
5.042GluSer: 5.042 ± 0.633
3.025GluThr: 3.025 ± 0.756
5.344GluVal: 5.344 ± 0.719
1.412GluTrp: 1.412 ± 0.379
2.218GluTyr: 2.218 ± 0.383
0.0GluXaa: 0.0 ± 0.0
Phe
2.218PheAla: 2.218 ± 0.367
0.403PheCys: 0.403 ± 0.178
2.823PheAsp: 2.823 ± 0.553
2.521PheGlu: 2.521 ± 0.556
1.21PhePhe: 1.21 ± 0.344
2.42PheGly: 2.42 ± 0.44
0.303PheHis: 0.303 ± 0.139
1.714PheIle: 1.714 ± 0.409
1.714PheLys: 1.714 ± 0.327
1.714PheLeu: 1.714 ± 0.389
0.807PheMet: 0.807 ± 0.319
1.714PheAsn: 1.714 ± 0.472
1.109PhePro: 1.109 ± 0.342
1.008PheGln: 1.008 ± 0.317
1.714PheArg: 1.714 ± 0.369
2.319PheSer: 2.319 ± 0.451
1.21PheThr: 1.21 ± 0.386
1.815PheVal: 1.815 ± 0.419
0.706PheTrp: 0.706 ± 0.284
1.613PheTyr: 1.613 ± 0.425
0.0PheXaa: 0.0 ± 0.0
Gly
6.756GlyAla: 6.756 ± 1.159
0.403GlyCys: 0.403 ± 0.177
3.63GlyAsp: 3.63 ± 0.583
5.445GlyGlu: 5.445 ± 0.719
1.815GlyPhe: 1.815 ± 0.353
5.143GlyGly: 5.143 ± 0.847
1.109GlyHis: 1.109 ± 0.274
3.428GlyIle: 3.428 ± 0.548
4.638GlyLys: 4.638 ± 0.719
5.647GlyLeu: 5.647 ± 0.623
2.118GlyMet: 2.118 ± 0.453
2.924GlyAsn: 2.924 ± 0.548
0.605GlyPro: 0.605 ± 0.231
2.319GlyGln: 2.319 ± 0.443
3.328GlyArg: 3.328 ± 0.579
4.336GlySer: 4.336 ± 1.027
4.739GlyThr: 4.739 ± 0.716
5.244GlyVal: 5.244 ± 0.728
1.21GlyTrp: 1.21 ± 0.363
2.823GlyTyr: 2.823 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
1.412HisAla: 1.412 ± 0.371
0.403HisCys: 0.403 ± 0.194
0.807HisAsp: 0.807 ± 0.263
1.109HisGlu: 1.109 ± 0.288
0.202HisPhe: 0.202 ± 0.138
1.311HisGly: 1.311 ± 0.329
0.202HisHis: 0.202 ± 0.148
1.109HisIle: 1.109 ± 0.429
1.21HisLys: 1.21 ± 0.33
1.714HisLeu: 1.714 ± 0.468
0.202HisMet: 0.202 ± 0.161
1.21HisAsn: 1.21 ± 0.318
0.706HisPro: 0.706 ± 0.298
0.908HisGln: 0.908 ± 0.287
1.311HisArg: 1.311 ± 0.333
0.706HisSer: 0.706 ± 0.267
1.21HisThr: 1.21 ± 0.297
1.008HisVal: 1.008 ± 0.301
0.303HisTrp: 0.303 ± 0.142
0.504HisTyr: 0.504 ± 0.174
0.0HisXaa: 0.0 ± 0.0
Ile
5.244IleAla: 5.244 ± 0.796
0.908IleCys: 0.908 ± 0.281
5.344IleAsp: 5.344 ± 0.643
4.437IleGlu: 4.437 ± 0.708
1.815IlePhe: 1.815 ± 0.395
4.739IleGly: 4.739 ± 0.624
1.21IleHis: 1.21 ± 0.319
2.823IleIle: 2.823 ± 0.483
4.437IleLys: 4.437 ± 0.726
3.933IleLeu: 3.933 ± 0.706
1.613IleMet: 1.613 ± 0.319
3.529IleAsn: 3.529 ± 0.504
2.723IlePro: 2.723 ± 0.691
2.017IleGln: 2.017 ± 0.415
3.328IleArg: 3.328 ± 0.731
4.134IleSer: 4.134 ± 0.701
3.933IleThr: 3.933 ± 0.775
2.924IleVal: 2.924 ± 0.728
1.008IleTrp: 1.008 ± 0.305
2.017IleTyr: 2.017 ± 0.516
0.0IleXaa: 0.0 ± 0.0
Lys
7.159LysAla: 7.159 ± 1.282
1.109LysCys: 1.109 ± 0.334
3.227LysAsp: 3.227 ± 0.654
5.344LysGlu: 5.344 ± 1.029
0.706LysPhe: 0.706 ± 0.318
4.134LysGly: 4.134 ± 0.609
2.118LysHis: 2.118 ± 0.408
4.538LysIle: 4.538 ± 0.572
5.244LysLys: 5.244 ± 0.943
4.941LysLeu: 4.941 ± 0.783
1.714LysMet: 1.714 ± 0.444
3.428LysAsn: 3.428 ± 0.499
3.63LysPro: 3.63 ± 0.82
2.42LysGln: 2.42 ± 0.486
3.126LysArg: 3.126 ± 0.58
3.529LysSer: 3.529 ± 0.573
4.739LysThr: 4.739 ± 0.69
3.227LysVal: 3.227 ± 0.587
0.807LysTrp: 0.807 ± 0.234
1.412LysTyr: 1.412 ± 0.383
0.0LysXaa: 0.0 ± 0.0
Leu
7.764LeuAla: 7.764 ± 0.786
0.403LeuCys: 0.403 ± 0.181
5.042LeuAsp: 5.042 ± 0.931
5.849LeuGlu: 5.849 ± 0.879
1.815LeuPhe: 1.815 ± 0.365
4.739LeuGly: 4.739 ± 0.564
1.21LeuHis: 1.21 ± 0.376
5.445LeuIle: 5.445 ± 0.744
5.244LeuLys: 5.244 ± 0.812
4.033LeuLeu: 4.033 ± 0.613
2.017LeuMet: 2.017 ± 0.368
4.134LeuAsn: 4.134 ± 0.526
2.218LeuPro: 2.218 ± 0.469
3.227LeuGln: 3.227 ± 0.483
4.538LeuArg: 4.538 ± 0.64
6.958LeuSer: 6.958 ± 0.877
5.445LeuThr: 5.445 ± 0.945
4.235LeuVal: 4.235 ± 0.707
1.412LeuTrp: 1.412 ± 0.438
1.714LeuTyr: 1.714 ± 0.381
0.0LeuXaa: 0.0 ± 0.0
Met
2.823MetAla: 2.823 ± 0.595
0.202MetCys: 0.202 ± 0.145
1.412MetAsp: 1.412 ± 0.302
0.908MetGlu: 0.908 ± 0.323
1.21MetPhe: 1.21 ± 0.321
1.613MetGly: 1.613 ± 0.338
0.403MetHis: 0.403 ± 0.204
2.017MetIle: 2.017 ± 0.478
2.017MetLys: 2.017 ± 0.568
2.42MetLeu: 2.42 ± 0.623
0.202MetMet: 0.202 ± 0.144
1.714MetAsn: 1.714 ± 0.429
1.109MetPro: 1.109 ± 0.362
0.908MetGln: 0.908 ± 0.292
1.815MetArg: 1.815 ± 0.384
3.025MetSer: 3.025 ± 0.544
2.42MetThr: 2.42 ± 0.505
0.908MetVal: 0.908 ± 0.257
0.303MetTrp: 0.303 ± 0.143
0.908MetTyr: 0.908 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
4.336AsnAla: 4.336 ± 0.62
0.706AsnCys: 0.706 ± 0.208
2.924AsnAsp: 2.924 ± 0.521
3.025AsnGlu: 3.025 ± 0.59
1.311AsnPhe: 1.311 ± 0.375
2.521AsnGly: 2.521 ± 0.427
1.21AsnHis: 1.21 ± 0.339
2.118AsnIle: 2.118 ± 0.493
3.126AsnLys: 3.126 ± 0.801
3.731AsnLeu: 3.731 ± 0.503
1.311AsnMet: 1.311 ± 0.317
1.412AsnAsn: 1.412 ± 0.316
2.521AsnPro: 2.521 ± 0.41
2.118AsnGln: 2.118 ± 0.424
2.319AsnArg: 2.319 ± 0.57
2.924AsnSer: 2.924 ± 0.586
2.319AsnThr: 2.319 ± 0.572
2.622AsnVal: 2.622 ± 0.342
0.403AsnTrp: 0.403 ± 0.208
1.815AsnTyr: 1.815 ± 0.434
0.0AsnXaa: 0.0 ± 0.0
Pro
3.428ProAla: 3.428 ± 0.503
0.303ProCys: 0.303 ± 0.179
2.42ProAsp: 2.42 ± 0.523
2.521ProGlu: 2.521 ± 0.538
1.714ProPhe: 1.714 ± 0.444
1.008ProGly: 1.008 ± 0.285
0.605ProHis: 0.605 ± 0.266
2.42ProIle: 2.42 ± 0.488
2.118ProLys: 2.118 ± 0.569
2.118ProLeu: 2.118 ± 0.377
1.311ProMet: 1.311 ± 0.32
1.815ProAsn: 1.815 ± 0.435
0.807ProPro: 0.807 ± 0.284
2.017ProGln: 2.017 ± 0.658
1.311ProArg: 1.311 ± 0.325
3.428ProSer: 3.428 ± 0.52
2.218ProThr: 2.218 ± 0.444
2.218ProVal: 2.218 ± 0.318
0.605ProTrp: 0.605 ± 0.217
1.21ProTyr: 1.21 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
4.235GlnAla: 4.235 ± 0.7
0.504GlnCys: 0.504 ± 0.202
2.017GlnAsp: 2.017 ± 0.437
2.823GlnGlu: 2.823 ± 0.477
1.109GlnPhe: 1.109 ± 0.331
2.924GlnGly: 2.924 ± 0.607
0.202GlnHis: 0.202 ± 0.136
2.118GlnIle: 2.118 ± 0.563
3.328GlnLys: 3.328 ± 0.595
3.731GlnLeu: 3.731 ± 0.481
1.008GlnMet: 1.008 ± 0.296
1.21GlnAsn: 1.21 ± 0.341
1.311GlnPro: 1.311 ± 0.335
2.118GlnGln: 2.118 ± 0.461
1.916GlnArg: 1.916 ± 0.36
1.916GlnSer: 1.916 ± 0.312
2.017GlnThr: 2.017 ± 0.372
2.118GlnVal: 2.118 ± 0.422
0.706GlnTrp: 0.706 ± 0.248
1.311GlnTyr: 1.311 ± 0.334
0.0GlnXaa: 0.0 ± 0.0
Arg
4.638ArgAla: 4.638 ± 0.555
0.504ArgCys: 0.504 ± 0.209
3.428ArgAsp: 3.428 ± 0.541
4.437ArgGlu: 4.437 ± 0.656
1.916ArgPhe: 1.916 ± 0.424
2.218ArgGly: 2.218 ± 0.405
0.908ArgHis: 0.908 ± 0.296
3.529ArgIle: 3.529 ± 0.476
3.025ArgLys: 3.025 ± 0.646
4.235ArgLeu: 4.235 ± 0.917
1.21ArgMet: 1.21 ± 0.309
1.916ArgAsn: 1.916 ± 0.475
2.521ArgPro: 2.521 ± 0.463
1.513ArgGln: 1.513 ± 0.406
2.118ArgArg: 2.118 ± 0.462
4.134ArgSer: 4.134 ± 0.715
3.126ArgThr: 3.126 ± 0.495
3.227ArgVal: 3.227 ± 0.618
0.807ArgTrp: 0.807 ± 0.276
0.504ArgTyr: 0.504 ± 0.165
0.0ArgXaa: 0.0 ± 0.0
Ser
5.949SerAla: 5.949 ± 0.916
0.202SerCys: 0.202 ± 0.124
4.739SerAsp: 4.739 ± 0.733
4.739SerGlu: 4.739 ± 0.787
2.723SerPhe: 2.723 ± 0.363
5.748SerGly: 5.748 ± 0.655
1.412SerHis: 1.412 ± 0.315
3.63SerIle: 3.63 ± 0.595
3.832SerLys: 3.832 ± 0.507
6.252SerLeu: 6.252 ± 0.588
2.622SerMet: 2.622 ± 0.462
4.336SerAsn: 4.336 ± 0.746
2.723SerPro: 2.723 ± 0.504
2.42SerGln: 2.42 ± 0.457
3.025SerArg: 3.025 ± 0.495
4.235SerSer: 4.235 ± 0.694
4.033SerThr: 4.033 ± 0.709
6.252SerVal: 6.252 ± 0.759
0.706SerTrp: 0.706 ± 0.223
2.521SerTyr: 2.521 ± 0.487
0.0SerXaa: 0.0 ± 0.0
Thr
6.454ThrAla: 6.454 ± 0.841
0.504ThrCys: 0.504 ± 0.19
3.529ThrAsp: 3.529 ± 0.484
3.328ThrGlu: 3.328 ± 0.844
2.218ThrPhe: 2.218 ± 0.466
5.244ThrGly: 5.244 ± 0.718
1.008ThrHis: 1.008 ± 0.318
3.933ThrIle: 3.933 ± 0.667
3.529ThrLys: 3.529 ± 0.57
4.638ThrLeu: 4.638 ± 0.64
1.311ThrMet: 1.311 ± 0.329
2.017ThrAsn: 2.017 ± 0.573
3.126ThrPro: 3.126 ± 0.695
2.218ThrGln: 2.218 ± 0.495
1.916ThrArg: 1.916 ± 0.414
2.924ThrSer: 2.924 ± 0.419
2.823ThrThr: 2.823 ± 0.644
4.941ThrVal: 4.941 ± 0.583
1.109ThrTrp: 1.109 ± 0.35
2.118ThrTyr: 2.118 ± 0.529
0.0ThrXaa: 0.0 ± 0.0
Val
6.655ValAla: 6.655 ± 1.248
0.504ValCys: 0.504 ± 0.197
4.033ValAsp: 4.033 ± 0.561
4.739ValGlu: 4.739 ± 0.716
2.017ValPhe: 2.017 ± 0.501
3.933ValGly: 3.933 ± 0.636
0.403ValHis: 0.403 ± 0.209
3.328ValIle: 3.328 ± 0.581
4.538ValLys: 4.538 ± 0.771
4.235ValLeu: 4.235 ± 0.677
1.613ValMet: 1.613 ± 0.368
2.218ValAsn: 2.218 ± 0.486
2.118ValPro: 2.118 ± 0.408
1.412ValGln: 1.412 ± 0.376
3.227ValArg: 3.227 ± 0.432
6.554ValSer: 6.554 ± 0.842
4.941ValThr: 4.941 ± 0.621
4.437ValVal: 4.437 ± 0.734
0.303ValTrp: 0.303 ± 0.166
1.916ValTyr: 1.916 ± 0.443
0.0ValXaa: 0.0 ± 0.0
Trp
1.311TrpAla: 1.311 ± 0.352
0.403TrpCys: 0.403 ± 0.182
1.412TrpAsp: 1.412 ± 0.317
0.605TrpGlu: 0.605 ± 0.199
0.706TrpPhe: 0.706 ± 0.237
1.008TrpGly: 1.008 ± 0.279
0.504TrpHis: 0.504 ± 0.212
1.311TrpIle: 1.311 ± 0.338
0.504TrpLys: 0.504 ± 0.205
1.513TrpLeu: 1.513 ± 0.381
0.202TrpMet: 0.202 ± 0.124
0.908TrpAsn: 0.908 ± 0.287
0.403TrpPro: 0.403 ± 0.193
0.908TrpGln: 0.908 ± 0.273
1.008TrpArg: 1.008 ± 0.321
1.008TrpSer: 1.008 ± 0.283
0.706TrpThr: 0.706 ± 0.278
1.21TrpVal: 1.21 ± 0.274
0.303TrpTrp: 0.303 ± 0.214
0.101TrpTyr: 0.101 ± 0.1
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.924TyrAla: 2.924 ± 0.491
0.504TyrCys: 0.504 ± 0.202
2.118TyrAsp: 2.118 ± 0.578
1.916TyrGlu: 1.916 ± 0.35
0.605TyrPhe: 0.605 ± 0.235
2.319TyrGly: 2.319 ± 0.561
0.605TyrHis: 0.605 ± 0.2
1.916TyrIle: 1.916 ± 0.392
1.412TyrLys: 1.412 ± 0.348
1.513TyrLeu: 1.513 ± 0.31
0.706TyrMet: 0.706 ± 0.284
1.513TyrAsn: 1.513 ± 0.375
0.908TyrPro: 0.908 ± 0.298
0.706TyrGln: 0.706 ± 0.29
1.916TyrArg: 1.916 ± 0.489
3.328TyrSer: 3.328 ± 0.643
1.815TyrThr: 1.815 ± 0.384
1.613TyrVal: 1.613 ± 0.514
0.807TyrTrp: 0.807 ± 0.248
0.807TyrTyr: 0.807 ± 0.289
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (9918 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski