Amino acid dipepetide frequency for Gordonia phage GMA5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.877AlaAla: 24.877 ± 3.313
1.09AlaCys: 1.09 ± 0.472
9.261AlaAsp: 9.261 ± 1.314
7.627AlaGlu: 7.627 ± 1.762
4.358AlaPhe: 4.358 ± 0.739
8.898AlaGly: 8.898 ± 1.486
3.269AlaHis: 3.269 ± 0.787
8.716AlaIle: 8.716 ± 1.442
2.724AlaLys: 2.724 ± 0.71
8.898AlaLeu: 8.898 ± 1.463
3.087AlaMet: 3.087 ± 0.481
2.361AlaAsn: 2.361 ± 0.575
5.084AlaPro: 5.084 ± 0.998
4.721AlaGln: 4.721 ± 1.0
8.535AlaArg: 8.535 ± 1.776
8.535AlaSer: 8.535 ± 1.156
9.261AlaThr: 9.261 ± 1.064
12.893AlaVal: 12.893 ± 2.189
2.179AlaTrp: 2.179 ± 0.49
3.087AlaTyr: 3.087 ± 0.667
0.0AlaXaa: 0.0 ± 0.0
Cys
1.997CysAla: 1.997 ± 0.712
0.0CysCys: 0.0 ± 0.0
0.363CysAsp: 0.363 ± 0.236
0.182CysGlu: 0.182 ± 0.249
0.0CysPhe: 0.0 ± 0.0
0.908CysGly: 0.908 ± 0.572
0.363CysHis: 0.363 ± 0.325
0.0CysIle: 0.0 ± 0.0
0.182CysLys: 0.182 ± 0.159
0.726CysLeu: 0.726 ± 0.356
0.182CysMet: 0.182 ± 0.197
0.363CysAsn: 0.363 ± 0.325
0.545CysPro: 0.545 ± 0.391
0.545CysGln: 0.545 ± 0.288
0.908CysArg: 0.908 ± 0.452
0.545CysSer: 0.545 ± 0.4
0.363CysThr: 0.363 ± 0.268
0.182CysVal: 0.182 ± 0.145
0.182CysTrp: 0.182 ± 0.179
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.171AspAla: 8.171 ± 1.681
0.182AspCys: 0.182 ± 0.174
4.54AspAsp: 4.54 ± 1.099
3.087AspGlu: 3.087 ± 0.691
1.453AspPhe: 1.453 ± 0.406
3.45AspGly: 3.45 ± 0.746
0.908AspHis: 0.908 ± 0.402
2.542AspIle: 2.542 ± 0.931
1.634AspLys: 1.634 ± 0.489
6.719AspLeu: 6.719 ± 0.813
1.816AspMet: 1.816 ± 0.624
2.179AspAsn: 2.179 ± 0.529
4.54AspPro: 4.54 ± 0.975
1.271AspGln: 1.271 ± 0.557
4.903AspArg: 4.903 ± 1.179
3.087AspSer: 3.087 ± 0.816
4.358AspThr: 4.358 ± 0.754
4.177AspVal: 4.177 ± 0.983
1.453AspTrp: 1.453 ± 0.563
1.997AspTyr: 1.997 ± 0.649
0.0AspXaa: 0.0 ± 0.0
Glu
3.995GluAla: 3.995 ± 1.02
0.363GluCys: 0.363 ± 0.299
1.634GluAsp: 1.634 ± 0.416
0.545GluGlu: 0.545 ± 0.302
1.816GluPhe: 1.816 ± 0.589
3.995GluGly: 3.995 ± 0.701
1.453GluHis: 1.453 ± 0.521
1.271GluIle: 1.271 ± 0.348
0.726GluLys: 0.726 ± 0.469
4.177GluLeu: 4.177 ± 1.247
0.726GluMet: 0.726 ± 0.301
1.453GluAsn: 1.453 ± 0.66
2.179GluPro: 2.179 ± 0.758
1.271GluGln: 1.271 ± 0.423
2.905GluArg: 2.905 ± 0.839
2.179GluSer: 2.179 ± 0.814
3.45GluThr: 3.45 ± 1.13
4.903GluVal: 4.903 ± 1.168
0.908GluTrp: 0.908 ± 0.414
0.908GluTyr: 0.908 ± 0.471
0.0GluXaa: 0.0 ± 0.0
Phe
2.724PheAla: 2.724 ± 0.92
0.182PheCys: 0.182 ± 0.195
2.542PheAsp: 2.542 ± 0.542
1.453PheGlu: 1.453 ± 0.391
0.363PhePhe: 0.363 ± 0.242
3.087PheGly: 3.087 ± 0.769
0.726PheHis: 0.726 ± 0.338
0.908PheIle: 0.908 ± 0.457
0.545PheLys: 0.545 ± 0.259
1.453PheLeu: 1.453 ± 0.464
0.908PheMet: 0.908 ± 0.456
0.545PheAsn: 0.545 ± 0.33
0.908PhePro: 0.908 ± 0.462
0.726PheGln: 0.726 ± 0.475
1.634PheArg: 1.634 ± 0.457
1.997PheSer: 1.997 ± 0.803
1.997PheThr: 1.997 ± 0.683
1.997PheVal: 1.997 ± 0.855
0.908PheTrp: 0.908 ± 0.399
0.726PheTyr: 0.726 ± 0.376
0.0PheXaa: 0.0 ± 0.0
Gly
8.535GlyAla: 8.535 ± 1.4
0.545GlyCys: 0.545 ± 0.301
3.45GlyAsp: 3.45 ± 0.768
3.632GlyGlu: 3.632 ± 1.016
2.724GlyPhe: 2.724 ± 0.732
6.356GlyGly: 6.356 ± 0.929
2.179GlyHis: 2.179 ± 0.584
4.177GlyIle: 4.177 ± 1.537
2.179GlyLys: 2.179 ± 0.812
6.9GlyLeu: 6.9 ± 0.904
1.816GlyMet: 1.816 ± 0.52
1.634GlyAsn: 1.634 ± 0.454
4.358GlyPro: 4.358 ± 1.017
2.179GlyGln: 2.179 ± 0.561
6.174GlyArg: 6.174 ± 1.265
4.177GlySer: 4.177 ± 0.998
7.627GlyThr: 7.627 ± 1.214
6.9GlyVal: 6.9 ± 1.127
2.179GlyTrp: 2.179 ± 0.653
1.997GlyTyr: 1.997 ± 0.569
0.0GlyXaa: 0.0 ± 0.0
His
1.997HisAla: 1.997 ± 0.668
0.182HisCys: 0.182 ± 0.195
1.09HisAsp: 1.09 ± 0.527
0.545HisGlu: 0.545 ± 0.269
0.908HisPhe: 0.908 ± 0.421
0.908HisGly: 0.908 ± 0.451
1.09HisHis: 1.09 ± 0.704
0.545HisIle: 0.545 ± 0.397
0.545HisLys: 0.545 ± 0.432
3.45HisLeu: 3.45 ± 1.153
0.545HisMet: 0.545 ± 0.259
0.0HisAsn: 0.0 ± 0.0
2.179HisPro: 2.179 ± 0.682
0.545HisGln: 0.545 ± 0.291
3.269HisArg: 3.269 ± 0.98
1.09HisSer: 1.09 ± 0.5
1.816HisThr: 1.816 ± 0.646
0.908HisVal: 0.908 ± 0.529
0.182HisTrp: 0.182 ± 0.249
0.363HisTyr: 0.363 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
7.082IleAla: 7.082 ± 1.552
0.726IleCys: 0.726 ± 0.455
4.903IleAsp: 4.903 ± 0.884
3.269IleGlu: 3.269 ± 0.851
0.726IlePhe: 0.726 ± 0.373
3.632IleGly: 3.632 ± 0.943
0.0IleHis: 0.0 ± 0.0
2.179IleIle: 2.179 ± 0.964
1.453IleLys: 1.453 ± 0.828
3.269IleLeu: 3.269 ± 0.714
0.726IleMet: 0.726 ± 0.393
0.726IleAsn: 0.726 ± 0.267
4.177IlePro: 4.177 ± 1.062
1.453IleGln: 1.453 ± 0.735
3.45IleArg: 3.45 ± 0.779
1.997IleSer: 1.997 ± 0.897
6.537IleThr: 6.537 ± 1.727
3.813IleVal: 3.813 ± 0.893
1.634IleTrp: 1.634 ± 0.421
0.908IleTyr: 0.908 ± 0.476
0.0IleXaa: 0.0 ± 0.0
Lys
3.632LysAla: 3.632 ± 1.033
0.182LysCys: 0.182 ± 0.179
0.908LysAsp: 0.908 ± 0.398
0.545LysGlu: 0.545 ± 0.426
0.363LysPhe: 0.363 ± 0.198
1.09LysGly: 1.09 ± 0.383
0.726LysHis: 0.726 ± 0.28
1.271LysIle: 1.271 ± 0.445
0.545LysLys: 0.545 ± 0.354
2.179LysLeu: 2.179 ± 0.477
0.182LysMet: 0.182 ± 0.154
0.908LysAsn: 0.908 ± 0.324
1.816LysPro: 1.816 ± 0.493
0.182LysGln: 0.182 ± 0.179
2.361LysArg: 2.361 ± 0.793
0.726LysSer: 0.726 ± 0.445
1.997LysThr: 1.997 ± 0.762
2.179LysVal: 2.179 ± 0.8
0.726LysTrp: 0.726 ± 0.408
0.363LysTyr: 0.363 ± 0.352
0.0LysXaa: 0.0 ± 0.0
Leu
11.077LeuAla: 11.077 ± 1.339
0.908LeuCys: 0.908 ± 0.415
5.992LeuAsp: 5.992 ± 1.221
1.816LeuGlu: 1.816 ± 0.621
1.09LeuPhe: 1.09 ± 0.416
9.261LeuGly: 9.261 ± 2.393
1.816LeuHis: 1.816 ± 0.642
4.358LeuIle: 4.358 ± 0.721
1.634LeuLys: 1.634 ± 0.536
5.448LeuLeu: 5.448 ± 0.777
1.634LeuMet: 1.634 ± 0.668
3.087LeuAsn: 3.087 ± 0.856
3.632LeuPro: 3.632 ± 0.9
0.726LeuGln: 0.726 ± 0.38
5.084LeuArg: 5.084 ± 1.746
4.903LeuSer: 4.903 ± 1.001
5.629LeuThr: 5.629 ± 1.06
5.811LeuVal: 5.811 ± 0.885
1.997LeuTrp: 1.997 ± 0.634
0.726LeuTyr: 0.726 ± 0.432
0.0LeuXaa: 0.0 ± 0.0
Met
1.997MetAla: 1.997 ± 0.522
0.545MetCys: 0.545 ± 0.285
1.09MetAsp: 1.09 ± 0.546
0.182MetGlu: 0.182 ± 0.211
1.09MetPhe: 1.09 ± 0.336
0.908MetGly: 0.908 ± 0.406
0.545MetHis: 0.545 ± 0.311
1.09MetIle: 1.09 ± 0.441
0.726MetLys: 0.726 ± 0.353
1.997MetLeu: 1.997 ± 0.681
0.182MetMet: 0.182 ± 0.245
1.271MetAsn: 1.271 ± 0.533
1.271MetPro: 1.271 ± 0.502
0.545MetGln: 0.545 ± 0.278
1.271MetArg: 1.271 ± 0.498
2.361MetSer: 2.361 ± 0.54
2.905MetThr: 2.905 ± 0.458
1.09MetVal: 1.09 ± 0.379
0.182MetTrp: 0.182 ± 0.145
0.182MetTyr: 0.182 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
2.361AsnAla: 2.361 ± 0.648
0.182AsnCys: 0.182 ± 0.195
1.634AsnAsp: 1.634 ± 0.587
0.182AsnGlu: 0.182 ± 0.154
0.0AsnPhe: 0.0 ± 0.0
4.903AsnGly: 4.903 ± 1.07
0.363AsnHis: 0.363 ± 0.244
0.908AsnIle: 0.908 ± 0.448
0.726AsnLys: 0.726 ± 0.361
0.726AsnLeu: 0.726 ± 0.406
0.363AsnMet: 0.363 ± 0.3
0.908AsnAsn: 0.908 ± 0.375
2.724AsnPro: 2.724 ± 0.828
0.908AsnGln: 0.908 ± 0.431
2.179AsnArg: 2.179 ± 0.605
1.997AsnSer: 1.997 ± 0.692
1.09AsnThr: 1.09 ± 0.476
2.542AsnVal: 2.542 ± 0.7
0.182AsnTrp: 0.182 ± 0.222
0.545AsnTyr: 0.545 ± 0.394
0.0AsnXaa: 0.0 ± 0.0
Pro
12.893ProAla: 12.893 ± 1.882
0.182ProCys: 0.182 ± 0.208
3.45ProAsp: 3.45 ± 0.679
1.634ProGlu: 1.634 ± 0.664
1.634ProPhe: 1.634 ± 0.55
5.266ProGly: 5.266 ± 0.948
0.908ProHis: 0.908 ± 0.416
3.087ProIle: 3.087 ± 0.715
1.09ProLys: 1.09 ± 0.501
4.903ProLeu: 4.903 ± 1.204
1.271ProMet: 1.271 ± 0.455
1.634ProAsn: 1.634 ± 0.667
3.087ProPro: 3.087 ± 0.642
1.816ProGln: 1.816 ± 0.549
2.724ProArg: 2.724 ± 0.803
3.269ProSer: 3.269 ± 0.75
3.813ProThr: 3.813 ± 0.86
4.358ProVal: 4.358 ± 0.788
1.453ProTrp: 1.453 ± 0.393
1.09ProTyr: 1.09 ± 0.445
0.0ProXaa: 0.0 ± 0.0
Gln
3.087GlnAla: 3.087 ± 0.654
0.363GlnCys: 0.363 ± 0.259
0.726GlnAsp: 0.726 ± 0.349
1.634GlnGlu: 1.634 ± 0.413
0.908GlnPhe: 0.908 ± 0.346
1.453GlnGly: 1.453 ± 0.377
1.09GlnHis: 1.09 ± 0.464
1.453GlnIle: 1.453 ± 0.526
0.0GlnLys: 0.0 ± 0.0
4.177GlnLeu: 4.177 ± 0.976
1.271GlnMet: 1.271 ± 0.447
0.182GlnAsn: 0.182 ± 0.159
0.726GlnPro: 0.726 ± 0.376
0.726GlnGln: 0.726 ± 0.354
1.634GlnArg: 1.634 ± 0.541
1.816GlnSer: 1.816 ± 0.463
2.361GlnThr: 2.361 ± 0.599
2.724GlnVal: 2.724 ± 0.66
0.363GlnTrp: 0.363 ± 0.255
0.182GlnTyr: 0.182 ± 0.195
0.0GlnXaa: 0.0 ± 0.0
Arg
9.261ArgAla: 9.261 ± 2.146
0.726ArgCys: 0.726 ± 0.371
5.084ArgAsp: 5.084 ± 1.327
2.905ArgGlu: 2.905 ± 1.039
1.816ArgPhe: 1.816 ± 0.611
4.903ArgGly: 4.903 ± 1.12
1.816ArgHis: 1.816 ± 0.659
3.45ArgIle: 3.45 ± 0.889
1.09ArgLys: 1.09 ± 0.46
5.448ArgLeu: 5.448 ± 0.919
2.179ArgMet: 2.179 ± 0.499
1.271ArgAsn: 1.271 ± 0.517
5.266ArgPro: 5.266 ± 1.038
2.179ArgGln: 2.179 ± 0.746
6.537ArgArg: 6.537 ± 1.554
3.813ArgSer: 3.813 ± 0.958
5.266ArgThr: 5.266 ± 1.171
6.174ArgVal: 6.174 ± 1.169
1.453ArgTrp: 1.453 ± 0.489
1.997ArgTyr: 1.997 ± 0.72
0.0ArgXaa: 0.0 ± 0.0
Ser
5.992SerAla: 5.992 ± 1.083
0.726SerCys: 0.726 ± 0.424
3.087SerAsp: 3.087 ± 0.771
1.09SerGlu: 1.09 ± 0.433
1.634SerPhe: 1.634 ± 0.496
4.903SerGly: 4.903 ± 0.94
1.634SerHis: 1.634 ± 0.659
4.721SerIle: 4.721 ± 0.968
2.361SerLys: 2.361 ± 0.662
3.813SerLeu: 3.813 ± 0.619
1.634SerMet: 1.634 ± 0.622
1.453SerAsn: 1.453 ± 0.515
3.632SerPro: 3.632 ± 0.855
2.179SerGln: 2.179 ± 0.717
4.721SerArg: 4.721 ± 1.082
3.632SerSer: 3.632 ± 0.814
4.54SerThr: 4.54 ± 1.073
4.903SerVal: 4.903 ± 0.973
1.634SerTrp: 1.634 ± 0.617
1.816SerTyr: 1.816 ± 0.674
0.0SerXaa: 0.0 ± 0.0
Thr
11.803ThrAla: 11.803 ± 1.423
0.182ThrCys: 0.182 ± 0.169
3.813ThrAsp: 3.813 ± 1.014
2.905ThrGlu: 2.905 ± 0.834
2.361ThrPhe: 2.361 ± 0.761
5.811ThrGly: 5.811 ± 0.959
0.363ThrHis: 0.363 ± 0.315
3.45ThrIle: 3.45 ± 1.009
1.816ThrLys: 1.816 ± 0.548
5.811ThrLeu: 5.811 ± 1.224
0.908ThrMet: 0.908 ± 0.422
1.634ThrAsn: 1.634 ± 0.548
6.174ThrPro: 6.174 ± 1.051
1.634ThrGln: 1.634 ± 0.385
4.903ThrArg: 4.903 ± 1.054
4.721ThrSer: 4.721 ± 1.035
6.537ThrThr: 6.537 ± 1.183
8.353ThrVal: 8.353 ± 1.411
1.816ThrTrp: 1.816 ± 0.708
1.453ThrTyr: 1.453 ± 0.641
0.0ThrXaa: 0.0 ± 0.0
Val
12.348ValAla: 12.348 ± 1.914
1.09ValCys: 1.09 ± 0.507
6.174ValAsp: 6.174 ± 0.91
5.448ValGlu: 5.448 ± 0.94
1.816ValPhe: 1.816 ± 0.536
5.811ValGly: 5.811 ± 1.137
1.634ValHis: 1.634 ± 0.556
6.174ValIle: 6.174 ± 1.357
1.997ValLys: 1.997 ± 0.511
3.995ValLeu: 3.995 ± 1.044
1.09ValMet: 1.09 ± 0.395
1.997ValAsn: 1.997 ± 0.728
4.903ValPro: 4.903 ± 0.778
2.361ValGln: 2.361 ± 0.767
5.629ValArg: 5.629 ± 1.364
5.266ValSer: 5.266 ± 1.195
4.721ValThr: 4.721 ± 0.909
7.082ValVal: 7.082 ± 1.188
1.997ValTrp: 1.997 ± 0.919
1.453ValTyr: 1.453 ± 0.556
0.0ValXaa: 0.0 ± 0.0
Trp
3.269TrpAla: 3.269 ± 0.75
0.0TrpCys: 0.0 ± 0.0
1.453TrpAsp: 1.453 ± 0.609
0.908TrpGlu: 0.908 ± 0.325
0.908TrpPhe: 0.908 ± 0.407
1.997TrpGly: 1.997 ± 0.772
0.363TrpHis: 0.363 ± 0.258
0.908TrpIle: 0.908 ± 0.323
0.726TrpLys: 0.726 ± 0.325
1.634TrpLeu: 1.634 ± 0.51
0.363TrpMet: 0.363 ± 0.316
1.634TrpAsn: 1.634 ± 1.044
0.908TrpPro: 0.908 ± 0.368
0.363TrpGln: 0.363 ± 0.253
1.634TrpArg: 1.634 ± 0.496
2.542TrpSer: 2.542 ± 0.577
1.09TrpThr: 1.09 ± 0.53
0.726TrpVal: 0.726 ± 0.323
0.0TrpTrp: 0.0 ± 0.0
0.545TrpTyr: 0.545 ± 0.296
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.361TyrAla: 2.361 ± 0.911
0.182TyrCys: 0.182 ± 0.174
1.271TyrAsp: 1.271 ± 0.594
1.271TyrGlu: 1.271 ± 0.506
0.363TyrPhe: 0.363 ± 0.338
2.179TyrGly: 2.179 ± 0.703
0.908TyrHis: 0.908 ± 0.368
1.453TyrIle: 1.453 ± 0.484
0.182TyrLys: 0.182 ± 0.195
0.908TyrLeu: 0.908 ± 0.346
0.545TyrMet: 0.545 ± 0.332
0.182TyrAsn: 0.182 ± 0.191
1.453TyrPro: 1.453 ± 0.489
0.545TyrGln: 0.545 ± 0.26
2.179TyrArg: 2.179 ± 0.935
1.453TyrSer: 1.453 ± 0.562
1.09TyrThr: 1.09 ± 0.347
1.271TyrVal: 1.271 ± 0.505
0.545TyrTrp: 0.545 ± 0.314
0.545TyrTyr: 0.545 ± 0.297
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28 proteins (5508 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski