Amino acid dipepetide frequency for Geobacillus phage GBK2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.715AlaAla: 7.715 ± 1.299
0.406AlaCys: 0.406 ± 0.189
2.924AlaAsp: 2.924 ± 0.459
5.36AlaGlu: 5.36 ± 0.646
3.411AlaPhe: 3.411 ± 0.619
4.629AlaGly: 4.629 ± 0.586
1.381AlaHis: 1.381 ± 0.287
6.416AlaIle: 6.416 ± 0.717
5.198AlaLys: 5.198 ± 0.829
7.066AlaLeu: 7.066 ± 0.871
2.193AlaMet: 2.193 ± 0.375
4.223AlaAsn: 4.223 ± 0.475
2.518AlaPro: 2.518 ± 0.406
3.086AlaGln: 3.086 ± 0.458
3.655AlaArg: 3.655 ± 0.605
3.33AlaSer: 3.33 ± 0.586
2.924AlaThr: 2.924 ± 0.422
5.035AlaVal: 5.035 ± 0.666
1.706AlaTrp: 1.706 ± 0.338
3.167AlaTyr: 3.167 ± 0.419
0.0AlaXaa: 0.0 ± 0.0
Cys
0.406CysAla: 0.406 ± 0.205
0.162CysCys: 0.162 ± 0.092
0.487CysAsp: 0.487 ± 0.193
0.569CysGlu: 0.569 ± 0.231
0.244CysPhe: 0.244 ± 0.124
1.381CysGly: 1.381 ± 0.396
0.081CysHis: 0.081 ± 0.08
1.137CysIle: 1.137 ± 0.42
0.975CysLys: 0.975 ± 0.299
0.244CysLeu: 0.244 ± 0.145
0.162CysMet: 0.162 ± 0.112
0.812CysAsn: 0.812 ± 0.234
0.65CysPro: 0.65 ± 0.243
0.244CysGln: 0.244 ± 0.139
0.731CysArg: 0.731 ± 0.203
0.406CysSer: 0.406 ± 0.164
0.325CysThr: 0.325 ± 0.153
0.325CysVal: 0.325 ± 0.146
0.0CysTrp: 0.0 ± 0.0
0.244CysTyr: 0.244 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
4.71AspAla: 4.71 ± 0.407
0.812AspCys: 0.812 ± 0.245
4.386AspAsp: 4.386 ± 0.751
5.279AspGlu: 5.279 ± 0.733
2.924AspPhe: 2.924 ± 0.508
3.167AspGly: 3.167 ± 0.521
0.162AspHis: 0.162 ± 0.118
4.223AspIle: 4.223 ± 0.537
3.655AspLys: 3.655 ± 0.508
5.198AspLeu: 5.198 ± 0.634
1.543AspMet: 1.543 ± 0.322
2.03AspAsn: 2.03 ± 0.416
1.462AspPro: 1.462 ± 0.34
1.299AspGln: 1.299 ± 0.305
1.949AspArg: 1.949 ± 0.397
1.624AspSer: 1.624 ± 0.379
2.518AspThr: 2.518 ± 0.376
3.655AspVal: 3.655 ± 0.517
0.731AspTrp: 0.731 ± 0.273
2.599AspTyr: 2.599 ± 0.59
0.0AspXaa: 0.0 ± 0.0
Glu
6.01GluAla: 6.01 ± 0.82
0.406GluCys: 0.406 ± 0.2
2.843GluAsp: 2.843 ± 0.449
5.117GluGlu: 5.117 ± 0.848
3.573GluPhe: 3.573 ± 0.497
4.71GluGly: 4.71 ± 0.633
0.975GluHis: 0.975 ± 0.318
5.441GluIle: 5.441 ± 0.595
7.309GluLys: 7.309 ± 0.89
7.878GluLeu: 7.878 ± 0.763
3.411GluMet: 3.411 ± 0.629
5.198GluAsn: 5.198 ± 0.743
2.68GluPro: 2.68 ± 0.391
3.898GluGln: 3.898 ± 0.506
5.198GluArg: 5.198 ± 0.746
1.868GluSer: 1.868 ± 0.372
3.817GluThr: 3.817 ± 0.576
5.117GluVal: 5.117 ± 0.726
1.218GluTrp: 1.218 ± 0.321
2.355GluTyr: 2.355 ± 0.504
0.0GluXaa: 0.0 ± 0.0
Phe
2.924PheAla: 2.924 ± 0.379
0.487PheCys: 0.487 ± 0.246
2.03PheAsp: 2.03 ± 0.379
2.355PheGlu: 2.355 ± 0.399
1.381PhePhe: 1.381 ± 0.355
2.436PheGly: 2.436 ± 0.439
0.65PheHis: 0.65 ± 0.199
3.573PheIle: 3.573 ± 0.437
3.005PheLys: 3.005 ± 0.37
2.761PheLeu: 2.761 ± 0.441
1.624PheMet: 1.624 ± 0.318
2.68PheAsn: 2.68 ± 0.387
1.218PhePro: 1.218 ± 0.409
1.056PheGln: 1.056 ± 0.349
1.787PheArg: 1.787 ± 0.399
3.086PheSer: 3.086 ± 0.44
1.949PheThr: 1.949 ± 0.442
2.03PheVal: 2.03 ± 0.386
0.812PheTrp: 0.812 ± 0.2
2.518PheTyr: 2.518 ± 0.538
0.0PheXaa: 0.0 ± 0.0
Gly
4.71GlyAla: 4.71 ± 0.662
0.731GlyCys: 0.731 ± 0.23
2.843GlyAsp: 2.843 ± 0.554
4.629GlyGlu: 4.629 ± 0.619
3.411GlyPhe: 3.411 ± 0.591
4.223GlyGly: 4.223 ± 0.455
1.299GlyHis: 1.299 ± 0.394
5.279GlyIle: 5.279 ± 0.782
5.523GlyLys: 5.523 ± 0.777
5.36GlyLeu: 5.36 ± 0.848
1.299GlyMet: 1.299 ± 0.256
3.167GlyAsn: 3.167 ± 0.478
2.924GlyPro: 2.924 ± 0.544
3.411GlyGln: 3.411 ± 0.418
3.33GlyArg: 3.33 ± 0.51
2.355GlySer: 2.355 ± 0.367
4.386GlyThr: 4.386 ± 0.846
4.386GlyVal: 4.386 ± 0.616
0.65GlyTrp: 0.65 ± 0.303
4.061GlyTyr: 4.061 ± 0.656
0.0GlyXaa: 0.0 ± 0.0
His
1.462HisAla: 1.462 ± 0.444
0.325HisCys: 0.325 ± 0.157
1.218HisAsp: 1.218 ± 0.242
1.462HisGlu: 1.462 ± 0.293
0.569HisPhe: 0.569 ± 0.176
0.731HisGly: 0.731 ± 0.247
0.081HisHis: 0.081 ± 0.073
1.218HisIle: 1.218 ± 0.312
0.975HisLys: 0.975 ± 0.274
1.056HisLeu: 1.056 ± 0.263
0.487HisMet: 0.487 ± 0.234
0.812HisAsn: 0.812 ± 0.261
0.731HisPro: 0.731 ± 0.299
0.731HisGln: 0.731 ± 0.23
1.056HisArg: 1.056 ± 0.277
0.731HisSer: 0.731 ± 0.277
0.487HisThr: 0.487 ± 0.208
0.65HisVal: 0.65 ± 0.245
0.244HisTrp: 0.244 ± 0.168
0.569HisTyr: 0.569 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
6.497IleAla: 6.497 ± 0.995
1.218IleCys: 1.218 ± 0.384
4.71IleAsp: 4.71 ± 0.453
7.878IleGlu: 7.878 ± 0.77
2.355IlePhe: 2.355 ± 0.437
4.71IleGly: 4.71 ± 0.527
0.731IleHis: 0.731 ± 0.21
3.817IleIle: 3.817 ± 0.637
7.066IleLys: 7.066 ± 0.772
4.792IleLeu: 4.792 ± 0.545
1.787IleMet: 1.787 ± 0.4
4.061IleAsn: 4.061 ± 0.558
2.68IlePro: 2.68 ± 0.639
2.112IleGln: 2.112 ± 0.375
3.573IleArg: 3.573 ± 0.424
1.949IleSer: 1.949 ± 0.348
4.304IleThr: 4.304 ± 0.553
4.142IleVal: 4.142 ± 0.466
0.893IleTrp: 0.893 ± 0.241
2.436IleTyr: 2.436 ± 0.473
0.0IleXaa: 0.0 ± 0.0
Lys
6.903LysAla: 6.903 ± 0.959
0.569LysCys: 0.569 ± 0.222
4.467LysAsp: 4.467 ± 0.573
6.741LysGlu: 6.741 ± 0.967
3.005LysPhe: 3.005 ± 0.527
4.71LysGly: 4.71 ± 0.641
1.706LysHis: 1.706 ± 0.393
5.847LysIle: 5.847 ± 0.607
6.984LysLys: 6.984 ± 0.772
5.279LysLeu: 5.279 ± 0.55
3.167LysMet: 3.167 ± 0.549
4.304LysAsn: 4.304 ± 0.559
3.005LysPro: 3.005 ± 0.649
3.492LysGln: 3.492 ± 0.417
5.198LysArg: 5.198 ± 0.554
3.249LysSer: 3.249 ± 0.526
4.873LysThr: 4.873 ± 0.681
4.548LysVal: 4.548 ± 0.518
1.056LysTrp: 1.056 ± 0.296
3.086LysTyr: 3.086 ± 0.497
0.0LysXaa: 0.0 ± 0.0
Leu
4.467LeuAla: 4.467 ± 0.535
0.731LeuCys: 0.731 ± 0.212
3.98LeuAsp: 3.98 ± 0.578
6.01LeuGlu: 6.01 ± 0.711
1.706LeuPhe: 1.706 ± 0.264
5.279LeuGly: 5.279 ± 0.77
0.731LeuHis: 0.731 ± 0.202
5.117LeuIle: 5.117 ± 0.597
7.797LeuLys: 7.797 ± 0.783
6.01LeuLeu: 6.01 ± 0.812
3.005LeuMet: 3.005 ± 0.439
4.71LeuAsn: 4.71 ± 0.57
2.599LeuPro: 2.599 ± 0.424
4.304LeuGln: 4.304 ± 0.596
3.736LeuArg: 3.736 ± 0.471
4.548LeuSer: 4.548 ± 0.746
4.223LeuThr: 4.223 ± 0.535
4.142LeuVal: 4.142 ± 0.63
1.543LeuTrp: 1.543 ± 0.29
3.167LeuTyr: 3.167 ± 0.637
0.0LeuXaa: 0.0 ± 0.0
Met
3.005MetAla: 3.005 ± 0.589
0.081MetCys: 0.081 ± 0.079
2.68MetAsp: 2.68 ± 0.558
2.761MetGlu: 2.761 ± 0.54
0.975MetPhe: 0.975 ± 0.272
2.112MetGly: 2.112 ± 0.394
0.406MetHis: 0.406 ± 0.178
1.787MetIle: 1.787 ± 0.315
1.868MetLys: 1.868 ± 0.346
2.112MetLeu: 2.112 ± 0.516
1.056MetMet: 1.056 ± 0.316
1.462MetAsn: 1.462 ± 0.261
1.381MetPro: 1.381 ± 0.324
1.218MetGln: 1.218 ± 0.313
1.949MetArg: 1.949 ± 0.424
2.03MetSer: 2.03 ± 0.407
1.381MetThr: 1.381 ± 0.282
1.706MetVal: 1.706 ± 0.357
0.081MetTrp: 0.081 ± 0.083
0.975MetTyr: 0.975 ± 0.333
0.0MetXaa: 0.0 ± 0.0
Asn
4.792AsnAla: 4.792 ± 0.75
0.893AsnCys: 0.893 ± 0.257
2.599AsnAsp: 2.599 ± 0.473
4.954AsnGlu: 4.954 ± 0.76
2.193AsnPhe: 2.193 ± 0.31
6.01AsnGly: 6.01 ± 0.759
0.975AsnHis: 0.975 ± 0.297
2.761AsnIle: 2.761 ± 0.487
4.629AsnLys: 4.629 ± 0.716
4.142AsnLeu: 4.142 ± 0.665
2.03AsnMet: 2.03 ± 0.357
2.924AsnAsn: 2.924 ± 0.397
3.167AsnPro: 3.167 ± 0.545
0.975AsnGln: 0.975 ± 0.359
2.843AsnArg: 2.843 ± 0.488
1.299AsnSer: 1.299 ± 0.288
2.274AsnThr: 2.274 ± 0.586
3.655AsnVal: 3.655 ± 0.485
1.137AsnTrp: 1.137 ± 0.289
2.355AsnTyr: 2.355 ± 0.321
0.0AsnXaa: 0.0 ± 0.0
Pro
3.005ProAla: 3.005 ± 0.674
0.081ProCys: 0.081 ± 0.08
2.518ProAsp: 2.518 ± 0.462
2.436ProGlu: 2.436 ± 0.514
1.868ProPhe: 1.868 ± 0.466
3.167ProGly: 3.167 ± 0.533
0.893ProHis: 0.893 ± 0.234
2.274ProIle: 2.274 ± 0.257
2.843ProLys: 2.843 ± 0.446
3.33ProLeu: 3.33 ± 0.573
1.218ProMet: 1.218 ± 0.295
2.274ProAsn: 2.274 ± 0.423
1.381ProPro: 1.381 ± 0.392
1.218ProGln: 1.218 ± 0.279
1.056ProArg: 1.056 ± 0.265
2.355ProSer: 2.355 ± 0.481
2.355ProThr: 2.355 ± 0.616
3.005ProVal: 3.005 ± 0.444
0.893ProTrp: 0.893 ± 0.258
1.706ProTyr: 1.706 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
2.599GlnAla: 2.599 ± 0.463
0.244GlnCys: 0.244 ± 0.12
1.381GlnAsp: 1.381 ± 0.345
3.736GlnGlu: 3.736 ± 0.549
1.624GlnPhe: 1.624 ± 0.313
2.355GlnGly: 2.355 ± 0.353
0.731GlnHis: 0.731 ± 0.25
2.355GlnIle: 2.355 ± 0.374
3.086GlnLys: 3.086 ± 0.666
3.655GlnLeu: 3.655 ± 0.572
2.193GlnMet: 2.193 ± 0.364
1.949GlnAsn: 1.949 ± 0.416
1.787GlnPro: 1.787 ± 0.346
1.949GlnGln: 1.949 ± 0.409
2.03GlnArg: 2.03 ± 0.329
1.218GlnSer: 1.218 ± 0.342
2.112GlnThr: 2.112 ± 0.521
1.787GlnVal: 1.787 ± 0.35
0.893GlnTrp: 0.893 ± 0.348
1.299GlnTyr: 1.299 ± 0.213
0.0GlnXaa: 0.0 ± 0.0
Arg
3.655ArgAla: 3.655 ± 0.545
0.487ArgCys: 0.487 ± 0.243
2.355ArgAsp: 2.355 ± 0.417
4.548ArgGlu: 4.548 ± 0.683
1.787ArgPhe: 1.787 ± 0.415
2.436ArgGly: 2.436 ± 0.405
0.975ArgHis: 0.975 ± 0.261
4.304ArgIle: 4.304 ± 0.616
3.411ArgLys: 3.411 ± 0.579
4.061ArgLeu: 4.061 ± 0.754
1.462ArgMet: 1.462 ± 0.358
2.599ArgAsn: 2.599 ± 0.385
2.518ArgPro: 2.518 ± 0.471
2.599ArgGln: 2.599 ± 0.399
2.436ArgArg: 2.436 ± 0.43
1.381ArgSer: 1.381 ± 0.282
2.843ArgThr: 2.843 ± 0.388
3.411ArgVal: 3.411 ± 0.434
0.893ArgTrp: 0.893 ± 0.424
2.355ArgTyr: 2.355 ± 0.466
0.0ArgXaa: 0.0 ± 0.0
Ser
2.355SerAla: 2.355 ± 0.532
0.244SerCys: 0.244 ± 0.131
2.68SerAsp: 2.68 ± 0.569
2.924SerGlu: 2.924 ± 0.51
1.706SerPhe: 1.706 ± 0.381
3.655SerGly: 3.655 ± 0.418
0.569SerHis: 0.569 ± 0.247
2.924SerIle: 2.924 ± 0.376
3.492SerLys: 3.492 ± 0.562
2.436SerLeu: 2.436 ± 0.404
0.731SerMet: 0.731 ± 0.276
2.112SerAsn: 2.112 ± 0.383
2.112SerPro: 2.112 ± 0.344
1.299SerGln: 1.299 ± 0.432
2.518SerArg: 2.518 ± 0.502
1.056SerSer: 1.056 ± 0.326
1.949SerThr: 1.949 ± 0.476
2.03SerVal: 2.03 ± 0.324
0.569SerTrp: 0.569 ± 0.229
1.381SerTyr: 1.381 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
4.142ThrAla: 4.142 ± 0.661
0.325ThrCys: 0.325 ± 0.183
2.924ThrAsp: 2.924 ± 0.549
3.005ThrGlu: 3.005 ± 0.514
3.249ThrPhe: 3.249 ± 0.541
4.873ThrGly: 4.873 ± 0.703
0.569ThrHis: 0.569 ± 0.267
4.467ThrIle: 4.467 ± 0.641
4.304ThrLys: 4.304 ± 0.622
3.411ThrLeu: 3.411 ± 0.536
1.299ThrMet: 1.299 ± 0.334
3.573ThrAsn: 3.573 ± 0.591
1.868ThrPro: 1.868 ± 0.512
1.868ThrGln: 1.868 ± 0.32
2.436ThrArg: 2.436 ± 0.407
1.462ThrSer: 1.462 ± 0.534
2.03ThrThr: 2.03 ± 0.357
3.33ThrVal: 3.33 ± 0.64
1.056ThrTrp: 1.056 ± 0.275
2.03ThrTyr: 2.03 ± 0.397
0.0ThrXaa: 0.0 ± 0.0
Val
3.736ValAla: 3.736 ± 0.477
0.569ValCys: 0.569 ± 0.307
3.167ValAsp: 3.167 ± 0.464
4.548ValGlu: 4.548 ± 0.608
2.355ValPhe: 2.355 ± 0.46
3.898ValGly: 3.898 ± 0.671
1.056ValHis: 1.056 ± 0.218
4.873ValIle: 4.873 ± 0.686
5.604ValLys: 5.604 ± 0.749
4.467ValLeu: 4.467 ± 0.572
1.056ValMet: 1.056 ± 0.271
3.573ValAsn: 3.573 ± 0.583
3.005ValPro: 3.005 ± 0.39
2.03ValGln: 2.03 ± 0.381
2.193ValArg: 2.193 ± 0.41
2.274ValSer: 2.274 ± 0.38
3.411ValThr: 3.411 ± 0.475
3.167ValVal: 3.167 ± 0.625
0.812ValTrp: 0.812 ± 0.255
3.655ValTyr: 3.655 ± 0.514
0.0ValXaa: 0.0 ± 0.0
Trp
0.893TrpAla: 0.893 ± 0.202
0.0TrpCys: 0.0 ± 0.0
0.812TrpAsp: 0.812 ± 0.306
1.624TrpGlu: 1.624 ± 0.363
0.812TrpPhe: 0.812 ± 0.296
0.731TrpGly: 0.731 ± 0.22
0.65TrpHis: 0.65 ± 0.247
0.893TrpIle: 0.893 ± 0.249
0.975TrpLys: 0.975 ± 0.296
0.812TrpLeu: 0.812 ± 0.292
0.325TrpMet: 0.325 ± 0.143
1.218TrpAsn: 1.218 ± 0.385
0.569TrpPro: 0.569 ± 0.183
0.731TrpGln: 0.731 ± 0.305
1.056TrpArg: 1.056 ± 0.3
0.893TrpSer: 0.893 ± 0.24
1.218TrpThr: 1.218 ± 0.319
1.137TrpVal: 1.137 ± 0.257
0.081TrpTrp: 0.081 ± 0.073
0.569TrpTyr: 0.569 ± 0.215
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.355TyrAla: 2.355 ± 0.48
0.812TyrCys: 0.812 ± 0.311
2.924TyrAsp: 2.924 ± 0.572
3.086TyrGlu: 3.086 ± 0.487
1.218TyrPhe: 1.218 ± 0.295
2.761TyrGly: 2.761 ± 0.592
0.975TyrHis: 0.975 ± 0.315
3.005TyrIle: 3.005 ± 0.477
3.573TyrLys: 3.573 ± 0.56
3.736TyrLeu: 3.736 ± 0.429
0.893TyrMet: 0.893 ± 0.354
2.924TyrAsn: 2.924 ± 0.494
1.624TyrPro: 1.624 ± 0.474
1.462TyrGln: 1.462 ± 0.306
1.787TyrArg: 1.787 ± 0.451
1.624TyrSer: 1.624 ± 0.371
2.843TyrThr: 2.843 ± 0.677
2.193TyrVal: 2.193 ± 0.42
0.731TyrTrp: 0.731 ± 0.192
1.543TyrTyr: 1.543 ± 0.39
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12314 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski