Amino acid dipepetide frequency for Gordonia phage Crocheter

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.347AlaAla: 9.347 ± 1.375
0.315AlaCys: 0.315 ± 0.144
4.936AlaAsp: 4.936 ± 0.515
5.986AlaGlu: 5.986 ± 0.593
3.571AlaPhe: 3.571 ± 0.471
6.249AlaGly: 6.249 ± 1.05
1.628AlaHis: 1.628 ± 0.298
5.514AlaIle: 5.514 ± 0.623
6.039AlaLys: 6.039 ± 0.588
6.564AlaLeu: 6.564 ± 1.11
2.993AlaMet: 2.993 ± 0.435
3.781AlaAsn: 3.781 ± 0.58
1.628AlaPro: 1.628 ± 0.33
3.098AlaGln: 3.098 ± 0.369
4.673AlaArg: 4.673 ± 0.512
4.988AlaSer: 4.988 ± 0.483
5.514AlaThr: 5.514 ± 0.718
7.194AlaVal: 7.194 ± 0.817
0.84AlaTrp: 0.84 ± 0.235
2.363AlaTyr: 2.363 ± 0.447
0.0AlaXaa: 0.0 ± 0.0
Cys
0.368CysAla: 0.368 ± 0.154
0.0CysCys: 0.0 ± 0.0
0.368CysAsp: 0.368 ± 0.141
0.473CysGlu: 0.473 ± 0.185
0.053CysPhe: 0.053 ± 0.051
0.63CysGly: 0.63 ± 0.229
0.053CysHis: 0.053 ± 0.056
0.525CysIle: 0.525 ± 0.223
0.368CysLys: 0.368 ± 0.154
0.84CysLeu: 0.84 ± 0.25
0.105CysMet: 0.105 ± 0.074
0.158CysAsn: 0.158 ± 0.078
0.315CysPro: 0.315 ± 0.136
0.263CysGln: 0.263 ± 0.119
0.42CysArg: 0.42 ± 0.156
0.578CysSer: 0.578 ± 0.222
0.21CysThr: 0.21 ± 0.111
0.368CysVal: 0.368 ± 0.139
0.315CysTrp: 0.315 ± 0.126
0.158CysTyr: 0.158 ± 0.094
0.0CysXaa: 0.0 ± 0.0
Asp
5.041AspAla: 5.041 ± 0.629
0.263AspCys: 0.263 ± 0.133
3.728AspAsp: 3.728 ± 0.613
5.251AspGlu: 5.251 ± 0.649
2.783AspPhe: 2.783 ± 0.388
4.411AspGly: 4.411 ± 0.504
1.103AspHis: 1.103 ± 0.292
4.043AspIle: 4.043 ± 0.435
3.571AspLys: 3.571 ± 0.551
4.463AspLeu: 4.463 ± 0.69
1.785AspMet: 1.785 ± 0.342
2.573AspAsn: 2.573 ± 0.344
3.098AspPro: 3.098 ± 0.472
2.258AspGln: 2.258 ± 0.365
3.413AspArg: 3.413 ± 0.426
3.623AspSer: 3.623 ± 0.454
3.151AspThr: 3.151 ± 0.401
3.938AspVal: 3.938 ± 0.411
0.998AspTrp: 0.998 ± 0.218
1.943AspTyr: 1.943 ± 0.328
0.0AspXaa: 0.0 ± 0.0
Glu
5.724GluAla: 5.724 ± 0.517
0.63GluCys: 0.63 ± 0.243
3.833GluAsp: 3.833 ± 0.52
4.988GluGlu: 4.988 ± 0.6
2.258GluPhe: 2.258 ± 0.425
5.146GluGly: 5.146 ± 0.537
1.26GluHis: 1.26 ± 0.291
4.306GluIle: 4.306 ± 0.462
4.883GluLys: 4.883 ± 0.674
6.669GluLeu: 6.669 ± 0.675
2.678GluMet: 2.678 ± 0.322
2.993GluAsn: 2.993 ± 0.443
2.258GluPro: 2.258 ± 0.439
3.046GluGln: 3.046 ± 0.412
2.625GluArg: 2.625 ± 0.42
4.043GluSer: 4.043 ± 0.463
3.098GluThr: 3.098 ± 0.352
4.568GluVal: 4.568 ± 0.435
1.47GluTrp: 1.47 ± 0.242
2.1GluTyr: 2.1 ± 0.283
0.0GluXaa: 0.0 ± 0.0
Phe
2.363PheAla: 2.363 ± 0.335
0.473PheCys: 0.473 ± 0.153
3.518PheAsp: 3.518 ± 0.406
3.203PheGlu: 3.203 ± 0.519
1.365PhePhe: 1.365 ± 0.253
2.783PheGly: 2.783 ± 0.386
0.368PheHis: 0.368 ± 0.135
1.838PheIle: 1.838 ± 0.291
1.943PheLys: 1.943 ± 0.316
2.731PheLeu: 2.731 ± 0.382
1.155PheMet: 1.155 ± 0.26
1.89PheAsn: 1.89 ± 0.308
1.628PhePro: 1.628 ± 0.32
0.788PheGln: 0.788 ± 0.244
1.838PheArg: 1.838 ± 0.308
1.785PheSer: 1.785 ± 0.409
2.048PheThr: 2.048 ± 0.385
1.943PheVal: 1.943 ± 0.413
0.368PheTrp: 0.368 ± 0.125
1.05PheTyr: 1.05 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
6.564GlyAla: 6.564 ± 0.816
0.473GlyCys: 0.473 ± 0.167
4.831GlyAsp: 4.831 ± 0.496
4.463GlyGlu: 4.463 ± 0.442
2.468GlyPhe: 2.468 ± 0.399
5.934GlyGly: 5.934 ± 0.714
1.155GlyHis: 1.155 ± 0.329
4.936GlyIle: 4.936 ± 0.908
3.781GlyLys: 3.781 ± 0.507
5.671GlyLeu: 5.671 ± 1.007
2.678GlyMet: 2.678 ± 0.406
2.941GlyAsn: 2.941 ± 0.421
2.836GlyPro: 2.836 ± 0.355
2.678GlyGln: 2.678 ± 0.51
3.151GlyArg: 3.151 ± 0.391
4.988GlySer: 4.988 ± 0.635
5.356GlyThr: 5.356 ± 0.453
6.459GlyVal: 6.459 ± 0.785
0.893GlyTrp: 0.893 ± 0.214
2.573GlyTyr: 2.573 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
1.47HisAla: 1.47 ± 0.321
0.105HisCys: 0.105 ± 0.076
0.998HisAsp: 0.998 ± 0.262
1.313HisGlu: 1.313 ± 0.296
0.578HisPhe: 0.578 ± 0.206
1.628HisGly: 1.628 ± 0.325
0.63HisHis: 0.63 ± 0.17
1.68HisIle: 1.68 ± 0.406
0.735HisLys: 0.735 ± 0.17
1.418HisLeu: 1.418 ± 0.275
0.42HisMet: 0.42 ± 0.15
0.893HisAsn: 0.893 ± 0.199
0.998HisPro: 0.998 ± 0.306
0.63HisGln: 0.63 ± 0.18
1.418HisArg: 1.418 ± 0.279
1.103HisSer: 1.103 ± 0.255
0.788HisThr: 0.788 ± 0.194
1.47HisVal: 1.47 ± 0.303
0.158HisTrp: 0.158 ± 0.099
1.208HisTyr: 1.208 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
5.776IleAla: 5.776 ± 0.554
0.578IleCys: 0.578 ± 0.196
4.358IleAsp: 4.358 ± 0.484
4.358IleGlu: 4.358 ± 0.563
2.048IlePhe: 2.048 ± 0.326
4.831IleGly: 4.831 ± 0.772
1.103IleHis: 1.103 ± 0.262
2.625IleIle: 2.625 ± 0.447
3.518IleLys: 3.518 ± 0.427
4.516IleLeu: 4.516 ± 0.572
0.893IleMet: 0.893 ± 0.207
1.89IleAsn: 1.89 ± 0.327
2.573IlePro: 2.573 ± 0.342
1.943IleGln: 1.943 ± 0.332
2.993IleArg: 2.993 ± 0.379
3.308IleSer: 3.308 ± 0.497
3.466IleThr: 3.466 ± 0.447
4.988IleVal: 4.988 ± 0.519
0.735IleTrp: 0.735 ± 0.164
1.785IleTyr: 1.785 ± 0.34
0.0IleXaa: 0.0 ± 0.0
Lys
5.198LysAla: 5.198 ± 0.476
0.315LysCys: 0.315 ± 0.149
3.256LysAsp: 3.256 ± 0.549
4.358LysGlu: 4.358 ± 0.446
1.785LysPhe: 1.785 ± 0.404
4.253LysGly: 4.253 ± 0.582
1.89LysHis: 1.89 ± 0.338
3.151LysIle: 3.151 ± 0.493
3.676LysLys: 3.676 ± 0.648
4.883LysLeu: 4.883 ± 0.548
1.785LysMet: 1.785 ± 0.342
2.941LysAsn: 2.941 ± 0.378
2.993LysPro: 2.993 ± 0.465
1.995LysGln: 1.995 ± 0.27
2.468LysArg: 2.468 ± 0.419
3.781LysSer: 3.781 ± 0.427
3.151LysThr: 3.151 ± 0.353
4.726LysVal: 4.726 ± 0.581
1.05LysTrp: 1.05 ± 0.217
1.89LysTyr: 1.89 ± 0.339
0.0LysXaa: 0.0 ± 0.0
Leu
7.561LeuAla: 7.561 ± 1.317
0.578LeuCys: 0.578 ± 0.175
5.356LeuAsp: 5.356 ± 0.605
3.728LeuGlu: 3.728 ± 0.549
2.573LeuPhe: 2.573 ± 0.39
6.144LeuGly: 6.144 ± 1.014
1.208LeuHis: 1.208 ± 0.338
4.253LeuIle: 4.253 ± 0.505
5.198LeuLys: 5.198 ± 0.596
5.356LeuLeu: 5.356 ± 0.567
2.205LeuMet: 2.205 ± 0.347
4.568LeuAsn: 4.568 ± 0.556
2.573LeuPro: 2.573 ± 0.366
2.205LeuGln: 2.205 ± 0.345
3.886LeuArg: 3.886 ± 0.484
6.249LeuSer: 6.249 ± 0.615
6.879LeuThr: 6.879 ± 0.58
6.669LeuVal: 6.669 ± 0.542
0.998LeuTrp: 0.998 ± 0.209
2.783LeuTyr: 2.783 ± 0.442
0.0LeuXaa: 0.0 ± 0.0
Met
2.731MetAla: 2.731 ± 0.458
0.21MetCys: 0.21 ± 0.114
1.523MetAsp: 1.523 ± 0.294
1.628MetGlu: 1.628 ± 0.388
0.788MetPhe: 0.788 ± 0.214
1.89MetGly: 1.89 ± 0.28
0.525MetHis: 0.525 ± 0.141
2.153MetIle: 2.153 ± 0.243
1.68MetLys: 1.68 ± 0.283
2.783MetLeu: 2.783 ± 0.444
0.998MetMet: 0.998 ± 0.235
1.05MetAsn: 1.05 ± 0.201
1.365MetPro: 1.365 ± 0.305
0.84MetGln: 0.84 ± 0.172
1.418MetArg: 1.418 ± 0.242
2.258MetSer: 2.258 ± 0.37
2.625MetThr: 2.625 ± 0.397
1.575MetVal: 1.575 ± 0.277
0.158MetTrp: 0.158 ± 0.095
0.525MetTyr: 0.525 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.676AsnAla: 3.676 ± 0.524
0.21AsnCys: 0.21 ± 0.109
3.413AsnAsp: 3.413 ± 0.484
3.308AsnGlu: 3.308 ± 0.396
1.575AsnPhe: 1.575 ± 0.241
3.098AsnGly: 3.098 ± 0.454
1.155AsnHis: 1.155 ± 0.22
1.89AsnIle: 1.89 ± 0.3
2.31AsnLys: 2.31 ± 0.418
3.938AsnLeu: 3.938 ± 0.45
1.103AsnMet: 1.103 ± 0.273
1.68AsnAsn: 1.68 ± 0.328
2.731AsnPro: 2.731 ± 0.376
1.418AsnGln: 1.418 ± 0.267
2.941AsnArg: 2.941 ± 0.449
2.993AsnSer: 2.993 ± 0.323
2.625AsnThr: 2.625 ± 0.379
2.205AsnVal: 2.205 ± 0.396
0.315AsnTrp: 0.315 ± 0.133
1.47AsnTyr: 1.47 ± 0.317
0.0AsnXaa: 0.0 ± 0.0
Pro
3.308ProAla: 3.308 ± 0.407
0.473ProCys: 0.473 ± 0.167
2.993ProAsp: 2.993 ± 0.414
3.833ProGlu: 3.833 ± 0.43
1.838ProPhe: 1.838 ± 0.291
2.888ProGly: 2.888 ± 0.395
0.893ProHis: 0.893 ± 0.234
1.838ProIle: 1.838 ± 0.282
2.1ProLys: 2.1 ± 0.298
2.468ProLeu: 2.468 ± 0.348
1.365ProMet: 1.365 ± 0.255
1.995ProAsn: 1.995 ± 0.388
2.153ProPro: 2.153 ± 0.503
1.365ProGln: 1.365 ± 0.238
1.838ProArg: 1.838 ± 0.326
2.836ProSer: 2.836 ± 0.468
2.993ProThr: 2.993 ± 0.598
3.361ProVal: 3.361 ± 0.412
0.473ProTrp: 0.473 ± 0.14
1.523ProTyr: 1.523 ± 0.286
0.0ProXaa: 0.0 ± 0.0
Gln
3.256GlnAla: 3.256 ± 0.413
0.158GlnCys: 0.158 ± 0.086
1.47GlnAsp: 1.47 ± 0.301
1.575GlnGlu: 1.575 ± 0.302
0.998GlnPhe: 0.998 ± 0.241
2.468GlnGly: 2.468 ± 0.426
1.208GlnHis: 1.208 ± 0.234
1.838GlnIle: 1.838 ± 0.335
1.838GlnLys: 1.838 ± 0.311
3.938GlnLeu: 3.938 ± 0.458
1.05GlnMet: 1.05 ± 0.256
1.47GlnAsn: 1.47 ± 0.25
1.47GlnPro: 1.47 ± 0.406
1.89GlnGln: 1.89 ± 0.37
1.943GlnArg: 1.943 ± 0.29
1.943GlnSer: 1.943 ± 0.309
2.153GlnThr: 2.153 ± 0.334
2.678GlnVal: 2.678 ± 0.36
0.368GlnTrp: 0.368 ± 0.154
1.155GlnTyr: 1.155 ± 0.256
0.0GlnXaa: 0.0 ± 0.0
Arg
4.096ArgAla: 4.096 ± 0.596
0.315ArgCys: 0.315 ± 0.106
2.573ArgAsp: 2.573 ± 0.364
3.151ArgGlu: 3.151 ± 0.474
1.68ArgPhe: 1.68 ± 0.352
2.836ArgGly: 2.836 ± 0.431
1.05ArgHis: 1.05 ± 0.251
2.783ArgIle: 2.783 ± 0.337
3.361ArgLys: 3.361 ± 0.518
4.621ArgLeu: 4.621 ± 0.621
1.418ArgMet: 1.418 ± 0.277
2.468ArgAsn: 2.468 ± 0.412
2.836ArgPro: 2.836 ± 0.465
2.625ArgGln: 2.625 ± 0.461
3.098ArgArg: 3.098 ± 0.496
2.52ArgSer: 2.52 ± 0.512
3.413ArgThr: 3.413 ± 0.38
2.888ArgVal: 2.888 ± 0.43
0.578ArgTrp: 0.578 ± 0.207
1.47ArgTyr: 1.47 ± 0.298
0.0ArgXaa: 0.0 ± 0.0
Ser
4.411SerAla: 4.411 ± 0.494
0.263SerCys: 0.263 ± 0.154
3.781SerAsp: 3.781 ± 0.449
5.671SerGlu: 5.671 ± 0.49
1.943SerPhe: 1.943 ± 0.302
5.356SerGly: 5.356 ± 0.591
1.208SerHis: 1.208 ± 0.23
3.466SerIle: 3.466 ± 0.376
3.938SerLys: 3.938 ± 0.61
5.146SerLeu: 5.146 ± 0.471
1.943SerMet: 1.943 ± 0.307
3.203SerAsn: 3.203 ± 0.456
2.363SerPro: 2.363 ± 0.344
2.205SerGln: 2.205 ± 0.34
2.52SerArg: 2.52 ± 0.39
4.516SerSer: 4.516 ± 0.58
4.673SerThr: 4.673 ± 0.527
4.253SerVal: 4.253 ± 0.504
1.155SerTrp: 1.155 ± 0.312
2.258SerTyr: 2.258 ± 0.325
0.0SerXaa: 0.0 ± 0.0
Thr
6.039ThrAla: 6.039 ± 0.855
0.368ThrCys: 0.368 ± 0.139
3.571ThrAsp: 3.571 ± 0.339
3.676ThrGlu: 3.676 ± 0.444
2.31ThrPhe: 2.31 ± 0.33
5.566ThrGly: 5.566 ± 0.624
0.893ThrHis: 0.893 ± 0.25
4.358ThrIle: 4.358 ± 0.627
3.623ThrLys: 3.623 ± 0.463
5.514ThrLeu: 5.514 ± 0.539
1.785ThrMet: 1.785 ± 0.273
2.52ThrAsn: 2.52 ± 0.386
3.518ThrPro: 3.518 ± 0.426
1.89ThrGln: 1.89 ± 0.297
2.52ThrArg: 2.52 ± 0.382
4.673ThrSer: 4.673 ± 0.399
4.726ThrThr: 4.726 ± 0.571
4.883ThrVal: 4.883 ± 0.644
0.525ThrTrp: 0.525 ± 0.182
2.1ThrTyr: 2.1 ± 0.369
0.0ThrXaa: 0.0 ± 0.0
Val
6.564ValAla: 6.564 ± 0.596
0.368ValCys: 0.368 ± 0.163
4.411ValAsp: 4.411 ± 0.392
4.936ValGlu: 4.936 ± 0.537
2.468ValPhe: 2.468 ± 0.316
4.936ValGly: 4.936 ± 0.579
1.418ValHis: 1.418 ± 0.303
4.673ValIle: 4.673 ± 0.692
4.043ValLys: 4.043 ± 0.414
5.724ValLeu: 5.724 ± 0.653
1.575ValMet: 1.575 ± 0.301
2.625ValAsn: 2.625 ± 0.348
3.203ValPro: 3.203 ± 0.412
2.468ValGln: 2.468 ± 0.364
4.096ValArg: 4.096 ± 0.532
5.093ValSer: 5.093 ± 0.524
5.304ValThr: 5.304 ± 0.705
5.829ValVal: 5.829 ± 0.684
1.155ValTrp: 1.155 ± 0.263
2.468ValTyr: 2.468 ± 0.386
0.0ValXaa: 0.0 ± 0.0
Trp
0.84TrpAla: 0.84 ± 0.174
0.158TrpCys: 0.158 ± 0.095
0.683TrpAsp: 0.683 ± 0.234
1.103TrpGlu: 1.103 ± 0.211
0.525TrpPhe: 0.525 ± 0.196
0.945TrpGly: 0.945 ± 0.196
0.368TrpHis: 0.368 ± 0.135
0.578TrpIle: 0.578 ± 0.173
0.683TrpLys: 0.683 ± 0.184
1.208TrpLeu: 1.208 ± 0.297
0.21TrpMet: 0.21 ± 0.098
1.208TrpAsn: 1.208 ± 0.244
0.683TrpPro: 0.683 ± 0.219
0.263TrpGln: 0.263 ± 0.099
0.735TrpArg: 0.735 ± 0.174
0.735TrpSer: 0.735 ± 0.203
0.945TrpThr: 0.945 ± 0.247
0.683TrpVal: 0.683 ± 0.181
0.105TrpTrp: 0.105 ± 0.078
0.525TrpTyr: 0.525 ± 0.174
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.888TyrAla: 2.888 ± 0.548
0.315TyrCys: 0.315 ± 0.139
1.838TyrAsp: 1.838 ± 0.361
1.575TyrGlu: 1.575 ± 0.309
1.47TyrPhe: 1.47 ± 0.314
2.836TyrGly: 2.836 ± 0.383
0.525TyrHis: 0.525 ± 0.181
1.838TyrIle: 1.838 ± 0.313
2.258TyrLys: 2.258 ± 0.416
2.52TyrLeu: 2.52 ± 0.35
0.525TyrMet: 0.525 ± 0.194
1.208TyrAsn: 1.208 ± 0.249
1.313TyrPro: 1.313 ± 0.256
0.998TyrGln: 0.998 ± 0.234
1.785TyrArg: 1.785 ± 0.304
2.31TyrSer: 2.31 ± 0.411
1.943TyrThr: 1.943 ± 0.358
2.678TyrVal: 2.678 ± 0.349
0.525TyrTrp: 0.525 ± 0.154
0.683TyrTyr: 0.683 ± 0.225
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (19045 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski