Amino acid dipepetide frequency for Gordonia phage William

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.584AlaAla: 18.584 ± 1.933
0.969AlaCys: 0.969 ± 0.307
7.93AlaAsp: 7.93 ± 0.578
7.506AlaGlu: 7.506 ± 0.76
3.511AlaPhe: 3.511 ± 0.535
7.93AlaGly: 7.93 ± 0.899
1.877AlaHis: 1.877 ± 0.31
5.327AlaIle: 5.327 ± 0.566
3.45AlaLys: 3.45 ± 0.679
10.835AlaLeu: 10.835 ± 0.787
2.421AlaMet: 2.421 ± 0.514
3.45AlaAsn: 3.45 ± 0.476
5.872AlaPro: 5.872 ± 0.672
5.508AlaGln: 5.508 ± 0.678
9.927AlaArg: 9.927 ± 0.951
5.448AlaSer: 5.448 ± 0.969
7.809AlaThr: 7.809 ± 0.797
8.232AlaVal: 8.232 ± 0.869
1.998AlaTrp: 1.998 ± 0.421
2.3AlaTyr: 2.3 ± 0.384
0.0AlaXaa: 0.0 ± 0.0
Cys
0.545CysAla: 0.545 ± 0.169
0.121CysCys: 0.121 ± 0.096
1.211CysAsp: 1.211 ± 0.394
0.424CysGlu: 0.424 ± 0.17
0.182CysPhe: 0.182 ± 0.109
0.787CysGly: 0.787 ± 0.261
0.242CysHis: 0.242 ± 0.105
0.424CysIle: 0.424 ± 0.167
0.242CysLys: 0.242 ± 0.132
0.303CysLeu: 0.303 ± 0.14
0.182CysMet: 0.182 ± 0.118
0.242CysAsn: 0.242 ± 0.121
0.726CysPro: 0.726 ± 0.213
0.242CysGln: 0.242 ± 0.128
0.726CysArg: 0.726 ± 0.278
0.726CysSer: 0.726 ± 0.242
0.908CysThr: 0.908 ± 0.26
0.242CysVal: 0.242 ± 0.134
0.303CysTrp: 0.303 ± 0.19
0.121CysTyr: 0.121 ± 0.076
0.0CysXaa: 0.0 ± 0.0
Asp
8.414AspAla: 8.414 ± 0.732
0.545AspCys: 0.545 ± 0.2
5.63AspAsp: 5.63 ± 0.738
4.358AspGlu: 4.358 ± 0.522
1.755AspPhe: 1.755 ± 0.282
7.143AspGly: 7.143 ± 0.792
1.755AspHis: 1.755 ± 0.317
3.087AspIle: 3.087 ± 0.353
1.937AspLys: 1.937 ± 0.374
5.508AspLeu: 5.508 ± 0.656
1.15AspMet: 1.15 ± 0.214
1.816AspAsn: 1.816 ± 0.279
4.479AspPro: 4.479 ± 0.469
3.269AspGln: 3.269 ± 0.484
4.903AspArg: 4.903 ± 0.803
3.148AspSer: 3.148 ± 0.437
4.782AspThr: 4.782 ± 0.511
5.993AspVal: 5.993 ± 0.563
0.847AspTrp: 0.847 ± 0.206
1.695AspTyr: 1.695 ± 0.36
0.0AspXaa: 0.0 ± 0.0
Glu
5.811GluAla: 5.811 ± 0.487
0.484GluCys: 0.484 ± 0.162
3.208GluAsp: 3.208 ± 0.483
2.724GluGlu: 2.724 ± 0.498
2.361GluPhe: 2.361 ± 0.52
3.329GluGly: 3.329 ± 0.377
1.634GluHis: 1.634 ± 0.365
2.906GluIle: 2.906 ± 0.4
1.755GluLys: 1.755 ± 0.322
4.54GluLeu: 4.54 ± 0.66
1.513GluMet: 1.513 ± 0.318
1.332GluAsn: 1.332 ± 0.292
2.785GluPro: 2.785 ± 0.42
2.845GluGln: 2.845 ± 0.443
5.085GluArg: 5.085 ± 0.699
3.39GluSer: 3.39 ± 0.42
2.724GluThr: 2.724 ± 0.344
5.508GluVal: 5.508 ± 0.528
1.271GluTrp: 1.271 ± 0.324
1.271GluTyr: 1.271 ± 0.324
0.0GluXaa: 0.0 ± 0.0
Phe
3.027PheAla: 3.027 ± 0.373
0.303PheCys: 0.303 ± 0.173
2.3PheAsp: 2.3 ± 0.415
1.937PheGlu: 1.937 ± 0.314
0.726PhePhe: 0.726 ± 0.2
2.421PheGly: 2.421 ± 0.396
0.424PheHis: 0.424 ± 0.164
1.453PheIle: 1.453 ± 0.355
0.969PheLys: 0.969 ± 0.312
1.695PheLeu: 1.695 ± 0.436
0.424PheMet: 0.424 ± 0.175
0.605PheAsn: 0.605 ± 0.234
1.332PhePro: 1.332 ± 0.231
0.363PheGln: 0.363 ± 0.12
1.877PheArg: 1.877 ± 0.358
1.332PheSer: 1.332 ± 0.294
2.179PheThr: 2.179 ± 0.319
2.724PheVal: 2.724 ± 0.327
0.424PheTrp: 0.424 ± 0.151
0.726PheTyr: 0.726 ± 0.2
0.0PheXaa: 0.0 ± 0.0
Gly
7.446GlyAla: 7.446 ± 1.226
0.666GlyCys: 0.666 ± 0.216
5.69GlyAsp: 5.69 ± 0.468
4.6GlyGlu: 4.6 ± 0.531
2.542GlyPhe: 2.542 ± 0.413
7.022GlyGly: 7.022 ± 1.077
1.513GlyHis: 1.513 ± 0.313
3.571GlyIle: 3.571 ± 0.773
3.814GlyLys: 3.814 ± 0.526
7.143GlyLeu: 7.143 ± 0.893
1.453GlyMet: 1.453 ± 0.299
2.542GlyAsn: 2.542 ± 0.395
3.511GlyPro: 3.511 ± 0.574
2.603GlyGln: 2.603 ± 0.357
6.235GlyArg: 6.235 ± 0.607
4.298GlySer: 4.298 ± 0.509
5.63GlyThr: 5.63 ± 0.737
5.932GlyVal: 5.932 ± 0.638
2.119GlyTrp: 2.119 ± 0.377
2.542GlyTyr: 2.542 ± 0.343
0.0GlyXaa: 0.0 ± 0.0
His
2.179HisAla: 2.179 ± 0.38
0.242HisCys: 0.242 ± 0.116
1.877HisAsp: 1.877 ± 0.347
0.726HisGlu: 0.726 ± 0.189
0.605HisPhe: 0.605 ± 0.177
1.695HisGly: 1.695 ± 0.358
0.484HisHis: 0.484 ± 0.176
0.666HisIle: 0.666 ± 0.23
0.484HisLys: 0.484 ± 0.16
1.937HisLeu: 1.937 ± 0.393
0.303HisMet: 0.303 ± 0.117
0.424HisAsn: 0.424 ± 0.162
1.755HisPro: 1.755 ± 0.345
0.847HisGln: 0.847 ± 0.248
1.211HisArg: 1.211 ± 0.286
0.847HisSer: 0.847 ± 0.209
1.211HisThr: 1.211 ± 0.252
1.332HisVal: 1.332 ± 0.269
0.484HisTrp: 0.484 ± 0.178
0.182HisTyr: 0.182 ± 0.096
0.0HisXaa: 0.0 ± 0.0
Ile
5.448IleAla: 5.448 ± 0.515
0.424IleCys: 0.424 ± 0.162
4.54IleAsp: 4.54 ± 0.589
3.208IleGlu: 3.208 ± 0.348
0.787IlePhe: 0.787 ± 0.198
4.116IleGly: 4.116 ± 0.996
0.666IleHis: 0.666 ± 0.224
1.634IleIle: 1.634 ± 0.54
1.574IleLys: 1.574 ± 0.625
2.058IleLeu: 2.058 ± 0.31
0.666IleMet: 0.666 ± 0.199
0.908IleAsn: 0.908 ± 0.263
2.966IlePro: 2.966 ± 0.404
1.15IleGln: 1.15 ± 0.284
3.753IleArg: 3.753 ± 0.496
2.421IleSer: 2.421 ± 0.356
3.39IleThr: 3.39 ± 0.458
4.237IleVal: 4.237 ± 0.567
0.303IleTrp: 0.303 ± 0.119
1.211IleTyr: 1.211 ± 0.257
0.0IleXaa: 0.0 ± 0.0
Lys
4.54LysAla: 4.54 ± 0.73
0.121LysCys: 0.121 ± 0.09
1.998LysAsp: 1.998 ± 0.405
1.332LysGlu: 1.332 ± 0.293
1.332LysPhe: 1.332 ± 0.377
2.785LysGly: 2.785 ± 0.484
0.484LysHis: 0.484 ± 0.205
1.513LysIle: 1.513 ± 0.315
1.332LysLys: 1.332 ± 0.289
3.148LysLeu: 3.148 ± 0.365
0.666LysMet: 0.666 ± 0.238
1.453LysAsn: 1.453 ± 0.306
2.361LysPro: 2.361 ± 0.372
0.908LysGln: 0.908 ± 0.284
1.513LysArg: 1.513 ± 0.324
2.058LysSer: 2.058 ± 0.375
2.058LysThr: 2.058 ± 0.311
2.785LysVal: 2.785 ± 0.32
0.424LysTrp: 0.424 ± 0.128
0.666LysTyr: 0.666 ± 0.196
0.0LysXaa: 0.0 ± 0.0
Leu
10.048LeuAla: 10.048 ± 0.8
0.969LeuCys: 0.969 ± 0.302
5.751LeuAsp: 5.751 ± 0.622
3.208LeuGlu: 3.208 ± 0.475
1.998LeuPhe: 1.998 ± 0.378
5.751LeuGly: 5.751 ± 0.622
1.453LeuHis: 1.453 ± 0.289
3.45LeuIle: 3.45 ± 0.417
2.724LeuLys: 2.724 ± 0.385
5.024LeuLeu: 5.024 ± 0.727
1.634LeuMet: 1.634 ± 0.29
1.816LeuAsn: 1.816 ± 0.365
4.54LeuPro: 4.54 ± 0.557
1.998LeuGln: 1.998 ± 0.397
5.932LeuArg: 5.932 ± 0.696
5.266LeuSer: 5.266 ± 0.605
5.024LeuThr: 5.024 ± 0.587
6.901LeuVal: 6.901 ± 0.625
1.998LeuTrp: 1.998 ± 0.337
1.392LeuTyr: 1.392 ± 0.304
0.0LeuXaa: 0.0 ± 0.0
Met
3.027MetAla: 3.027 ± 0.459
0.182MetCys: 0.182 ± 0.143
0.847MetAsp: 0.847 ± 0.203
0.545MetGlu: 0.545 ± 0.178
0.787MetPhe: 0.787 ± 0.204
1.271MetGly: 1.271 ± 0.344
0.303MetHis: 0.303 ± 0.123
1.211MetIle: 1.211 ± 0.233
0.605MetLys: 0.605 ± 0.206
1.271MetLeu: 1.271 ± 0.273
0.424MetMet: 0.424 ± 0.264
0.787MetAsn: 0.787 ± 0.198
1.392MetPro: 1.392 ± 0.276
0.484MetGln: 0.484 ± 0.176
1.634MetArg: 1.634 ± 0.434
1.877MetSer: 1.877 ± 0.352
2.058MetThr: 2.058 ± 0.355
1.029MetVal: 1.029 ± 0.265
0.605MetTrp: 0.605 ± 0.261
0.242MetTyr: 0.242 ± 0.111
0.0MetXaa: 0.0 ± 0.0
Asn
3.329AsnAla: 3.329 ± 0.319
0.121AsnCys: 0.121 ± 0.088
1.755AsnAsp: 1.755 ± 0.372
1.029AsnGlu: 1.029 ± 0.27
0.484AsnPhe: 0.484 ± 0.182
2.966AsnGly: 2.966 ± 0.385
0.363AsnHis: 0.363 ± 0.141
0.847AsnIle: 0.847 ± 0.271
0.847AsnLys: 0.847 ± 0.238
2.361AsnLeu: 2.361 ± 0.355
0.484AsnMet: 0.484 ± 0.156
0.182AsnAsn: 0.182 ± 0.092
2.361AsnPro: 2.361 ± 0.283
0.787AsnGln: 0.787 ± 0.205
2.421AsnArg: 2.421 ± 0.504
1.998AsnSer: 1.998 ± 0.388
2.482AsnThr: 2.482 ± 0.459
1.755AsnVal: 1.755 ± 0.36
0.605AsnTrp: 0.605 ± 0.165
0.605AsnTyr: 0.605 ± 0.222
0.0AsnXaa: 0.0 ± 0.0
Pro
7.627ProAla: 7.627 ± 0.973
0.545ProCys: 0.545 ± 0.174
4.54ProAsp: 4.54 ± 0.506
3.995ProGlu: 3.995 ± 0.505
1.211ProPhe: 1.211 ± 0.277
5.569ProGly: 5.569 ± 0.631
1.211ProHis: 1.211 ± 0.302
2.663ProIle: 2.663 ± 0.437
2.542ProLys: 2.542 ± 0.373
3.027ProLeu: 3.027 ± 0.534
1.211ProMet: 1.211 ± 0.337
1.816ProAsn: 1.816 ± 0.307
3.027ProPro: 3.027 ± 0.617
1.937ProGln: 1.937 ± 0.354
2.482ProArg: 2.482 ± 0.396
3.027ProSer: 3.027 ± 0.457
4.358ProThr: 4.358 ± 0.607
3.632ProVal: 3.632 ± 0.515
1.453ProTrp: 1.453 ± 0.288
0.969ProTyr: 0.969 ± 0.218
0.0ProXaa: 0.0 ± 0.0
Gln
3.39GlnAla: 3.39 ± 0.402
0.121GlnCys: 0.121 ± 0.098
1.513GlnAsp: 1.513 ± 0.275
1.332GlnGlu: 1.332 ± 0.298
1.271GlnPhe: 1.271 ± 0.281
2.845GlnGly: 2.845 ± 0.379
0.847GlnHis: 0.847 ± 0.256
1.695GlnIle: 1.695 ± 0.374
1.09GlnLys: 1.09 ± 0.274
3.632GlnLeu: 3.632 ± 0.409
0.787GlnMet: 0.787 ± 0.173
1.15GlnAsn: 1.15 ± 0.276
2.058GlnPro: 2.058 ± 0.421
1.332GlnGln: 1.332 ± 0.245
3.753GlnArg: 3.753 ± 0.475
1.332GlnSer: 1.332 ± 0.268
2.361GlnThr: 2.361 ± 0.484
2.119GlnVal: 2.119 ± 0.411
0.726GlnTrp: 0.726 ± 0.216
0.908GlnTyr: 0.908 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
7.567ArgAla: 7.567 ± 0.815
0.666ArgCys: 0.666 ± 0.195
5.448ArgAsp: 5.448 ± 0.632
6.174ArgGlu: 6.174 ± 0.816
1.937ArgPhe: 1.937 ± 0.359
6.598ArgGly: 6.598 ± 0.673
1.513ArgHis: 1.513 ± 0.402
4.298ArgIle: 4.298 ± 0.522
2.542ArgLys: 2.542 ± 0.331
6.053ArgLeu: 6.053 ± 0.629
2.179ArgMet: 2.179 ± 0.406
2.119ArgAsn: 2.119 ± 0.37
3.269ArgPro: 3.269 ± 0.487
2.421ArgGln: 2.421 ± 0.371
6.78ArgArg: 6.78 ± 0.969
4.116ArgSer: 4.116 ± 0.498
4.298ArgThr: 4.298 ± 0.415
4.419ArgVal: 4.419 ± 0.661
1.755ArgTrp: 1.755 ± 0.395
1.634ArgTyr: 1.634 ± 0.36
0.0ArgXaa: 0.0 ± 0.0
Ser
6.961SerAla: 6.961 ± 1.007
0.424SerCys: 0.424 ± 0.205
3.632SerAsp: 3.632 ± 0.418
2.785SerGlu: 2.785 ± 0.368
1.15SerPhe: 1.15 ± 0.257
5.387SerGly: 5.387 ± 0.781
0.969SerHis: 0.969 ± 0.247
2.906SerIle: 2.906 ± 0.379
1.755SerLys: 1.755 ± 0.236
3.874SerLeu: 3.874 ± 0.565
1.15SerMet: 1.15 ± 0.226
1.513SerAsn: 1.513 ± 0.39
3.087SerPro: 3.087 ± 0.383
1.513SerGln: 1.513 ± 0.361
3.814SerArg: 3.814 ± 0.492
2.421SerSer: 2.421 ± 0.438
4.54SerThr: 4.54 ± 0.635
4.298SerVal: 4.298 ± 0.451
1.09SerTrp: 1.09 ± 0.238
0.908SerTyr: 0.908 ± 0.247
0.0SerXaa: 0.0 ± 0.0
Thr
8.898ThrAla: 8.898 ± 1.302
0.484ThrCys: 0.484 ± 0.173
5.63ThrAsp: 5.63 ± 0.723
3.087ThrGlu: 3.087 ± 0.526
1.877ThrPhe: 1.877 ± 0.34
5.508ThrGly: 5.508 ± 0.767
1.392ThrHis: 1.392 ± 0.363
3.632ThrIle: 3.632 ± 0.486
2.482ThrLys: 2.482 ± 0.406
5.206ThrLeu: 5.206 ± 0.52
1.513ThrMet: 1.513 ± 0.316
1.392ThrAsn: 1.392 ± 0.296
4.237ThrPro: 4.237 ± 0.633
1.574ThrGln: 1.574 ± 0.284
3.995ThrArg: 3.995 ± 0.54
3.874ThrSer: 3.874 ± 0.671
3.692ThrThr: 3.692 ± 0.713
6.416ThrVal: 6.416 ± 0.818
1.998ThrTrp: 1.998 ± 0.303
1.392ThrTyr: 1.392 ± 0.284
0.0ThrXaa: 0.0 ± 0.0
Val
9.443ValAla: 9.443 ± 0.742
0.969ValCys: 0.969 ± 0.277
6.659ValAsp: 6.659 ± 0.605
5.63ValGlu: 5.63 ± 0.742
1.513ValPhe: 1.513 ± 0.276
5.508ValGly: 5.508 ± 0.542
1.15ValHis: 1.15 ± 0.279
2.724ValIle: 2.724 ± 0.366
2.361ValLys: 2.361 ± 0.408
5.993ValLeu: 5.993 ± 0.628
1.392ValMet: 1.392 ± 0.406
2.179ValAsn: 2.179 ± 0.382
4.177ValPro: 4.177 ± 0.546
2.785ValGln: 2.785 ± 0.401
5.751ValArg: 5.751 ± 0.728
3.935ValSer: 3.935 ± 0.59
5.811ValThr: 5.811 ± 0.846
7.022ValVal: 7.022 ± 0.625
1.453ValTrp: 1.453 ± 0.328
1.574ValTyr: 1.574 ± 0.279
0.0ValXaa: 0.0 ± 0.0
Trp
2.058TrpAla: 2.058 ± 0.398
0.182TrpCys: 0.182 ± 0.1
1.211TrpAsp: 1.211 ± 0.275
0.787TrpGlu: 0.787 ± 0.239
0.545TrpPhe: 0.545 ± 0.163
0.787TrpGly: 0.787 ± 0.257
0.666TrpHis: 0.666 ± 0.222
0.666TrpIle: 0.666 ± 0.176
0.484TrpLys: 0.484 ± 0.168
2.058TrpLeu: 2.058 ± 0.429
0.605TrpMet: 0.605 ± 0.193
1.271TrpAsn: 1.271 ± 0.34
1.755TrpPro: 1.755 ± 0.363
0.787TrpGln: 0.787 ± 0.212
1.816TrpArg: 1.816 ± 0.255
1.271TrpSer: 1.271 ± 0.249
1.392TrpThr: 1.392 ± 0.271
1.574TrpVal: 1.574 ± 0.281
0.605TrpTrp: 0.605 ± 0.203
0.363TrpTyr: 0.363 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.603TyrAla: 2.603 ± 0.472
0.363TyrCys: 0.363 ± 0.156
0.847TyrAsp: 0.847 ± 0.272
1.09TyrGlu: 1.09 ± 0.36
0.666TyrPhe: 0.666 ± 0.162
1.453TyrGly: 1.453 ± 0.284
0.545TyrHis: 0.545 ± 0.159
0.787TyrIle: 0.787 ± 0.265
0.484TyrLys: 0.484 ± 0.163
1.211TyrLeu: 1.211 ± 0.337
0.363TyrMet: 0.363 ± 0.169
0.787TyrAsn: 0.787 ± 0.196
1.15TyrPro: 1.15 ± 0.249
0.908TyrGln: 0.908 ± 0.26
2.179TyrArg: 2.179 ± 0.384
1.392TyrSer: 1.392 ± 0.294
1.574TyrThr: 1.574 ± 0.387
1.877TyrVal: 1.877 ± 0.403
0.424TyrTrp: 0.424 ± 0.138
0.787TyrTyr: 0.787 ± 0.279
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 83 proteins (16521 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski