Amino acid dipepetide frequency for Gordonia phage Nedarya

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.92AlaAla: 9.92 ± 0.982
0.796AlaCys: 0.796 ± 0.227
5.144AlaAsp: 5.144 ± 0.676
6.797AlaGlu: 6.797 ± 0.727
3.735AlaPhe: 3.735 ± 0.49
7.899AlaGly: 7.899 ± 0.881
1.837AlaHis: 1.837 ± 0.331
4.225AlaIle: 4.225 ± 0.422
5.082AlaLys: 5.082 ± 0.716
6.736AlaLeu: 6.736 ± 0.747
2.266AlaMet: 2.266 ± 0.415
2.572AlaAsn: 2.572 ± 0.412
4.041AlaPro: 4.041 ± 0.573
3.368AlaGln: 3.368 ± 0.523
6.368AlaArg: 6.368 ± 0.59
5.511AlaSer: 5.511 ± 0.61
5.633AlaThr: 5.633 ± 0.812
6.919AlaVal: 6.919 ± 0.673
1.715AlaTrp: 1.715 ± 0.303
1.959AlaTyr: 1.959 ± 0.381
0.0AlaXaa: 0.0 ± 0.0
Cys
0.796CysAla: 0.796 ± 0.225
0.122CysCys: 0.122 ± 0.124
0.796CysAsp: 0.796 ± 0.211
0.367CysGlu: 0.367 ± 0.139
0.306CysPhe: 0.306 ± 0.173
0.735CysGly: 0.735 ± 0.258
0.245CysHis: 0.245 ± 0.14
0.245CysIle: 0.245 ± 0.134
0.367CysLys: 0.367 ± 0.153
0.612CysLeu: 0.612 ± 0.218
0.367CysMet: 0.367 ± 0.136
0.551CysAsn: 0.551 ± 0.172
0.612CysPro: 0.612 ± 0.231
0.306CysGln: 0.306 ± 0.187
0.306CysArg: 0.306 ± 0.151
0.429CysSer: 0.429 ± 0.171
0.122CysThr: 0.122 ± 0.087
0.551CysVal: 0.551 ± 0.193
0.367CysTrp: 0.367 ± 0.148
0.245CysTyr: 0.245 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
5.878AspAla: 5.878 ± 0.694
0.674AspCys: 0.674 ± 0.188
4.654AspAsp: 4.654 ± 0.69
4.715AspGlu: 4.715 ± 0.604
2.082AspPhe: 2.082 ± 0.385
5.756AspGly: 5.756 ± 0.557
1.592AspHis: 1.592 ± 0.366
2.511AspIle: 2.511 ± 0.38
2.327AspLys: 2.327 ± 0.458
6.429AspLeu: 6.429 ± 0.777
0.918AspMet: 0.918 ± 0.236
1.653AspAsn: 1.653 ± 0.355
4.286AspPro: 4.286 ± 0.551
2.082AspGln: 2.082 ± 0.367
3.123AspArg: 3.123 ± 0.501
3.613AspSer: 3.613 ± 0.522
4.103AspThr: 4.103 ± 0.559
4.899AspVal: 4.899 ± 0.468
1.286AspTrp: 1.286 ± 0.284
2.817AspTyr: 2.817 ± 0.411
0.0AspXaa: 0.0 ± 0.0
Glu
7.838GluAla: 7.838 ± 0.791
0.122GluCys: 0.122 ± 0.08
4.409GluAsp: 4.409 ± 0.506
4.286GluGlu: 4.286 ± 0.67
2.204GluPhe: 2.204 ± 0.372
5.021GluGly: 5.021 ± 0.552
1.898GluHis: 1.898 ± 0.38
4.103GluIle: 4.103 ± 0.481
2.327GluLys: 2.327 ± 0.414
7.96GluLeu: 7.96 ± 0.778
1.592GluMet: 1.592 ± 0.329
2.143GluAsn: 2.143 ± 0.325
2.878GluPro: 2.878 ± 0.489
2.143GluGln: 2.143 ± 0.35
3.98GluArg: 3.98 ± 0.501
3.429GluSer: 3.429 ± 0.48
4.286GluThr: 4.286 ± 0.506
6.123GluVal: 6.123 ± 0.77
1.041GluTrp: 1.041 ± 0.259
1.653GluTyr: 1.653 ± 0.332
0.0GluXaa: 0.0 ± 0.0
Phe
3.735PheAla: 3.735 ± 0.544
0.367PheCys: 0.367 ± 0.206
2.633PheAsp: 2.633 ± 0.351
2.939PheGlu: 2.939 ± 0.548
0.796PhePhe: 0.796 ± 0.271
3.307PheGly: 3.307 ± 0.401
0.796PheHis: 0.796 ± 0.205
0.918PheIle: 0.918 ± 0.273
1.163PheLys: 1.163 ± 0.25
2.327PheLeu: 2.327 ± 0.352
0.735PheMet: 0.735 ± 0.163
1.898PheAsn: 1.898 ± 0.318
1.959PhePro: 1.959 ± 0.357
1.47PheGln: 1.47 ± 0.272
1.47PheArg: 1.47 ± 0.31
1.837PheSer: 1.837 ± 0.304
1.837PheThr: 1.837 ± 0.455
1.898PheVal: 1.898 ± 0.357
0.306PheTrp: 0.306 ± 0.124
0.857PheTyr: 0.857 ± 0.247
0.0PheXaa: 0.0 ± 0.0
Gly
5.94GlyAla: 5.94 ± 0.648
0.98GlyCys: 0.98 ± 0.256
6.613GlyAsp: 6.613 ± 0.726
5.389GlyGlu: 5.389 ± 0.668
3.429GlyPhe: 3.429 ± 0.534
8.144GlyGly: 8.144 ± 1.319
1.898GlyHis: 1.898 ± 0.334
4.041GlyIle: 4.041 ± 0.554
3.98GlyLys: 3.98 ± 0.565
5.94GlyLeu: 5.94 ± 0.867
1.592GlyMet: 1.592 ± 0.304
2.817GlyAsn: 2.817 ± 0.469
3.0GlyPro: 3.0 ± 0.513
2.572GlyGln: 2.572 ± 0.441
4.409GlyArg: 4.409 ± 0.558
5.327GlySer: 5.327 ± 0.865
5.45GlyThr: 5.45 ± 0.579
6.307GlyVal: 6.307 ± 0.676
2.449GlyTrp: 2.449 ± 0.425
3.184GlyTyr: 3.184 ± 0.382
0.0GlyXaa: 0.0 ± 0.0
His
1.592HisAla: 1.592 ± 0.304
0.367HisCys: 0.367 ± 0.133
0.857HisAsp: 0.857 ± 0.211
1.776HisGlu: 1.776 ± 0.396
0.612HisPhe: 0.612 ± 0.203
1.776HisGly: 1.776 ± 0.414
0.367HisHis: 0.367 ± 0.182
0.918HisIle: 0.918 ± 0.237
0.857HisLys: 0.857 ± 0.27
1.653HisLeu: 1.653 ± 0.437
0.367HisMet: 0.367 ± 0.168
0.612HisAsn: 0.612 ± 0.215
1.041HisPro: 1.041 ± 0.266
0.551HisGln: 0.551 ± 0.184
2.021HisArg: 2.021 ± 0.371
0.98HisSer: 0.98 ± 0.228
1.102HisThr: 1.102 ± 0.254
1.347HisVal: 1.347 ± 0.266
0.306HisTrp: 0.306 ± 0.172
0.735HisTyr: 0.735 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
4.592IleAla: 4.592 ± 0.512
0.306IleCys: 0.306 ± 0.131
3.368IleAsp: 3.368 ± 0.432
4.715IleGlu: 4.715 ± 0.547
1.225IlePhe: 1.225 ± 0.235
4.592IleGly: 4.592 ± 0.719
0.98IleHis: 0.98 ± 0.258
2.449IleIle: 2.449 ± 0.455
2.021IleLys: 2.021 ± 0.343
3.368IleLeu: 3.368 ± 0.437
0.98IleMet: 0.98 ± 0.211
1.531IleAsn: 1.531 ± 0.283
3.552IlePro: 3.552 ± 0.473
1.531IleGln: 1.531 ± 0.3
3.49IleArg: 3.49 ± 0.412
3.49IleSer: 3.49 ± 0.378
3.858IleThr: 3.858 ± 0.504
3.552IleVal: 3.552 ± 0.356
1.102IleTrp: 1.102 ± 0.271
1.408IleTyr: 1.408 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
4.225LysAla: 4.225 ± 0.462
0.306LysCys: 0.306 ± 0.144
3.123LysAsp: 3.123 ± 0.391
2.511LysGlu: 2.511 ± 0.421
0.98LysPhe: 0.98 ± 0.21
3.307LysGly: 3.307 ± 0.431
0.857LysHis: 0.857 ± 0.23
2.694LysIle: 2.694 ± 0.507
2.939LysLys: 2.939 ± 0.595
4.103LysLeu: 4.103 ± 0.479
1.041LysMet: 1.041 ± 0.273
1.592LysAsn: 1.592 ± 0.363
2.388LysPro: 2.388 ± 0.436
1.715LysGln: 1.715 ± 0.345
2.388LysArg: 2.388 ± 0.369
2.021LysSer: 2.021 ± 0.366
2.327LysThr: 2.327 ± 0.463
3.858LysVal: 3.858 ± 0.551
0.796LysTrp: 0.796 ± 0.22
1.408LysTyr: 1.408 ± 0.341
0.0LysXaa: 0.0 ± 0.0
Leu
7.226LeuAla: 7.226 ± 0.628
0.674LeuCys: 0.674 ± 0.213
4.47LeuAsp: 4.47 ± 0.483
6.613LeuGlu: 6.613 ± 0.806
2.143LeuPhe: 2.143 ± 0.37
6.185LeuGly: 6.185 ± 0.636
1.715LeuHis: 1.715 ± 0.327
4.409LeuIle: 4.409 ± 0.498
2.755LeuLys: 2.755 ± 0.5
6.246LeuLeu: 6.246 ± 0.707
2.204LeuMet: 2.204 ± 0.521
3.245LeuAsn: 3.245 ± 0.402
4.409LeuPro: 4.409 ± 0.472
2.939LeuGln: 2.939 ± 0.486
5.878LeuArg: 5.878 ± 0.662
4.409LeuSer: 4.409 ± 0.531
5.021LeuThr: 5.021 ± 0.533
5.94LeuVal: 5.94 ± 0.521
1.837LeuTrp: 1.837 ± 0.293
2.449LeuTyr: 2.449 ± 0.419
0.0LeuXaa: 0.0 ± 0.0
Met
2.572MetAla: 2.572 ± 0.397
0.184MetCys: 0.184 ± 0.11
0.98MetAsp: 0.98 ± 0.264
1.102MetGlu: 1.102 ± 0.255
1.041MetPhe: 1.041 ± 0.257
1.347MetGly: 1.347 ± 0.359
0.367MetHis: 0.367 ± 0.141
1.959MetIle: 1.959 ± 0.379
0.918MetLys: 0.918 ± 0.266
1.531MetLeu: 1.531 ± 0.384
0.551MetMet: 0.551 ± 0.176
0.735MetAsn: 0.735 ± 0.19
1.408MetPro: 1.408 ± 0.251
0.49MetGln: 0.49 ± 0.205
1.531MetArg: 1.531 ± 0.291
2.449MetSer: 2.449 ± 0.42
2.266MetThr: 2.266 ± 0.376
0.796MetVal: 0.796 ± 0.24
0.184MetTrp: 0.184 ± 0.092
0.857MetTyr: 0.857 ± 0.246
0.0MetXaa: 0.0 ± 0.0
Asn
3.307AsnAla: 3.307 ± 0.399
0.429AsnCys: 0.429 ± 0.2
1.837AsnAsp: 1.837 ± 0.307
2.388AsnGlu: 2.388 ± 0.366
0.918AsnPhe: 0.918 ± 0.219
3.307AsnGly: 3.307 ± 0.501
0.735AsnHis: 0.735 ± 0.205
1.837AsnIle: 1.837 ± 0.463
1.286AsnLys: 1.286 ± 0.259
3.796AsnLeu: 3.796 ± 0.492
0.796AsnMet: 0.796 ± 0.196
0.796AsnAsn: 0.796 ± 0.242
2.082AsnPro: 2.082 ± 0.349
1.837AsnGln: 1.837 ± 0.291
1.898AsnArg: 1.898 ± 0.3
1.408AsnSer: 1.408 ± 0.304
2.878AsnThr: 2.878 ± 0.49
1.776AsnVal: 1.776 ± 0.29
0.612AsnTrp: 0.612 ± 0.202
1.531AsnTyr: 1.531 ± 0.275
0.0AsnXaa: 0.0 ± 0.0
Pro
3.98ProAla: 3.98 ± 0.581
0.367ProCys: 0.367 ± 0.169
4.103ProAsp: 4.103 ± 0.549
4.164ProGlu: 4.164 ± 0.553
1.592ProPhe: 1.592 ± 0.307
4.225ProGly: 4.225 ± 0.595
0.674ProHis: 0.674 ± 0.191
3.123ProIle: 3.123 ± 0.466
2.511ProLys: 2.511 ± 0.398
2.939ProLeu: 2.939 ± 0.492
1.347ProMet: 1.347 ± 0.274
2.204ProAsn: 2.204 ± 0.353
2.021ProPro: 2.021 ± 0.407
1.653ProGln: 1.653 ± 0.401
2.694ProArg: 2.694 ± 0.467
2.755ProSer: 2.755 ± 0.452
4.103ProThr: 4.103 ± 0.566
3.49ProVal: 3.49 ± 0.563
0.551ProTrp: 0.551 ± 0.201
1.592ProTyr: 1.592 ± 0.358
0.0ProXaa: 0.0 ± 0.0
Gln
3.98GlnAla: 3.98 ± 0.57
0.184GlnCys: 0.184 ± 0.111
1.653GlnAsp: 1.653 ± 0.26
2.511GlnGlu: 2.511 ± 0.391
1.347GlnPhe: 1.347 ± 0.321
2.694GlnGly: 2.694 ± 0.41
0.551GlnHis: 0.551 ± 0.181
2.266GlnIle: 2.266 ± 0.302
1.347GlnLys: 1.347 ± 0.263
3.062GlnLeu: 3.062 ± 0.391
0.857GlnMet: 0.857 ± 0.247
1.041GlnAsn: 1.041 ± 0.265
1.286GlnPro: 1.286 ± 0.312
1.225GlnGln: 1.225 ± 0.314
2.082GlnArg: 2.082 ± 0.454
1.715GlnSer: 1.715 ± 0.365
2.021GlnThr: 2.021 ± 0.327
2.082GlnVal: 2.082 ± 0.423
0.735GlnTrp: 0.735 ± 0.233
1.531GlnTyr: 1.531 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
5.389ArgAla: 5.389 ± 0.555
0.551ArgCys: 0.551 ± 0.332
3.919ArgAsp: 3.919 ± 0.625
4.103ArgGlu: 4.103 ± 0.485
2.817ArgPhe: 2.817 ± 0.481
5.144ArgGly: 5.144 ± 0.7
1.531ArgHis: 1.531 ± 0.34
4.103ArgIle: 4.103 ± 0.474
2.694ArgLys: 2.694 ± 0.387
5.205ArgLeu: 5.205 ± 0.655
1.898ArgMet: 1.898 ± 0.296
2.021ArgAsn: 2.021 ± 0.376
2.388ArgPro: 2.388 ± 0.407
1.837ArgGln: 1.837 ± 0.337
5.144ArgArg: 5.144 ± 0.674
3.245ArgSer: 3.245 ± 0.528
2.755ArgThr: 2.755 ± 0.454
4.47ArgVal: 4.47 ± 0.526
0.98ArgTrp: 0.98 ± 0.288
2.204ArgTyr: 2.204 ± 0.404
0.0ArgXaa: 0.0 ± 0.0
Ser
5.817SerAla: 5.817 ± 0.435
0.367SerCys: 0.367 ± 0.173
3.613SerAsp: 3.613 ± 0.557
3.49SerGlu: 3.49 ± 0.484
2.143SerPhe: 2.143 ± 0.317
5.389SerGly: 5.389 ± 0.945
0.735SerHis: 0.735 ± 0.19
2.755SerIle: 2.755 ± 0.365
2.327SerLys: 2.327 ± 0.411
4.225SerLeu: 4.225 ± 0.695
1.776SerMet: 1.776 ± 0.324
2.021SerAsn: 2.021 ± 0.431
2.572SerPro: 2.572 ± 0.328
2.266SerGln: 2.266 ± 0.393
4.654SerArg: 4.654 ± 0.513
3.919SerSer: 3.919 ± 0.557
3.735SerThr: 3.735 ± 0.544
4.103SerVal: 4.103 ± 0.559
1.102SerTrp: 1.102 ± 0.228
1.653SerTyr: 1.653 ± 0.391
0.0SerXaa: 0.0 ± 0.0
Thr
4.776ThrAla: 4.776 ± 0.61
0.551ThrCys: 0.551 ± 0.281
4.47ThrAsp: 4.47 ± 0.379
3.429ThrGlu: 3.429 ± 0.5
1.653ThrPhe: 1.653 ± 0.319
6.246ThrGly: 6.246 ± 0.599
0.735ThrHis: 0.735 ± 0.202
3.429ThrIle: 3.429 ± 0.496
3.98ThrLys: 3.98 ± 0.588
5.572ThrLeu: 5.572 ± 0.701
1.347ThrMet: 1.347 ± 0.247
1.959ThrAsn: 1.959 ± 0.384
4.225ThrPro: 4.225 ± 0.616
2.266ThrGln: 2.266 ± 0.43
3.062ThrArg: 3.062 ± 0.461
3.368ThrSer: 3.368 ± 0.441
3.919ThrThr: 3.919 ± 0.557
5.021ThrVal: 5.021 ± 0.597
1.47ThrTrp: 1.47 ± 0.336
1.898ThrTyr: 1.898 ± 0.361
0.0ThrXaa: 0.0 ± 0.0
Val
6.736ValAla: 6.736 ± 0.661
0.612ValCys: 0.612 ± 0.215
5.205ValAsp: 5.205 ± 0.563
4.715ValGlu: 4.715 ± 0.501
2.694ValPhe: 2.694 ± 0.466
4.164ValGly: 4.164 ± 0.463
1.225ValHis: 1.225 ± 0.236
4.103ValIle: 4.103 ± 0.586
3.429ValLys: 3.429 ± 0.449
4.776ValLeu: 4.776 ± 0.67
1.592ValMet: 1.592 ± 0.29
3.552ValAsn: 3.552 ± 0.395
4.041ValPro: 4.041 ± 0.55
2.266ValGln: 2.266 ± 0.441
4.286ValArg: 4.286 ± 0.55
5.082ValSer: 5.082 ± 0.636
4.47ValThr: 4.47 ± 0.53
5.327ValVal: 5.327 ± 0.77
1.163ValTrp: 1.163 ± 0.249
2.572ValTyr: 2.572 ± 0.377
0.0ValXaa: 0.0 ± 0.0
Trp
1.715TrpAla: 1.715 ± 0.314
0.429TrpCys: 0.429 ± 0.184
1.347TrpAsp: 1.347 ± 0.283
1.408TrpGlu: 1.408 ± 0.29
0.796TrpPhe: 0.796 ± 0.224
1.225TrpGly: 1.225 ± 0.257
0.367TrpHis: 0.367 ± 0.145
1.102TrpIle: 1.102 ± 0.262
0.735TrpLys: 0.735 ± 0.199
1.102TrpLeu: 1.102 ± 0.275
0.184TrpMet: 0.184 ± 0.093
1.041TrpAsn: 1.041 ± 0.251
0.796TrpPro: 0.796 ± 0.252
0.735TrpGln: 0.735 ± 0.228
1.163TrpArg: 1.163 ± 0.285
1.163TrpSer: 1.163 ± 0.258
1.47TrpThr: 1.47 ± 0.276
1.102TrpVal: 1.102 ± 0.269
0.551TrpTrp: 0.551 ± 0.208
0.49TrpTyr: 0.49 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.511TyrAla: 2.511 ± 0.387
0.061TyrCys: 0.061 ± 0.052
2.204TyrAsp: 2.204 ± 0.36
2.021TyrGlu: 2.021 ± 0.353
0.735TyrPhe: 0.735 ± 0.22
3.0TyrGly: 3.0 ± 0.348
0.735TyrHis: 0.735 ± 0.242
0.918TyrIle: 0.918 ± 0.233
1.531TyrLys: 1.531 ± 0.356
3.245TyrLeu: 3.245 ± 0.366
0.796TyrMet: 0.796 ± 0.225
1.47TyrAsn: 1.47 ± 0.315
1.102TyrPro: 1.102 ± 0.276
0.918TyrGln: 0.918 ± 0.221
2.449TyrArg: 2.449 ± 0.473
2.511TyrSer: 2.511 ± 0.461
2.021TyrThr: 2.021 ± 0.417
2.449TyrVal: 2.449 ± 0.478
0.367TyrTrp: 0.367 ± 0.229
0.796TyrTyr: 0.796 ± 0.216
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (16332 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski