Amino acid dipepetide frequency for Gordonia phage Stultus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.544AlaAla: 17.544 ± 1.042
0.758AlaCys: 0.758 ± 0.266
7.96AlaAsp: 7.96 ± 0.756
9.151AlaGlu: 9.151 ± 1.081
2.924AlaPhe: 2.924 ± 0.478
10.992AlaGly: 10.992 ± 0.919
2.112AlaHis: 2.112 ± 0.36
6.714AlaIle: 6.714 ± 0.805
3.682AlaLys: 3.682 ± 0.471
9.422AlaLeu: 9.422 ± 0.954
2.491AlaMet: 2.491 ± 0.43
3.032AlaAsn: 3.032 ± 0.544
6.931AlaPro: 6.931 ± 0.815
4.657AlaGln: 4.657 ± 0.489
8.555AlaArg: 8.555 ± 0.911
5.74AlaSer: 5.74 ± 0.775
7.743AlaThr: 7.743 ± 0.831
8.664AlaVal: 8.664 ± 0.686
2.437AlaTrp: 2.437 ± 0.392
2.058AlaTyr: 2.058 ± 0.373
0.0AlaXaa: 0.0 ± 0.0
Cys
1.083CysAla: 1.083 ± 0.283
0.0CysCys: 0.0 ± 0.0
0.596CysAsp: 0.596 ± 0.195
0.379CysGlu: 0.379 ± 0.146
0.108CysPhe: 0.108 ± 0.076
1.137CysGly: 1.137 ± 0.306
0.433CysHis: 0.433 ± 0.164
0.054CysIle: 0.054 ± 0.054
0.108CysLys: 0.108 ± 0.07
0.271CysLeu: 0.271 ± 0.117
0.054CysMet: 0.054 ± 0.055
0.054CysAsn: 0.054 ± 0.056
0.812CysPro: 0.812 ± 0.213
0.271CysGln: 0.271 ± 0.138
0.379CysArg: 0.379 ± 0.143
0.325CysSer: 0.325 ± 0.158
0.596CysThr: 0.596 ± 0.203
0.433CysVal: 0.433 ± 0.17
0.054CysTrp: 0.054 ± 0.062
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
8.339AspAla: 8.339 ± 0.716
0.379AspCys: 0.379 ± 0.183
6.768AspAsp: 6.768 ± 0.982
5.902AspGlu: 5.902 ± 0.888
1.245AspPhe: 1.245 ± 0.317
6.931AspGly: 6.931 ± 0.77
1.624AspHis: 1.624 ± 0.38
1.462AspIle: 1.462 ± 0.25
1.679AspLys: 1.679 ± 0.367
5.306AspLeu: 5.306 ± 0.616
1.949AspMet: 1.949 ± 0.295
2.274AspAsn: 2.274 ± 0.433
7.418AspPro: 7.418 ± 0.887
2.22AspGln: 2.22 ± 0.504
5.144AspArg: 5.144 ± 0.551
2.328AspSer: 2.328 ± 0.289
3.628AspThr: 3.628 ± 0.509
6.335AspVal: 6.335 ± 0.582
1.841AspTrp: 1.841 ± 0.319
1.895AspTyr: 1.895 ± 0.372
0.0AspXaa: 0.0 ± 0.0
Glu
5.306GluAla: 5.306 ± 0.525
0.541GluCys: 0.541 ± 0.195
3.357GluAsp: 3.357 ± 0.612
1.354GluGlu: 1.354 ± 0.295
2.166GluPhe: 2.166 ± 0.354
4.278GluGly: 4.278 ± 0.574
2.382GluHis: 2.382 ± 0.401
2.382GluIle: 2.382 ± 0.346
1.245GluLys: 1.245 ± 0.308
4.873GluLeu: 4.873 ± 0.747
1.029GluMet: 1.029 ± 0.223
1.516GluAsn: 1.516 ± 0.247
3.357GluPro: 3.357 ± 0.529
3.899GluGln: 3.899 ± 0.592
4.115GluArg: 4.115 ± 0.605
3.032GluSer: 3.032 ± 0.457
2.762GluThr: 2.762 ± 0.44
4.873GluVal: 4.873 ± 0.524
1.733GluTrp: 1.733 ± 0.356
1.624GluTyr: 1.624 ± 0.382
0.0GluXaa: 0.0 ± 0.0
Phe
3.357PheAla: 3.357 ± 0.415
0.379PheCys: 0.379 ± 0.141
2.762PheAsp: 2.762 ± 0.514
1.624PheGlu: 1.624 ± 0.313
0.596PhePhe: 0.596 ± 0.207
2.599PheGly: 2.599 ± 0.348
0.379PheHis: 0.379 ± 0.132
1.408PheIle: 1.408 ± 0.314
0.541PheLys: 0.541 ± 0.174
1.624PheLeu: 1.624 ± 0.325
0.596PheMet: 0.596 ± 0.157
0.541PheAsn: 0.541 ± 0.189
1.083PhePro: 1.083 ± 0.237
0.541PheGln: 0.541 ± 0.192
0.975PheArg: 0.975 ± 0.222
1.029PheSer: 1.029 ± 0.292
1.733PheThr: 1.733 ± 0.32
2.274PheVal: 2.274 ± 0.292
0.271PheTrp: 0.271 ± 0.141
0.217PheTyr: 0.217 ± 0.099
0.0PheXaa: 0.0 ± 0.0
Gly
9.422GlyAla: 9.422 ± 0.921
0.487GlyCys: 0.487 ± 0.189
6.66GlyAsp: 6.66 ± 0.804
4.386GlyGlu: 4.386 ± 0.558
1.895GlyPhe: 1.895 ± 0.358
8.176GlyGly: 8.176 ± 1.031
2.22GlyHis: 2.22 ± 0.38
4.169GlyIle: 4.169 ± 0.476
3.249GlyLys: 3.249 ± 0.465
5.794GlyLeu: 5.794 ± 0.743
1.679GlyMet: 1.679 ± 0.301
2.707GlyAsn: 2.707 ± 0.427
3.899GlyPro: 3.899 ± 0.549
3.574GlyGln: 3.574 ± 0.464
5.631GlyArg: 5.631 ± 0.571
5.415GlySer: 5.415 ± 0.707
5.415GlyThr: 5.415 ± 0.915
7.364GlyVal: 7.364 ± 0.605
1.57GlyTrp: 1.57 ± 0.3
1.787GlyTyr: 1.787 ± 0.295
0.0GlyXaa: 0.0 ± 0.0
His
2.058HisAla: 2.058 ± 0.399
0.054HisCys: 0.054 ± 0.053
1.949HisAsp: 1.949 ± 0.38
1.3HisGlu: 1.3 ± 0.292
0.487HisPhe: 0.487 ± 0.177
1.408HisGly: 1.408 ± 0.214
0.325HisHis: 0.325 ± 0.114
0.975HisIle: 0.975 ± 0.209
0.433HisLys: 0.433 ± 0.187
1.57HisLeu: 1.57 ± 0.267
0.596HisMet: 0.596 ± 0.179
0.379HisAsn: 0.379 ± 0.16
1.462HisPro: 1.462 ± 0.29
0.812HisGln: 0.812 ± 0.172
1.733HisArg: 1.733 ± 0.301
0.812HisSer: 0.812 ± 0.213
1.949HisThr: 1.949 ± 0.38
1.787HisVal: 1.787 ± 0.304
0.379HisTrp: 0.379 ± 0.134
0.433HisTyr: 0.433 ± 0.153
0.0HisXaa: 0.0 ± 0.0
Ile
6.281IleAla: 6.281 ± 0.52
0.162IleCys: 0.162 ± 0.085
4.061IleAsp: 4.061 ± 0.459
3.682IleGlu: 3.682 ± 0.438
0.487IlePhe: 0.487 ± 0.21
4.711IleGly: 4.711 ± 1.004
0.541IleHis: 0.541 ± 0.156
2.491IleIle: 2.491 ± 0.438
0.758IleLys: 0.758 ± 0.292
2.274IleLeu: 2.274 ± 0.335
0.921IleMet: 0.921 ± 0.239
0.866IleAsn: 0.866 ± 0.199
3.086IlePro: 3.086 ± 0.4
1.408IleGln: 1.408 ± 0.284
2.924IleArg: 2.924 ± 0.317
1.733IleSer: 1.733 ± 0.353
3.628IleThr: 3.628 ± 0.5
3.465IleVal: 3.465 ± 0.338
0.758IleTrp: 0.758 ± 0.183
0.921IleTyr: 0.921 ± 0.227
0.0IleXaa: 0.0 ± 0.0
Lys
2.707LysAla: 2.707 ± 0.45
0.108LysCys: 0.108 ± 0.083
0.921LysAsp: 0.921 ± 0.222
0.596LysGlu: 0.596 ± 0.18
1.029LysPhe: 1.029 ± 0.253
1.516LysGly: 1.516 ± 0.316
0.596LysHis: 0.596 ± 0.164
1.354LysIle: 1.354 ± 0.293
0.541LysLys: 0.541 ± 0.194
2.382LysLeu: 2.382 ± 0.414
0.487LysMet: 0.487 ± 0.165
0.921LysAsn: 0.921 ± 0.264
1.895LysPro: 1.895 ± 0.368
0.921LysGln: 0.921 ± 0.219
2.22LysArg: 2.22 ± 0.371
1.733LysSer: 1.733 ± 0.368
1.733LysThr: 1.733 ± 0.318
2.274LysVal: 2.274 ± 0.332
0.487LysTrp: 0.487 ± 0.153
0.866LysTyr: 0.866 ± 0.238
0.0LysXaa: 0.0 ± 0.0
Leu
9.043LeuAla: 9.043 ± 0.872
0.433LeuCys: 0.433 ± 0.174
5.794LeuAsp: 5.794 ± 0.722
2.274LeuGlu: 2.274 ± 0.376
2.058LeuPhe: 2.058 ± 0.361
6.985LeuGly: 6.985 ± 1.094
1.733LeuHis: 1.733 ± 0.312
3.357LeuIle: 3.357 ± 0.435
1.083LeuLys: 1.083 ± 0.212
5.902LeuLeu: 5.902 ± 0.599
1.3LeuMet: 1.3 ± 0.311
2.166LeuAsn: 2.166 ± 0.312
4.711LeuPro: 4.711 ± 0.534
3.086LeuGln: 3.086 ± 0.417
6.335LeuArg: 6.335 ± 0.501
3.682LeuSer: 3.682 ± 0.474
5.902LeuThr: 5.902 ± 0.592
6.227LeuVal: 6.227 ± 0.538
1.949LeuTrp: 1.949 ± 0.37
1.787LeuTyr: 1.787 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
2.058MetAla: 2.058 ± 0.339
0.108MetCys: 0.108 ± 0.079
0.866MetAsp: 0.866 ± 0.193
0.812MetGlu: 0.812 ± 0.25
0.541MetPhe: 0.541 ± 0.231
1.191MetGly: 1.191 ± 0.262
0.271MetHis: 0.271 ± 0.11
0.704MetIle: 0.704 ± 0.159
0.704MetLys: 0.704 ± 0.188
2.003MetLeu: 2.003 ± 0.338
0.379MetMet: 0.379 ± 0.15
0.541MetAsn: 0.541 ± 0.213
2.166MetPro: 2.166 ± 0.357
0.921MetGln: 0.921 ± 0.323
1.624MetArg: 1.624 ± 0.353
1.137MetSer: 1.137 ± 0.212
3.086MetThr: 3.086 ± 0.369
1.462MetVal: 1.462 ± 0.331
0.433MetTrp: 0.433 ± 0.152
0.162MetTyr: 0.162 ± 0.097
0.0MetXaa: 0.0 ± 0.0
Asn
3.682AsnAla: 3.682 ± 0.553
0.271AsnCys: 0.271 ± 0.102
2.112AsnAsp: 2.112 ± 0.352
1.354AsnGlu: 1.354 ± 0.305
0.541AsnPhe: 0.541 ± 0.158
2.653AsnGly: 2.653 ± 0.364
0.379AsnHis: 0.379 ± 0.155
1.029AsnIle: 1.029 ± 0.236
0.433AsnLys: 0.433 ± 0.166
2.058AsnLeu: 2.058 ± 0.363
0.65AsnMet: 0.65 ± 0.186
0.704AsnAsn: 0.704 ± 0.192
2.437AsnPro: 2.437 ± 0.49
0.812AsnGln: 0.812 ± 0.253
1.679AsnArg: 1.679 ± 0.286
0.975AsnSer: 0.975 ± 0.228
1.462AsnThr: 1.462 ± 0.274
2.003AsnVal: 2.003 ± 0.384
0.812AsnTrp: 0.812 ± 0.227
0.704AsnTyr: 0.704 ± 0.21
0.0AsnXaa: 0.0 ± 0.0
Pro
8.934ProAla: 8.934 ± 0.922
0.271ProCys: 0.271 ± 0.179
5.794ProAsp: 5.794 ± 0.837
3.411ProGlu: 3.411 ± 0.513
1.624ProPhe: 1.624 ± 0.3
5.252ProGly: 5.252 ± 0.461
0.975ProHis: 0.975 ± 0.244
3.303ProIle: 3.303 ± 0.456
1.516ProLys: 1.516 ± 0.312
4.115ProLeu: 4.115 ± 0.442
1.462ProMet: 1.462 ± 0.252
2.166ProAsn: 2.166 ± 0.398
4.548ProPro: 4.548 ± 0.694
2.22ProGln: 2.22 ± 0.366
3.141ProArg: 3.141 ± 0.439
3.736ProSer: 3.736 ± 0.484
4.494ProThr: 4.494 ± 0.549
5.306ProVal: 5.306 ± 0.559
0.975ProTrp: 0.975 ± 0.235
1.137ProTyr: 1.137 ± 0.227
0.0ProXaa: 0.0 ± 0.0
Gln
5.415GlnAla: 5.415 ± 0.481
0.162GlnCys: 0.162 ± 0.098
1.083GlnAsp: 1.083 ± 0.335
0.812GlnGlu: 0.812 ± 0.226
1.354GlnPhe: 1.354 ± 0.229
2.003GlnGly: 2.003 ± 0.304
0.975GlnHis: 0.975 ± 0.232
2.22GlnIle: 2.22 ± 0.353
0.921GlnLys: 0.921 ± 0.204
4.007GlnLeu: 4.007 ± 0.489
1.029GlnMet: 1.029 ± 0.26
0.866GlnAsn: 0.866 ± 0.205
2.599GlnPro: 2.599 ± 0.348
2.816GlnGln: 2.816 ± 0.489
3.628GlnArg: 3.628 ± 0.41
1.516GlnSer: 1.516 ± 0.241
2.545GlnThr: 2.545 ± 0.437
2.762GlnVal: 2.762 ± 0.391
0.975GlnTrp: 0.975 ± 0.209
0.758GlnTyr: 0.758 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
9.476ArgAla: 9.476 ± 0.922
0.596ArgCys: 0.596 ± 0.209
4.873ArgAsp: 4.873 ± 0.557
4.061ArgGlu: 4.061 ± 0.481
1.895ArgPhe: 1.895 ± 0.319
4.548ArgGly: 4.548 ± 0.55
1.462ArgHis: 1.462 ± 0.348
3.574ArgIle: 3.574 ± 0.44
2.058ArgLys: 2.058 ± 0.424
5.036ArgLeu: 5.036 ± 0.556
2.058ArgMet: 2.058 ± 0.332
2.003ArgAsn: 2.003 ± 0.284
3.844ArgPro: 3.844 ± 0.53
2.437ArgGln: 2.437 ± 0.318
7.906ArgArg: 7.906 ± 1.134
3.141ArgSer: 3.141 ± 0.363
4.224ArgThr: 4.224 ± 0.486
5.469ArgVal: 5.469 ± 0.647
1.57ArgTrp: 1.57 ± 0.344
1.787ArgTyr: 1.787 ± 0.346
0.0ArgXaa: 0.0 ± 0.0
Ser
4.927SerAla: 4.927 ± 0.708
0.487SerCys: 0.487 ± 0.197
3.736SerAsp: 3.736 ± 0.526
2.762SerGlu: 2.762 ± 0.451
0.921SerPhe: 0.921 ± 0.275
5.523SerGly: 5.523 ± 0.658
0.758SerHis: 0.758 ± 0.206
2.058SerIle: 2.058 ± 0.42
1.3SerLys: 1.3 ± 0.247
3.465SerLeu: 3.465 ± 0.507
0.921SerMet: 0.921 ± 0.275
0.975SerAsn: 0.975 ± 0.24
2.978SerPro: 2.978 ± 0.424
1.191SerGln: 1.191 ± 0.372
3.682SerArg: 3.682 ± 0.504
2.762SerSer: 2.762 ± 0.502
4.115SerThr: 4.115 ± 0.427
3.736SerVal: 3.736 ± 0.35
1.137SerTrp: 1.137 ± 0.3
1.3SerTyr: 1.3 ± 0.283
0.0SerXaa: 0.0 ± 0.0
Thr
10.126ThrAla: 10.126 ± 0.954
0.758ThrCys: 0.758 ± 0.245
4.873ThrAsp: 4.873 ± 0.523
3.195ThrGlu: 3.195 ± 0.441
1.3ThrPhe: 1.3 ± 0.273
5.74ThrGly: 5.74 ± 0.676
1.3ThrHis: 1.3 ± 0.321
2.924ThrIle: 2.924 ± 0.414
1.895ThrLys: 1.895 ± 0.319
5.902ThrLeu: 5.902 ± 0.586
1.029ThrMet: 1.029 ± 0.215
1.679ThrAsn: 1.679 ± 0.302
5.036ThrPro: 5.036 ± 0.415
1.679ThrGln: 1.679 ± 0.218
3.736ThrArg: 3.736 ± 0.442
3.628ThrSer: 3.628 ± 0.481
4.657ThrThr: 4.657 ± 0.75
5.09ThrVal: 5.09 ± 0.747
1.137ThrTrp: 1.137 ± 0.239
1.57ThrTyr: 1.57 ± 0.325
0.0ThrXaa: 0.0 ± 0.0
Val
9.097ValAla: 9.097 ± 0.894
0.65ValCys: 0.65 ± 0.213
7.797ValAsp: 7.797 ± 0.657
5.902ValGlu: 5.902 ± 0.609
1.895ValPhe: 1.895 ± 0.264
6.877ValGly: 6.877 ± 0.614
1.679ValHis: 1.679 ± 0.319
3.249ValIle: 3.249 ± 0.398
1.895ValLys: 1.895 ± 0.271
6.01ValLeu: 6.01 ± 0.54
1.137ValMet: 1.137 ± 0.248
2.545ValAsn: 2.545 ± 0.439
3.574ValPro: 3.574 ± 0.482
3.249ValGln: 3.249 ± 0.359
4.765ValArg: 4.765 ± 0.44
3.953ValSer: 3.953 ± 0.588
5.469ValThr: 5.469 ± 0.596
6.335ValVal: 6.335 ± 0.747
1.787ValTrp: 1.787 ± 0.224
2.058ValTyr: 2.058 ± 0.352
0.0ValXaa: 0.0 ± 0.0
Trp
1.841TrpAla: 1.841 ± 0.305
0.271TrpCys: 0.271 ± 0.109
0.921TrpAsp: 0.921 ± 0.256
1.083TrpGlu: 1.083 ± 0.262
0.704TrpPhe: 0.704 ± 0.177
1.137TrpGly: 1.137 ± 0.218
0.325TrpHis: 0.325 ± 0.128
0.921TrpIle: 0.921 ± 0.244
0.866TrpLys: 0.866 ± 0.213
2.112TrpLeu: 2.112 ± 0.378
0.866TrpMet: 0.866 ± 0.205
0.541TrpAsn: 0.541 ± 0.166
1.3TrpPro: 1.3 ± 0.23
1.191TrpGln: 1.191 ± 0.19
1.787TrpArg: 1.787 ± 0.366
1.245TrpSer: 1.245 ± 0.242
1.083TrpThr: 1.083 ± 0.261
2.003TrpVal: 2.003 ± 0.253
0.325TrpTrp: 0.325 ± 0.165
0.487TrpTyr: 0.487 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.87TyrAla: 2.87 ± 0.49
0.271TyrCys: 0.271 ± 0.124
1.787TyrAsp: 1.787 ± 0.337
1.787TyrGlu: 1.787 ± 0.314
0.758TyrPhe: 0.758 ± 0.185
1.841TyrGly: 1.841 ± 0.232
0.433TyrHis: 0.433 ± 0.128
0.812TyrIle: 0.812 ± 0.201
0.379TyrLys: 0.379 ± 0.13
1.679TyrLeu: 1.679 ± 0.292
0.487TyrMet: 0.487 ± 0.163
0.271TyrAsn: 0.271 ± 0.112
1.245TyrPro: 1.245 ± 0.318
0.541TyrGln: 0.541 ± 0.166
2.112TyrArg: 2.112 ± 0.33
0.812TyrSer: 0.812 ± 0.216
1.029TyrThr: 1.029 ± 0.258
2.058TyrVal: 2.058 ± 0.358
0.379TyrTrp: 0.379 ± 0.116
0.379TyrTyr: 0.379 ± 0.133
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (18469 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski