Amino acid dipepetide frequency for Gordonia phage WhoseManz

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.082AlaAla: 14.082 ± 1.398
0.782AlaCys: 0.782 ± 0.194
7.09AlaAsp: 7.09 ± 0.926
7.432AlaGlu: 7.432 ± 0.724
3.325AlaPhe: 3.325 ± 0.418
8.166AlaGly: 8.166 ± 0.77
2.445AlaHis: 2.445 ± 0.528
4.743AlaIle: 4.743 ± 0.451
5.036AlaLys: 5.036 ± 0.57
8.899AlaLeu: 8.899 ± 0.839
2.347AlaMet: 2.347 ± 0.389
2.836AlaAsn: 2.836 ± 0.429
4.401AlaPro: 4.401 ± 0.553
4.205AlaGln: 4.205 ± 0.486
6.406AlaArg: 6.406 ± 0.74
5.281AlaSer: 5.281 ± 0.557
6.014AlaThr: 6.014 ± 0.6
6.846AlaVal: 6.846 ± 0.627
1.565AlaTrp: 1.565 ± 0.33
2.249AlaTyr: 2.249 ± 0.334
0.0AlaXaa: 0.0 ± 0.0
Cys
0.733CysAla: 0.733 ± 0.19
0.147CysCys: 0.147 ± 0.098
0.782CysAsp: 0.782 ± 0.196
0.538CysGlu: 0.538 ± 0.142
0.098CysPhe: 0.098 ± 0.072
0.685CysGly: 0.685 ± 0.173
0.293CysHis: 0.293 ± 0.131
0.293CysIle: 0.293 ± 0.112
0.244CysLys: 0.244 ± 0.112
1.027CysLeu: 1.027 ± 0.206
0.0CysMet: 0.0 ± 0.0
0.196CysAsn: 0.196 ± 0.084
0.538CysPro: 0.538 ± 0.188
0.342CysGln: 0.342 ± 0.118
0.44CysArg: 0.44 ± 0.182
0.587CysSer: 0.587 ± 0.21
0.489CysThr: 0.489 ± 0.147
0.88CysVal: 0.88 ± 0.206
0.147CysTrp: 0.147 ± 0.098
0.196CysTyr: 0.196 ± 0.136
0.0CysXaa: 0.0 ± 0.0
Asp
7.872AspAla: 7.872 ± 0.768
0.636AspCys: 0.636 ± 0.182
7.139AspAsp: 7.139 ± 1.218
4.694AspGlu: 4.694 ± 0.674
2.249AspPhe: 2.249 ± 0.305
7.041AspGly: 7.041 ± 0.765
1.76AspHis: 1.76 ± 0.324
3.57AspIle: 3.57 ± 0.413
2.151AspLys: 2.151 ± 0.314
5.428AspLeu: 5.428 ± 0.521
1.711AspMet: 1.711 ± 0.278
2.103AspAsn: 2.103 ± 0.402
4.401AspPro: 4.401 ± 0.461
1.956AspGln: 1.956 ± 0.329
4.645AspArg: 4.645 ± 0.484
3.032AspSer: 3.032 ± 0.627
3.423AspThr: 3.423 ± 0.4
5.232AspVal: 5.232 ± 0.492
1.271AspTrp: 1.271 ± 0.232
2.005AspTyr: 2.005 ± 0.365
0.0AspXaa: 0.0 ± 0.0
Glu
5.819GluAla: 5.819 ± 0.486
0.636GluCys: 0.636 ± 0.168
4.45GluAsp: 4.45 ± 0.593
4.45GluGlu: 4.45 ± 0.488
2.103GluPhe: 2.103 ± 0.338
3.472GluGly: 3.472 ± 0.505
1.027GluHis: 1.027 ± 0.172
4.45GluIle: 4.45 ± 0.445
2.934GluLys: 2.934 ± 0.423
5.917GluLeu: 5.917 ± 0.561
1.809GluMet: 1.809 ± 0.31
2.151GluAsn: 2.151 ± 0.305
3.325GluPro: 3.325 ± 0.496
2.347GluGln: 2.347 ± 0.447
4.841GluArg: 4.841 ± 0.722
3.618GluSer: 3.618 ± 0.371
3.227GluThr: 3.227 ± 0.387
4.792GluVal: 4.792 ± 0.423
1.076GluTrp: 1.076 ± 0.206
1.76GluTyr: 1.76 ± 0.33
0.0GluXaa: 0.0 ± 0.0
Phe
2.64PheAla: 2.64 ± 0.316
0.391PheCys: 0.391 ± 0.147
2.592PheAsp: 2.592 ± 0.434
2.298PheGlu: 2.298 ± 0.316
0.733PhePhe: 0.733 ± 0.181
2.885PheGly: 2.885 ± 0.432
0.782PheHis: 0.782 ± 0.176
1.271PheIle: 1.271 ± 0.249
0.88PheLys: 0.88 ± 0.184
2.347PheLeu: 2.347 ± 0.32
0.538PheMet: 0.538 ± 0.138
0.782PheAsn: 0.782 ± 0.175
1.271PhePro: 1.271 ± 0.241
0.733PheGln: 0.733 ± 0.207
2.151PheArg: 2.151 ± 0.321
2.054PheSer: 2.054 ± 0.446
2.054PheThr: 2.054 ± 0.292
2.298PheVal: 2.298 ± 0.381
0.831PheTrp: 0.831 ± 0.22
1.027PheTyr: 1.027 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
7.726GlyAla: 7.726 ± 0.72
0.831GlyCys: 0.831 ± 0.233
6.454GlyAsp: 6.454 ± 0.493
5.623GlyGlu: 5.623 ± 0.507
2.787GlyPhe: 2.787 ± 0.356
8.508GlyGly: 8.508 ± 1.515
1.663GlyHis: 1.663 ± 0.284
4.596GlyIle: 4.596 ± 0.559
3.521GlyLys: 3.521 ± 0.493
6.112GlyLeu: 6.112 ± 0.577
1.956GlyMet: 1.956 ± 0.395
2.543GlyAsn: 2.543 ± 0.366
3.423GlyPro: 3.423 ± 0.458
3.081GlyGln: 3.081 ± 0.485
5.672GlyArg: 5.672 ± 0.626
5.965GlySer: 5.965 ± 0.754
4.988GlyThr: 4.988 ± 0.557
5.819GlyVal: 5.819 ± 0.561
1.663GlyTrp: 1.663 ± 0.298
2.885GlyTyr: 2.885 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
2.005HisAla: 2.005 ± 0.326
0.196HisCys: 0.196 ± 0.087
1.565HisAsp: 1.565 ± 0.295
1.125HisGlu: 1.125 ± 0.235
0.88HisPhe: 0.88 ± 0.196
1.858HisGly: 1.858 ± 0.386
0.44HisHis: 0.44 ± 0.158
0.88HisIle: 0.88 ± 0.19
0.733HisLys: 0.733 ± 0.2
1.174HisLeu: 1.174 ± 0.242
0.538HisMet: 0.538 ± 0.189
0.244HisAsn: 0.244 ± 0.111
1.614HisPro: 1.614 ± 0.386
0.293HisGln: 0.293 ± 0.105
1.467HisArg: 1.467 ± 0.301
0.929HisSer: 0.929 ± 0.231
1.125HisThr: 1.125 ± 0.214
1.858HisVal: 1.858 ± 0.343
0.489HisTrp: 0.489 ± 0.148
0.636HisTyr: 0.636 ± 0.204
0.0HisXaa: 0.0 ± 0.0
Ile
5.77IleAla: 5.77 ± 0.554
0.196IleCys: 0.196 ± 0.11
3.374IleAsp: 3.374 ± 0.41
4.156IleGlu: 4.156 ± 0.517
0.831IlePhe: 0.831 ± 0.243
4.303IleGly: 4.303 ± 0.607
1.027IleHis: 1.027 ± 0.236
2.103IleIle: 2.103 ± 0.277
1.809IleLys: 1.809 ± 0.338
3.178IleLeu: 3.178 ± 0.343
0.782IleMet: 0.782 ± 0.174
2.054IleAsn: 2.054 ± 0.307
2.445IlePro: 2.445 ± 0.346
1.614IleGln: 1.614 ± 0.365
2.592IleArg: 2.592 ± 0.382
2.983IleSer: 2.983 ± 0.354
2.592IleThr: 2.592 ± 0.384
2.738IleVal: 2.738 ± 0.305
0.88IleTrp: 0.88 ± 0.231
1.222IleTyr: 1.222 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
4.303LysAla: 4.303 ± 0.552
0.244LysCys: 0.244 ± 0.101
3.227LysAsp: 3.227 ± 0.375
1.663LysGlu: 1.663 ± 0.309
1.125LysPhe: 1.125 ± 0.23
3.423LysGly: 3.423 ± 0.475
0.733LysHis: 0.733 ± 0.159
2.054LysIle: 2.054 ± 0.289
2.494LysLys: 2.494 ± 0.388
3.374LysLeu: 3.374 ± 0.46
0.978LysMet: 0.978 ± 0.196
1.516LysAsn: 1.516 ± 0.316
2.103LysPro: 2.103 ± 0.362
1.271LysGln: 1.271 ± 0.19
3.227LysArg: 3.227 ± 0.522
1.467LysSer: 1.467 ± 0.348
2.787LysThr: 2.787 ± 0.448
3.276LysVal: 3.276 ± 0.373
0.88LysTrp: 0.88 ± 0.22
1.467LysTyr: 1.467 ± 0.265
0.0LysXaa: 0.0 ± 0.0
Leu
8.948LeuAla: 8.948 ± 0.765
1.076LeuCys: 1.076 ± 0.228
5.868LeuAsp: 5.868 ± 0.519
3.912LeuGlu: 3.912 ± 0.4
2.2LeuPhe: 2.2 ± 0.314
6.063LeuGly: 6.063 ± 0.699
1.222LeuHis: 1.222 ± 0.276
2.494LeuIle: 2.494 ± 0.431
3.667LeuLys: 3.667 ± 0.453
5.183LeuLeu: 5.183 ± 0.518
1.614LeuMet: 1.614 ± 0.223
2.347LeuAsn: 2.347 ± 0.322
4.694LeuPro: 4.694 ± 0.495
2.103LeuGln: 2.103 ± 0.312
5.672LeuArg: 5.672 ± 0.486
4.499LeuSer: 4.499 ± 0.509
5.623LeuThr: 5.623 ± 0.666
5.672LeuVal: 5.672 ± 0.554
1.174LeuTrp: 1.174 ± 0.179
1.663LeuTyr: 1.663 ± 0.345
0.0LeuXaa: 0.0 ± 0.0
Met
2.64MetAla: 2.64 ± 0.388
0.0MetCys: 0.0 ± 0.0
1.125MetAsp: 1.125 ± 0.236
1.125MetGlu: 1.125 ± 0.239
0.978MetPhe: 0.978 ± 0.226
0.88MetGly: 0.88 ± 0.22
0.342MetHis: 0.342 ± 0.191
1.418MetIle: 1.418 ± 0.321
1.174MetLys: 1.174 ± 0.23
1.663MetLeu: 1.663 ± 0.282
0.587MetMet: 0.587 ± 0.154
0.978MetAsn: 0.978 ± 0.187
1.76MetPro: 1.76 ± 0.283
0.489MetGln: 0.489 ± 0.147
1.907MetArg: 1.907 ± 0.303
1.858MetSer: 1.858 ± 0.312
1.956MetThr: 1.956 ± 0.358
0.88MetVal: 0.88 ± 0.203
0.44MetTrp: 0.44 ± 0.133
0.685MetTyr: 0.685 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
2.494AsnAla: 2.494 ± 0.311
0.293AsnCys: 0.293 ± 0.103
2.103AsnAsp: 2.103 ± 0.378
2.151AsnGlu: 2.151 ± 0.272
1.174AsnPhe: 1.174 ± 0.218
3.618AsnGly: 3.618 ± 0.599
0.733AsnHis: 0.733 ± 0.187
1.271AsnIle: 1.271 ± 0.299
1.222AsnLys: 1.222 ± 0.294
2.298AsnLeu: 2.298 ± 0.31
0.978AsnMet: 0.978 ± 0.229
0.88AsnAsn: 0.88 ± 0.224
2.445AsnPro: 2.445 ± 0.38
0.88AsnGln: 0.88 ± 0.174
2.151AsnArg: 2.151 ± 0.325
1.711AsnSer: 1.711 ± 0.34
2.592AsnThr: 2.592 ± 0.416
2.347AsnVal: 2.347 ± 0.472
0.831AsnTrp: 0.831 ± 0.218
0.782AsnTyr: 0.782 ± 0.201
0.0AsnXaa: 0.0 ± 0.0
Pro
5.183ProAla: 5.183 ± 0.617
0.293ProCys: 0.293 ± 0.134
4.254ProAsp: 4.254 ± 0.515
3.765ProGlu: 3.765 ± 0.491
1.858ProPhe: 1.858 ± 0.289
4.645ProGly: 4.645 ± 0.506
0.636ProHis: 0.636 ± 0.167
2.592ProIle: 2.592 ± 0.513
2.787ProLys: 2.787 ± 0.388
4.01ProLeu: 4.01 ± 0.401
1.32ProMet: 1.32 ± 0.256
2.005ProAsn: 2.005 ± 0.364
3.081ProPro: 3.081 ± 0.458
1.76ProGln: 1.76 ± 0.248
3.863ProArg: 3.863 ± 0.436
3.325ProSer: 3.325 ± 0.48
4.107ProThr: 4.107 ± 0.476
4.107ProVal: 4.107 ± 0.457
0.636ProTrp: 0.636 ± 0.147
1.027ProTyr: 1.027 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
3.423GlnAla: 3.423 ± 0.469
0.196GlnCys: 0.196 ± 0.098
1.271GlnAsp: 1.271 ± 0.23
1.565GlnGlu: 1.565 ± 0.297
1.076GlnPhe: 1.076 ± 0.229
2.005GlnGly: 2.005 ± 0.311
0.489GlnHis: 0.489 ± 0.154
1.858GlnIle: 1.858 ± 0.287
1.369GlnLys: 1.369 ± 0.227
2.983GlnLeu: 2.983 ± 0.485
1.271GlnMet: 1.271 ± 0.229
1.418GlnAsn: 1.418 ± 0.265
1.32GlnPro: 1.32 ± 0.229
1.418GlnGln: 1.418 ± 0.353
3.423GlnArg: 3.423 ± 0.452
1.125GlnSer: 1.125 ± 0.18
2.054GlnThr: 2.054 ± 0.314
2.396GlnVal: 2.396 ± 0.443
0.733GlnTrp: 0.733 ± 0.167
0.636GlnTyr: 0.636 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
6.846ArgAla: 6.846 ± 0.735
1.125ArgCys: 1.125 ± 0.296
4.45ArgAsp: 4.45 ± 0.634
5.623ArgGlu: 5.623 ± 0.689
2.689ArgPhe: 2.689 ± 0.477
6.161ArgGly: 6.161 ± 0.668
1.956ArgHis: 1.956 ± 0.364
3.081ArgIle: 3.081 ± 0.374
3.57ArgLys: 3.57 ± 0.401
4.596ArgLeu: 4.596 ± 0.482
2.005ArgMet: 2.005 ± 0.315
2.2ArgAsn: 2.2 ± 0.306
3.374ArgPro: 3.374 ± 0.407
2.592ArgGln: 2.592 ± 0.442
7.286ArgArg: 7.286 ± 0.711
3.178ArgSer: 3.178 ± 0.411
3.961ArgThr: 3.961 ± 0.66
5.085ArgVal: 5.085 ± 0.516
1.32ArgTrp: 1.32 ± 0.236
1.76ArgTyr: 1.76 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
6.259SerAla: 6.259 ± 0.646
0.244SerCys: 0.244 ± 0.109
4.058SerAsp: 4.058 ± 0.7
3.129SerGlu: 3.129 ± 0.351
1.32SerPhe: 1.32 ± 0.225
5.77SerGly: 5.77 ± 0.547
1.174SerHis: 1.174 ± 0.244
2.543SerIle: 2.543 ± 0.335
2.005SerLys: 2.005 ± 0.391
3.227SerLeu: 3.227 ± 0.425
1.418SerMet: 1.418 ± 0.262
2.445SerAsn: 2.445 ± 0.4
2.885SerPro: 2.885 ± 0.399
1.467SerGln: 1.467 ± 0.27
3.814SerArg: 3.814 ± 0.533
3.081SerSer: 3.081 ± 0.595
3.716SerThr: 3.716 ± 0.402
3.325SerVal: 3.325 ± 0.327
1.076SerTrp: 1.076 ± 0.187
1.32SerTyr: 1.32 ± 0.201
0.0SerXaa: 0.0 ± 0.0
Thr
6.063ThrAla: 6.063 ± 0.693
0.44ThrCys: 0.44 ± 0.127
3.667ThrAsp: 3.667 ± 0.38
2.885ThrGlu: 2.885 ± 0.406
2.054ThrPhe: 2.054 ± 0.266
7.286ThrGly: 7.286 ± 0.604
1.222ThrHis: 1.222 ± 0.28
2.983ThrIle: 2.983 ± 0.403
2.543ThrLys: 2.543 ± 0.362
4.743ThrLeu: 4.743 ± 0.524
1.076ThrMet: 1.076 ± 0.194
1.809ThrAsn: 1.809 ± 0.308
5.183ThrPro: 5.183 ± 0.652
1.956ThrGln: 1.956 ± 0.367
4.401ThrArg: 4.401 ± 0.509
2.885ThrSer: 2.885 ± 0.341
4.303ThrThr: 4.303 ± 0.453
4.89ThrVal: 4.89 ± 0.543
1.467ThrTrp: 1.467 ± 0.236
1.956ThrTyr: 1.956 ± 0.354
0.0ThrXaa: 0.0 ± 0.0
Val
7.432ValAla: 7.432 ± 0.725
0.44ValCys: 0.44 ± 0.182
5.672ValAsp: 5.672 ± 0.562
5.428ValGlu: 5.428 ± 0.564
1.809ValPhe: 1.809 ± 0.271
5.183ValGly: 5.183 ± 0.483
1.076ValHis: 1.076 ± 0.232
3.129ValIle: 3.129 ± 0.449
2.396ValLys: 2.396 ± 0.348
5.232ValLeu: 5.232 ± 0.441
0.978ValMet: 0.978 ± 0.223
2.592ValAsn: 2.592 ± 0.374
3.912ValPro: 3.912 ± 0.405
2.249ValGln: 2.249 ± 0.276
5.574ValArg: 5.574 ± 0.626
4.205ValSer: 4.205 ± 0.394
5.917ValThr: 5.917 ± 0.596
5.33ValVal: 5.33 ± 0.528
1.222ValTrp: 1.222 ± 0.319
1.663ValTyr: 1.663 ± 0.292
0.0ValXaa: 0.0 ± 0.0
Trp
1.858TrpAla: 1.858 ± 0.296
0.196TrpCys: 0.196 ± 0.105
1.418TrpAsp: 1.418 ± 0.272
1.174TrpGlu: 1.174 ± 0.235
0.489TrpPhe: 0.489 ± 0.131
1.222TrpGly: 1.222 ± 0.3
0.538TrpHis: 0.538 ± 0.168
0.831TrpIle: 0.831 ± 0.216
0.489TrpLys: 0.489 ± 0.175
1.614TrpLeu: 1.614 ± 0.332
0.293TrpMet: 0.293 ± 0.114
0.782TrpAsn: 0.782 ± 0.193
0.782TrpPro: 0.782 ± 0.182
0.782TrpGln: 0.782 ± 0.231
1.076TrpArg: 1.076 ± 0.245
1.174TrpSer: 1.174 ± 0.216
1.663TrpThr: 1.663 ± 0.316
1.809TrpVal: 1.809 ± 0.338
0.782TrpTrp: 0.782 ± 0.181
0.196TrpTyr: 0.196 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.054TyrAla: 2.054 ± 0.306
0.293TyrCys: 0.293 ± 0.129
1.76TyrAsp: 1.76 ± 0.316
1.467TyrGlu: 1.467 ± 0.259
0.782TyrPhe: 0.782 ± 0.175
2.64TyrGly: 2.64 ± 0.338
0.489TyrHis: 0.489 ± 0.19
0.489TyrIle: 0.489 ± 0.19
0.587TyrLys: 0.587 ± 0.155
2.689TyrLeu: 2.689 ± 0.467
0.636TyrMet: 0.636 ± 0.166
1.076TyrAsn: 1.076 ± 0.234
2.347TyrPro: 2.347 ± 0.369
0.489TyrGln: 0.489 ± 0.139
2.249TyrArg: 2.249 ± 0.308
1.32TyrSer: 1.32 ± 0.277
1.222TyrThr: 1.222 ± 0.227
1.858TyrVal: 1.858 ± 0.325
0.685TyrTrp: 0.685 ± 0.221
0.489TyrTyr: 0.489 ± 0.163
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 92 proteins (20452 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski