Amino acid dipepetide frequency for Gordonia phage DumpsterDude

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.414AlaAla: 18.414 ± 1.629
1.402AlaCys: 1.402 ± 0.366
8.414AlaAsp: 8.414 ± 0.787
8.841AlaGlu: 8.841 ± 0.695
2.988AlaPhe: 2.988 ± 0.429
10.853AlaGly: 10.853 ± 1.321
2.195AlaHis: 2.195 ± 0.49
6.341AlaIle: 6.341 ± 0.572
5.305AlaLys: 5.305 ± 0.882
9.573AlaLeu: 9.573 ± 0.966
2.866AlaMet: 2.866 ± 0.429
3.536AlaAsn: 3.536 ± 0.482
7.865AlaPro: 7.865 ± 0.817
3.658AlaGln: 3.658 ± 0.518
8.597AlaArg: 8.597 ± 1.043
5.853AlaSer: 5.853 ± 0.777
5.731AlaThr: 5.731 ± 0.649
7.682AlaVal: 7.682 ± 0.719
2.195AlaTrp: 2.195 ± 0.391
2.561AlaTyr: 2.561 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
0.793CysAla: 0.793 ± 0.255
0.366CysCys: 0.366 ± 0.197
0.854CysAsp: 0.854 ± 0.29
0.549CysGlu: 0.549 ± 0.204
0.0CysPhe: 0.0 ± 0.0
1.158CysGly: 1.158 ± 0.328
0.244CysHis: 0.244 ± 0.114
0.183CysIle: 0.183 ± 0.093
0.122CysLys: 0.122 ± 0.083
0.366CysLeu: 0.366 ± 0.175
0.122CysMet: 0.122 ± 0.082
0.183CysAsn: 0.183 ± 0.107
0.488CysPro: 0.488 ± 0.169
0.183CysGln: 0.183 ± 0.1
0.732CysArg: 0.732 ± 0.23
0.915CysSer: 0.915 ± 0.229
0.793CysThr: 0.793 ± 0.184
0.549CysVal: 0.549 ± 0.162
0.244CysTrp: 0.244 ± 0.152
0.183CysTyr: 0.183 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
8.414AspAla: 8.414 ± 0.764
0.427AspCys: 0.427 ± 0.164
6.036AspAsp: 6.036 ± 0.723
4.207AspGlu: 4.207 ± 0.615
1.585AspPhe: 1.585 ± 0.391
6.768AspGly: 6.768 ± 0.686
2.073AspHis: 2.073 ± 0.386
3.11AspIle: 3.11 ± 0.391
1.402AspLys: 1.402 ± 0.31
6.036AspLeu: 6.036 ± 0.696
1.341AspMet: 1.341 ± 0.263
1.768AspAsn: 1.768 ± 0.411
4.207AspPro: 4.207 ± 0.556
2.5AspGln: 2.5 ± 0.338
4.878AspArg: 4.878 ± 0.609
2.866AspSer: 2.866 ± 0.472
4.024AspThr: 4.024 ± 0.485
5.0AspVal: 5.0 ± 0.595
1.646AspTrp: 1.646 ± 0.285
1.585AspTyr: 1.585 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
6.036GluAla: 6.036 ± 0.651
0.488GluCys: 0.488 ± 0.182
3.414GluAsp: 3.414 ± 0.413
2.256GluGlu: 2.256 ± 0.393
1.707GluPhe: 1.707 ± 0.336
3.414GluGly: 3.414 ± 0.465
0.976GluHis: 0.976 ± 0.25
2.317GluIle: 2.317 ± 0.34
2.927GluLys: 2.927 ± 0.531
6.28GluLeu: 6.28 ± 0.711
0.854GluMet: 0.854 ± 0.209
1.158GluAsn: 1.158 ± 0.255
2.622GluPro: 2.622 ± 0.367
2.012GluGln: 2.012 ± 0.278
5.244GluArg: 5.244 ± 0.744
2.988GluSer: 2.988 ± 0.433
3.719GluThr: 3.719 ± 0.519
4.451GluVal: 4.451 ± 0.52
1.28GluTrp: 1.28 ± 0.214
1.341GluTyr: 1.341 ± 0.251
0.0GluXaa: 0.0 ± 0.0
Phe
3.353PheAla: 3.353 ± 0.438
0.122PheCys: 0.122 ± 0.078
1.585PheAsp: 1.585 ± 0.318
1.341PheGlu: 1.341 ± 0.23
0.305PhePhe: 0.305 ± 0.117
2.683PheGly: 2.683 ± 0.469
0.793PheHis: 0.793 ± 0.24
1.097PheIle: 1.097 ± 0.268
1.097PheLys: 1.097 ± 0.317
1.646PheLeu: 1.646 ± 0.388
0.671PheMet: 0.671 ± 0.17
0.732PheAsn: 0.732 ± 0.199
1.28PhePro: 1.28 ± 0.303
0.549PheGln: 0.549 ± 0.179
2.073PheArg: 2.073 ± 0.321
1.524PheSer: 1.524 ± 0.315
2.561PheThr: 2.561 ± 0.373
1.89PheVal: 1.89 ± 0.318
0.671PheTrp: 0.671 ± 0.186
0.549PheTyr: 0.549 ± 0.152
0.0PheXaa: 0.0 ± 0.0
Gly
9.634GlyAla: 9.634 ± 0.962
0.488GlyCys: 0.488 ± 0.207
6.463GlyAsp: 6.463 ± 0.861
4.512GlyGlu: 4.512 ± 0.577
2.988GlyPhe: 2.988 ± 0.51
8.048GlyGly: 8.048 ± 1.622
1.341GlyHis: 1.341 ± 0.28
4.329GlyIle: 4.329 ± 0.992
2.866GlyLys: 2.866 ± 0.327
7.195GlyLeu: 7.195 ± 0.764
1.951GlyMet: 1.951 ± 0.379
2.622GlyAsn: 2.622 ± 0.36
4.085GlyPro: 4.085 ± 0.53
3.11GlyGln: 3.11 ± 0.439
6.097GlyArg: 6.097 ± 0.616
5.426GlySer: 5.426 ± 0.802
5.731GlyThr: 5.731 ± 0.56
6.951GlyVal: 6.951 ± 0.626
1.646GlyTrp: 1.646 ± 0.318
1.768GlyTyr: 1.768 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
2.256HisAla: 2.256 ± 0.393
0.244HisCys: 0.244 ± 0.106
1.158HisAsp: 1.158 ± 0.274
1.037HisGlu: 1.037 ± 0.247
0.549HisPhe: 0.549 ± 0.203
1.524HisGly: 1.524 ± 0.281
0.793HisHis: 0.793 ± 0.323
1.037HisIle: 1.037 ± 0.282
0.549HisLys: 0.549 ± 0.175
1.89HisLeu: 1.89 ± 0.367
0.183HisMet: 0.183 ± 0.112
0.793HisAsn: 0.793 ± 0.193
1.89HisPro: 1.89 ± 0.399
0.61HisGln: 0.61 ± 0.163
1.341HisArg: 1.341 ± 0.393
1.158HisSer: 1.158 ± 0.277
1.89HisThr: 1.89 ± 0.284
1.646HisVal: 1.646 ± 0.355
0.305HisTrp: 0.305 ± 0.129
0.427HisTyr: 0.427 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
6.036IleAla: 6.036 ± 0.869
0.366IleCys: 0.366 ± 0.17
4.878IleAsp: 4.878 ± 0.534
3.536IleGlu: 3.536 ± 0.521
1.219IlePhe: 1.219 ± 0.273
4.085IleGly: 4.085 ± 0.724
0.854IleHis: 0.854 ± 0.221
1.89IleIle: 1.89 ± 0.34
1.89IleLys: 1.89 ± 0.545
2.5IleLeu: 2.5 ± 0.413
0.671IleMet: 0.671 ± 0.203
0.915IleAsn: 0.915 ± 0.205
2.683IlePro: 2.683 ± 0.486
1.341IleGln: 1.341 ± 0.331
3.049IleArg: 3.049 ± 0.424
2.439IleSer: 2.439 ± 0.366
3.719IleThr: 3.719 ± 0.493
3.292IleVal: 3.292 ± 0.389
0.427IleTrp: 0.427 ± 0.164
1.097IleTyr: 1.097 ± 0.257
0.0IleXaa: 0.0 ± 0.0
Lys
4.329LysAla: 4.329 ± 0.853
0.244LysCys: 0.244 ± 0.119
1.707LysAsp: 1.707 ± 0.34
1.524LysGlu: 1.524 ± 0.354
1.097LysPhe: 1.097 ± 0.311
2.744LysGly: 2.744 ± 0.387
0.427LysHis: 0.427 ± 0.166
1.463LysIle: 1.463 ± 0.449
1.158LysLys: 1.158 ± 0.304
2.5LysLeu: 2.5 ± 0.432
0.61LysMet: 0.61 ± 0.175
1.28LysAsn: 1.28 ± 0.587
2.195LysPro: 2.195 ± 0.309
1.158LysGln: 1.158 ± 0.308
2.5LysArg: 2.5 ± 0.373
1.89LysSer: 1.89 ± 0.298
2.988LysThr: 2.988 ± 0.428
3.78LysVal: 3.78 ± 0.394
0.305LysTrp: 0.305 ± 0.134
0.549LysTyr: 0.549 ± 0.209
0.0LysXaa: 0.0 ± 0.0
Leu
9.512LeuAla: 9.512 ± 0.727
0.671LeuCys: 0.671 ± 0.197
6.158LeuAsp: 6.158 ± 0.598
4.268LeuGlu: 4.268 ± 0.435
2.561LeuPhe: 2.561 ± 0.419
6.829LeuGly: 6.829 ± 0.522
1.89LeuHis: 1.89 ± 0.328
3.11LeuIle: 3.11 ± 0.455
1.89LeuLys: 1.89 ± 0.34
5.853LeuLeu: 5.853 ± 0.556
1.524LeuMet: 1.524 ± 0.345
2.256LeuAsn: 2.256 ± 0.404
3.597LeuPro: 3.597 ± 0.465
2.073LeuGln: 2.073 ± 0.314
6.402LeuArg: 6.402 ± 0.712
4.573LeuSer: 4.573 ± 0.643
6.097LeuThr: 6.097 ± 0.652
5.548LeuVal: 5.548 ± 0.61
1.037LeuTrp: 1.037 ± 0.234
1.463LeuTyr: 1.463 ± 0.283
0.0LeuXaa: 0.0 ± 0.0
Met
2.439MetAla: 2.439 ± 0.322
0.366MetCys: 0.366 ± 0.129
0.732MetAsp: 0.732 ± 0.186
0.305MetGlu: 0.305 ± 0.119
0.427MetPhe: 0.427 ± 0.155
1.28MetGly: 1.28 ± 0.302
0.244MetHis: 0.244 ± 0.14
0.854MetIle: 0.854 ± 0.161
0.488MetLys: 0.488 ± 0.156
1.402MetLeu: 1.402 ± 0.333
0.366MetMet: 0.366 ± 0.143
0.854MetAsn: 0.854 ± 0.198
0.915MetPro: 0.915 ± 0.208
0.793MetGln: 0.793 ± 0.193
1.037MetArg: 1.037 ± 0.2
1.524MetSer: 1.524 ± 0.261
2.256MetThr: 2.256 ± 0.337
1.28MetVal: 1.28 ± 0.213
0.244MetTrp: 0.244 ± 0.119
0.183MetTyr: 0.183 ± 0.095
0.0MetXaa: 0.0 ± 0.0
Asn
3.049AsnAla: 3.049 ± 0.517
0.183AsnCys: 0.183 ± 0.103
1.524AsnAsp: 1.524 ± 0.347
1.219AsnGlu: 1.219 ± 0.294
0.915AsnPhe: 0.915 ± 0.261
3.232AsnGly: 3.232 ± 0.526
0.915AsnHis: 0.915 ± 0.184
0.671AsnIle: 0.671 ± 0.19
0.854AsnLys: 0.854 ± 0.251
2.073AsnLeu: 2.073 ± 0.484
0.122AsnMet: 0.122 ± 0.099
0.427AsnAsn: 0.427 ± 0.138
2.561AsnPro: 2.561 ± 0.399
0.854AsnGln: 0.854 ± 0.224
1.89AsnArg: 1.89 ± 0.354
1.524AsnSer: 1.524 ± 0.356
1.829AsnThr: 1.829 ± 0.29
2.195AsnVal: 2.195 ± 0.455
0.671AsnTrp: 0.671 ± 0.283
0.305AsnTyr: 0.305 ± 0.127
0.0AsnXaa: 0.0 ± 0.0
Pro
7.256ProAla: 7.256 ± 0.901
0.549ProCys: 0.549 ± 0.213
5.426ProAsp: 5.426 ± 0.588
3.171ProGlu: 3.171 ± 0.465
0.976ProPhe: 0.976 ± 0.212
4.878ProGly: 4.878 ± 0.535
0.976ProHis: 0.976 ± 0.265
2.561ProIle: 2.561 ± 0.371
2.317ProLys: 2.317 ± 0.327
2.988ProLeu: 2.988 ± 0.434
1.097ProMet: 1.097 ± 0.271
1.158ProAsn: 1.158 ± 0.254
3.292ProPro: 3.292 ± 0.54
2.256ProGln: 2.256 ± 0.415
4.207ProArg: 4.207 ± 0.639
3.232ProSer: 3.232 ± 0.394
4.329ProThr: 4.329 ± 0.501
4.817ProVal: 4.817 ± 0.387
1.037ProTrp: 1.037 ± 0.276
1.341ProTyr: 1.341 ± 0.376
0.0ProXaa: 0.0 ± 0.0
Gln
4.085GlnAla: 4.085 ± 0.721
0.183GlnCys: 0.183 ± 0.131
1.28GlnAsp: 1.28 ± 0.301
1.158GlnGlu: 1.158 ± 0.335
1.341GlnPhe: 1.341 ± 0.314
2.134GlnGly: 2.134 ± 0.381
1.037GlnHis: 1.037 ± 0.24
1.402GlnIle: 1.402 ± 0.312
1.037GlnLys: 1.037 ± 0.298
3.292GlnLeu: 3.292 ± 0.393
0.366GlnMet: 0.366 ± 0.128
0.854GlnAsn: 0.854 ± 0.224
1.768GlnPro: 1.768 ± 0.294
1.341GlnGln: 1.341 ± 0.279
2.744GlnArg: 2.744 ± 0.471
1.829GlnSer: 1.829 ± 0.342
1.768GlnThr: 1.768 ± 0.336
2.561GlnVal: 2.561 ± 0.366
0.976GlnTrp: 0.976 ± 0.299
0.61GlnTyr: 0.61 ± 0.248
0.0GlnXaa: 0.0 ± 0.0
Arg
8.231ArgAla: 8.231 ± 0.808
1.037ArgCys: 1.037 ± 0.273
4.939ArgAsp: 4.939 ± 0.533
4.939ArgGlu: 4.939 ± 0.632
1.951ArgPhe: 1.951 ± 0.406
5.67ArgGly: 5.67 ± 0.604
1.951ArgHis: 1.951 ± 0.413
3.78ArgIle: 3.78 ± 0.49
2.317ArgLys: 2.317 ± 0.501
5.975ArgLeu: 5.975 ± 0.786
1.28ArgMet: 1.28 ± 0.302
2.317ArgAsn: 2.317 ± 0.379
4.085ArgPro: 4.085 ± 0.635
2.622ArgGln: 2.622 ± 0.501
6.768ArgArg: 6.768 ± 0.961
3.11ArgSer: 3.11 ± 0.444
4.39ArgThr: 4.39 ± 0.572
5.487ArgVal: 5.487 ± 0.666
1.585ArgTrp: 1.585 ± 0.352
1.463ArgTyr: 1.463 ± 0.327
0.0ArgXaa: 0.0 ± 0.0
Ser
7.561SerAla: 7.561 ± 0.792
0.244SerCys: 0.244 ± 0.125
2.866SerAsp: 2.866 ± 0.451
2.561SerGlu: 2.561 ± 0.43
0.976SerPhe: 0.976 ± 0.265
6.89SerGly: 6.89 ± 1.003
1.037SerHis: 1.037 ± 0.207
2.805SerIle: 2.805 ± 0.346
1.951SerLys: 1.951 ± 0.323
3.719SerLeu: 3.719 ± 0.653
1.28SerMet: 1.28 ± 0.307
1.28SerAsn: 1.28 ± 0.253
2.683SerPro: 2.683 ± 0.476
2.012SerGln: 2.012 ± 0.368
3.597SerArg: 3.597 ± 0.598
2.866SerSer: 2.866 ± 0.467
4.085SerThr: 4.085 ± 0.514
4.817SerVal: 4.817 ± 0.619
0.793SerTrp: 0.793 ± 0.184
1.037SerTyr: 1.037 ± 0.211
0.0SerXaa: 0.0 ± 0.0
Thr
8.658ThrAla: 8.658 ± 1.004
0.61ThrCys: 0.61 ± 0.212
4.878ThrAsp: 4.878 ± 0.596
3.11ThrGlu: 3.11 ± 0.525
2.012ThrPhe: 2.012 ± 0.404
6.219ThrGly: 6.219 ± 0.688
1.402ThrHis: 1.402 ± 0.364
4.634ThrIle: 4.634 ± 0.447
2.317ThrLys: 2.317 ± 0.58
5.061ThrLeu: 5.061 ± 0.636
1.037ThrMet: 1.037 ± 0.249
1.341ThrAsn: 1.341 ± 0.314
4.207ThrPro: 4.207 ± 0.449
1.28ThrGln: 1.28 ± 0.27
4.878ThrArg: 4.878 ± 0.669
3.414ThrSer: 3.414 ± 0.504
5.366ThrThr: 5.366 ± 0.572
5.67ThrVal: 5.67 ± 0.566
0.915ThrTrp: 0.915 ± 0.221
1.707ThrTyr: 1.707 ± 0.362
0.0ThrXaa: 0.0 ± 0.0
Val
9.999ValAla: 9.999 ± 0.893
0.61ValCys: 0.61 ± 0.247
5.244ValAsp: 5.244 ± 0.666
4.756ValGlu: 4.756 ± 0.683
1.89ValPhe: 1.89 ± 0.313
5.548ValGly: 5.548 ± 0.628
1.28ValHis: 1.28 ± 0.225
3.232ValIle: 3.232 ± 0.436
2.866ValLys: 2.866 ± 0.313
6.402ValLeu: 6.402 ± 0.542
0.915ValMet: 0.915 ± 0.244
2.134ValAsn: 2.134 ± 0.31
5.122ValPro: 5.122 ± 0.466
2.134ValGln: 2.134 ± 0.367
5.0ValArg: 5.0 ± 0.601
4.939ValSer: 4.939 ± 0.605
5.122ValThr: 5.122 ± 0.516
6.036ValVal: 6.036 ± 0.801
2.073ValTrp: 2.073 ± 0.368
0.732ValTyr: 0.732 ± 0.207
0.0ValXaa: 0.0 ± 0.0
Trp
1.463TrpAla: 1.463 ± 0.262
0.244TrpCys: 0.244 ± 0.115
1.402TrpAsp: 1.402 ± 0.301
0.854TrpGlu: 0.854 ± 0.238
0.671TrpPhe: 0.671 ± 0.183
1.219TrpGly: 1.219 ± 0.278
0.488TrpHis: 0.488 ± 0.194
1.097TrpIle: 1.097 ± 0.249
0.366TrpLys: 0.366 ± 0.141
1.768TrpLeu: 1.768 ± 0.37
0.427TrpMet: 0.427 ± 0.152
0.793TrpAsn: 0.793 ± 0.337
1.28TrpPro: 1.28 ± 0.213
0.793TrpGln: 0.793 ± 0.242
1.28TrpArg: 1.28 ± 0.234
1.463TrpSer: 1.463 ± 0.266
1.219TrpThr: 1.219 ± 0.31
1.037TrpVal: 1.037 ± 0.233
0.427TrpTrp: 0.427 ± 0.146
0.549TrpTyr: 0.549 ± 0.235
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.353TyrAla: 3.353 ± 0.422
0.183TyrCys: 0.183 ± 0.101
0.915TyrAsp: 0.915 ± 0.203
1.341TyrGlu: 1.341 ± 0.358
0.366TyrPhe: 0.366 ± 0.133
2.073TyrGly: 2.073 ± 0.319
0.366TyrHis: 0.366 ± 0.142
0.976TyrIle: 0.976 ± 0.251
0.732TyrLys: 0.732 ± 0.213
0.793TyrLeu: 0.793 ± 0.21
0.244TyrMet: 0.244 ± 0.145
0.61TyrAsn: 0.61 ± 0.191
1.037TyrPro: 1.037 ± 0.245
0.488TyrGln: 0.488 ± 0.172
1.585TyrArg: 1.585 ± 0.337
1.524TyrSer: 1.524 ± 0.237
1.097TyrThr: 1.097 ± 0.375
1.341TyrVal: 1.341 ± 0.381
0.427TyrTrp: 0.427 ± 0.157
0.427TyrTyr: 0.427 ± 0.211
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (16402 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski