Amino acid dipepetide frequency for Gordonia phage Ebert

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.239AlaAla: 18.239 ± 2.216
0.802AlaCys: 0.802 ± 0.217
8.418AlaAsp: 8.418 ± 0.866
8.886AlaGlu: 8.886 ± 0.907
2.806AlaPhe: 2.806 ± 0.464
9.487AlaGly: 9.487 ± 0.739
2.272AlaHis: 2.272 ± 0.344
7.149AlaIle: 7.149 ± 0.674
3.942AlaLys: 3.942 ± 0.457
8.886AlaLeu: 8.886 ± 0.847
3.474AlaMet: 3.474 ± 0.427
4.142AlaAsn: 4.142 ± 0.678
5.746AlaPro: 5.746 ± 0.64
4.476AlaGln: 4.476 ± 0.619
7.817AlaArg: 7.817 ± 0.788
6.948AlaSer: 6.948 ± 0.643
8.084AlaThr: 8.084 ± 0.947
7.75AlaVal: 7.75 ± 0.805
1.67AlaTrp: 1.67 ± 0.302
2.272AlaTyr: 2.272 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
1.002CysAla: 1.002 ± 0.299
0.134CysCys: 0.134 ± 0.096
0.601CysAsp: 0.601 ± 0.311
0.468CysGlu: 0.468 ± 0.198
0.134CysPhe: 0.134 ± 0.071
0.869CysGly: 0.869 ± 0.311
0.2CysHis: 0.2 ± 0.111
0.067CysIle: 0.067 ± 0.061
0.267CysLys: 0.267 ± 0.123
0.534CysLeu: 0.534 ± 0.186
0.134CysMet: 0.134 ± 0.087
0.134CysAsn: 0.134 ± 0.111
0.534CysPro: 0.534 ± 0.184
0.267CysGln: 0.267 ± 0.172
0.802CysArg: 0.802 ± 0.244
0.534CysSer: 0.534 ± 0.214
0.534CysThr: 0.534 ± 0.183
0.401CysVal: 0.401 ± 0.172
0.267CysTrp: 0.267 ± 0.108
0.2CysTyr: 0.2 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
7.616AspAla: 7.616 ± 0.699
0.802AspCys: 0.802 ± 0.309
4.877AspAsp: 4.877 ± 0.756
4.677AspGlu: 4.677 ± 0.54
2.071AspPhe: 2.071 ± 0.365
6.414AspGly: 6.414 ± 0.574
1.403AspHis: 1.403 ± 0.321
2.739AspIle: 2.739 ± 0.453
1.603AspLys: 1.603 ± 0.316
7.149AspLeu: 7.149 ± 0.779
1.336AspMet: 1.336 ± 0.322
2.071AspAsn: 2.071 ± 0.468
5.077AspPro: 5.077 ± 0.536
1.804AspGln: 1.804 ± 0.351
5.144AspArg: 5.144 ± 0.489
2.672AspSer: 2.672 ± 0.34
3.942AspThr: 3.942 ± 0.436
4.543AspVal: 4.543 ± 0.428
1.603AspTrp: 1.603 ± 0.275
1.47AspTyr: 1.47 ± 0.33
0.0AspXaa: 0.0 ± 0.0
Glu
7.215GluAla: 7.215 ± 0.772
0.935GluCys: 0.935 ± 0.311
2.539GluAsp: 2.539 ± 0.413
2.873GluGlu: 2.873 ± 0.56
2.004GluPhe: 2.004 ± 0.35
3.942GluGly: 3.942 ± 0.51
1.203GluHis: 1.203 ± 0.317
3.274GluIle: 3.274 ± 0.496
1.737GluLys: 1.737 ± 0.264
5.679GluLeu: 5.679 ± 0.685
1.403GluMet: 1.403 ± 0.328
1.47GluAsn: 1.47 ± 0.251
3.407GluPro: 3.407 ± 0.427
3.274GluGln: 3.274 ± 0.47
5.211GluArg: 5.211 ± 0.692
3.407GluSer: 3.407 ± 0.575
3.407GluThr: 3.407 ± 0.538
2.873GluVal: 2.873 ± 0.398
0.802GluTrp: 0.802 ± 0.24
1.737GluTyr: 1.737 ± 0.402
0.0GluXaa: 0.0 ± 0.0
Phe
3.407PheAla: 3.407 ± 0.434
0.134PheCys: 0.134 ± 0.09
2.672PheAsp: 2.672 ± 0.548
1.269PheGlu: 1.269 ± 0.295
0.735PhePhe: 0.735 ± 0.224
3.274PheGly: 3.274 ± 0.48
0.334PheHis: 0.334 ± 0.135
0.869PheIle: 0.869 ± 0.227
0.668PheLys: 0.668 ± 0.204
1.47PheLeu: 1.47 ± 0.262
0.401PheMet: 0.401 ± 0.143
0.534PheAsn: 0.534 ± 0.184
0.935PhePro: 0.935 ± 0.267
0.534PheGln: 0.534 ± 0.182
1.804PheArg: 1.804 ± 0.38
1.403PheSer: 1.403 ± 0.261
2.272PheThr: 2.272 ± 0.423
2.004PheVal: 2.004 ± 0.492
1.47PheTrp: 1.47 ± 0.379
0.267PheTyr: 0.267 ± 0.119
0.0PheXaa: 0.0 ± 0.0
Gly
8.084GlyAla: 8.084 ± 0.882
0.401GlyCys: 0.401 ± 0.207
6.414GlyAsp: 6.414 ± 0.61
5.011GlyGlu: 5.011 ± 0.486
2.138GlyPhe: 2.138 ± 0.409
7.883GlyGly: 7.883 ± 1.109
1.537GlyHis: 1.537 ± 0.261
4.276GlyIle: 4.276 ± 0.429
3.608GlyLys: 3.608 ± 0.439
6.414GlyLeu: 6.414 ± 0.678
1.871GlyMet: 1.871 ± 0.353
2.873GlyAsn: 2.873 ± 0.534
4.142GlyPro: 4.142 ± 0.512
3.073GlyGln: 3.073 ± 0.411
6.547GlyArg: 6.547 ± 0.647
5.077GlySer: 5.077 ± 0.745
5.679GlyThr: 5.679 ± 0.834
6.414GlyVal: 6.414 ± 0.623
1.336GlyTrp: 1.336 ± 0.296
2.606GlyTyr: 2.606 ± 0.389
0.0GlyXaa: 0.0 ± 0.0
His
1.737HisAla: 1.737 ± 0.384
0.267HisCys: 0.267 ± 0.131
1.336HisAsp: 1.336 ± 0.296
1.069HisGlu: 1.069 ± 0.274
0.401HisPhe: 0.401 ± 0.148
1.871HisGly: 1.871 ± 0.443
0.735HisHis: 0.735 ± 0.204
0.668HisIle: 0.668 ± 0.218
0.601HisLys: 0.601 ± 0.169
1.871HisLeu: 1.871 ± 0.405
0.2HisMet: 0.2 ± 0.089
0.534HisAsn: 0.534 ± 0.2
1.67HisPro: 1.67 ± 0.337
1.002HisGln: 1.002 ± 0.244
1.269HisArg: 1.269 ± 0.361
0.534HisSer: 0.534 ± 0.19
1.136HisThr: 1.136 ± 0.234
1.002HisVal: 1.002 ± 0.262
0.668HisTrp: 0.668 ± 0.232
0.601HisTyr: 0.601 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
6.08IleAla: 6.08 ± 0.683
0.334IleCys: 0.334 ± 0.141
4.61IleAsp: 4.61 ± 0.447
3.207IleGlu: 3.207 ± 0.454
1.47IlePhe: 1.47 ± 0.388
4.743IleGly: 4.743 ± 0.421
0.869IleHis: 0.869 ± 0.245
1.737IleIle: 1.737 ± 0.325
1.136IleLys: 1.136 ± 0.334
3.207IleLeu: 3.207 ± 0.512
0.601IleMet: 0.601 ± 0.188
1.336IleAsn: 1.336 ± 0.31
2.272IlePro: 2.272 ± 0.304
2.472IleGln: 2.472 ± 0.372
3.474IleArg: 3.474 ± 0.433
3.006IleSer: 3.006 ± 0.437
3.006IleThr: 3.006 ± 0.436
4.209IleVal: 4.209 ± 0.531
0.334IleTrp: 0.334 ± 0.152
1.537IleTyr: 1.537 ± 0.338
0.0IleXaa: 0.0 ± 0.0
Lys
3.808LysAla: 3.808 ± 0.541
0.067LysCys: 0.067 ± 0.07
2.004LysAsp: 2.004 ± 0.38
1.203LysGlu: 1.203 ± 0.355
0.802LysPhe: 0.802 ± 0.302
2.138LysGly: 2.138 ± 0.415
0.668LysHis: 0.668 ± 0.237
1.203LysIle: 1.203 ± 0.258
0.735LysLys: 0.735 ± 0.279
2.94LysLeu: 2.94 ± 0.525
0.802LysMet: 0.802 ± 0.282
0.802LysAsn: 0.802 ± 0.239
1.937LysPro: 1.937 ± 0.417
1.136LysGln: 1.136 ± 0.259
2.138LysArg: 2.138 ± 0.397
1.804LysSer: 1.804 ± 0.436
1.937LysThr: 1.937 ± 0.338
2.539LysVal: 2.539 ± 0.332
0.534LysTrp: 0.534 ± 0.19
0.668LysTyr: 0.668 ± 0.205
0.0LysXaa: 0.0 ± 0.0
Leu
10.623LeuAla: 10.623 ± 0.902
0.802LeuCys: 0.802 ± 0.28
5.211LeuAsp: 5.211 ± 0.535
3.741LeuGlu: 3.741 ± 0.415
1.804LeuPhe: 1.804 ± 0.494
7.616LeuGly: 7.616 ± 0.766
1.203LeuHis: 1.203 ± 0.31
3.675LeuIle: 3.675 ± 0.521
1.937LeuLys: 1.937 ± 0.42
5.345LeuLeu: 5.345 ± 0.47
1.737LeuMet: 1.737 ± 0.375
2.071LeuAsn: 2.071 ± 0.404
4.944LeuPro: 4.944 ± 0.452
2.272LeuGln: 2.272 ± 0.389
5.412LeuArg: 5.412 ± 0.737
4.543LeuSer: 4.543 ± 0.569
5.545LeuThr: 5.545 ± 0.669
6.28LeuVal: 6.28 ± 0.59
1.269LeuTrp: 1.269 ± 0.248
1.537LeuTyr: 1.537 ± 0.288
0.0LeuXaa: 0.0 ± 0.0
Met
2.94MetAla: 2.94 ± 0.553
0.334MetCys: 0.334 ± 0.172
1.002MetAsp: 1.002 ± 0.277
0.802MetGlu: 0.802 ± 0.193
0.468MetPhe: 0.468 ± 0.168
1.603MetGly: 1.603 ± 0.351
0.401MetHis: 0.401 ± 0.16
1.336MetIle: 1.336 ± 0.334
0.534MetLys: 0.534 ± 0.212
1.804MetLeu: 1.804 ± 0.33
0.401MetMet: 0.401 ± 0.155
0.735MetAsn: 0.735 ± 0.284
1.136MetPro: 1.136 ± 0.273
0.601MetGln: 0.601 ± 0.217
1.336MetArg: 1.336 ± 0.308
1.804MetSer: 1.804 ± 0.348
2.472MetThr: 2.472 ± 0.425
0.869MetVal: 0.869 ± 0.201
0.267MetTrp: 0.267 ± 0.123
0.267MetTyr: 0.267 ± 0.12
0.0MetXaa: 0.0 ± 0.0
Asn
3.875AsnAla: 3.875 ± 0.548
0.067AsnCys: 0.067 ± 0.074
1.47AsnAsp: 1.47 ± 0.251
1.47AsnGlu: 1.47 ± 0.34
0.802AsnPhe: 0.802 ± 0.434
2.739AsnGly: 2.739 ± 0.486
0.802AsnHis: 0.802 ± 0.376
1.403AsnIle: 1.403 ± 0.315
0.935AsnLys: 0.935 ± 0.277
1.871AsnLeu: 1.871 ± 0.349
0.401AsnMet: 0.401 ± 0.168
0.601AsnAsn: 0.601 ± 0.151
2.138AsnPro: 2.138 ± 0.524
1.269AsnGln: 1.269 ± 0.358
2.606AsnArg: 2.606 ± 0.571
1.67AsnSer: 1.67 ± 0.32
1.937AsnThr: 1.937 ± 0.468
1.47AsnVal: 1.47 ± 0.267
0.401AsnTrp: 0.401 ± 0.149
0.869AsnTyr: 0.869 ± 0.209
0.0AsnXaa: 0.0 ± 0.0
Pro
7.683ProAla: 7.683 ± 0.896
0.468ProCys: 0.468 ± 0.221
4.944ProAsp: 4.944 ± 0.597
4.61ProGlu: 4.61 ± 0.602
0.935ProPhe: 0.935 ± 0.263
5.077ProGly: 5.077 ± 0.51
1.069ProHis: 1.069 ± 0.272
2.539ProIle: 2.539 ± 0.445
1.737ProLys: 1.737 ± 0.388
3.34ProLeu: 3.34 ± 0.428
1.002ProMet: 1.002 ± 0.266
2.004ProAsn: 2.004 ± 0.422
3.541ProPro: 3.541 ± 0.462
1.67ProGln: 1.67 ± 0.277
4.075ProArg: 4.075 ± 0.579
2.873ProSer: 2.873 ± 0.409
3.541ProThr: 3.541 ± 0.595
3.541ProVal: 3.541 ± 0.499
1.47ProTrp: 1.47 ± 0.279
1.002ProTyr: 1.002 ± 0.22
0.0ProXaa: 0.0 ± 0.0
Gln
3.942GlnAla: 3.942 ± 0.53
0.267GlnCys: 0.267 ± 0.128
1.67GlnAsp: 1.67 ± 0.336
1.537GlnGlu: 1.537 ± 0.323
1.069GlnPhe: 1.069 ± 0.264
2.004GlnGly: 2.004 ± 0.4
0.802GlnHis: 0.802 ± 0.223
2.739GlnIle: 2.739 ± 0.387
1.136GlnLys: 1.136 ± 0.3
2.739GlnLeu: 2.739 ± 0.326
1.603GlnMet: 1.603 ± 0.3
1.002GlnAsn: 1.002 ± 0.246
2.004GlnPro: 2.004 ± 0.329
2.004GlnGln: 2.004 ± 0.45
3.608GlnArg: 3.608 ± 0.713
1.804GlnSer: 1.804 ± 0.36
2.606GlnThr: 2.606 ± 0.434
3.006GlnVal: 3.006 ± 0.421
0.735GlnTrp: 0.735 ± 0.2
0.668GlnTyr: 0.668 ± 0.253
0.0GlnXaa: 0.0 ± 0.0
Arg
7.616ArgAla: 7.616 ± 0.617
0.534ArgCys: 0.534 ± 0.218
4.009ArgAsp: 4.009 ± 0.568
5.011ArgGlu: 5.011 ± 0.675
2.672ArgPhe: 2.672 ± 0.416
4.81ArgGly: 4.81 ± 0.565
1.136ArgHis: 1.136 ± 0.343
4.476ArgIle: 4.476 ± 0.549
3.14ArgLys: 3.14 ± 0.415
5.812ArgLeu: 5.812 ± 0.646
1.937ArgMet: 1.937 ± 0.36
2.539ArgAsn: 2.539 ± 0.342
4.276ArgPro: 4.276 ± 0.604
2.472ArgGln: 2.472 ± 0.405
6.547ArgArg: 6.547 ± 0.866
3.741ArgSer: 3.741 ± 0.465
4.81ArgThr: 4.81 ± 0.594
5.812ArgVal: 5.812 ± 0.634
1.603ArgTrp: 1.603 ± 0.37
1.804ArgTyr: 1.804 ± 0.402
0.0ArgXaa: 0.0 ± 0.0
Ser
7.215SerAla: 7.215 ± 0.828
0.334SerCys: 0.334 ± 0.141
3.207SerAsp: 3.207 ± 0.425
2.806SerGlu: 2.806 ± 0.367
1.67SerPhe: 1.67 ± 0.337
6.08SerGly: 6.08 ± 0.69
0.735SerHis: 0.735 ± 0.212
3.006SerIle: 3.006 ± 0.486
1.603SerLys: 1.603 ± 0.374
4.476SerLeu: 4.476 ± 0.6
0.735SerMet: 0.735 ± 0.238
1.336SerAsn: 1.336 ± 0.273
3.34SerPro: 3.34 ± 0.542
2.138SerGln: 2.138 ± 0.327
3.407SerArg: 3.407 ± 0.532
2.739SerSer: 2.739 ± 0.459
3.608SerThr: 3.608 ± 0.488
3.34SerVal: 3.34 ± 0.446
1.804SerTrp: 1.804 ± 0.331
1.136SerTyr: 1.136 ± 0.241
0.0SerXaa: 0.0 ± 0.0
Thr
9.42ThrAla: 9.42 ± 1.279
0.534ThrCys: 0.534 ± 0.188
4.61ThrAsp: 4.61 ± 0.555
3.474ThrGlu: 3.474 ± 0.497
1.136ThrPhe: 1.136 ± 0.304
6.28ThrGly: 6.28 ± 0.753
1.136ThrHis: 1.136 ± 0.282
4.61ThrIle: 4.61 ± 0.507
1.403ThrLys: 1.403 ± 0.346
5.478ThrLeu: 5.478 ± 0.498
1.069ThrMet: 1.069 ± 0.273
1.804ThrAsn: 1.804 ± 0.418
3.675ThrPro: 3.675 ± 0.465
2.071ThrGln: 2.071 ± 0.362
3.808ThrArg: 3.808 ± 0.442
4.276ThrSer: 4.276 ± 0.551
5.077ThrThr: 5.077 ± 0.664
5.278ThrVal: 5.278 ± 0.64
1.47ThrTrp: 1.47 ± 0.28
0.735ThrTyr: 0.735 ± 0.213
0.0ThrXaa: 0.0 ± 0.0
Val
8.284ValAla: 8.284 ± 0.801
0.534ValCys: 0.534 ± 0.194
6.881ValAsp: 6.881 ± 0.684
4.142ValGlu: 4.142 ± 0.396
2.205ValPhe: 2.205 ± 0.392
4.944ValGly: 4.944 ± 0.657
1.336ValHis: 1.336 ± 0.337
2.94ValIle: 2.94 ± 0.46
1.804ValLys: 1.804 ± 0.322
4.677ValLeu: 4.677 ± 0.55
1.203ValMet: 1.203 ± 0.325
1.937ValAsn: 1.937 ± 0.45
4.075ValPro: 4.075 ± 0.604
2.472ValGln: 2.472 ± 0.374
6.146ValArg: 6.146 ± 0.579
3.808ValSer: 3.808 ± 0.602
5.011ValThr: 5.011 ± 0.587
6.347ValVal: 6.347 ± 0.764
1.069ValTrp: 1.069 ± 0.257
1.203ValTyr: 1.203 ± 0.348
0.0ValXaa: 0.0 ± 0.0
Trp
1.937TrpAla: 1.937 ± 0.357
0.267TrpCys: 0.267 ± 0.144
1.269TrpAsp: 1.269 ± 0.342
1.002TrpGlu: 1.002 ± 0.212
0.735TrpPhe: 0.735 ± 0.249
1.67TrpGly: 1.67 ± 0.351
0.401TrpHis: 0.401 ± 0.189
0.401TrpIle: 0.401 ± 0.179
0.401TrpLys: 0.401 ± 0.133
2.606TrpLeu: 2.606 ± 0.399
0.601TrpMet: 0.601 ± 0.185
0.468TrpAsn: 0.468 ± 0.205
1.269TrpPro: 1.269 ± 0.322
0.802TrpGln: 0.802 ± 0.226
1.871TrpArg: 1.871 ± 0.29
1.136TrpSer: 1.136 ± 0.236
1.203TrpThr: 1.203 ± 0.264
1.069TrpVal: 1.069 ± 0.209
0.735TrpTrp: 0.735 ± 0.258
0.267TrpTyr: 0.267 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.606TyrAla: 2.606 ± 0.423
0.067TyrCys: 0.067 ± 0.072
1.269TyrAsp: 1.269 ± 0.316
1.136TyrGlu: 1.136 ± 0.332
0.401TyrPhe: 0.401 ± 0.156
1.937TyrGly: 1.937 ± 0.352
0.935TyrHis: 0.935 ± 0.347
0.534TyrIle: 0.534 ± 0.157
0.935TyrLys: 0.935 ± 0.17
1.269TyrLeu: 1.269 ± 0.226
0.134TyrMet: 0.134 ± 0.069
0.401TyrAsn: 0.401 ± 0.149
1.002TyrPro: 1.002 ± 0.267
1.136TyrGln: 1.136 ± 0.31
1.737TyrArg: 1.737 ± 0.443
0.935TyrSer: 0.935 ± 0.201
1.537TyrThr: 1.537 ± 0.306
2.272TyrVal: 2.272 ± 0.409
0.601TyrTrp: 0.601 ± 0.23
0.401TyrTyr: 0.401 ± 0.153
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (14969 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski