Amino acid dipepetide frequency for Arthrobacter phage Faja

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.0AlaAla: 18.0 ± 2.583
0.906AlaCys: 0.906 ± 0.223
7.49AlaAsp: 7.49 ± 0.871
9.241AlaGlu: 9.241 ± 0.998
2.174AlaPhe: 2.174 ± 0.334
11.053AlaGly: 11.053 ± 1.189
2.96AlaHis: 2.96 ± 0.418
5.859AlaIle: 5.859 ± 0.685
5.678AlaLys: 5.678 ± 0.768
9.06AlaLeu: 9.06 ± 0.933
3.262AlaMet: 3.262 ± 0.359
3.805AlaAsn: 3.805 ± 0.505
4.47AlaPro: 4.47 ± 0.781
5.013AlaGln: 5.013 ± 0.644
7.007AlaArg: 7.007 ± 0.709
6.04AlaSer: 6.04 ± 0.78
6.705AlaThr: 6.705 ± 0.633
8.819AlaVal: 8.819 ± 0.683
2.476AlaTrp: 2.476 ± 0.328
2.778AlaTyr: 2.778 ± 0.501
0.0AlaXaa: 0.0 ± 0.0
Cys
0.664CysAla: 0.664 ± 0.186
0.242CysCys: 0.242 ± 0.113
0.664CysAsp: 0.664 ± 0.191
0.664CysGlu: 0.664 ± 0.176
0.181CysPhe: 0.181 ± 0.098
0.966CysGly: 0.966 ± 0.319
0.302CysHis: 0.302 ± 0.142
0.121CysIle: 0.121 ± 0.088
0.846CysLys: 0.846 ± 0.215
0.423CysLeu: 0.423 ± 0.169
0.06CysMet: 0.06 ± 0.051
0.302CysAsn: 0.302 ± 0.127
1.208CysPro: 1.208 ± 0.298
0.302CysGln: 0.302 ± 0.123
0.725CysArg: 0.725 ± 0.258
0.544CysSer: 0.544 ± 0.22
0.906CysThr: 0.906 ± 0.234
0.604CysVal: 0.604 ± 0.205
0.181CysTrp: 0.181 ± 0.101
0.06CysTyr: 0.06 ± 0.055
0.0CysXaa: 0.0 ± 0.0
Asp
8.215AspAla: 8.215 ± 0.793
0.423AspCys: 0.423 ± 0.185
3.322AspAsp: 3.322 ± 0.523
3.443AspGlu: 3.443 ± 0.51
1.631AspPhe: 1.631 ± 0.243
7.792AspGly: 7.792 ± 0.863
1.027AspHis: 1.027 ± 0.256
3.02AspIle: 3.02 ± 0.454
2.537AspLys: 2.537 ± 0.395
5.315AspLeu: 5.315 ± 0.454
1.087AspMet: 1.087 ± 0.307
1.57AspAsn: 1.57 ± 0.301
3.926AspPro: 3.926 ± 0.507
1.993AspGln: 1.993 ± 0.282
2.899AspArg: 2.899 ± 0.418
2.778AspSer: 2.778 ± 0.394
2.476AspThr: 2.476 ± 0.355
4.349AspVal: 4.349 ± 0.495
1.329AspTrp: 1.329 ± 0.254
1.812AspTyr: 1.812 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
8.275GluAla: 8.275 ± 0.737
0.483GluCys: 0.483 ± 0.176
3.564GluAsp: 3.564 ± 0.45
2.778GluGlu: 2.778 ± 0.387
1.812GluPhe: 1.812 ± 0.357
4.228GluGly: 4.228 ± 0.561
1.45GluHis: 1.45 ± 0.301
2.416GluIle: 2.416 ± 0.436
2.899GluLys: 2.899 ± 0.45
4.651GluLeu: 4.651 ± 0.524
1.148GluMet: 1.148 ± 0.285
2.235GluAsn: 2.235 ± 0.396
2.597GluPro: 2.597 ± 0.439
2.718GluGln: 2.718 ± 0.42
4.651GluArg: 4.651 ± 0.498
2.899GluSer: 2.899 ± 0.466
3.805GluThr: 3.805 ± 0.556
4.228GluVal: 4.228 ± 0.524
1.148GluTrp: 1.148 ± 0.281
1.268GluTyr: 1.268 ± 0.261
0.0GluXaa: 0.0 ± 0.0
Phe
2.537PheAla: 2.537 ± 0.409
0.302PheCys: 0.302 ± 0.152
2.658PheAsp: 2.658 ± 0.387
1.329PheGlu: 1.329 ± 0.279
0.906PhePhe: 0.906 ± 0.241
2.718PheGly: 2.718 ± 0.355
1.087PheHis: 1.087 ± 0.227
0.664PheIle: 0.664 ± 0.212
1.148PheLys: 1.148 ± 0.299
2.114PheLeu: 2.114 ± 0.433
0.725PheMet: 0.725 ± 0.216
0.966PheAsn: 0.966 ± 0.229
1.268PhePro: 1.268 ± 0.251
1.208PheGln: 1.208 ± 0.242
2.054PheArg: 2.054 ± 0.322
2.114PheSer: 2.114 ± 0.359
1.993PheThr: 1.993 ± 0.372
1.933PheVal: 1.933 ± 0.451
0.544PheTrp: 0.544 ± 0.149
0.966PheTyr: 0.966 ± 0.303
0.0PheXaa: 0.0 ± 0.0
Gly
9.121GlyAla: 9.121 ± 1.076
0.906GlyCys: 0.906 ± 0.185
4.711GlyAsp: 4.711 ± 0.515
4.107GlyGlu: 4.107 ± 0.491
3.08GlyPhe: 3.08 ± 0.429
6.282GlyGly: 6.282 ± 0.937
1.389GlyHis: 1.389 ± 0.36
3.02GlyIle: 3.02 ± 0.37
4.168GlyLys: 4.168 ± 0.491
6.886GlyLeu: 6.886 ± 0.673
2.476GlyMet: 2.476 ± 0.413
3.08GlyAsn: 3.08 ± 0.454
4.409GlyPro: 4.409 ± 1.112
2.597GlyGln: 2.597 ± 0.418
5.496GlyArg: 5.496 ± 0.633
4.892GlySer: 4.892 ± 0.556
5.799GlyThr: 5.799 ± 0.636
6.221GlyVal: 6.221 ± 0.618
1.933GlyTrp: 1.933 ± 0.378
2.658GlyTyr: 2.658 ± 0.368
0.0GlyXaa: 0.0 ± 0.0
His
2.718HisAla: 2.718 ± 0.348
0.423HisCys: 0.423 ± 0.154
1.087HisAsp: 1.087 ± 0.235
2.295HisGlu: 2.295 ± 0.471
0.906HisPhe: 0.906 ± 0.26
1.57HisGly: 1.57 ± 0.313
0.966HisHis: 0.966 ± 0.259
1.45HisIle: 1.45 ± 0.285
0.483HisLys: 0.483 ± 0.173
1.389HisLeu: 1.389 ± 0.293
0.544HisMet: 0.544 ± 0.183
1.389HisAsn: 1.389 ± 0.308
1.027HisPro: 1.027 ± 0.237
0.906HisGln: 0.906 ± 0.202
1.57HisArg: 1.57 ± 0.275
1.087HisSer: 1.087 ± 0.232
1.51HisThr: 1.51 ± 0.29
1.51HisVal: 1.51 ± 0.281
0.121HisTrp: 0.121 ± 0.086
0.906HisTyr: 0.906 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
4.832IleAla: 4.832 ± 0.629
0.362IleCys: 0.362 ± 0.131
3.141IleAsp: 3.141 ± 0.429
2.778IleGlu: 2.778 ± 0.314
1.148IlePhe: 1.148 ± 0.239
3.141IleGly: 3.141 ± 0.407
1.208IleHis: 1.208 ± 0.238
2.416IleIle: 2.416 ± 0.353
1.812IleLys: 1.812 ± 0.291
3.382IleLeu: 3.382 ± 0.431
0.544IleMet: 0.544 ± 0.158
1.268IleAsn: 1.268 ± 0.309
2.96IlePro: 2.96 ± 0.377
1.631IleGln: 1.631 ± 0.313
3.382IleArg: 3.382 ± 0.443
2.356IleSer: 2.356 ± 0.358
3.926IleThr: 3.926 ± 0.508
3.08IleVal: 3.08 ± 0.417
0.242IleTrp: 0.242 ± 0.105
0.906IleTyr: 0.906 ± 0.256
0.0IleXaa: 0.0 ± 0.0
Lys
5.194LysAla: 5.194 ± 0.848
0.664LysCys: 0.664 ± 0.217
2.537LysAsp: 2.537 ± 0.363
2.235LysGlu: 2.235 ± 0.4
1.268LysPhe: 1.268 ± 0.274
3.02LysGly: 3.02 ± 0.491
0.785LysHis: 0.785 ± 0.241
1.51LysIle: 1.51 ± 0.362
2.054LysLys: 2.054 ± 0.314
3.624LysLeu: 3.624 ± 0.497
1.691LysMet: 1.691 ± 0.323
1.148LysAsn: 1.148 ± 0.222
2.718LysPro: 2.718 ± 0.419
1.51LysGln: 1.51 ± 0.307
3.443LysArg: 3.443 ± 0.412
2.537LysSer: 2.537 ± 0.477
2.899LysThr: 2.899 ± 0.355
2.778LysVal: 2.778 ± 0.345
0.604LysTrp: 0.604 ± 0.179
1.389LysTyr: 1.389 ± 0.25
0.0LysXaa: 0.0 ± 0.0
Leu
9.725LeuAla: 9.725 ± 0.846
0.544LeuCys: 0.544 ± 0.194
5.255LeuAsp: 5.255 ± 0.552
4.772LeuGlu: 4.772 ± 0.538
1.933LeuPhe: 1.933 ± 0.437
7.248LeuGly: 7.248 ± 0.756
1.45LeuHis: 1.45 ± 0.354
3.805LeuIle: 3.805 ± 0.538
2.839LeuLys: 2.839 ± 0.398
6.644LeuLeu: 6.644 ± 0.737
1.45LeuMet: 1.45 ± 0.285
2.114LeuAsn: 2.114 ± 0.357
5.255LeuPro: 5.255 ± 0.704
1.993LeuGln: 1.993 ± 0.426
4.772LeuArg: 4.772 ± 0.462
4.53LeuSer: 4.53 ± 0.597
6.04LeuThr: 6.04 ± 0.533
5.617LeuVal: 5.617 ± 0.739
1.268LeuTrp: 1.268 ± 0.276
1.933LeuTyr: 1.933 ± 0.35
0.0LeuXaa: 0.0 ± 0.0
Met
3.141MetAla: 3.141 ± 0.498
0.181MetCys: 0.181 ± 0.092
0.966MetAsp: 0.966 ± 0.266
0.785MetGlu: 0.785 ± 0.199
0.785MetPhe: 0.785 ± 0.21
1.631MetGly: 1.631 ± 0.683
0.785MetHis: 0.785 ± 0.196
1.027MetIle: 1.027 ± 0.277
1.087MetLys: 1.087 ± 0.24
1.691MetLeu: 1.691 ± 0.314
0.423MetMet: 0.423 ± 0.127
1.208MetAsn: 1.208 ± 0.261
1.45MetPro: 1.45 ± 0.287
0.604MetGln: 0.604 ± 0.171
1.027MetArg: 1.027 ± 0.251
1.812MetSer: 1.812 ± 0.276
2.174MetThr: 2.174 ± 0.334
1.51MetVal: 1.51 ± 0.328
0.181MetTrp: 0.181 ± 0.091
0.483MetTyr: 0.483 ± 0.195
0.0MetXaa: 0.0 ± 0.0
Asn
4.53AsnAla: 4.53 ± 0.504
0.181AsnCys: 0.181 ± 0.11
1.812AsnAsp: 1.812 ± 0.39
1.268AsnGlu: 1.268 ± 0.278
0.785AsnPhe: 0.785 ± 0.189
2.899AsnGly: 2.899 ± 0.501
1.208AsnHis: 1.208 ± 0.316
1.51AsnIle: 1.51 ± 0.291
1.51AsnLys: 1.51 ± 0.381
2.718AsnLeu: 2.718 ± 0.432
0.362AsnMet: 0.362 ± 0.128
0.785AsnAsn: 0.785 ± 0.214
2.718AsnPro: 2.718 ± 0.415
1.027AsnGln: 1.027 ± 0.266
2.295AsnArg: 2.295 ± 0.38
1.812AsnSer: 1.812 ± 0.353
1.812AsnThr: 1.812 ± 0.455
1.933AsnVal: 1.933 ± 0.333
0.664AsnTrp: 0.664 ± 0.194
0.483AsnTyr: 0.483 ± 0.16
0.0AsnXaa: 0.0 ± 0.0
Pro
6.04ProAla: 6.04 ± 0.704
0.846ProCys: 0.846 ± 0.226
3.684ProAsp: 3.684 ± 0.539
3.866ProGlu: 3.866 ± 0.553
1.45ProPhe: 1.45 ± 0.31
5.496ProGly: 5.496 ± 1.04
1.087ProHis: 1.087 ± 0.236
2.114ProIle: 2.114 ± 0.335
2.658ProLys: 2.658 ± 0.38
3.443ProLeu: 3.443 ± 0.702
1.027ProMet: 1.027 ± 0.237
1.631ProAsn: 1.631 ± 0.298
3.684ProPro: 3.684 ± 0.719
1.57ProGln: 1.57 ± 0.335
2.718ProArg: 2.718 ± 0.422
3.926ProSer: 3.926 ± 0.586
3.684ProThr: 3.684 ± 0.481
3.564ProVal: 3.564 ± 0.546
1.027ProTrp: 1.027 ± 0.298
0.725ProTyr: 0.725 ± 0.2
0.0ProXaa: 0.0 ± 0.0
Gln
4.168GlnAla: 4.168 ± 0.557
0.362GlnCys: 0.362 ± 0.162
1.812GlnAsp: 1.812 ± 0.364
1.027GlnGlu: 1.027 ± 0.248
1.027GlnPhe: 1.027 ± 0.251
2.778GlnGly: 2.778 ± 0.503
0.604GlnHis: 0.604 ± 0.169
1.933GlnIle: 1.933 ± 0.275
1.57GlnLys: 1.57 ± 0.241
2.96GlnLeu: 2.96 ± 0.402
0.906GlnMet: 0.906 ± 0.258
0.664GlnAsn: 0.664 ± 0.194
1.812GlnPro: 1.812 ± 0.385
1.993GlnGln: 1.993 ± 0.321
2.778GlnArg: 2.778 ± 0.455
1.631GlnSer: 1.631 ± 0.298
2.476GlnThr: 2.476 ± 0.413
3.02GlnVal: 3.02 ± 0.522
0.423GlnTrp: 0.423 ± 0.164
0.302GlnTyr: 0.302 ± 0.11
0.0GlnXaa: 0.0 ± 0.0
Arg
6.403ArgAla: 6.403 ± 0.556
1.027ArgCys: 1.027 ± 0.236
3.805ArgAsp: 3.805 ± 0.601
3.805ArgGlu: 3.805 ± 0.482
2.537ArgPhe: 2.537 ± 0.35
4.288ArgGly: 4.288 ± 0.56
2.054ArgHis: 2.054 ± 0.363
3.926ArgIle: 3.926 ± 0.461
2.778ArgLys: 2.778 ± 0.375
5.738ArgLeu: 5.738 ± 0.626
1.329ArgMet: 1.329 ± 0.28
1.993ArgAsn: 1.993 ± 0.303
2.718ArgPro: 2.718 ± 0.543
1.933ArgGln: 1.933 ± 0.312
5.013ArgArg: 5.013 ± 0.748
3.805ArgSer: 3.805 ± 0.411
3.262ArgThr: 3.262 ± 0.384
3.564ArgVal: 3.564 ± 0.562
1.631ArgTrp: 1.631 ± 0.355
1.933ArgTyr: 1.933 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
6.705SerAla: 6.705 ± 0.916
0.302SerCys: 0.302 ± 0.127
3.382SerAsp: 3.382 ± 0.51
3.322SerGlu: 3.322 ± 0.473
1.752SerPhe: 1.752 ± 0.331
5.496SerGly: 5.496 ± 0.567
1.027SerHis: 1.027 ± 0.262
3.08SerIle: 3.08 ± 0.351
2.054SerLys: 2.054 ± 0.302
5.557SerLeu: 5.557 ± 0.661
1.872SerMet: 1.872 ± 0.27
1.45SerAsn: 1.45 ± 0.289
2.899SerPro: 2.899 ± 0.537
1.389SerGln: 1.389 ± 0.254
3.262SerArg: 3.262 ± 0.399
3.08SerSer: 3.08 ± 0.445
3.866SerThr: 3.866 ± 0.414
3.805SerVal: 3.805 ± 0.456
0.846SerTrp: 0.846 ± 0.229
1.45SerTyr: 1.45 ± 0.304
0.0SerXaa: 0.0 ± 0.0
Thr
9.121ThrAla: 9.121 ± 0.657
0.604ThrCys: 0.604 ± 0.183
3.564ThrAsp: 3.564 ± 0.443
4.892ThrGlu: 4.892 ± 0.682
2.718ThrPhe: 2.718 ± 0.402
4.651ThrGly: 4.651 ± 0.512
1.329ThrHis: 1.329 ± 0.25
3.322ThrIle: 3.322 ± 0.445
2.839ThrLys: 2.839 ± 0.453
4.288ThrLeu: 4.288 ± 0.59
1.268ThrMet: 1.268 ± 0.339
2.96ThrAsn: 2.96 ± 0.418
3.443ThrPro: 3.443 ± 0.548
2.054ThrGln: 2.054 ± 0.296
2.778ThrArg: 2.778 ± 0.512
3.805ThrSer: 3.805 ± 0.597
4.168ThrThr: 4.168 ± 0.46
5.315ThrVal: 5.315 ± 0.666
1.027ThrTrp: 1.027 ± 0.261
2.295ThrTyr: 2.295 ± 0.337
0.0ThrXaa: 0.0 ± 0.0
Val
8.154ValAla: 8.154 ± 0.728
0.423ValCys: 0.423 ± 0.183
4.953ValAsp: 4.953 ± 0.62
3.745ValGlu: 3.745 ± 0.407
2.054ValPhe: 2.054 ± 0.342
4.832ValGly: 4.832 ± 0.665
1.691ValHis: 1.691 ± 0.34
2.658ValIle: 2.658 ± 0.612
3.201ValLys: 3.201 ± 0.339
5.557ValLeu: 5.557 ± 0.813
1.933ValMet: 1.933 ± 0.304
2.174ValAsn: 2.174 ± 0.313
3.684ValPro: 3.684 ± 0.519
2.356ValGln: 2.356 ± 0.378
4.409ValArg: 4.409 ± 0.451
4.409ValSer: 4.409 ± 0.45
5.617ValThr: 5.617 ± 0.698
5.013ValVal: 5.013 ± 0.488
1.027ValTrp: 1.027 ± 0.221
2.356ValTyr: 2.356 ± 0.435
0.0ValXaa: 0.0 ± 0.0
Trp
2.356TrpAla: 2.356 ± 0.406
0.242TrpCys: 0.242 ± 0.101
1.148TrpAsp: 1.148 ± 0.246
0.785TrpGlu: 0.785 ± 0.204
0.302TrpPhe: 0.302 ± 0.132
0.846TrpGly: 0.846 ± 0.229
0.785TrpHis: 0.785 ± 0.226
0.181TrpIle: 0.181 ± 0.103
0.604TrpLys: 0.604 ± 0.164
1.993TrpLeu: 1.993 ± 0.362
0.181TrpMet: 0.181 ± 0.088
0.846TrpAsn: 0.846 ± 0.25
0.785TrpPro: 0.785 ± 0.229
0.664TrpGln: 0.664 ± 0.182
1.45TrpArg: 1.45 ± 0.275
1.208TrpSer: 1.208 ± 0.294
1.208TrpThr: 1.208 ± 0.234
1.208TrpVal: 1.208 ± 0.309
0.181TrpTrp: 0.181 ± 0.105
0.242TrpTyr: 0.242 ± 0.125
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.899TyrAla: 2.899 ± 0.384
0.544TyrCys: 0.544 ± 0.237
1.389TyrAsp: 1.389 ± 0.31
2.174TyrGlu: 2.174 ± 0.295
0.785TyrPhe: 0.785 ± 0.186
2.295TyrGly: 2.295 ± 0.395
0.664TyrHis: 0.664 ± 0.185
0.483TyrIle: 0.483 ± 0.144
0.846TyrLys: 0.846 ± 0.178
1.752TyrLeu: 1.752 ± 0.31
0.604TyrMet: 0.604 ± 0.14
0.725TyrAsn: 0.725 ± 0.195
1.208TyrPro: 1.208 ± 0.295
0.785TyrGln: 0.785 ± 0.231
1.812TyrArg: 1.812 ± 0.289
1.329TyrSer: 1.329 ± 0.303
2.114TyrThr: 2.114 ± 0.318
2.174TyrVal: 2.174 ± 0.354
0.362TyrTrp: 0.362 ± 0.184
0.664TyrTyr: 0.664 ± 0.177
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (16557 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski