Amino acid dipepetide frequency for Streptomyces phage IchabodCrane

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.247AlaAla: 8.247 ± 0.987
0.65AlaCys: 0.65 ± 0.142
4.7AlaAsp: 4.7 ± 0.457
6.326AlaGlu: 6.326 ± 0.649
3.222AlaPhe: 3.222 ± 0.306
6.888AlaGly: 6.888 ± 0.64
1.448AlaHis: 1.448 ± 0.17
4.109AlaIle: 4.109 ± 0.451
5.528AlaLys: 5.528 ± 0.457
6.828AlaLeu: 6.828 ± 0.555
2.838AlaMet: 2.838 ± 0.304
3.606AlaAsn: 3.606 ± 0.47
2.956AlaPro: 2.956 ± 0.32
3.222AlaGln: 3.222 ± 0.416
4.405AlaArg: 4.405 ± 0.447
4.493AlaSer: 4.493 ± 0.489
4.582AlaThr: 4.582 ± 0.679
5.883AlaVal: 5.883 ± 0.527
1.508AlaTrp: 1.508 ± 0.165
3.163AlaTyr: 3.163 ± 0.3
0.0AlaXaa: 0.0 ± 0.0
Cys
0.709CysAla: 0.709 ± 0.129
0.089CysCys: 0.089 ± 0.048
0.503CysAsp: 0.503 ± 0.146
0.65CysGlu: 0.65 ± 0.163
0.355CysPhe: 0.355 ± 0.112
1.242CysGly: 1.242 ± 0.25
0.266CysHis: 0.266 ± 0.094
0.621CysIle: 0.621 ± 0.146
0.65CysLys: 0.65 ± 0.15
0.739CysLeu: 0.739 ± 0.155
0.325CysMet: 0.325 ± 0.095
0.532CysAsn: 0.532 ± 0.121
0.443CysPro: 0.443 ± 0.13
0.266CysGln: 0.266 ± 0.093
0.739CysArg: 0.739 ± 0.168
0.739CysSer: 0.739 ± 0.163
0.473CysThr: 0.473 ± 0.123
0.562CysVal: 0.562 ± 0.14
0.296CysTrp: 0.296 ± 0.088
0.325CysTyr: 0.325 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
5.676AspAla: 5.676 ± 0.539
0.709AspCys: 0.709 ± 0.159
3.636AspAsp: 3.636 ± 0.411
4.641AspGlu: 4.641 ± 0.426
3.045AspPhe: 3.045 ± 0.301
5.764AspGly: 5.764 ± 0.505
0.946AspHis: 0.946 ± 0.215
3.547AspIle: 3.547 ± 0.399
4.227AspLys: 4.227 ± 0.379
4.818AspLeu: 4.818 ± 0.38
2.454AspMet: 2.454 ± 0.253
2.631AspAsn: 2.631 ± 0.244
2.217AspPro: 2.217 ± 0.247
1.596AspGln: 1.596 ± 0.179
2.749AspArg: 2.749 ± 0.255
3.932AspSer: 3.932 ± 0.348
3.281AspThr: 3.281 ± 0.355
4.05AspVal: 4.05 ± 0.347
1.537AspTrp: 1.537 ± 0.203
2.808AspTyr: 2.808 ± 0.296
0.0AspXaa: 0.0 ± 0.0
Glu
6.119GluAla: 6.119 ± 0.507
0.769GluCys: 0.769 ± 0.157
4.7GluAsp: 4.7 ± 0.463
5.084GluGlu: 5.084 ± 0.538
3.163GluPhe: 3.163 ± 0.307
4.611GluGly: 4.611 ± 0.408
1.567GluHis: 1.567 ± 0.281
4.375GluIle: 4.375 ± 0.34
4.641GluLys: 4.641 ± 0.533
5.587GluLeu: 5.587 ± 0.44
1.833GluMet: 1.833 ± 0.261
2.808GluAsn: 2.808 ± 0.281
1.774GluPro: 1.774 ± 0.218
2.867GluGln: 2.867 ± 0.354
3.991GluArg: 3.991 ± 0.387
3.193GluSer: 3.193 ± 0.329
3.932GluThr: 3.932 ± 0.413
5.232GluVal: 5.232 ± 0.487
1.478GluTrp: 1.478 ± 0.245
2.542GluTyr: 2.542 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
2.601PheAla: 2.601 ± 0.296
0.414PheCys: 0.414 ± 0.111
3.104PheAsp: 3.104 ± 0.313
3.34PheGlu: 3.34 ± 0.31
1.271PhePhe: 1.271 ± 0.193
3.045PheGly: 3.045 ± 0.266
0.798PheHis: 0.798 ± 0.178
1.833PheIle: 1.833 ± 0.226
1.655PheLys: 1.655 ± 0.258
2.306PheLeu: 2.306 ± 0.33
1.005PheMet: 1.005 ± 0.18
2.247PheAsn: 2.247 ± 0.196
1.271PhePro: 1.271 ± 0.217
0.916PheGln: 0.916 ± 0.185
2.158PheArg: 2.158 ± 0.252
2.72PheSer: 2.72 ± 0.408
2.542PheThr: 2.542 ± 0.293
2.631PheVal: 2.631 ± 0.289
0.503PheTrp: 0.503 ± 0.113
1.685PheTyr: 1.685 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
4.937GlyAla: 4.937 ± 0.475
0.798GlyCys: 0.798 ± 0.174
4.759GlyAsp: 4.759 ± 0.4
4.257GlyGlu: 4.257 ± 0.371
3.104GlyPhe: 3.104 ± 0.309
5.084GlyGly: 5.084 ± 0.489
1.478GlyHis: 1.478 ± 0.205
4.641GlyIle: 4.641 ± 0.439
5.528GlyLys: 5.528 ± 0.386
5.291GlyLeu: 5.291 ± 0.426
2.601GlyMet: 2.601 ± 0.253
3.902GlyAsn: 3.902 ± 0.359
2.187GlyPro: 2.187 ± 0.297
2.394GlyGln: 2.394 ± 0.27
3.961GlyArg: 3.961 ± 0.321
3.902GlySer: 3.902 ± 0.502
5.735GlyThr: 5.735 ± 0.744
5.587GlyVal: 5.587 ± 0.452
1.567GlyTrp: 1.567 ± 0.205
2.956GlyTyr: 2.956 ± 0.273
0.0GlyXaa: 0.0 ± 0.0
His
1.064HisAla: 1.064 ± 0.193
0.177HisCys: 0.177 ± 0.067
1.035HisAsp: 1.035 ± 0.184
1.36HisGlu: 1.36 ± 0.212
0.621HisPhe: 0.621 ± 0.133
1.448HisGly: 1.448 ± 0.227
0.503HisHis: 0.503 ± 0.115
0.916HisIle: 0.916 ± 0.157
0.946HisLys: 0.946 ± 0.163
1.123HisLeu: 1.123 ± 0.165
0.473HisMet: 0.473 ± 0.119
1.035HisAsn: 1.035 ± 0.214
0.798HisPro: 0.798 ± 0.189
0.532HisGln: 0.532 ± 0.105
1.33HisArg: 1.33 ± 0.203
0.946HisSer: 0.946 ± 0.15
0.769HisThr: 0.769 ± 0.166
1.389HisVal: 1.389 ± 0.223
0.266HisTrp: 0.266 ± 0.093
0.769HisTyr: 0.769 ± 0.141
0.0HisXaa: 0.0 ± 0.0
Ile
5.025IleAla: 5.025 ± 0.384
0.591IleCys: 0.591 ± 0.134
4.375IleAsp: 4.375 ± 0.412
4.345IleGlu: 4.345 ± 0.408
1.685IlePhe: 1.685 ± 0.23
3.872IleGly: 3.872 ± 0.416
0.828IleHis: 0.828 ± 0.17
2.72IleIle: 2.72 ± 0.294
4.168IleLys: 4.168 ± 0.344
3.193IleLeu: 3.193 ± 0.335
1.626IleMet: 1.626 ± 0.205
2.187IleAsn: 2.187 ± 0.305
1.774IlePro: 1.774 ± 0.176
1.862IleGln: 1.862 ± 0.284
2.986IleArg: 2.986 ± 0.272
2.69IleSer: 2.69 ± 0.341
3.488IleThr: 3.488 ± 0.355
4.464IleVal: 4.464 ± 0.411
0.769IleTrp: 0.769 ± 0.17
1.715IleTyr: 1.715 ± 0.22
0.0IleXaa: 0.0 ± 0.0
Lys
5.705LysAla: 5.705 ± 0.531
0.769LysCys: 0.769 ± 0.192
4.079LysAsp: 4.079 ± 0.5
3.784LysGlu: 3.784 ± 0.394
2.483LysPhe: 2.483 ± 0.249
3.725LysGly: 3.725 ± 0.335
0.975LysHis: 0.975 ± 0.167
4.464LysIle: 4.464 ± 0.36
5.025LysLys: 5.025 ± 0.465
4.168LysLeu: 4.168 ± 0.468
2.276LysMet: 2.276 ± 0.244
3.636LysAsn: 3.636 ± 0.347
2.808LysPro: 2.808 ± 0.324
2.779LysGln: 2.779 ± 0.264
3.577LysArg: 3.577 ± 0.394
3.813LysSer: 3.813 ± 0.322
3.577LysThr: 3.577 ± 0.328
4.405LysVal: 4.405 ± 0.369
1.36LysTrp: 1.36 ± 0.205
2.69LysTyr: 2.69 ± 0.256
0.0LysXaa: 0.0 ± 0.0
Leu
6.769LeuAla: 6.769 ± 0.517
0.857LeuCys: 0.857 ± 0.181
5.084LeuAsp: 5.084 ± 0.389
5.823LeuGlu: 5.823 ± 0.559
2.542LeuPhe: 2.542 ± 0.345
4.907LeuGly: 4.907 ± 0.423
1.36LeuHis: 1.36 ± 0.224
3.961LeuIle: 3.961 ± 0.421
4.405LeuLys: 4.405 ± 0.412
4.759LeuLeu: 4.759 ± 0.419
1.774LeuMet: 1.774 ± 0.256
3.222LeuAsn: 3.222 ± 0.303
2.72LeuPro: 2.72 ± 0.269
1.478LeuGln: 1.478 ± 0.279
4.434LeuArg: 4.434 ± 0.385
5.084LeuSer: 5.084 ± 0.307
4.789LeuThr: 4.789 ± 0.371
4.079LeuVal: 4.079 ± 0.363
1.626LeuTrp: 1.626 ± 0.275
2.424LeuTyr: 2.424 ± 0.336
0.0LeuXaa: 0.0 ± 0.0
Met
3.193MetAla: 3.193 ± 0.328
0.414MetCys: 0.414 ± 0.112
1.774MetAsp: 1.774 ± 0.195
1.448MetGlu: 1.448 ± 0.204
0.828MetPhe: 0.828 ± 0.213
1.921MetGly: 1.921 ± 0.367
0.532MetHis: 0.532 ± 0.131
1.685MetIle: 1.685 ± 0.21
1.981MetLys: 1.981 ± 0.268
1.774MetLeu: 1.774 ± 0.195
0.916MetMet: 0.916 ± 0.158
1.774MetAsn: 1.774 ± 0.209
1.33MetPro: 1.33 ± 0.201
1.123MetGln: 1.123 ± 0.204
2.187MetArg: 2.187 ± 0.263
2.187MetSer: 2.187 ± 0.234
2.513MetThr: 2.513 ± 0.287
1.508MetVal: 1.508 ± 0.206
0.296MetTrp: 0.296 ± 0.101
1.035MetTyr: 1.035 ± 0.189
0.0MetXaa: 0.0 ± 0.0
Asn
3.459AsnAla: 3.459 ± 0.412
0.384AsnCys: 0.384 ± 0.1
2.897AsnAsp: 2.897 ± 0.309
3.518AsnGlu: 3.518 ± 0.316
1.744AsnPhe: 1.744 ± 0.22
3.961AsnGly: 3.961 ± 0.422
0.887AsnHis: 0.887 ± 0.157
1.774AsnIle: 1.774 ± 0.206
3.37AsnLys: 3.37 ± 0.401
3.459AsnLeu: 3.459 ± 0.391
1.182AsnMet: 1.182 ± 0.203
1.862AsnAsn: 1.862 ± 0.261
2.276AsnPro: 2.276 ± 0.31
1.36AsnGln: 1.36 ± 0.2
2.365AsnArg: 2.365 ± 0.245
2.779AsnSer: 2.779 ± 0.304
2.69AsnThr: 2.69 ± 0.445
3.311AsnVal: 3.311 ± 0.368
0.857AsnTrp: 0.857 ± 0.156
1.951AsnTyr: 1.951 ± 0.262
0.0AsnXaa: 0.0 ± 0.0
Pro
3.074ProAla: 3.074 ± 0.279
0.414ProCys: 0.414 ± 0.129
2.838ProAsp: 2.838 ± 0.377
2.631ProGlu: 2.631 ± 0.3
1.567ProPhe: 1.567 ± 0.229
2.956ProGly: 2.956 ± 0.357
0.621ProHis: 0.621 ± 0.11
1.537ProIle: 1.537 ± 0.202
1.892ProLys: 1.892 ± 0.23
2.513ProLeu: 2.513 ± 0.347
1.035ProMet: 1.035 ± 0.176
1.508ProAsn: 1.508 ± 0.23
1.123ProPro: 1.123 ± 0.263
0.916ProGln: 0.916 ± 0.172
1.774ProArg: 1.774 ± 0.242
2.069ProSer: 2.069 ± 0.344
2.454ProThr: 2.454 ± 0.5
3.37ProVal: 3.37 ± 0.335
0.384ProTrp: 0.384 ± 0.119
1.182ProTyr: 1.182 ± 0.195
0.0ProXaa: 0.0 ± 0.0
Gln
3.636GlnAla: 3.636 ± 0.495
0.296GlnCys: 0.296 ± 0.094
1.212GlnAsp: 1.212 ± 0.211
2.276GlnGlu: 2.276 ± 0.284
1.478GlnPhe: 1.478 ± 0.232
1.774GlnGly: 1.774 ± 0.225
0.414GlnHis: 0.414 ± 0.112
1.744GlnIle: 1.744 ± 0.232
2.394GlnLys: 2.394 ± 0.273
2.572GlnLeu: 2.572 ± 0.252
0.946GlnMet: 0.946 ± 0.211
1.478GlnAsn: 1.478 ± 0.258
1.035GlnPro: 1.035 ± 0.204
1.242GlnGln: 1.242 ± 0.34
2.128GlnArg: 2.128 ± 0.305
2.306GlnSer: 2.306 ± 0.262
1.537GlnThr: 1.537 ± 0.243
2.365GlnVal: 2.365 ± 0.299
0.621GlnTrp: 0.621 ± 0.132
1.182GlnTyr: 1.182 ± 0.167
0.0GlnXaa: 0.0 ± 0.0
Arg
4.996ArgAla: 4.996 ± 0.5
0.739ArgCys: 0.739 ± 0.176
3.311ArgAsp: 3.311 ± 0.296
4.345ArgGlu: 4.345 ± 0.425
2.04ArgPhe: 2.04 ± 0.299
3.636ArgGly: 3.636 ± 0.3
0.916ArgHis: 0.916 ± 0.215
2.66ArgIle: 2.66 ± 0.335
3.843ArgLys: 3.843 ± 0.44
4.227ArgLeu: 4.227 ± 0.413
2.099ArgMet: 2.099 ± 0.312
2.483ArgAsn: 2.483 ± 0.267
1.685ArgPro: 1.685 ± 0.24
2.01ArgGln: 2.01 ± 0.286
3.133ArgArg: 3.133 ± 0.429
2.956ArgSer: 2.956 ± 0.323
2.926ArgThr: 2.926 ± 0.306
3.754ArgVal: 3.754 ± 0.359
1.242ArgTrp: 1.242 ± 0.241
2.424ArgTyr: 2.424 ± 0.285
0.0ArgXaa: 0.0 ± 0.0
Ser
4.611SerAla: 4.611 ± 0.582
0.414SerCys: 0.414 ± 0.11
3.459SerAsp: 3.459 ± 0.34
3.37SerGlu: 3.37 ± 0.304
2.542SerPhe: 2.542 ± 0.258
5.528SerGly: 5.528 ± 0.604
1.005SerHis: 1.005 ± 0.168
3.459SerIle: 3.459 ± 0.308
3.784SerLys: 3.784 ± 0.363
4.493SerLeu: 4.493 ± 0.374
2.01SerMet: 2.01 ± 0.253
2.365SerAsn: 2.365 ± 0.37
2.04SerPro: 2.04 ± 0.327
1.744SerGln: 1.744 ± 0.247
3.488SerArg: 3.488 ± 0.301
3.252SerSer: 3.252 ± 0.491
3.872SerThr: 3.872 ± 0.678
4.02SerVal: 4.02 ± 0.427
1.36SerTrp: 1.36 ± 0.194
1.803SerTyr: 1.803 ± 0.268
0.0SerXaa: 0.0 ± 0.0
Thr
4.848ThrAla: 4.848 ± 0.625
0.68ThrCys: 0.68 ± 0.152
3.932ThrAsp: 3.932 ± 0.332
3.902ThrGlu: 3.902 ± 0.385
2.276ThrPhe: 2.276 ± 0.286
5.498ThrGly: 5.498 ± 0.861
0.769ThrHis: 0.769 ± 0.178
3.547ThrIle: 3.547 ± 0.442
3.459ThrLys: 3.459 ± 0.298
4.434ThrLeu: 4.434 ± 0.381
1.301ThrMet: 1.301 ± 0.182
2.956ThrAsn: 2.956 ± 0.498
3.015ThrPro: 3.015 ± 0.405
2.394ThrGln: 2.394 ± 0.287
2.867ThrArg: 2.867 ± 0.307
3.37ThrSer: 3.37 ± 0.373
3.784ThrThr: 3.784 ± 0.644
4.671ThrVal: 4.671 ± 0.483
1.33ThrTrp: 1.33 ± 0.199
2.099ThrTyr: 2.099 ± 0.382
0.0ThrXaa: 0.0 ± 0.0
Val
5.735ValAla: 5.735 ± 0.389
0.739ValCys: 0.739 ± 0.158
4.818ValAsp: 4.818 ± 0.452
4.405ValGlu: 4.405 ± 0.364
2.276ValPhe: 2.276 ± 0.241
4.405ValGly: 4.405 ± 0.361
1.182ValHis: 1.182 ± 0.177
4.05ValIle: 4.05 ± 0.288
4.789ValLys: 4.789 ± 0.389
4.759ValLeu: 4.759 ± 0.394
2.01ValMet: 2.01 ± 0.225
2.749ValAsn: 2.749 ± 0.318
2.601ValPro: 2.601 ± 0.287
2.128ValGln: 2.128 ± 0.269
4.286ValArg: 4.286 ± 0.389
4.671ValSer: 4.671 ± 0.382
4.7ValThr: 4.7 ± 0.633
4.966ValVal: 4.966 ± 0.449
1.596ValTrp: 1.596 ± 0.245
3.015ValTyr: 3.015 ± 0.366
0.0ValXaa: 0.0 ± 0.0
Trp
1.419TrpAla: 1.419 ± 0.218
0.266TrpCys: 0.266 ± 0.107
1.242TrpAsp: 1.242 ± 0.195
2.069TrpGlu: 2.069 ± 0.239
0.65TrpPhe: 0.65 ± 0.14
1.478TrpGly: 1.478 ± 0.206
0.414TrpHis: 0.414 ± 0.106
0.857TrpIle: 0.857 ± 0.162
1.36TrpLys: 1.36 ± 0.225
1.655TrpLeu: 1.655 ± 0.285
0.828TrpMet: 0.828 ± 0.22
0.916TrpAsn: 0.916 ± 0.187
0.414TrpPro: 0.414 ± 0.119
0.65TrpGln: 0.65 ± 0.138
0.975TrpArg: 0.975 ± 0.162
1.094TrpSer: 1.094 ± 0.159
1.153TrpThr: 1.153 ± 0.241
1.005TrpVal: 1.005 ± 0.182
0.384TrpTrp: 0.384 ± 0.118
0.769TrpTyr: 0.769 ± 0.186
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.779TyrAla: 2.779 ± 0.322
0.355TyrCys: 0.355 ± 0.113
2.808TyrAsp: 2.808 ± 0.328
2.69TyrGlu: 2.69 ± 0.323
1.153TyrPhe: 1.153 ± 0.281
2.926TyrGly: 2.926 ± 0.356
0.591TyrHis: 0.591 ± 0.128
1.862TyrIle: 1.862 ± 0.218
2.454TyrLys: 2.454 ± 0.326
3.399TyrLeu: 3.399 ± 0.342
0.857TyrMet: 0.857 ± 0.156
2.217TyrAsn: 2.217 ± 0.31
1.389TyrPro: 1.389 ± 0.211
1.153TyrGln: 1.153 ± 0.173
1.892TyrArg: 1.892 ± 0.241
2.424TyrSer: 2.424 ± 0.241
2.335TyrThr: 2.335 ± 0.292
2.631TyrVal: 2.631 ± 0.392
0.621TyrTrp: 0.621 ± 0.173
1.389TyrTyr: 1.389 ± 0.199
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 214 proteins (33830 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski