Amino acid dipepetide frequency for Staphylococcus phage vB_SscM-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.024AlaAla: 0.024 ± 0.024
0.341AlaCys: 0.341 ± 0.087
2.362AlaAsp: 2.362 ± 0.276
3.166AlaGlu: 3.166 ± 0.341
1.656AlaPhe: 1.656 ± 0.199
2.776AlaGly: 2.776 ± 0.361
0.633AlaHis: 0.633 ± 0.135
3.434AlaIle: 3.434 ± 0.277
4.237AlaLys: 4.237 ± 0.35
3.58AlaLeu: 3.58 ± 0.38
0.925AlaMet: 0.925 ± 0.219
1.973AlaAsn: 1.973 ± 0.198
1.486AlaPro: 1.486 ± 0.198
1.729AlaGln: 1.729 ± 0.235
1.534AlaArg: 1.534 ± 0.182
3.215AlaSer: 3.215 ± 0.321
2.825AlaThr: 2.825 ± 0.319
2.557AlaVal: 2.557 ± 0.242
0.438AlaTrp: 0.438 ± 0.086
2.46AlaTyr: 2.46 ± 0.23
0.0AlaXaa: 0.0 ± 0.0
Cys
0.17CysAla: 0.17 ± 0.059
0.097CysCys: 0.097 ± 0.052
0.244CysAsp: 0.244 ± 0.077
0.365CysGlu: 0.365 ± 0.089
0.341CysPhe: 0.341 ± 0.101
0.463CysGly: 0.463 ± 0.122
0.122CysHis: 0.122 ± 0.048
0.341CysIle: 0.341 ± 0.091
0.804CysLys: 0.804 ± 0.14
0.511CysLeu: 0.511 ± 0.101
0.0CysMet: 0.0 ± 0.0
0.341CysAsn: 0.341 ± 0.094
0.219CysPro: 0.219 ± 0.101
0.073CysGln: 0.073 ± 0.042
0.39CysArg: 0.39 ± 0.121
0.292CysSer: 0.292 ± 0.083
0.268CysThr: 0.268 ± 0.091
0.414CysVal: 0.414 ± 0.103
0.073CysTrp: 0.073 ± 0.043
0.414CysTyr: 0.414 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
2.825AspAla: 2.825 ± 0.28
0.414AspCys: 0.414 ± 0.095
3.336AspAsp: 3.336 ± 0.317
4.822AspGlu: 4.822 ± 0.419
3.385AspPhe: 3.385 ± 0.263
3.361AspGly: 3.361 ± 0.362
0.536AspHis: 0.536 ± 0.124
6.429AspIle: 6.429 ± 0.414
6.478AspLys: 6.478 ± 0.487
5.918AspLeu: 5.918 ± 0.319
1.924AspMet: 1.924 ± 0.199
5.041AspAsn: 5.041 ± 0.314
1.583AspPro: 1.583 ± 0.208
0.852AspGln: 0.852 ± 0.198
2.655AspArg: 2.655 ± 0.219
4.213AspSer: 4.213 ± 0.312
3.921AspThr: 3.921 ± 0.242
4.164AspVal: 4.164 ± 0.323
0.609AspTrp: 0.609 ± 0.123
4.262AspTyr: 4.262 ± 0.332
0.0AspXaa: 0.0 ± 0.0
Glu
4.457GluAla: 4.457 ± 0.394
0.39GluCys: 0.39 ± 0.102
6.308GluAsp: 6.308 ± 0.443
10.545GluGlu: 10.545 ± 1.026
4.091GluPhe: 4.091 ± 0.321
5.309GluGly: 5.309 ± 0.38
1.632GluHis: 1.632 ± 0.212
5.796GluIle: 5.796 ± 0.367
7.379GluLys: 7.379 ± 0.494
7.501GluLeu: 7.501 ± 0.443
2.338GluMet: 2.338 ± 0.233
4.116GluAsn: 4.116 ± 0.334
2.314GluPro: 2.314 ± 0.454
3.336GluGln: 3.336 ± 0.36
3.409GluArg: 3.409 ± 0.284
4.505GluSer: 4.505 ± 0.311
3.629GluThr: 3.629 ± 0.297
6.04GluVal: 6.04 ± 0.409
0.998GluTrp: 0.998 ± 0.159
3.994GluTyr: 3.994 ± 0.362
0.0GluXaa: 0.0 ± 0.0
Phe
1.364PheAla: 1.364 ± 0.167
0.146PheCys: 0.146 ± 0.064
2.849PheAsp: 2.849 ± 0.254
2.874PheGlu: 2.874 ± 0.26
1.145PhePhe: 1.145 ± 0.158
2.07PheGly: 2.07 ± 0.285
0.511PheHis: 0.511 ± 0.105
3.312PheIle: 3.312 ± 0.338
3.872PheLys: 3.872 ± 0.367
2.874PheLeu: 2.874 ± 0.309
1.218PheMet: 1.218 ± 0.175
3.166PheAsn: 3.166 ± 0.278
0.998PhePro: 0.998 ± 0.165
0.998PheGln: 0.998 ± 0.151
1.461PheArg: 1.461 ± 0.192
2.898PheSer: 2.898 ± 0.293
2.655PheThr: 2.655 ± 0.272
2.241PheVal: 2.241 ± 0.246
0.219PheTrp: 0.219 ± 0.069
2.338PheTyr: 2.338 ± 0.216
0.0PheXaa: 0.0 ± 0.0
Gly
2.606GlyAla: 2.606 ± 0.284
0.365GlyCys: 0.365 ± 0.102
3.409GlyAsp: 3.409 ± 0.291
4.53GlyGlu: 4.53 ± 0.391
2.314GlyPhe: 2.314 ± 0.261
3.872GlyGly: 3.872 ± 0.719
0.901GlyHis: 0.901 ± 0.142
4.408GlyIle: 4.408 ± 0.34
5.528GlyLys: 5.528 ± 0.395
4.627GlyLeu: 4.627 ± 0.386
1.778GlyMet: 1.778 ± 0.268
3.921GlyAsn: 3.921 ± 0.343
0.0GlyPro: 0.0 ± 0.0
2.046GlyGln: 2.046 ± 0.265
2.046GlyArg: 2.046 ± 0.238
4.116GlySer: 4.116 ± 0.471
4.116GlyThr: 4.116 ± 0.41
4.116GlyVal: 4.116 ± 0.297
0.804GlyTrp: 0.804 ± 0.156
3.409GlyTyr: 3.409 ± 0.277
0.0GlyXaa: 0.0 ± 0.0
His
0.56HisAla: 0.56 ± 0.119
0.122HisCys: 0.122 ± 0.05
0.974HisAsp: 0.974 ± 0.167
1.242HisGlu: 1.242 ± 0.2
0.658HisPhe: 0.658 ± 0.126
0.877HisGly: 0.877 ± 0.146
0.511HisHis: 0.511 ± 0.113
1.437HisIle: 1.437 ± 0.188
1.412HisLys: 1.412 ± 0.216
1.339HisLeu: 1.339 ± 0.22
0.268HisMet: 0.268 ± 0.084
0.852HisAsn: 0.852 ± 0.12
0.633HisPro: 0.633 ± 0.126
0.584HisGln: 0.584 ± 0.111
0.682HisArg: 0.682 ± 0.132
0.852HisSer: 0.852 ± 0.142
0.998HisThr: 0.998 ± 0.154
1.145HisVal: 1.145 ± 0.174
0.219HisTrp: 0.219 ± 0.073
0.804HisTyr: 0.804 ± 0.122
0.0HisXaa: 0.0 ± 0.0
Ile
2.825IleAla: 2.825 ± 0.332
0.341IleCys: 0.341 ± 0.088
5.309IleAsp: 5.309 ± 0.407
7.33IleGlu: 7.33 ± 0.528
2.289IlePhe: 2.289 ± 0.213
3.629IleGly: 3.629 ± 0.292
1.266IleHis: 1.266 ± 0.157
5.358IleIle: 5.358 ± 0.446
6.941IleLys: 6.941 ± 0.332
5.577IleLeu: 5.577 ± 0.423
2.021IleMet: 2.021 ± 0.215
5.285IleAsn: 5.285 ± 0.42
2.07IlePro: 2.07 ± 0.224
2.484IleGln: 2.484 ± 0.224
2.947IleArg: 2.947 ± 0.264
4.968IleSer: 4.968 ± 0.376
5.553IleThr: 5.553 ± 0.507
4.311IleVal: 4.311 ± 0.388
0.365IleTrp: 0.365 ± 0.084
3.239IleTyr: 3.239 ± 0.287
0.0IleXaa: 0.0 ± 0.0
Lys
4.067LysAla: 4.067 ± 0.375
0.511LysCys: 0.511 ± 0.137
7.793LysAsp: 7.793 ± 0.523
9.936LysGlu: 9.936 ± 0.604
3.142LysPhe: 3.142 ± 0.242
5.723LysGly: 5.723 ± 0.416
2.021LysHis: 2.021 ± 0.241
4.749LysIle: 4.749 ± 0.337
7.769LysLys: 7.769 ± 0.668
6.892LysLeu: 6.892 ± 0.426
2.241LysMet: 2.241 ± 0.259
4.919LysAsn: 4.919 ± 0.4
2.338LysPro: 2.338 ± 0.259
2.898LysGln: 2.898 ± 0.286
3.215LysArg: 3.215 ± 0.265
5.406LysSer: 5.406 ± 0.436
4.311LysThr: 4.311 ± 0.318
6.965LysVal: 6.965 ± 0.379
0.56LysTrp: 0.56 ± 0.125
4.481LysTyr: 4.481 ± 0.362
0.0LysXaa: 0.0 ± 0.0
Leu
3.336LeuAla: 3.336 ± 0.322
0.39LeuCys: 0.39 ± 0.092
6.356LeuAsp: 6.356 ± 0.428
7.647LeuGlu: 7.647 ± 0.498
2.849LeuPhe: 2.849 ± 0.304
4.846LeuGly: 4.846 ± 0.391
1.145LeuHis: 1.145 ± 0.157
5.48LeuIle: 5.48 ± 0.399
7.136LeuLys: 7.136 ± 0.391
5.991LeuLeu: 5.991 ± 0.422
2.046LeuMet: 2.046 ± 0.202
4.968LeuAsn: 4.968 ± 0.329
2.947LeuPro: 2.947 ± 0.288
3.385LeuGln: 3.385 ± 0.308
3.556LeuArg: 3.556 ± 0.265
5.041LeuSer: 5.041 ± 0.31
4.773LeuThr: 4.773 ± 0.347
4.7LeuVal: 4.7 ± 0.388
0.584LeuTrp: 0.584 ± 0.106
3.263LeuTyr: 3.263 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
1.412MetAla: 1.412 ± 0.179
0.17MetCys: 0.17 ± 0.068
1.778MetAsp: 1.778 ± 0.238
1.924MetGlu: 1.924 ± 0.203
1.023MetPhe: 1.023 ± 0.158
1.023MetGly: 1.023 ± 0.197
0.341MetHis: 0.341 ± 0.089
1.973MetIle: 1.973 ± 0.272
2.776MetLys: 2.776 ± 0.287
2.094MetLeu: 2.094 ± 0.212
0.536MetMet: 0.536 ± 0.121
1.632MetAsn: 1.632 ± 0.182
0.536MetPro: 0.536 ± 0.105
0.682MetGln: 0.682 ± 0.124
1.023MetArg: 1.023 ± 0.149
1.924MetSer: 1.924 ± 0.189
1.534MetThr: 1.534 ± 0.18
1.559MetVal: 1.559 ± 0.189
0.17MetTrp: 0.17 ± 0.059
1.096MetTyr: 1.096 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
2.581AsnAla: 2.581 ± 0.255
0.438AsnCys: 0.438 ± 0.108
3.507AsnAsp: 3.507 ± 0.26
4.457AsnGlu: 4.457 ± 0.349
2.192AsnPhe: 2.192 ± 0.197
3.994AsnGly: 3.994 ± 0.453
0.877AsnHis: 0.877 ± 0.165
5.674AsnIle: 5.674 ± 0.438
5.796AsnLys: 5.796 ± 0.396
4.992AsnLeu: 4.992 ± 0.409
1.607AsnMet: 1.607 ± 0.183
4.919AsnAsn: 4.919 ± 0.306
2.557AsnPro: 2.557 ± 0.259
1.583AsnGln: 1.583 ± 0.183
2.703AsnArg: 2.703 ± 0.252
4.237AsnSer: 4.237 ± 0.356
4.262AsnThr: 4.262 ± 0.348
3.385AsnVal: 3.385 ± 0.327
0.414AsnTrp: 0.414 ± 0.11
3.044AsnTyr: 3.044 ± 0.31
0.0AsnXaa: 0.0 ± 0.0
Pro
1.169ProAla: 1.169 ± 0.214
0.073ProCys: 0.073 ± 0.034
1.51ProAsp: 1.51 ± 0.261
2.581ProGlu: 2.581 ± 0.258
1.145ProPhe: 1.145 ± 0.175
1.51ProGly: 1.51 ± 0.225
0.463ProHis: 0.463 ± 0.115
1.924ProIle: 1.924 ± 0.205
2.411ProLys: 2.411 ± 0.261
1.827ProLeu: 1.827 ± 0.215
0.584ProMet: 0.584 ± 0.118
1.753ProAsn: 1.753 ± 0.242
0.755ProPro: 0.755 ± 0.231
1.072ProGln: 1.072 ± 0.231
0.877ProArg: 0.877 ± 0.134
2.021ProSer: 2.021 ± 0.351
2.533ProThr: 2.533 ± 0.248
1.729ProVal: 1.729 ± 0.178
0.097ProTrp: 0.097 ± 0.045
1.68ProTyr: 1.68 ± 0.214
0.0ProXaa: 0.0 ± 0.0
Gln
2.192GlnAla: 2.192 ± 0.212
0.195GlnCys: 0.195 ± 0.075
2.265GlnAsp: 2.265 ± 0.232
3.361GlnGlu: 3.361 ± 0.376
1.023GlnPhe: 1.023 ± 0.154
2.241GlnGly: 2.241 ± 0.324
0.487GlnHis: 0.487 ± 0.116
2.07GlnIle: 2.07 ± 0.236
2.241GlnLys: 2.241 ± 0.238
2.728GlnLeu: 2.728 ± 0.257
0.974GlnMet: 0.974 ± 0.155
1.51GlnAsn: 1.51 ± 0.217
0.95GlnPro: 0.95 ± 0.303
1.875GlnGln: 1.875 ± 0.311
1.047GlnArg: 1.047 ± 0.157
2.119GlnSer: 2.119 ± 0.219
1.607GlnThr: 1.607 ± 0.177
2.338GlnVal: 2.338 ± 0.207
0.219GlnTrp: 0.219 ± 0.066
1.875GlnTyr: 1.875 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
1.607ArgAla: 1.607 ± 0.246
0.244ArgCys: 0.244 ± 0.086
3.117ArgAsp: 3.117 ± 0.268
3.799ArgGlu: 3.799 ± 0.279
1.802ArgPhe: 1.802 ± 0.212
2.46ArgGly: 2.46 ± 0.213
0.536ArgHis: 0.536 ± 0.115
2.63ArgIle: 2.63 ± 0.242
3.044ArgLys: 3.044 ± 0.3
3.507ArgLeu: 3.507 ± 0.307
1.193ArgMet: 1.193 ± 0.18
2.192ArgAsn: 2.192 ± 0.233
0.779ArgPro: 0.779 ± 0.113
1.339ArgGln: 1.339 ± 0.179
1.315ArgArg: 1.315 ± 0.178
1.534ArgSer: 1.534 ± 0.204
2.07ArgThr: 2.07 ± 0.254
2.801ArgVal: 2.801 ± 0.244
0.292ArgTrp: 0.292 ± 0.103
1.802ArgTyr: 1.802 ± 0.216
0.0ArgXaa: 0.0 ± 0.0
Ser
2.776SerAla: 2.776 ± 0.292
0.317SerCys: 0.317 ± 0.083
3.799SerAsp: 3.799 ± 0.255
4.384SerGlu: 4.384 ± 0.333
2.728SerPhe: 2.728 ± 0.244
3.775SerGly: 3.775 ± 0.328
0.95SerHis: 0.95 ± 0.158
5.455SerIle: 5.455 ± 0.367
6.137SerLys: 6.137 ± 0.399
5.114SerLeu: 5.114 ± 0.297
1.607SerMet: 1.607 ± 0.174
4.432SerAsn: 4.432 ± 0.353
1.559SerPro: 1.559 ± 0.187
1.997SerGln: 1.997 ± 0.216
2.143SerArg: 2.143 ± 0.235
4.14SerSer: 4.14 ± 0.351
4.554SerThr: 4.554 ± 0.458
3.872SerVal: 3.872 ± 0.336
0.633SerTrp: 0.633 ± 0.098
3.215SerTyr: 3.215 ± 0.271
0.0SerXaa: 0.0 ± 0.0
Thr
2.46ThrAla: 2.46 ± 0.243
0.17ThrCys: 0.17 ± 0.073
3.288ThrAsp: 3.288 ± 0.282
4.822ThrGlu: 4.822 ± 0.479
2.898ThrPhe: 2.898 ± 0.237
3.97ThrGly: 3.97 ± 0.397
1.193ThrHis: 1.193 ± 0.152
4.749ThrIle: 4.749 ± 0.411
4.968ThrLys: 4.968 ± 0.351
5.285ThrLeu: 5.285 ± 0.329
1.047ThrMet: 1.047 ± 0.163
3.775ThrAsn: 3.775 ± 0.336
2.533ThrPro: 2.533 ± 0.252
1.875ThrGln: 1.875 ± 0.243
2.581ThrArg: 2.581 ± 0.282
4.116ThrSer: 4.116 ± 0.315
3.775ThrThr: 3.775 ± 0.348
5.139ThrVal: 5.139 ± 0.356
0.438ThrTrp: 0.438 ± 0.087
2.898ThrTyr: 2.898 ± 0.277
0.0ThrXaa: 0.0 ± 0.0
Val
2.435ValAla: 2.435 ± 0.269
0.633ValCys: 0.633 ± 0.112
4.725ValAsp: 4.725 ± 0.39
6.137ValGlu: 6.137 ± 0.437
2.314ValPhe: 2.314 ± 0.228
3.19ValGly: 3.19 ± 0.237
0.925ValHis: 0.925 ± 0.147
4.627ValIle: 4.627 ± 0.348
5.82ValLys: 5.82 ± 0.385
5.309ValLeu: 5.309 ± 0.445
1.51ValMet: 1.51 ± 0.21
4.067ValAsn: 4.067 ± 0.297
1.9ValPro: 1.9 ± 0.211
2.362ValGln: 2.362 ± 0.217
2.192ValArg: 2.192 ± 0.211
4.505ValSer: 4.505 ± 0.375
4.725ValThr: 4.725 ± 0.299
4.018ValVal: 4.018 ± 0.349
0.731ValTrp: 0.731 ± 0.14
3.799ValTyr: 3.799 ± 0.316
0.0ValXaa: 0.0 ± 0.0
Trp
0.487TrpAla: 0.487 ± 0.125
0.073TrpCys: 0.073 ± 0.038
0.682TrpAsp: 0.682 ± 0.11
0.633TrpGlu: 0.633 ± 0.125
0.341TrpPhe: 0.341 ± 0.085
0.584TrpGly: 0.584 ± 0.121
0.195TrpHis: 0.195 ± 0.063
0.487TrpIle: 0.487 ± 0.109
0.731TrpLys: 0.731 ± 0.134
0.877TrpLeu: 0.877 ± 0.147
0.146TrpMet: 0.146 ± 0.056
0.414TrpAsn: 0.414 ± 0.108
0.0TrpPro: 0.0 ± 0.0
0.341TrpGln: 0.341 ± 0.072
0.268TrpArg: 0.268 ± 0.074
0.438TrpSer: 0.438 ± 0.108
0.292TrpThr: 0.292 ± 0.078
0.755TrpVal: 0.755 ± 0.127
0.195TrpTrp: 0.195 ± 0.074
0.584TrpTyr: 0.584 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.997TyrAla: 1.997 ± 0.199
0.511TyrCys: 0.511 ± 0.117
3.117TyrAsp: 3.117 ± 0.296
3.531TyrGlu: 3.531 ± 0.339
1.875TyrPhe: 1.875 ± 0.212
3.239TyrGly: 3.239 ± 0.29
0.925TyrHis: 0.925 ± 0.151
3.775TyrIle: 3.775 ± 0.314
4.457TyrLys: 4.457 ± 0.297
3.897TyrLeu: 3.897 ± 0.297
1.218TyrMet: 1.218 ± 0.146
3.945TyrAsn: 3.945 ± 0.252
1.559TyrPro: 1.559 ± 0.223
1.802TyrGln: 1.802 ± 0.205
2.021TyrArg: 2.021 ± 0.215
3.044TyrSer: 3.044 ± 0.27
3.58TyrThr: 3.58 ± 0.327
3.677TyrVal: 3.677 ± 0.318
0.463TyrTrp: 0.463 ± 0.109
2.947TyrTyr: 2.947 ± 0.329
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 202 proteins (41063 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski