Amino acid dipepetide frequency for Staphylococcus phage K

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.091AlaAla: 0.091 ± 0.054
0.122AlaCys: 0.122 ± 0.065
2.43AlaAsp: 2.43 ± 0.305
3.099AlaGlu: 3.099 ± 0.423
1.58AlaPhe: 1.58 ± 0.169
2.339AlaGly: 2.339 ± 0.484
0.881AlaHis: 0.881 ± 0.165
3.737AlaIle: 3.737 ± 0.35
4.101AlaLys: 4.101 ± 0.511
3.463AlaLeu: 3.463 ± 0.366
1.306AlaMet: 1.306 ± 0.32
2.582AlaAsn: 2.582 ± 0.256
1.367AlaPro: 1.367 ± 0.185
2.127AlaGln: 2.127 ± 0.284
1.762AlaArg: 1.762 ± 0.216
3.585AlaSer: 3.585 ± 0.531
3.311AlaThr: 3.311 ± 0.359
2.886AlaVal: 2.886 ± 0.277
0.425AlaTrp: 0.425 ± 0.118
2.278AlaTyr: 2.278 ± 0.236
0.0AlaXaa: 0.0 ± 0.0
Cys
0.152CysAla: 0.152 ± 0.075
0.061CysCys: 0.061 ± 0.038
0.273CysAsp: 0.273 ± 0.101
0.334CysGlu: 0.334 ± 0.123
0.182CysPhe: 0.182 ± 0.09
0.365CysGly: 0.365 ± 0.098
0.122CysHis: 0.122 ± 0.068
0.395CysIle: 0.395 ± 0.124
0.516CysLys: 0.516 ± 0.143
0.425CysLeu: 0.425 ± 0.121
0.061CysMet: 0.061 ± 0.044
0.091CysAsn: 0.091 ± 0.066
0.243CysPro: 0.243 ± 0.099
0.243CysGln: 0.243 ± 0.101
0.182CysArg: 0.182 ± 0.084
0.395CysSer: 0.395 ± 0.114
0.334CysThr: 0.334 ± 0.095
0.213CysVal: 0.213 ± 0.075
0.061CysTrp: 0.061 ± 0.054
0.304CysTyr: 0.304 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
3.281AspAla: 3.281 ± 0.289
0.273AspCys: 0.273 ± 0.103
4.709AspAsp: 4.709 ± 0.488
5.225AspGlu: 5.225 ± 0.395
3.433AspPhe: 3.433 ± 0.374
3.676AspGly: 3.676 ± 0.37
0.486AspHis: 0.486 ± 0.126
6.289AspIle: 6.289 ± 0.518
6.927AspLys: 6.927 ± 0.465
5.954AspLeu: 5.954 ± 0.441
2.37AspMet: 2.37 ± 0.276
5.863AspAsn: 5.863 ± 0.489
1.823AspPro: 1.823 ± 0.203
1.276AspGln: 1.276 ± 0.251
2.461AspArg: 2.461 ± 0.28
4.891AspSer: 4.891 ± 0.416
3.98AspThr: 3.98 ± 0.354
4.739AspVal: 4.739 ± 0.413
0.668AspTrp: 0.668 ± 0.124
4.01AspTyr: 4.01 ± 0.422
0.0AspXaa: 0.0 ± 0.0
Glu
3.797GluAla: 3.797 ± 0.342
0.334GluCys: 0.334 ± 0.126
6.592GluAsp: 6.592 ± 0.594
7.716GluGlu: 7.716 ± 0.712
2.977GluPhe: 2.977 ± 0.271
4.435GluGly: 4.435 ± 0.303
1.762GluHis: 1.762 ± 0.214
5.408GluIle: 5.408 ± 0.491
7.504GluLys: 7.504 ± 0.522
6.866GluLeu: 6.866 ± 0.515
2.157GluMet: 2.157 ± 0.253
4.314GluAsn: 4.314 ± 0.427
2.248GluPro: 2.248 ± 0.649
3.828GluGln: 3.828 ± 0.498
2.886GluArg: 2.886 ± 0.298
4.587GluSer: 4.587 ± 0.356
3.676GluThr: 3.676 ± 0.313
5.165GluVal: 5.165 ± 0.425
0.668GluTrp: 0.668 ± 0.161
3.919GluTyr: 3.919 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
1.489PheAla: 1.489 ± 0.224
0.213PheCys: 0.213 ± 0.075
2.643PheAsp: 2.643 ± 0.293
2.673PheGlu: 2.673 ± 0.276
1.367PhePhe: 1.367 ± 0.176
2.278PheGly: 2.278 ± 0.35
0.486PheHis: 0.486 ± 0.121
3.281PheIle: 3.281 ± 0.371
3.524PheLys: 3.524 ± 0.314
2.704PheLeu: 2.704 ± 0.305
0.729PheMet: 0.729 ± 0.168
3.129PheAsn: 3.129 ± 0.345
1.033PhePro: 1.033 ± 0.164
1.701PheGln: 1.701 ± 0.203
1.215PheArg: 1.215 ± 0.164
2.704PheSer: 2.704 ± 0.297
2.582PheThr: 2.582 ± 0.28
2.643PheVal: 2.643 ± 0.327
0.365PheTrp: 0.365 ± 0.105
2.309PheTyr: 2.309 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
3.038GlyAla: 3.038 ± 0.776
0.243GlyCys: 0.243 ± 0.088
4.04GlyAsp: 4.04 ± 0.371
4.618GlyGlu: 4.618 ± 0.342
2.309GlyPhe: 2.309 ± 0.245
4.618GlyGly: 4.618 ± 1.438
1.033GlyHis: 1.033 ± 0.177
3.98GlyIle: 3.98 ± 0.403
6.197GlyLys: 6.197 ± 0.642
4.83GlyLeu: 4.83 ± 0.415
1.215GlyMet: 1.215 ± 0.256
4.192GlyAsn: 4.192 ± 0.379
0.0GlyPro: 0.0 ± 0.0
2.066GlyGln: 2.066 ± 0.32
2.278GlyArg: 2.278 ± 0.322
4.891GlySer: 4.891 ± 0.643
3.858GlyThr: 3.858 ± 0.478
3.737GlyVal: 3.737 ± 0.398
0.547GlyTrp: 0.547 ± 0.19
3.402GlyTyr: 3.402 ± 0.287
0.0GlyXaa: 0.0 ± 0.0
His
0.699HisAla: 0.699 ± 0.128
0.273HisCys: 0.273 ± 0.096
1.003HisAsp: 1.003 ± 0.189
1.033HisGlu: 1.033 ± 0.182
0.638HisPhe: 0.638 ± 0.135
1.033HisGly: 1.033 ± 0.181
0.334HisHis: 0.334 ± 0.116
1.61HisIle: 1.61 ± 0.26
1.063HisLys: 1.063 ± 0.183
1.458HisLeu: 1.458 ± 0.252
0.304HisMet: 0.304 ± 0.104
0.972HisAsn: 0.972 ± 0.19
0.608HisPro: 0.608 ± 0.116
0.516HisGln: 0.516 ± 0.11
0.759HisArg: 0.759 ± 0.174
0.972HisSer: 0.972 ± 0.187
0.942HisThr: 0.942 ± 0.156
0.82HisVal: 0.82 ± 0.142
0.182HisTrp: 0.182 ± 0.078
0.638HisTyr: 0.638 ± 0.146
0.0HisXaa: 0.0 ± 0.0
Ile
2.977IleAla: 2.977 ± 0.351
0.273IleCys: 0.273 ± 0.094
5.833IleAsp: 5.833 ± 0.414
5.62IleGlu: 5.62 ± 0.501
2.339IlePhe: 2.339 ± 0.245
4.284IleGly: 4.284 ± 0.438
0.911IleHis: 0.911 ± 0.165
5.559IleIle: 5.559 ± 0.603
6.44IleLys: 6.44 ± 0.483
4.8IleLeu: 4.8 ± 0.423
1.792IleMet: 1.792 ± 0.231
5.225IleAsn: 5.225 ± 0.397
2.43IlePro: 2.43 ± 0.244
2.521IleGln: 2.521 ± 0.286
2.765IleArg: 2.765 ± 0.266
4.344IleSer: 4.344 ± 0.32
4.921IleThr: 4.921 ± 0.414
4.375IleVal: 4.375 ± 0.433
0.486IleTrp: 0.486 ± 0.135
2.765IleTyr: 2.765 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
3.98LysAla: 3.98 ± 0.519
0.425LysCys: 0.425 ± 0.124
7.382LysAsp: 7.382 ± 0.467
9.964LysGlu: 9.964 ± 0.781
3.129LysPhe: 3.129 ± 0.249
6.289LysGly: 6.289 ± 0.76
1.762LysHis: 1.762 ± 0.265
4.405LysIle: 4.405 ± 0.317
7.625LysLys: 7.625 ± 0.733
6.532LysLeu: 6.532 ± 0.463
2.309LysMet: 2.309 ± 0.261
5.468LysAsn: 5.468 ± 0.429
3.19LysPro: 3.19 ± 0.373
4.01LysGln: 4.01 ± 0.401
2.734LysArg: 2.734 ± 0.251
5.134LysSer: 5.134 ± 0.426
4.223LysThr: 4.223 ± 0.337
6.866LysVal: 6.866 ± 0.402
0.729LysTrp: 0.729 ± 0.147
4.527LysTyr: 4.527 ± 0.43
0.0LysXaa: 0.0 ± 0.0
Leu
3.372LeuAla: 3.372 ± 0.361
0.365LeuCys: 0.365 ± 0.116
6.228LeuAsp: 6.228 ± 0.455
6.41LeuGlu: 6.41 ± 0.563
2.582LeuPhe: 2.582 ± 0.241
4.891LeuGly: 4.891 ± 0.494
1.003LeuHis: 1.003 ± 0.201
4.83LeuIle: 4.83 ± 0.418
7.625LeuLys: 7.625 ± 0.484
5.742LeuLeu: 5.742 ± 0.52
1.884LeuMet: 1.884 ± 0.23
5.468LeuAsn: 5.468 ± 0.505
2.613LeuPro: 2.613 ± 0.269
2.795LeuGln: 2.795 ± 0.334
3.281LeuArg: 3.281 ± 0.317
5.863LeuSer: 5.863 ± 0.41
5.62LeuThr: 5.62 ± 0.385
3.858LeuVal: 3.858 ± 0.402
0.729LeuTrp: 0.729 ± 0.133
2.886LeuTyr: 2.886 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
1.823MetAla: 1.823 ± 0.323
0.213MetCys: 0.213 ± 0.096
1.64MetAsp: 1.64 ± 0.264
2.066MetGlu: 2.066 ± 0.251
1.094MetPhe: 1.094 ± 0.236
1.428MetGly: 1.428 ± 0.306
0.304MetHis: 0.304 ± 0.101
1.549MetIle: 1.549 ± 0.268
2.157MetLys: 2.157 ± 0.258
1.367MetLeu: 1.367 ± 0.208
0.608MetMet: 0.608 ± 0.147
1.428MetAsn: 1.428 ± 0.208
0.577MetPro: 0.577 ± 0.142
0.881MetGln: 0.881 ± 0.169
1.185MetArg: 1.185 ± 0.188
1.762MetSer: 1.762 ± 0.2
1.489MetThr: 1.489 ± 0.184
1.276MetVal: 1.276 ± 0.206
0.122MetTrp: 0.122 ± 0.052
1.003MetTyr: 1.003 ± 0.16
0.0MetXaa: 0.0 ± 0.0
Asn
3.251AsnAla: 3.251 ± 0.336
0.304AsnCys: 0.304 ± 0.088
4.527AsnAsp: 4.527 ± 0.353
4.618AsnGlu: 4.618 ± 0.369
2.704AsnPhe: 2.704 ± 0.333
4.01AsnGly: 4.01 ± 0.301
1.185AsnHis: 1.185 ± 0.216
4.709AsnIle: 4.709 ± 0.401
6.744AsnLys: 6.744 ± 0.467
5.165AsnLeu: 5.165 ± 0.444
1.458AsnMet: 1.458 ± 0.191
4.921AsnAsn: 4.921 ± 0.525
2.613AsnPro: 2.613 ± 0.342
2.248AsnGln: 2.248 ± 0.271
2.613AsnArg: 2.613 ± 0.245
4.071AsnSer: 4.071 ± 0.298
3.98AsnThr: 3.98 ± 0.331
3.98AsnVal: 3.98 ± 0.381
0.425AsnTrp: 0.425 ± 0.119
3.524AsnTyr: 3.524 ± 0.392
0.0AsnXaa: 0.0 ± 0.0
Pro
1.306ProAla: 1.306 ± 0.174
0.122ProCys: 0.122 ± 0.07
1.792ProAsp: 1.792 ± 0.25
2.613ProGlu: 2.613 ± 0.449
1.276ProPhe: 1.276 ± 0.188
1.124ProGly: 1.124 ± 0.208
0.456ProHis: 0.456 ± 0.108
2.248ProIle: 2.248 ± 0.209
2.643ProLys: 2.643 ± 0.318
1.914ProLeu: 1.914 ± 0.25
0.759ProMet: 0.759 ± 0.148
2.248ProAsn: 2.248 ± 0.269
0.82ProPro: 0.82 ± 0.207
1.337ProGln: 1.337 ± 0.249
0.851ProArg: 0.851 ± 0.155
2.521ProSer: 2.521 ± 0.384
1.975ProThr: 1.975 ± 0.236
1.762ProVal: 1.762 ± 0.205
0.182ProTrp: 0.182 ± 0.078
1.58ProTyr: 1.58 ± 0.239
0.0ProXaa: 0.0 ± 0.0
Gln
2.278GlnAla: 2.278 ± 0.268
0.122GlnCys: 0.122 ± 0.062
2.491GlnAsp: 2.491 ± 0.281
3.251GlnGlu: 3.251 ± 0.434
1.458GlnPhe: 1.458 ± 0.229
2.886GlnGly: 2.886 ± 0.371
0.577GlnHis: 0.577 ± 0.139
2.461GlnIle: 2.461 ± 0.32
2.309GlnLys: 2.309 ± 0.262
3.372GlnLeu: 3.372 ± 0.34
0.79GlnMet: 0.79 ± 0.153
1.61GlnAsn: 1.61 ± 0.247
1.367GlnPro: 1.367 ± 0.273
2.218GlnGln: 2.218 ± 0.377
1.063GlnArg: 1.063 ± 0.148
3.008GlnSer: 3.008 ± 0.316
2.005GlnThr: 2.005 ± 0.234
2.005GlnVal: 2.005 ± 0.232
0.304GlnTrp: 0.304 ± 0.084
2.005GlnTyr: 2.005 ± 0.265
0.0GlnXaa: 0.0 ± 0.0
Arg
1.64ArgAla: 1.64 ± 0.204
0.304ArgCys: 0.304 ± 0.105
2.673ArgAsp: 2.673 ± 0.26
2.856ArgGlu: 2.856 ± 0.306
1.792ArgPhe: 1.792 ± 0.196
2.157ArgGly: 2.157 ± 0.201
0.425ArgHis: 0.425 ± 0.098
2.491ArgIle: 2.491 ± 0.275
3.372ArgLys: 3.372 ± 0.326
3.159ArgLeu: 3.159 ± 0.264
0.881ArgMet: 0.881 ± 0.148
2.218ArgAsn: 2.218 ± 0.252
1.124ArgPro: 1.124 ± 0.175
1.519ArgGln: 1.519 ± 0.202
1.458ArgArg: 1.458 ± 0.199
1.914ArgSer: 1.914 ± 0.249
2.127ArgThr: 2.127 ± 0.299
2.278ArgVal: 2.278 ± 0.23
0.182ArgTrp: 0.182 ± 0.081
1.61ArgTyr: 1.61 ± 0.209
0.0ArgXaa: 0.0 ± 0.0
Ser
3.129SerAla: 3.129 ± 0.363
0.213SerCys: 0.213 ± 0.08
4.527SerAsp: 4.527 ± 0.486
4.557SerGlu: 4.557 ± 0.354
3.251SerPhe: 3.251 ± 0.321
4.132SerGly: 4.132 ± 0.583
1.154SerHis: 1.154 ± 0.191
4.921SerIle: 4.921 ± 0.393
6.289SerLys: 6.289 ± 0.504
5.408SerLeu: 5.408 ± 0.309
1.397SerMet: 1.397 ± 0.183
4.861SerAsn: 4.861 ± 0.387
2.127SerPro: 2.127 ± 0.249
1.61SerGln: 1.61 ± 0.257
2.127SerArg: 2.127 ± 0.24
4.891SerSer: 4.891 ± 0.46
4.375SerThr: 4.375 ± 0.441
4.587SerVal: 4.587 ± 0.435
0.699SerTrp: 0.699 ± 0.135
3.737SerTyr: 3.737 ± 0.302
0.0SerXaa: 0.0 ± 0.0
Thr
2.37ThrAla: 2.37 ± 0.286
0.122ThrCys: 0.122 ± 0.055
4.01ThrAsp: 4.01 ± 0.331
5.165ThrGlu: 5.165 ± 0.459
2.521ThrPhe: 2.521 ± 0.302
4.405ThrGly: 4.405 ± 0.485
0.942ThrHis: 0.942 ± 0.161
4.527ThrIle: 4.527 ± 0.462
4.405ThrLys: 4.405 ± 0.395
4.77ThrLeu: 4.77 ± 0.422
0.972ThrMet: 0.972 ± 0.146
3.281ThrAsn: 3.281 ± 0.294
2.096ThrPro: 2.096 ± 0.288
2.491ThrGln: 2.491 ± 0.358
2.552ThrArg: 2.552 ± 0.291
4.162ThrSer: 4.162 ± 0.405
2.886ThrThr: 2.886 ± 0.42
4.314ThrVal: 4.314 ± 0.42
0.668ThrTrp: 0.668 ± 0.142
2.856ThrTyr: 2.856 ± 0.266
0.0ThrXaa: 0.0 ± 0.0
Val
2.4ValAla: 2.4 ± 0.277
0.577ValCys: 0.577 ± 0.119
4.83ValAsp: 4.83 ± 0.368
5.134ValGlu: 5.134 ± 0.515
2.4ValPhe: 2.4 ± 0.256
3.311ValGly: 3.311 ± 0.341
0.972ValHis: 0.972 ± 0.17
4.314ValIle: 4.314 ± 0.354
5.924ValLys: 5.924 ± 0.521
5.043ValLeu: 5.043 ± 0.391
1.367ValMet: 1.367 ± 0.206
4.344ValAsn: 4.344 ± 0.402
1.853ValPro: 1.853 ± 0.219
1.884ValGln: 1.884 ± 0.275
2.157ValArg: 2.157 ± 0.238
5.073ValSer: 5.073 ± 0.44
3.949ValThr: 3.949 ± 0.413
3.706ValVal: 3.706 ± 0.418
0.365ValTrp: 0.365 ± 0.092
3.19ValTyr: 3.19 ± 0.352
0.0ValXaa: 0.0 ± 0.0
Trp
0.304TrpAla: 0.304 ± 0.1
0.03TrpCys: 0.03 ± 0.029
0.638TrpAsp: 0.638 ± 0.127
0.79TrpGlu: 0.79 ± 0.144
0.182TrpPhe: 0.182 ± 0.074
0.547TrpGly: 0.547 ± 0.174
0.122TrpHis: 0.122 ± 0.058
0.699TrpIle: 0.699 ± 0.156
0.851TrpLys: 0.851 ± 0.173
0.608TrpLeu: 0.608 ± 0.143
0.182TrpMet: 0.182 ± 0.08
0.577TrpAsn: 0.577 ± 0.132
0.0TrpPro: 0.0 ± 0.0
0.365TrpGln: 0.365 ± 0.109
0.182TrpArg: 0.182 ± 0.066
0.516TrpSer: 0.516 ± 0.133
0.395TrpThr: 0.395 ± 0.144
0.577TrpVal: 0.577 ± 0.127
0.152TrpTrp: 0.152 ± 0.07
0.608TrpTyr: 0.608 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.096TyrAla: 2.096 ± 0.264
0.365TyrCys: 0.365 ± 0.109
4.071TyrAsp: 4.071 ± 0.375
3.099TyrGlu: 3.099 ± 0.342
1.853TyrPhe: 1.853 ± 0.235
2.886TyrGly: 2.886 ± 0.288
0.942TyrHis: 0.942 ± 0.169
3.251TyrIle: 3.251 ± 0.361
4.466TyrLys: 4.466 ± 0.462
4.466TyrLeu: 4.466 ± 0.433
1.397TyrMet: 1.397 ± 0.168
4.284TyrAsn: 4.284 ± 0.399
1.397TyrPro: 1.397 ± 0.201
1.944TyrGln: 1.944 ± 0.239
1.671TyrArg: 1.671 ± 0.167
2.704TyrSer: 2.704 ± 0.283
2.886TyrThr: 2.886 ± 0.355
3.008TyrVal: 3.008 ± 0.359
0.365TyrTrp: 0.365 ± 0.103
2.916TyrTyr: 2.916 ± 0.369
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 115 proteins (32918 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski