Amino acid dipepetide frequency for Staphylococcus phage 6ec

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.808AlaAla: 1.808 ± 0.436
0.362AlaCys: 0.362 ± 0.117
2.531AlaAsp: 2.531 ± 0.316
3.181AlaGlu: 3.181 ± 0.418
1.988AlaPhe: 1.988 ± 0.244
2.639AlaGly: 2.639 ± 0.455
0.723AlaHis: 0.723 ± 0.183
4.013AlaIle: 4.013 ± 0.296
4.193AlaLys: 4.193 ± 0.45
3.76AlaLeu: 3.76 ± 0.46
1.627AlaMet: 1.627 ± 0.288
2.133AlaAsn: 2.133 ± 0.354
0.94AlaPro: 0.94 ± 0.232
1.735AlaGln: 1.735 ± 0.296
2.169AlaArg: 2.169 ± 0.329
2.277AlaSer: 2.277 ± 0.384
2.711AlaThr: 2.711 ± 0.38
2.675AlaVal: 2.675 ± 0.347
0.434AlaTrp: 0.434 ± 0.176
1.663AlaTyr: 1.663 ± 0.233
0.0AlaXaa: 0.0 ± 0.0
Cys
0.217CysAla: 0.217 ± 0.071
0.072CysCys: 0.072 ± 0.055
0.434CysAsp: 0.434 ± 0.144
0.47CysGlu: 0.47 ± 0.141
0.325CysPhe: 0.325 ± 0.116
0.578CysGly: 0.578 ± 0.17
0.217CysHis: 0.217 ± 0.104
0.506CysIle: 0.506 ± 0.137
0.651CysLys: 0.651 ± 0.191
0.542CysLeu: 0.542 ± 0.133
0.289CysMet: 0.289 ± 0.087
0.434CysAsn: 0.434 ± 0.112
0.253CysPro: 0.253 ± 0.134
0.253CysGln: 0.253 ± 0.1
0.145CysArg: 0.145 ± 0.075
0.47CysSer: 0.47 ± 0.135
0.362CysThr: 0.362 ± 0.119
0.506CysVal: 0.506 ± 0.167
0.181CysTrp: 0.181 ± 0.095
0.651CysTyr: 0.651 ± 0.151
0.0CysXaa: 0.0 ± 0.0
Asp
2.603AspAla: 2.603 ± 0.314
0.542AspCys: 0.542 ± 0.17
4.338AspAsp: 4.338 ± 0.581
5.82AspGlu: 5.82 ± 0.558
3.434AspPhe: 3.434 ± 0.406
4.627AspGly: 4.627 ± 0.611
0.506AspHis: 0.506 ± 0.104
6.616AspIle: 6.616 ± 0.591
7.194AspLys: 7.194 ± 0.644
6.218AspLeu: 6.218 ± 0.404
2.169AspMet: 2.169 ± 0.296
5.097AspAsn: 5.097 ± 0.508
1.085AspPro: 1.085 ± 0.192
0.687AspGln: 0.687 ± 0.129
1.735AspArg: 1.735 ± 0.273
4.157AspSer: 4.157 ± 0.407
3.687AspThr: 3.687 ± 0.371
3.904AspVal: 3.904 ± 0.402
1.012AspTrp: 1.012 ± 0.158
4.374AspTyr: 4.374 ± 0.424
0.0AspXaa: 0.0 ± 0.0
Glu
3.254GluAla: 3.254 ± 0.377
0.831GluCys: 0.831 ± 0.154
5.676GluAsp: 5.676 ± 0.656
6.543GluGlu: 6.543 ± 0.667
3.507GluPhe: 3.507 ± 0.388
3.254GluGly: 3.254 ± 0.358
1.193GluHis: 1.193 ± 0.216
6.507GluIle: 6.507 ± 0.658
8.17GluLys: 8.17 ± 0.543
7.809GluLeu: 7.809 ± 0.611
2.314GluMet: 2.314 ± 0.299
5.17GluAsn: 5.17 ± 0.523
1.554GluPro: 1.554 ± 0.218
3.217GluGln: 3.217 ± 0.288
3.073GluArg: 3.073 ± 0.351
4.049GluSer: 4.049 ± 0.386
3.977GluThr: 3.977 ± 0.325
4.736GluVal: 4.736 ± 0.621
0.759GluTrp: 0.759 ± 0.168
3.904GluTyr: 3.904 ± 0.448
0.0GluXaa: 0.0 ± 0.0
Phe
1.952PheAla: 1.952 ± 0.236
0.325PheCys: 0.325 ± 0.106
3.001PheAsp: 3.001 ± 0.356
2.82PheGlu: 2.82 ± 0.346
1.699PhePhe: 1.699 ± 0.234
2.277PheGly: 2.277 ± 0.44
0.506PheHis: 0.506 ± 0.144
3.94PheIle: 3.94 ± 0.442
4.157PheLys: 4.157 ± 0.361
3.651PheLeu: 3.651 ± 0.493
1.048PheMet: 1.048 ± 0.194
3.362PheAsn: 3.362 ± 0.4
0.434PhePro: 0.434 ± 0.133
0.94PheGln: 0.94 ± 0.201
1.374PheArg: 1.374 ± 0.215
2.784PheSer: 2.784 ± 0.36
2.133PheThr: 2.133 ± 0.287
2.422PheVal: 2.422 ± 0.269
0.325PheTrp: 0.325 ± 0.098
2.422PheTyr: 2.422 ± 0.34
0.0PheXaa: 0.0 ± 0.0
Gly
2.567GlyAla: 2.567 ± 0.482
0.253GlyCys: 0.253 ± 0.092
3.579GlyAsp: 3.579 ± 0.386
2.603GlyGlu: 2.603 ± 0.31
2.603GlyPhe: 2.603 ± 0.274
3.109GlyGly: 3.109 ± 0.499
0.94GlyHis: 0.94 ± 0.216
4.157GlyIle: 4.157 ± 0.532
5.965GlyLys: 5.965 ± 0.597
5.603GlyLeu: 5.603 ± 0.514
1.735GlyMet: 1.735 ± 0.271
3.904GlyAsn: 3.904 ± 0.413
0.723GlyPro: 0.723 ± 0.181
2.024GlyGln: 2.024 ± 0.377
1.591GlyArg: 1.591 ± 0.245
3.724GlySer: 3.724 ± 0.399
3.579GlyThr: 3.579 ± 0.539
3.868GlyVal: 3.868 ± 0.366
0.904GlyTrp: 0.904 ± 0.319
3.326GlyTyr: 3.326 ± 0.351
0.0GlyXaa: 0.0 ± 0.0
His
0.759HisAla: 0.759 ± 0.163
0.289HisCys: 0.289 ± 0.107
1.301HisAsp: 1.301 ± 0.2
0.723HisGlu: 0.723 ± 0.171
0.651HisPhe: 0.651 ± 0.142
1.121HisGly: 1.121 ± 0.188
0.506HisHis: 0.506 ± 0.113
1.699HisIle: 1.699 ± 0.211
1.338HisLys: 1.338 ± 0.197
1.012HisLeu: 1.012 ± 0.176
0.325HisMet: 0.325 ± 0.096
1.085HisAsn: 1.085 ± 0.217
0.398HisPro: 0.398 ± 0.105
0.362HisGln: 0.362 ± 0.108
0.868HisArg: 0.868 ± 0.187
1.482HisSer: 1.482 ± 0.231
0.578HisThr: 0.578 ± 0.131
1.012HisVal: 1.012 ± 0.177
0.072HisTrp: 0.072 ± 0.051
0.687HisTyr: 0.687 ± 0.149
0.0HisXaa: 0.0 ± 0.0
Ile
3.398IleAla: 3.398 ± 0.4
0.506IleCys: 0.506 ± 0.13
6.109IleAsp: 6.109 ± 0.53
7.194IleGlu: 7.194 ± 0.653
2.784IlePhe: 2.784 ± 0.313
4.049IleGly: 4.049 ± 0.445
0.868IleHis: 0.868 ± 0.159
6.146IleIle: 6.146 ± 0.729
8.242IleLys: 8.242 ± 0.599
6.579IleLeu: 6.579 ± 0.627
2.205IleMet: 2.205 ± 0.309
6.254IleAsn: 6.254 ± 0.516
2.494IlePro: 2.494 ± 0.278
1.952IleGln: 1.952 ± 0.234
2.675IleArg: 2.675 ± 0.358
4.989IleSer: 4.989 ± 0.39
5.097IleThr: 5.097 ± 0.397
5.314IleVal: 5.314 ± 0.415
0.615IleTrp: 0.615 ± 0.22
3.687IleTyr: 3.687 ± 0.511
0.0IleXaa: 0.0 ± 0.0
Lys
4.049LysAla: 4.049 ± 0.454
0.615LysCys: 0.615 ± 0.179
7.086LysAsp: 7.086 ± 0.438
9.688LysGlu: 9.688 ± 0.67
2.675LysPhe: 2.675 ± 0.345
6.109LysGly: 6.109 ± 0.605
2.422LysHis: 2.422 ± 0.303
6.543LysIle: 6.543 ± 0.548
9.182LysLys: 9.182 ± 0.707
6.001LysLeu: 6.001 ± 0.455
3.29LysMet: 3.29 ± 0.317
7.194LysAsn: 7.194 ± 0.661
1.699LysPro: 1.699 ± 0.242
4.266LysGln: 4.266 ± 0.479
3.832LysArg: 3.832 ± 0.339
5.459LysSer: 5.459 ± 0.604
5.459LysThr: 5.459 ± 0.433
6.399LysVal: 6.399 ± 0.532
0.831LysTrp: 0.831 ± 0.186
5.17LysTyr: 5.17 ± 0.471
0.0LysXaa: 0.0 ± 0.0
Leu
3.507LeuAla: 3.507 ± 0.422
0.578LeuCys: 0.578 ± 0.154
6.037LeuAsp: 6.037 ± 0.528
7.302LeuGlu: 7.302 ± 0.603
3.181LeuPhe: 3.181 ± 0.4
4.41LeuGly: 4.41 ± 0.532
1.338LeuHis: 1.338 ± 0.214
6.073LeuIle: 6.073 ± 0.518
7.664LeuLys: 7.664 ± 0.533
6.941LeuLeu: 6.941 ± 0.614
1.88LeuMet: 1.88 ± 0.254
6.254LeuAsn: 6.254 ± 0.572
2.314LeuPro: 2.314 ± 0.364
3.47LeuGln: 3.47 ± 0.384
2.458LeuArg: 2.458 ± 0.358
5.314LeuSer: 5.314 ± 0.453
4.049LeuThr: 4.049 ± 0.397
4.374LeuVal: 4.374 ± 0.43
0.868LeuTrp: 0.868 ± 0.252
3.615LeuTyr: 3.615 ± 0.42
0.0LeuXaa: 0.0 ± 0.0
Met
1.808MetAla: 1.808 ± 0.275
0.289MetCys: 0.289 ± 0.089
1.627MetAsp: 1.627 ± 0.217
2.277MetGlu: 2.277 ± 0.27
0.868MetPhe: 0.868 ± 0.173
1.193MetGly: 1.193 ± 0.256
0.362MetHis: 0.362 ± 0.103
1.735MetIle: 1.735 ± 0.28
2.856MetLys: 2.856 ± 0.298
1.808MetLeu: 1.808 ± 0.197
0.506MetMet: 0.506 ± 0.147
2.314MetAsn: 2.314 ± 0.284
0.759MetPro: 0.759 ± 0.188
0.578MetGln: 0.578 ± 0.134
0.615MetArg: 0.615 ± 0.155
2.205MetSer: 2.205 ± 0.264
1.41MetThr: 1.41 ± 0.184
1.518MetVal: 1.518 ± 0.248
0.506MetTrp: 0.506 ± 0.124
1.048MetTyr: 1.048 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
2.567AsnAla: 2.567 ± 0.305
0.362AsnCys: 0.362 ± 0.097
5.133AsnAsp: 5.133 ± 0.461
5.423AsnGlu: 5.423 ± 0.515
2.784AsnPhe: 2.784 ± 0.315
5.423AsnGly: 5.423 ± 0.585
1.374AsnHis: 1.374 ± 0.195
6.652AsnIle: 6.652 ± 0.688
8.098AsnLys: 8.098 ± 0.511
4.7AsnLeu: 4.7 ± 0.436
1.952AsnMet: 1.952 ± 0.252
6.037AsnAsn: 6.037 ± 0.664
1.844AsnPro: 1.844 ± 0.244
1.771AsnGln: 1.771 ± 0.276
3.217AsnArg: 3.217 ± 0.33
3.687AsnSer: 3.687 ± 0.388
3.579AsnThr: 3.579 ± 0.395
3.868AsnVal: 3.868 ± 0.376
0.687AsnTrp: 0.687 ± 0.165
3.254AsnTyr: 3.254 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
1.121ProAla: 1.121 ± 0.204
0.145ProCys: 0.145 ± 0.085
1.012ProAsp: 1.012 ± 0.2
2.277ProGlu: 2.277 ± 0.255
1.121ProPhe: 1.121 ± 0.235
0.651ProGly: 0.651 ± 0.191
0.362ProHis: 0.362 ± 0.103
1.844ProIle: 1.844 ± 0.248
1.41ProLys: 1.41 ± 0.227
1.554ProLeu: 1.554 ± 0.216
0.615ProMet: 0.615 ± 0.138
1.735ProAsn: 1.735 ± 0.29
0.506ProPro: 0.506 ± 0.137
0.651ProGln: 0.651 ± 0.166
0.723ProArg: 0.723 ± 0.131
1.338ProSer: 1.338 ± 0.212
1.627ProThr: 1.627 ± 0.208
1.735ProVal: 1.735 ± 0.26
0.036ProTrp: 0.036 ± 0.045
1.301ProTyr: 1.301 ± 0.211
0.0ProXaa: 0.0 ± 0.0
Gln
1.338GlnAla: 1.338 ± 0.264
0.145GlnCys: 0.145 ± 0.074
1.735GlnAsp: 1.735 ± 0.211
3.145GlnGlu: 3.145 ± 0.331
1.518GlnPhe: 1.518 ± 0.256
1.771GlnGly: 1.771 ± 0.325
0.506GlnHis: 0.506 ± 0.112
2.458GlnIle: 2.458 ± 0.277
3.29GlnLys: 3.29 ± 0.35
2.856GlnLeu: 2.856 ± 0.432
0.94GlnMet: 0.94 ± 0.201
2.205GlnAsn: 2.205 ± 0.252
0.398GlnPro: 0.398 ± 0.131
1.41GlnGln: 1.41 ± 0.245
1.554GlnArg: 1.554 ± 0.264
1.916GlnSer: 1.916 ± 0.24
1.482GlnThr: 1.482 ± 0.279
1.446GlnVal: 1.446 ± 0.263
0.578GlnTrp: 0.578 ± 0.181
1.374GlnTyr: 1.374 ± 0.248
0.0GlnXaa: 0.0 ± 0.0
Arg
1.663ArgAla: 1.663 ± 0.204
0.253ArgCys: 0.253 ± 0.124
2.711ArgAsp: 2.711 ± 0.293
2.639ArgGlu: 2.639 ± 0.349
1.41ArgPhe: 1.41 ± 0.207
1.627ArgGly: 1.627 ± 0.249
0.759ArgHis: 0.759 ± 0.13
2.82ArgIle: 2.82 ± 0.317
3.29ArgLys: 3.29 ± 0.419
2.675ArgLeu: 2.675 ± 0.35
0.831ArgMet: 0.831 ± 0.189
2.964ArgAsn: 2.964 ± 0.315
0.795ArgPro: 0.795 ± 0.173
1.338ArgGln: 1.338 ± 0.217
1.374ArgArg: 1.374 ± 0.226
1.735ArgSer: 1.735 ± 0.334
1.771ArgThr: 1.771 ± 0.26
1.88ArgVal: 1.88 ± 0.227
0.398ArgTrp: 0.398 ± 0.128
2.061ArgTyr: 2.061 ± 0.238
0.0ArgXaa: 0.0 ± 0.0
Ser
2.747SerAla: 2.747 ± 0.324
0.362SerCys: 0.362 ± 0.149
4.772SerAsp: 4.772 ± 0.408
4.193SerGlu: 4.193 ± 0.351
3.217SerPhe: 3.217 ± 0.289
3.724SerGly: 3.724 ± 0.452
0.831SerHis: 0.831 ± 0.165
4.989SerIle: 4.989 ± 0.358
5.242SerLys: 5.242 ± 0.44
5.097SerLeu: 5.097 ± 0.413
1.301SerMet: 1.301 ± 0.231
4.302SerAsn: 4.302 ± 0.32
1.121SerPro: 1.121 ± 0.174
2.241SerGln: 2.241 ± 0.311
1.88SerArg: 1.88 ± 0.226
4.013SerSer: 4.013 ± 0.484
3.724SerThr: 3.724 ± 0.364
4.013SerVal: 4.013 ± 0.424
0.687SerTrp: 0.687 ± 0.155
3.073SerTyr: 3.073 ± 0.356
0.0SerXaa: 0.0 ± 0.0
Thr
2.603ThrAla: 2.603 ± 0.349
0.362ThrCys: 0.362 ± 0.129
3.832ThrAsp: 3.832 ± 0.4
4.338ThrGlu: 4.338 ± 0.375
2.277ThrPhe: 2.277 ± 0.289
3.651ThrGly: 3.651 ± 0.62
1.121ThrHis: 1.121 ± 0.187
4.808ThrIle: 4.808 ± 0.424
4.519ThrLys: 4.519 ± 0.397
4.808ThrLeu: 4.808 ± 0.395
0.759ThrMet: 0.759 ± 0.156
2.747ThrAsn: 2.747 ± 0.298
1.808ThrPro: 1.808 ± 0.235
1.916ThrGln: 1.916 ± 0.31
1.808ThrArg: 1.808 ± 0.247
3.217ThrSer: 3.217 ± 0.461
3.362ThrThr: 3.362 ± 0.318
3.724ThrVal: 3.724 ± 0.522
0.398ThrTrp: 0.398 ± 0.102
2.567ThrTyr: 2.567 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
3.398ValAla: 3.398 ± 0.365
0.506ValCys: 0.506 ± 0.167
4.519ValAsp: 4.519 ± 0.338
4.555ValGlu: 4.555 ± 0.492
2.314ValPhe: 2.314 ± 0.295
2.964ValGly: 2.964 ± 0.355
0.831ValHis: 0.831 ± 0.137
4.663ValIle: 4.663 ± 0.417
6.652ValLys: 6.652 ± 0.634
4.591ValLeu: 4.591 ± 0.441
1.374ValMet: 1.374 ± 0.202
4.772ValAsn: 4.772 ± 0.444
1.446ValPro: 1.446 ± 0.236
1.699ValGln: 1.699 ± 0.207
1.952ValArg: 1.952 ± 0.217
4.41ValSer: 4.41 ± 0.372
2.928ValThr: 2.928 ± 0.323
3.326ValVal: 3.326 ± 0.309
0.542ValTrp: 0.542 ± 0.154
2.639ValTyr: 2.639 ± 0.305
0.0ValXaa: 0.0 ± 0.0
Trp
0.398TrpAla: 0.398 ± 0.132
0.108TrpCys: 0.108 ± 0.063
0.687TrpAsp: 0.687 ± 0.195
0.868TrpGlu: 0.868 ± 0.184
0.615TrpPhe: 0.615 ± 0.273
0.578TrpGly: 0.578 ± 0.148
0.145TrpHis: 0.145 ± 0.068
0.687TrpIle: 0.687 ± 0.159
1.157TrpLys: 1.157 ± 0.306
0.904TrpLeu: 0.904 ± 0.214
0.072TrpMet: 0.072 ± 0.047
0.831TrpAsn: 0.831 ± 0.184
0.0TrpPro: 0.0 ± 0.0
0.325TrpGln: 0.325 ± 0.113
0.47TrpArg: 0.47 ± 0.126
0.615TrpSer: 0.615 ± 0.197
0.578TrpThr: 0.578 ± 0.169
0.759TrpVal: 0.759 ± 0.143
0.108TrpTrp: 0.108 ± 0.065
0.615TrpTyr: 0.615 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.808TyrAla: 1.808 ± 0.217
0.651TyrCys: 0.651 ± 0.15
3.687TyrAsp: 3.687 ± 0.37
3.434TyrGlu: 3.434 ± 0.334
2.603TyrPhe: 2.603 ± 0.313
3.001TyrGly: 3.001 ± 0.403
0.759TyrHis: 0.759 ± 0.144
4.302TyrIle: 4.302 ± 0.475
4.7TyrLys: 4.7 ± 0.469
4.519TyrLeu: 4.519 ± 0.481
1.012TyrMet: 1.012 ± 0.188
3.615TyrAsn: 3.615 ± 0.368
1.193TyrPro: 1.193 ± 0.223
1.229TyrGln: 1.229 ± 0.2
1.518TyrArg: 1.518 ± 0.257
3.687TyrSer: 3.687 ± 0.397
2.531TyrThr: 2.531 ± 0.335
2.603TyrVal: 2.603 ± 0.325
0.578TyrTrp: 0.578 ± 0.184
2.531TyrTyr: 2.531 ± 0.363
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 142 proteins (27663 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski